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Title Of The Invention 

NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO BACTEROIDES 
FRAGILIS FOR DIAGNOSTICS AND THERAPEUTICS 

INVENTOR: Gary L.Breton 

Related Applications: 

This application claims the benefit of U.S. Provisional Application Serial Number 
60/128,705, filed April 9, 1999, the entire teachings of which are incorporated herein by 
reference. 



Field Of The Invention 

The invention relates to isolated nucleic acids and polypeptides derived from 
Bacteroidesfragilis that are useful as molecular targets for diagnostics, prophylaxis and 
treatment of pathological conditions, as well as materials and methods for the diagnosis, 
prevention, and amelioration of pathological conditions resulting from bacterial 
infection. 
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BACKGROUND OF THE INVENTION 

The genus Bacteroides is a member of the family Bacteroidaceae. They are 
Gram-negative, obligately anaerobic, nonsporeforming rods. The genus contains at least 
39 species, and are often isolated from sewage as well as the digestive tract of man, 
5 animals, and insects. Bacteroides fragilis was first described in 1898 by Veillon and 
Zuber, but was called Bacillus fragilis. In 1919, Castellani and Chalmers transferred it to 
the Bacteroides genus. The "5. fragilis group" refers to the saccharoclastic bacteroids 
that grow well in bile. Members of this group were previously subspecies of B fragilis 
and include B. fragilis, B. distasonis, B. ovatus, B. thetaiotaomicron, and B.vulgatus 

10 (Castellani and Chalmers. 1984. Genus I. Bacteroides 1919, 959. Krieg and Holt (editors) 
In Bergey's Manual of Systematic Bacteriology, 1 :604-63 1). 

Bacteroides fragilis accounts for only 1% of the normal flora of the human colon, 
but is the most common anaerobe isolated from clinical specimens. It is associated with 
soft tissue infections, abscesses and bacteremia (Moncrief J., et al> 1998. Infect. Immun. 

15 66:1735-1739). B. fragilis has also been associated with infection of the skeletal muscle 
(Katagiri, K., et al 9 1996. J, Dermatology. 23:129-132), and meningitis (Aucher, P., et al, 
1996. Eur. J. Clin. Microbiol. Infect. Dis. 15:820-823). The B. fragilis group is 
responsible for 65% of all anaerobic bacteremia cases, with mortality rates in excess of 
19% (Redondo, M., et al, 1995. Clinical Infectious Disease. 20:1492-1496). 

20 In 1 984, strains of B. fragilis were found to cause diarrhea in newborn lambs 

(Myers, L,, et al> 1984. Infect. Immun. 44:241-244). Subsequently, it has been shown that 
B fragilis is associated with diarrhea in other livestock and young children. These strains 
are called enterotoxigenic strains, because they produced a 20KD metalloprotease 
enterotoxin with intestinal secretory activity (Moncrief J., et al, 1995. Infect. Immun. 

25 63:175-181). 

There has been an increase in antibiotic resistance within the Bacteroides fragilis 
group. While there is still excellent activity of many antibiotics, even some of the most 
potent agents, the carbapenems and the 6-lactamase-inhibitor combinations, are losing 
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activity (Snydman,D., etal, 1996. Clinical Infectious Diseases. 23:S54-65). The 
cefoxitin resistance rate has increased from 0% in 1987 to 22% in 1995 (Bianchini, H., 
etal, 1997. Clinical Infectious Diseases. 25;S268-269). Resistance to metronidazole, co- 
amoxiclav, and imipenem is rare, but strains have been found that are resistant to one or 
5 all of these antibiotics. (Turner,R, et al, 1995/The Lancet. 345:1275-1277). Clindaycin 
resistance has been shown to be transferred between strains by either plasmid or 
transposon mechanisms. (Dalmau, D., et al y 1997. Clinical Infectious Diseases. 24:874- 
877). The increasing resistance to antibiotics commonly used against Bacteroides 
species may eventually lead to failures of these treatments. 
10 Sequencing and analysis of this genome is crucial for the identification of 

essential genes for development of drug targets and to reduce the emerging health threat 
this organism poses. 

SUMMARY OF THE INVENTION 
15 The present invention fulfills the need for diagnostic tools and therapeutics by 

providing bacterial-specific compositions and methods for detecting Bacteroides species 
including B. fragilis , as well as compositions and methods useful for treating and 
preventing Bacteroides infection, in particular, B, fragilis infection, in vertebrates 
including mammals. 

20 The present invention encompasses isolated nucleic acids and polypeptides 

derived from B. fragilis that are useful as reagents for diagnosis of bacterial disease, 
components of effective antibacterial vaccines, and/or as targets for antibacterial drugs 
including anti-fi. fragilis drugs. They can also be used to detect the presence of B. 
fragilis and other Bacteroides species in a sample; and in screening compounds for the 

25 ability to interfere with the B. fragilis life cycle or to inhibit B, fragilis infection. They 

also have use as biocontrol agents for plants. 

In one aspect, the invention features compositions of nucleic acids corresponding 

to entire coding sequences of B. fragilis proteins, including surface or secreted proteins 

or parts thereof, nucleic acids capable of binding mRNA from B. fragilis proteins to 
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block protein translation, and methods for producing B. fragilis proteins or parts thereof 
using peptide synthesis and recombinant DNA techniques. This invention also features 
antibodies and nucleic acids useful as probes to detect 5. fragilis infection. In addition, 
vaccine compositions and methods for the protection or treatment of infection by B. 
5 fragilis are within the scope of this invention. 

The nucleotide sequences provided in SEQ ID NO: 1 - SEQ ID NO: 5222, a 
fragment thereof, or a nucleotide sequence at least about 99.5% identical to a sequence 
contained within SEQ ID NO: 1 - SEQ ID NO: 5222 may be "provided" in a variety of 
medias to facilitate use thereof. As used herein, "provided" refers to a manufacture, 

10 other than an isolated nucleic acid molecule, which contains a nucleotide sequence of the 
present invention, i.e., the nucleotide sequence provided in SEQ ID NO: 1 - SEQ ID NO: 
5222, a fragment thereof, or a nucleotide sequence at least about 99.5% identical to a 
sequence contained within SEQ ID NO: 1 - SEQ ID NO: 5222. Uses for and methods for 
providing nucleotide sequences in a variety of media is well known in the art (see e.g., 

15 EPO Publication No. EP 0 756 006). 

In one application of this embodiment, a nucleotide sequence of the present 
invention can be recorded on computer readable media. As used herein, "computer 
readable media" refers to any media which can be read and accessed directly by a 
computer. Such media include, but are not limited to: magnetic storage media, such as 

20 floppy discs, hard disc storage media, and magnetic tape; optical storage media such as 
CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these 
categories such as magnetic/optical storage media. A person skilled in the art can 
readily appreciate how any of the presently known computer readable media can be used 
to create a manufacture comprising computer readable media having recorded thereon a 

25 nucleotide sequence of the present invention. 

As used herein, "recorded" refers to a process for storing information on 
computer readable media. A person skilled in the art can readily adopt any of the 
presently known methods for recording information on computer readable media to 
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generate manufactures comprising the nucleotide sequence information of the present 
invention. 

A variety of data storage structures are available to a person skilled in the art for 
creating a computer readable media having recorded thereon a nucleotide sequence of the 
5 present invention. The choice of the data storage structure will generally be based on the 
means chosen to access the stored information. In addition, a variety of data processor 
programs and formats can be used to store the nucleotide sequence information of the 
present invention on computer readable media. The sequence information can be 
represented in a word processing text file, formatted in commercially-available software 
10 such as WordPerfect and Microsoft Word, or represented in the form of an ASCII file, 
stored in a database application, such as DB2, Sybase, Oracle, or the like. A person 
skilled in the art can readily adapt any number of data processor structuring formats (e.g. 
text file or database) in order to obtain computer readable media having recorded thereon 
the nucleotide sequence information of the present invention. 
15 By providing the nucleotide sequence of SEQ ID NO: 1 - SEQ ID NO: 5222, a 

fragment thereof, or a nucleotide sequence at least about 99.5% identical to SEQ ID NO: 
1 - SEQ ID NO: 5222 in computer readable form, a person skilled in the art can routinely 
access the coding sequence information for a variety of purposes. Computer software is 
publicly available which allows a person skilled in the art to access sequence information 
20 provided in a computer readable media. Examples of such computer software include 
programs of the "Staden Package", "DNA Star", "MacVector", GCG "Wisconsin 
Package" (Genetics Computer Group, Madison, WI) and "NCBI Toolbox" (National 
Center For Biotechnology Information). Suitable programs are described, for example, 
in Martin J. Bishop, ed., Guide to Human Genome Computing, 2d Edition, Academic 
25 Press, San Diego, CA. (1998); and Leonard F. Peruski, Jr., and Anne Harwood Peruski, 
The Internet and the New Biology: Tools for Genomic and Molecular Research, 
American Society for Microbiology, Washington, D.C. (1997). 
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Computer algorithms enable the identification of B. fragilis open reading frames 
(ORFs) within SEQ ID NO: 1 - SEQ ID NO: 5222 which contain homology to ORFs or 
proteins from other organisms. Examples of such similarity-search algorithms include 
the BLAST [Altschul et al., J. Mol. Biol. 215:403-410 (1990)] and Smith- Waterman 
5 [Smith and Waterman (1 98 1) Advances in Applied Mathematics, 2:482-489] search 
algorithms. Suitable search algorithms are described, for example, in Martin J. Bishop, 
ed., Guide to Human Genome Computing, 2d Edition, Academic Press, San Diego, CA. 
(1998); and Leonard F. Peruski, Jr., and Anne Harwood Peruski, The Internet and the 
New Biology: Tools for Genomic and Molecular Research, American Society for 
10 Microbiology, Washington, D.C. (1 997). Such algorithms are utilized on computer 
systems as exemplified below. The ORFs so identified represent protein encoding 
fragments within the B. fragilis genome and are useful in producing commercially 
important proteins such as enzymes used in fermentation reactions and in the production 
of commercially useful metabolites. 
1 5 The present invention further provides systems, particularly computer-based 

systems, which contain the sequence information described herein. Such systems are 
designed to identify commercially important fragments of the B. fragilis genome. As 
used herein, "a computer-based system" refers to the hardware means, software means, 
and data storage means used to analyze the nucleotide sequence information of the 
20 present invention. The minimum hardware means of the computer-based systems of the 
present invention comprises a central processing unit (CPU), input means, output means, 
and data storage means. A person skilled in the art can readily appreciate that any one of 
the currently available computer-based systems is suitable for use in the present 
invention. The computer-based systems of the present invention comprise a data storage 
25 means having stored therein a nucleotide sequence of the present invention and the 

necessary hardware means and software means for supporting and implementing a search 
means. As used herein, "data storage means" refers to memory which can store 
nucleotide sequence information of the present invention, or a memory access means 



which can access manufactures having recorded thereon the nucleotide sequence 
information of the present invention. 

As used herein, "search means" refers to one or more programs which are 
implemented on the computer-based system to compare a target sequence or target 
5 structural motif with the sequence information stored within the data storage means. 
Search means are used to identify fragments or regions of the B. fragilis genome which 
are similar to, or "match", a particular target sequence or target motif. A variety of 
known algorithms are known in the art and have been disclosed publicly, and a variety of 
commercially available software for conducting homology-based similarity searches are 
10 available and can be used in the computer-based systems of the present invention. 
Examples of such software includes, but is not limited to, FASTA (GCG Wisconsin 
Package), Bic_SW (Compugen Bioccelerator), BLASTN2, BLASTP2, BLASTX2 
(NCBI) and Motifs (GCG). Suitable software programs are described, for example, in 
Martin J. Bishop, ed., Guide to Human Genome Computing, 2d Edition, Academic Press, 
15 San Diego, CA. (1 998); and Leonard F. Peruski, Jr., and Anne Harwood Peruski, The 
Internet and the New Biology: Tools for Genomic and Molecular Research, American 
Society for Microbiology, Washington, D.C. (1 997). A person skilled in the art can 
readily recognize that any one of the available algorithms or implementing software 
packages for conducting homology searches can be adapted for use in the present 
20 computer-based systems. 

As used herein, a "target sequence" can be any DNA or amino acid sequence of 
six or more nucleotides or two or more amino acids. A person skilled in the art can 
readily recognize that the longer a target sequence is, the less likely a target sequence will 
be present as a random occurrence in the database. The most preferred sequence length of 
25 a target sequence is from about 10 to 1 00 amino acids or from about 30 to 300 nucleotide 
residues. However, it is well recognized that many genes are longer than 500 amino 
acids, or 1.5 kb in length, and that commercially important fragments of the B. fragilis 
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genome, such as sequence fragments involved in gene expression and protein processing, 
will often be shorter than 30 nucleotides. 

As used herein, "a target structural motif," or "target motif," refers to any 
rationally selected sequence or combination of sequences in which the sequence(s) are 
5 chosen based on a specific functional domain or three-dimensional configuration which 
is formed upon the folding of the target polypeptide. There are a variety of target motifs 
known in the art. Protein target motifs include, but are not limited to, enzymatic active 
sites, membrane-spanning regions, and signal sequences. Nucleic acid target motifs 
include, but are not limited to, promoter sequences, hairpin structures and inducible 

10 expression elements (protein binding sequences). 

A variety of structural formats for the input and output means can be used to 
input and output the information in the computer-based systems of the present invention. 
A preferred format for an output means ranks fragments of the B. fragilis genome 
possessing varying degrees of homology to the target sequence or target motif. Such 

1 5 presentation provides a person skilled in the art with a ranking of sequences which 
contain various amounts of the target sequence or target motif and identifies the degree 
of homology contained in the identified fragment. 

A variety of comparing means can be used to compare a target sequence or target 
motif with the data storage means to identify sequence fragments of the B. fragilis 

20 genome. In the present examples, implementing software which implement the 

BLASTP2 and bic_SW algorithms (Altschul et al. ? J MoL Biol. 215:403-410 (1990); 
Compugen Biocellerator) was used to identify open reading frames within the B. fragilis 
genome. A person skilled in the art can readily recognize that any one of the publicly 
available homology search programs can be used as the search means for the computer- 

25 based systems of the present invention. Suitable programs are described, for example, in 
Martin J. Bishop, ed., Guide to Human Genome Computing, 2d Edition, Academic Press, 
San Diego, CA. (1998); and Leonard F. Peruski, Jr., and Anne Harwood Peruski, The 
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Internet and the New Biology: Tools for Genomic and Molecular Research, American 
Society for Microbiology, Washington, D.C. (1997). 

The invention features B, fragilis polypeptides, preferably a substantially pure 
preparation of an B. fragilis polypeptide, or a recombinant B. fragilis polypeptide. In 
5 preferred embodiments: the polypeptide has biological activity; the polypeptide has an 
amino acid sequence at least about 60%, 70%, 80%, 90%, 95%, 98%, or 99% identical to 
an amino acid sequence of the invention contained in the Sequence Listing, preferably it 
has about 65% sequence identity with an amino acid sequence of the invention contained 
in the Sequence Listing, and most preferably it has about 92% to about 99% sequence 

10 identity with an amino acid sequence of the invention contained in the Sequence Listing; 
the polypeptide has an amino acid sequence essentially the same as an amino acid 
sequence of the invention contained in the Sequence Listing; the polypeptide is at least 
about 5, 10, 20, 50, 100, or 150 amino acid residues in length; the polypeptide includes at 
least about 5, preferably at least about 10, more preferably at least about 20, still more 

15 preferably at least about 50, 100, or 150 contiguous amino acid residues of the invention 
contained in the Sequence Listing, In yet another preferred embodiment, the amino acid 
sequence which differs in sequence identity by about 7% to about 8% from the B. fragilis 
amino acid sequences of the invention contained in the Sequence Listing is also 
encompassed by the invention. 

20 In preferred embodiments: the B. fragilis polypeptide is encoded by a nucleic 

acid of the invention contained in the Sequence Listing, or by a nucleic acid having at 
least about 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with a nucleic acid of 
the invention contained in the Sequence Listing. 

In a preferred embodiment, the subject B. fragilis polypeptide differs in amino 

25 acid sequence at about 1, 2, 3, 5, 10 or more residues from a sequence of the invention 
contained in the Sequence Listing. The differences, however, are such that the B. fragilis 
polypeptide exhibits an B, fragilis biological activity, e.g., the B. fragilis polypeptide 
retains a biological activity of a naturally occurring B. fragilis enzyme. 
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In preferred embodiments, the polypeptide includes all or a fragment of an amino 
acid sequence of the invention contained in the Sequence Listing; fused, in reading 
frame, to additional amino acid residues, preferably to residues encoded by genomic 
DNA 5' or 3 f to the genomic DNA which encodes a sequence of the invention contained 
5 in the Sequence Listing. 

In yet other preferred embodiments, the B, fragilis polypeptide is a recombinant 
fusion protein having a first 5. fragilis polypeptide portion and a second polypeptide 
portion, e.g., a second polypeptide portion having an amino acid sequence unrelated to B. 
fragilis . The second polypeptide portion can be, e.g., any of glutathione-S-transferase, a 
10 DNA binding domain, or a polymerase activating domain. In preferred embodiment the 
fusion protein can be used in a two-hybrid assay. 

Polypeptides of the invention include those which arise as a result of alternative 
transcription events, alternative RNA splicing events, and alternative translational and 
postranslational events. 
15 In a preferred embodiment, the encoded B. fragilis polypeptide differs (e.g., by 

amino acid substitution, addition or deletion of at least one amino acid residue) in amino 
acid sequence at about 1, 2, 3, 5, 10 or more residues, from a sequence of the invention 
contained in the Sequence Listing. The differences, however, are such that: the B. 
fragilis encoded polypeptide exhibits an B. fragilis biological activity, e.g., the encoded 
20 B. fragilis enzyme retains a biological activity of a naturally occurring B. fragilis . 

In preferred embodiments, the encoded polypeptide includes all or a fragment of 
an amino acid sequence of the invention contained in the Sequence Listing; fused, in 
reading frame, to additional amino acid residues, preferably to residues encoded by 
genomic DNA 5 1 or 3' to the genomic DNA which encodes a sequence of the invention 
25 contained in the Sequence Listing. 

The B. fragilis strain, 14062, from which genomic sequences have been 
sequenced, has been deposited on July 20, 1998, in the American Type Culture 
Collection and assigned the ATCC designation # 202158. 
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Included in the invention are: allelic variations; natural mutants; induced 
mutants; proteins encoded by DNA that hybridize under high or low stringency 
conditions to a nucleic acid which encodes a polypeptide of the invention contained in 
the Sequence Listing (for definitions of high and low stringency see Current Protocols in 
5 Molecular Biology, John Wiley & Sons, New York, 1 989, 6.3.1 -6.3.6, hereby 

incorporated by reference); and, polypeptides specifically bound by antisera to B. fragilis 
polypeptides, especially by antisera to an active site or binding domain of B. fragilis 
polypeptide. The invention also includes fragments, preferably biologically active 
fragments. These and other polypeptides are also referred to herein as B. fragilis 
0 polypeptide analogs or variants. 

The invention further provides nucleic acids, e.g., RNA or DNA and their 
respective complements, encoding a polypeptide of the invention. This includes double 
stranded nucleic acids as well as coding and antisense single strands. 

In preferred embodiments, the subject B. fragilis nucleic acid will include a 
5 transcriptional regulatory sequence, e.g., at least one of a transcriptional promoter or 
transcriptional enhancer sequence, operably linked to the B. fragilis gene sequence, e.g., 
to render the B. fragilis gene sequence suitable for expression in a recombinant host cell. 

In yet a further preferred embodiment, the nucleic acid which encodes an B. 
fragilis polypeptide of the invention, hybridizes under stringent conditions to a nucleic 
acid probe corresponding to at least about 8 consecutive nucleotides of the invention 
contained in the Sequence Listing; more preferably to at least about 12 consecutive 
nucleotides of the invention contained in the Sequence Listing; still more preferably to at 
least about 20 consecutive nucleotides of the invention contained in the Sequence 
Listing; most preferably to at least about 40 consecutive nucleotides of the invention 
contained in the Sequence Listing. 

In another aspect, the invention provides a substantially pure nucleic acid having 
a nucleotide sequence which encodes an B. fragilis polypeptide. In preferred 
embodiments: the encoded polypeptide has biological activity; the encoded polypeptide 
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has an amino acid sequence at least about 60%, 70%, 80%, 90%, 95%, 98% or 99% 
homologous to an amino acid sequence of the invention contained in the Sequence 
Listing; the encoded polypeptide has an amino acid sequence essentially the same as an 
amino acid sequence of the invention contained in the Sequence Listing; the encoded 
5 polypeptide is at least about 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded 
polypeptide comprises at least about 5, preferably at least about 10, more preferably at 
least about 20, still more preferably at least about 50, 100, or 150 contiguous amino acids 
of the invention contained in the Sequence Listing. 

In another aspect, the invention encompasses: a vector including a nucleic acid 

10 which encodes an B. fragilis polypeptide or an B, fragilis polypeptide variant as 

described herein; a host cell transfected with the vector; and a method of producing a 
recombinant B. fragilis polypeptide or B. fragilis polypeptide variant; including culturing 
the cell, e.g., in a cell culture medium, and isolating an B. fragilis or B. fragilis 
polypeptide variant, e.g., from the cell or from the cell culture medium. 

15 One embodiment of the invention is directed to substantially isolated nucleic 

acids. Nucleic acids of the invention include sequences comprising at least about 8 
nucleotides in length, more preferably at least about 12 nucleotides in length, even more 
preferably at least about 15-20 nucleotides in length, that correspond to a subsequence of 
any one of SEQ ID NO: 1 - SEQ ID NO: 5222 or complements thereof. Alternatively, 

20 the nucleic acids comprise sequences contained within any ORF (open reading frame), 
including a complete protein-coding sequence, of which any of SEQ ID NO: 1 - SEQ ID 
NO: 5222 forms a part. The invention encompasses sequence-conservative variants and 
function-conservative variants of these sequences. The nucleic acids may be DNA, 
RNA, DNA/RNA duplexes, protein-nucleic acid (PNA), or derivatives thereof. 

25 In another aspect, the invention features a purified recombinant nucleic acid 

having at least about 50%, 60%, 70%, 80%, 90%, 95%, 98%, or 99% sequence identity 
or % homology with a sequence of the invention contained in the Sequence Listing 
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The invention also encompasses recombinant DNA (including DNA cloning and 
expression vectors) comprising these B.fragilis -derived sequences; host cells 
comprising such DNA, including fungal, bacterial, yeast, plant, insect, and mammalian 
host cells; and methods for producing expression products comprising RNA and 
5 polypeptides encoded by the B. fragilis sequences. These methods are carried out by 
incubating a host cell comprising an B. fragilis -derived nucleic acid sequence under 
conditions in which the sequence is expressed. The host cell may be native or 
recombinant. The polypeptides can be obtained by (a) harvesting the incubated cells to 
produce a cell fraction and a medium fraction; and (b) recovering the B. fragilis 
10 polypeptide from the cell fraction, the medium fraction, or both. The polypeptides can 
also be made by in vitro translation. 

In another aspect, the invention features nucleic acids capable of binding mRNA 
of B. fragilis . Such nucleic acid is capable of acting as antisense nucleic acid to control 
the translation of mRNA of B. fragilis . A further aspect features a nucleic acid which is 
1 5 capable of binding specifically to an B. fragilis nucleic acid. These nucleic acids are also 
referred to herein as complements and have utility as probes and as capture reagents. 

In another aspect, the invention features an expression system comprising an open 
reading frame corresponding to B. fragilis nucleic acid. The nucleic acid further 
comprises a control sequence compatible with an intended host. The expression system 
20 is useful for making polypeptides corresponding to B. fragilis nucleic acid. 

In another aspect, the invention encompasses: a vector including a nucleic acid 
which encodes an B. fragilis polypeptide or an B. fragilis polypeptide variant as 
described herein; a host cell transfected with the vector; and a method of producing a 
recombinant B. fragilis polypeptide or B. fragilis polypeptide variant; including culturing 
25 the cell, e.g., in a cell culture medium, and isolating the B. fragilis or B. fragilis 
polypeptide variant, e.g., from the cell or from the cell culture medium. 

In yet another embodiment of the invention encompasses reagents for detecting 
bacterial infection, including B. fragilis infection, which comprise at least one B. fragilis 
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-derived nucleic acid defined by any one of SEQ ID NO: 1 - SEQ ID NO: 5222, or 
sequence-conservative or function-conservative variants thereof. Alternatively, the 
diagnostic reagents comprise nucleotide sequences that are contained within any open 
reading frames (ORFs), including preferably complete protein-coding sequences, 
5 contained within any of SEQ ID NO: 1 - SEQ ID NO: 5222, or polypeptide sequences 
contained within any of SEQ ID NO: 5223 - SEQ ID NO: 10444, or polypeptides of 
which any of the above sequences forms a part, or antibodies directed against any of the 
above peptide sequences or function-conservative variants and/or fragments thereof. 
The invention further provides antibodies, preferably monoclonal antibodies, 
10 which specifically bind to the polypeptides of the invention. Methods are also provided 
for producing antibodies in a host animal. The methods of the invention comprise 
immunizing an animal with at least one B. fragilis -derived immunogenic component, 
wherein the immunogenic component comprises one or more of the polypeptides 
encoded by any one of SEQ ID NO: 1 - SEQ ID NO: 5222 or sequence-conservative or 
15 function-conservative variants thereof; or polypeptides that are contained within any 
ORFs, including complete protein-coding sequences, of which any of SEQ ID NO: 1 - 
SEQ ID NO: 5222 forms a part; or polypeptide sequences contained within any of SEQ 
ID NO: 5223 - SEQ ID NO: 10444; or polypeptides of which any of SEQ ID NO: 5223 - 
SEQ ID NO: 10444 forms a part. Host animals include any warm blooded animal, 
20 including without limitation mammals and birds. Such antibodies have utility as 
reagents for immunoassays to evaluate the abundance and distribution of B. fragilis - 
specific antigens. 

In yet another aspect, the invention provides diagnostic methods for detecting B. 
fragilis antigenic components or anti-A fragilis antibodies in a sample. B. fragilis 
25 antigenic components may be detected by known processes, including but not limited to 
detection by a process comprising: (i) contacting a sample suspected to contain a 
bacterial antigenic component with a bacterial-specific antibody, under conditions in 
which a stable antigen-antibody complex can form between the antibody and bacterial 
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antigenic components in the sample; and (ii) detecting any antigen-antibody complex 
formed in step (i), wherein detection of an antigen-antibody complex indicates the 
presence of at least one bacterial antigenic component in the sample. In different 
embodiments of this method, the antibodies used are directed against a sequence encoded 
5 by any of SEQ ID NO: 1 - SEQ ID NO: 5222 or sequence-conservative or function- 
conservative variants thereof, or against a polypeptide sequence contained in any of SEQ 
ID NO: 5223 - SEQ ID NO: 10444 or function-conservative variants thereof 

In yet another aspect, the invention provides a method for detecting antibacterial- 
specific antibodies in a sample, which comprises: (i) contacting a sample suspected to 
10 contain antibacterial-specific antibodies with an B. fragilis antigenic component, under 
conditions in which a stable antigen-antibody complex can form between the B. fragilis 
antigenic component and antibacterial antibodies in the sample; and (ii) detecting any 
antigen-antibody complex formed in step (i), wherein detection of an antigen-antibody 
complex indicates the presence of antibacterial antibodies in the sample. In different 
15 embodiments of this method, the antigenic component is encoded by a sequence 
contained in any of SEQ ID NO: 1 - SEQ ID NO: 5222 or sequence-conservative and 
function-conservative variants thereof, or is a polypeptide sequence contained in any of 
SEQ ID NO: 5223 - SEQ ID NO: 10444 or function-conservative variants thereof 
In another aspect, the invention features a method of generating vaccines for 
20 immunizing an individual against B. fragilis. The method includes: immunizing a 
subject with an B. fragilis polypeptide, e.g., a surface or secreted polypeptide, or a 
combination of such peptides or active portion(s) thereof, and a pharmaceutical^ 
acceptable carrier. Such vaccines have therapeutic and prophylactic utilities. 

In another aspect, the invention features a method of evaluating a compound, e.g., 
25 a polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an B. 
fragilis polypeptide. The method includes contacting the compound to be evaluated with 
an B. fragilis polypeptide and determining if the compound binds or otherwise interacts 
with the B. fragilis polypeptide. Compounds which bind or otherwise interact with B. 

-15- 



2709.1001-001 



fragilis polypeptides are candidates as modulators, including activators and inhibitors, of 
the bacterial life cycle. These assays can be performed in vitro or in vivo. 

In another aspect, the invention features a method of evaluating a compound, e.g., 
a polypeptide, e.g., a fragment of a host cell polypeptide, for the ability to bind an B. 
5 fragilis nucleic acid, e.g., DNA or RNA. The method includes contacting the compound 
to be evaluated with an B. fragilis nucleic acid and determining if the compound binds or 
otherwise interacts with the B. fragilis nucleic acid. Compounds which bind B. fragilis 
are candidates as modultors, including activators and inhibitors, of the bacterial life 
cycle. These assays can be performed in vitro or in vivo. 

10 A particularly preferred embodiment of the invention is directed to a method of 

screening test compounds for anti-bacterial activity, which method comprises: selecting 
as a target a bacterial specific sequence, which sequence is essential to the viability of a 
bacterial species; contacting a test compound with said target sequence; and selecting 
those test compounds which bind to said target sequence as potential anti-bacterial 

15 candidates. In one embodiment, the target sequence selected is specific to a single 
species, or even a single strain, such as, for example, the strain B. fragilis 14062. In a 
second embodiment, the target sequence is common to at least two species of bacteria. 
In a third embodiment, the target sequence is common to a family of bacteria. The target 
sequence may be a nucleic acid sequence or a polypeptide sequence. Methods employing 

20 sequences common to more than one species of microorganism may be used to screen 
candidates for broad spectrum anti-bacterial activity. 

The invention also provides methods for preventing or treating disease caused by 
certain bacteria, including B. fragilis , which are carried out by administering to an 
animal in need of such treatment, in particular a warm-blooded vertebrate, including but 

25 not limited to birds and mammals, a compound that specifically inhibits or interferes 
with the function of a bacterial polypeptide or nucleic acid. In a particularly preferred 
embodiment, the mammal to be treated is human. 
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DETAILED DESCRIPTION OF THE INVENTION 

The sequences of the present invention include the specific nucleic acid and 
amino acid sequences set forth in the Sequence Listing that forms a part of the present 
specification, and which are designated SEQ ID NO: 1 - SEQ ID NO: 10444. Use of the 
terms "SEQ ID NO: 1 - SEQ ID NO: 5222 ", " SEQ ID NO: 5223 - SEQ ID NO: 10444, 
"the sequences depicted in Table 2", etc., is intended, for convenience, to refer to each 
individual SEQ ID NO individually, and is not intended to refer to the genus of these 
sequences unless such reference would be indicated. In other words, it is a shorthand for 
listing all of these sequences individually. The invention encompasses each sequence 
individually, as well as any combination thereof. 

DEFINITIONS 

"Nucleic acid" or "polynucleotide" as used herein refers to purine- and 
pyrimidine-containing polymers of any length, either polyribonucleotides or 
polydeoxyribonucleotides or mixed polyribo-polydeoxyribo nucleotides. This includes 
single- and double-stranded molecules, i.e., DNA-DNA, DNA-RNA and RNA-RNA 
hybrids, as well as "protein nucleic acids" (PNA) formed by conjugating bases to an 
amino acid backbone. This also includes nucleic acids containing modified bases. 

A nucleic acid or polypeptide sequence that is "derived from" a designated 
sequence refers to a sequence that corresponds to a region of the designated sequence. 
For nucleic acid sequences, this encompasses sequences that are homologous or 
complementary to the sequence, as well as "sequence-conservative variants" and 
"function-conservative variants." For polypeptide sequences, this encompasses 
"function-conservative variants." Sequence-conservative variants are those in which a 
change of one or more nucleotides in a given codon position results in no alteration in the 
amino acid encoded at that position. Function-conservative variants are those in which a 
given amino acid residue in a polypeptide has been changed without altering the overall 
conformation and function of the native polypeptide, including, but not limited to, 
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replacement of an amino acid with one having similar physico-chemical properties (such 
as, for example, acidic, basic, hydrophobic, and the like). "Function-conservative" 
variants also include any polypeptides that have the ability to elicit antibodies specific to 
a designated polypeptide. 

An "A fragilis -derived" nucleic acid or polypeptide sequence may or may not be 
present in other bacterial species, and may or may not be present in all B. fragilis strains. 
This term is intended to refer to the source from which the sequence was originally 
isolated. Thus, an B. fragilis -derived polypeptide, as used herein, may be used, e.g., as a 
target to screen for a broad spectrum antibacterial agent, to search for homologous 
proteins in other species of bacteria or in eukaryotic organisms such asbacteria humans, 
etc. 

A purified or isolated polypeptide or a substantially pure preparation of a 
polypeptide are used interchangeably herein and, as used herein, mean a polypeptide that 
has been separated from other proteins, lipids, and nucleic acids with which it naturally 
occurs. Preferably, the polypeptide is also separated from substances, e.g., antibodies or 
gel matrix, e.g., polyacrylamide, which are used to purify it. Preferably, the polypeptide 
constitutes at least about 10, 20, 50 70, 80 or 95% dry weight of the purified preparation. 
Preferably, the preparation contains sufficient polypeptide to allow protein sequencing; at 
least about 1, 10, or preferably 100 mg of polypeptide. 

A purified preparation of cells refers to, in the case of plant or animal cells, an in 
vitro preparation of cells and not an entire intact plant or animal. In the case of cultured 
cells or microbial cells, it consists of a preparation of at least about 10%, more preferably 
at least about 50%, of the subject cells. 

A purified or isolated or a substantially pure nucleic acid, e.g., a substantially 
pure DNA, (are terms used interchangeably herein) is a nucleic acid which is one or both 
of the following: not immediately contiguous with both of the coding sequences with 
which it is immediately contiguous (i.e., one at the 5' end and one at the 3' end) in the 
naturally-occurring genome of the organism from which the nucleic acid is derived; or 
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which is substantially free of a nucleic acid with which it occurs in the organism from 
which the nucleic acid is derived. The term includes, for example, a recombinant DNA 
which is incorporated into a vector, e.g., into an autonomously replicating plasmid or 
virus, or into the genomic DNA of a prokaryote or eukaryote, or which exists as a 
separate molecule (e.g., a cDNA or a genomic DNA fragment produced by PCR or 
restriction endonuclease treatment) independent of other DNA sequences. Substantially 
pure DNA also includes a recombinant DNA which is part of a hybrid gene encoding 
additional B. fragilis DNA sequence. 

A "contig" as used herein is a nucleic acid representing a continuous stretch of 
genomic sequence of an organism. 

An "open reading frame", also referred to herein as ORF, is a region of nucleic 
acid which encodes a polypeptide. This region may represent a portion of a coding 
sequence or a total sequence and can be determined from a stop to stop codon or from a 
start to stop codon. 

As used herein, a "coding sequence" is a nucleic acid which is transcribed into 
messenger RNA and/or translated into a polypeptide when placed under the control of 
appropriate regulatory sequences. The boundaries of the coding sequence are determined 
by a translation start codon at the five prime terminus and a translation stop code at the 
three prime terminus. A coding sequence can include but is not limited to messenger 
RNA, synthetic DNA, and recombinant nucleic acid sequences. 

A "complement" of a nucleic acid as used herein refers to an anti -parallel or 
antisense sequence that participates in Watson-Crick base-pairing with the original 
sequence. 

A "gene product" is a protein or structural RNA which is specifically encoded by 

a gene. 

As used herein, the term "probe" refers to a nucleic acid, peptide or other 
chemical entity which specifically binds to a molecule of interest. Probes are often 
associated with or capable of associating with a label. A label is a chemical moiety 
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capable of detection. Typical labels comprise dyes, radioisotopes, luminescent and 
chemiluminescent moieties, fluorophores, enzymes, precipitating agents, amplification 
sequences, and the like. Similarly, a nucleic acid, peptide or other chemical entity which 
specifically binds to a molecule of interest and immobilizes such molecule is referred 
5 herein as a "capture ligand". Capture ligands are typically associated with or capable of 
associating with a support such as nitro-cellulose, glass, nylon membranes, beads, 
particles and the like. The specificity of hybridization is dependent on conditions such as 
the base pair composition of the nucleotides, and the temperature and salt concentration 
of the reaction. These conditions are readily discernable to one of ordinary skill in the art 

10 using routine experimentation. 

"Homologous" refers to the sequence similarity or sequence identity between two 
polypeptides or between two nucleic acid molecules. When a position in both of the two 
compared sequences is occupied by the same base or amino acid monomer subunit, e.g., 
if a position in each of two DNA molecules is occupied by adenine, then the molecules 

15 are homologous at that position. The percent of homology between two sequences is a 
function of the number of matching or homologous positions shared by the two 
sequences divided by the number of positions compared x 100. For example, if 6 of 10 
of the positions in two sequences are matched or homologous then the two sequences are 
60% homologous. By way of example, the DNA sequences ATTGCC and TATGGC 

20 share 50% homology. Generally, a comparison is made when two sequences are aligned 
to give maximum homology. 

Nucleic acids are hybridizable to each other when at least one strand of a nucleic 
acid can anneal to the other nucleic acid under defined stringency conditions. Stringency 
of hybridization is determined by: (a) the temperature at which hybridization and/or 

25 washing is performed; and (b) the ionic strength and polarity of the hybridization and 
washing solutions. Hybridization requires that the two nucleic acids contain 
complementary sequences; depending on the stringency of hybridization, however, 
mismatches may be tolerated. Typically, hybridization of two sequences at high 
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stringency (such as, for example, in a solution of 0.5X SSC, at 65° C) requires that the 
sequences be essentially completely homologous. Conditions of intermediate stringency 
(such as, for example, 2X SSC at 65 0 C) and low stringency (such as, for example 2X 
SSC at 55° C) require correspondingly less overall complementarity between the 
5 hybridizing sequences. (IX SSC is 0. 1 5 M NaCl, 0.01 5 M Na citrate). 

The terms peptides, proteins, and polypeptides are used interchangeably herein. 

As used herein, the term "surface protein' 1 refers to all surface accessible proteins, 
e.g. inner and outer membrane proteins, proteins adhering to the cell wall, and secreted 
proteins. 

10 A polypeptide has B. fragilis biological activity if it has one, two or preferably 

more of the following properties: (1) if when expressed in the course of an B. fragilis 
infection, it can promote, or mediate the attachment of B. fragilis to a cell; (2) it has an 
enzymatic activity, structural or regulatory function characteristic of an B. fragilis 
protein; (3) the gene which encodes it can rescue a lethal mutation in an B. fragilis gene. 

15 A polypeptide has biological activity if it is an antagonist, agonist, or super-agonist of a 
polypeptide having one of the above-listed properties. 

A biologically active fragment or analog is one having an in vivo or in vitro 
activity which is characteristic of the B. fragilis polypeptides of the invention contained 
in the Sequence Listing, or of other naturally occurring B. fragilis polypeptides, e.g., one 

20 or more of the biological activities described herein. Especially preferred are fragments 
which exist in vivo, e.g., fragments which arise from post transcriptional processing or 
which arise from translation of alternatively spliced RNA's. Fragments include those 
expressed in native or endogenous cells as well as those made in expression systems, 
e.g., in CHO (Chinese Hamster Ovary) cells. Because peptides such as B, fragilis 

25 polypeptides often exhibit a range of physiological properties and because such 

properties may be attributable to different portions of the molecule, a useful B. fragilis 
fragment or B. fragilis analog is one which exhibits a biological activity in any biological 
assay for B. fragilis activity. The fragment or analog possesses about 10%, preferably 
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about 40%, more preferably about 60%, 70%, 80% or 90% or greater of the activity of B. 
fragilis , in any in vivo or in vitro assay. 

Analogs can differ from naturally occurring B. fragilis polypeptides in amino acid 
sequence or in ways that do not involve sequence, or both. Non-sequence modifications 

5 include changes in acetylation, methylation, phosphorylation, carboxylation, or 

glycosylation. Preferred analogs include B. fragilis polypeptides (or biologically active 
fragments thereof) whose sequences differ from the wild-type sequence by one or more 
conservative amino acid substitutions or by one or more non-conservative amino acid 
substitutions, deletions, or insertions which do not substantially diminish the biological 

10 activity of the B. fragilis polypeptide. Conservative substitutions typically include the 
substitution of one amino acid for another with similar characteristics, e.g., substitutions 
within the following groups: valine, glycine; glycine, alanine; valine, isoleucine, leucine; 
aspartic acid, glutamic acid; asparagine, glutamine; serine, threonine; lysine, arginine; 
and phenylalanine, tyrosine. Other conservative substitutions can be made in view of the 



15 table below. 
TABLE 1 

CONSERVATIVE AMINO ACID REPLACEMENTS 



For Amino Acid 


Code 


Replace with any of 


Alanine 


A 


D-Ala, Gly, beta-Ala, L-Cys, D-Cys 


Arginine 


R 


D-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg, Met, He, 
D-Met, D-Ile, Orn, D-Orn 


Asparagine 


N 


D-Asn, Asp, D-Asp, Glu, D-Glu, Gin, D-Gln 


Aspartic Acid 


D 


D-Asp, D-Asn, Asn, Glu, D-Glu, Gin, D-Gln 


Cysteine 


C 


D-Cys, S-Me-Cys, Met, D-Met, Thr, D-Thr 


Glutamine 


Q 


D-Gln, Asn, D-Asn, Glu, D-Glu, Asp, D-Asp 


Glutamic Acid 


E 


D-Glu, D-Asp, Asp, Asn, D-Asn, Gin, D-Gln 


Glycine 


G 


Ala, D-Ala, Pro, D-Pro, (3-Ala, Acp 


Isoleucine 


I 


D-Ile, Val, D-Val, Leu, D-Leu, Met, D-Met 
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Leucine 


L 


D-Leu, Val, D-Val, Leu, D-Leu, Met, D-Met 


T v^inp 


K 


D-T/vs Ayp D-Arp homo- Are D-homo-Ars? Met D- 
Met, He, D-Ile, Orn, D-Orn 


Methionine 


M 


D-Met, S-Me-Cys, He, D-Ile, Leu, D-Leu, Val, D-Val 


Phenylalanine 


F 


D-Phe, Tyr, D-Thr, L-Dopa, His, D-His, Trp, D-Trp, 
Trans-3,4, or 5-phenylproline, cis-3,4, or 5- 
phenylproline 


Proline 


P 


D-Pro, L-I-thioazolidine-4-carboxylic acid, D-or L-l- 
oxazolidine-4-carboxylic acid 


Serine 


S 


D-Ser, Thr, D-Thr, allo-Thr, Met, D-Met, Met(O), 
D-Met(O), L-Cys, D-Cys 


Threonine 


T 


D-Thr, Ser, D-Ser, allo-Thr, Met, D-Met, Met(0), 
D-Met(O), Val, D-Val 


Tyrosine 


Y 


D-Tyr, Phe, D-Phe, L-Dopa, His, D-His 


Valine 


V 


D-Val, Leu, D-Leu, He, D-Ile, Met, D-Met 



Other analogs within the invention are those with modifications which increase 
peptide stability; such analogs may contain, for example, one or more non-peptide bonds 
(which replace the peptide bonds) in the peptide sequence. Also included are: analogs 
5 that include residues other than naturally occurring L-amino acids, e.g., D-amino acids or 
non-naturally occurring or synthetic amino acids, e.g., [3 or y amino acids; and cyclic 
analogs. 

As used herein, the term "fragment", as applied to an B. fragilis analog, will 
ordinarily be at least about 20 residues, more typically at least about 40 residues, 

10 preferably at least about 60 residues in length. Fragments of B. fragilis polypeptides can 
be generated by methods known to those skilled in the art. The ability of an Bacteroides 
fragment to exhibit a biological activity of B. fragilis polypeptide can be assessed by 
methods known to those skilled in the art as described herein. Also included are B. 
fragilis polypeptides containing residues that are not required for biological activity of 

15 the peptide or that result from alternative mRNA splicing or alternative protein 
processing events. 
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An "immunogenic component" as used herein is a moiety, such as an B, fragilis 
polypeptide, analog or fragment thereof, that is capable of eliciting a humoral and/or 
cellular immune response in a host animal. 

An "antigenic component" as used herein is a moiety, such as an B. fragilis 
5 polypeptide, analog or fragment thereof, that is capable of binding to a specific antibody 
with sufficiently high affinity to form a detectable antigen-antibody complex. 

The term "antibody" as used herein is intended to include fragments thereof 
which are specifically reactive with B. fragilis polypeptides. 

As used herein, the term "cell-specific promoter" means a DNA sequence that 
10 serves as a promoter, i.e., regulates expression of a selected DNA sequence operably 
linked to the promoter, and which effects expression of the selected DNA sequence in 
specific cells of a tissue. The term also covers so-called "leaky" promoters, which 
regulate expression of a selected DNA primarily in one tissue, but cause expression in 
other tissues as well. 

15 Misexpression, as used herein, refers to a non-wild type pattern of gene 

expression. It includes: expression at non-wild type levels, i.e., over or under expression; 
a pattern of expression that differs from wild type in terms of the time or stage at which 
the gene is expressed, e.g., increased or decreased expression (as compared with wild 
type) at a predetermined developmental period or stage; a pattern of expression that 

20 differs from wild type in terms of increased expression (as compared with wild type) in a 
predetermined cell type or tissue type; a pattern of expression that differs from wild type 
in terms of the splicing size, amino acid sequence, post-translational modification, or 
biological activity of the expressed polypeptide; a pattern of expression that differs from 
wild type in terms of the effect of an environmental stimulus or extracellular stimulus on 

25 expression of the gene, e.g., a pattern of increased or decreased expression (as compared 
with wild type) in the presence of an increase or decrease in the strength of the stimulus. 

As used herein, "host cells" and other such terms denoting microorganisms or 
higher eukaryotic cell lines cultured as unicellular entities refers to cells which can 
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become or have been used as recipients for a recombinant vector or other transfer DNA, 
and include the progeny of the original cell which has been transfected. It is understood 
by individuals skilled in the art that the progeny of a single parental cell may not 
necessarily be completely identical in genomic or total DNA compliment to the original 
5 parent, due to accident or deliberate mutation. 

As used herein, the term "control sequence" refers to a nucleic acid having a base 
sequence which is recognized by the host organism to effect the expression of encoded 
sequences to which they are ligated. The nature of such control sequences differs 
depending upon the host organism; in prokaryotes, such control sequences generally 

10 include a promoter, ribosomal binding site, terminators, and in some cases operators; in 
eukaryotes, generally such control sequences include promoters, terminators and in some 
instances, enhancers. The term control sequence is intended to include at a minimum, all 
components whose presence is necessary for expression, and may also include additional 
components whose presence is advantageous, for example, leader sequences. 

15 As used herein, the term "operably linked" refers to sequences joined or ligated to 

function in their intended manner. For example, a control sequence is operably linked to 
coding sequence by ligation in such a way that expression of the coding sequence is 
achieved under conditions compatible with the control sequence and host cell. 

The "metabolism" of a substance, as used herein, means any aspect of the 

20 expression, function, action, or regulation of the substance. The metabolism of a 
substance includes modifications, e.g., covalent or non-covalent modifications of the 
substance. The metabolism of a substance includes modifications, e.g., covalent or non- 
covalent modification, the substance induces in other substances. The metabolism of a 
substance also includes changes in the distribution of the substance. The metabolism of a 

25 substance includes changes the substance induces in the distribution of other substances. 
A "sample" as used herein refers to a biological sample, such as, for example, 
tissue or fluid isloated from an individual (including without limitation plasma, serum, 
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cerebrospinal fluid, lymph, tears, saliva and tissue sections) or from in vitro cell culture 
constituents, as well as samples from the environment. 

Technical and scientific terms used herein have the meanings commonly 
understood by one of ordinary skill in the art to which the present invention pertains, 

5 unless otherwise defined. Reference is made herein to various methodologies known to 
those of skill in the art. Publications and other materials setting forth such known 
methodologies to which reference is made are incorporated herein by reference in their 
entireties as though set forth in full. The practice of the invention will employ, unless 
otherwise indicated, conventional techniques of chemistry, molecular biology, 

10 microbiology, recombinant DNA, and immunology, which are within the skill of the art. 
Such techniques are explained fully in the literature. See e.g., Sambrook, Fritsch, and 
Maniatis, Molecular Cloning; Laboratory Manual 2nd ed. (1989); DNA Cloning, 
Volumes I and II (D.N Glover ed. 1985); Oligonucleotide Synthesis (M.J. Gait ed, 1984); 
Nucleic Acid Hybridization (B.D. Hames & S.J. Higgins eds. 1984); the series, Methods 

15 in Enzymoloqy (Academic Press, Inc.), particularly Vol. 154 and Vol. 155 (Wu and 
Grossman, eds.); PCR-A Practical Approach (McPherson, Quirke, and Taylor, eds., 
1991); Immunology, 2d Edition, 1989, Roitt et ah, C.V. Mosby Company, and New 
York; Advanced Immunology, 2d Edition, 1991, Male et ah, Grower Medical Publishing, 
New York.; DNA Cloning: A Practical Approach, Volumes I and II, 1985 (D.N. Glover 

20 ed.); Oligonucleotide Synthesis, 1984, (M.L. Gait ed); Transcription and Translation, 
1984 (Hames and Higgins eds.); Animal Cell Culture, 1986 (R.I. Freshney ed.); 
Immobilized Cells and Enzymes, 1986 (IRL Press); Perbal, 1984, A Practical Guide to 
Molecular Cloning; Gene Transfer Vectors for Mammalian Cells, 1987 (J. H. Miller and 
M. P. Calos eds., Cold Spring Harbor Laboratory); Martin J. Bishop, ed., Guide to 

25 Human Genome Computing, 2d Edition, Academic Press, San Diego, CA. (1998); and 
Leonard F. Peruski, Jr., and Anne Harwood Peruski, The Internet and the New Biology: 
Tools for Genomic and Molecular Research, American Society for Microbiology, 
Washington, D.C. (1997). 
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Any suitable materials and/or methods known to those of skill can be utilized in 
carrying out the present invention; however, preferred materials and/or methods are 
described. Materials, reagents and the like to which reference is made in the following 
description and examples are obtainable from commercial sources, unless otherwise 
5 noted. 

B. FRAGILIS GENOMIC SEQUENCE 

This invention provides nucleotide sequences of the genome of B. fragilis which 
thus comprises a DNA sequence library of B. fragilis genomic DNA. The detailed 

10 description that follows provides nucleotide sequences of B. fragilis 9 and also describes 
how the sequences were obtained and how ORFs and protein-coding sequences were 
identified. Also described are compositions and methods of using the disclosed B. 
fragilis sequences in methods including diagnostic and therapeutic applications. 
Furthermore, the library can be used as a database for identification and comparison of 

15 medically important sequences in this and other strains of B. fragilis . 

To determine the genomic sequence of B. fragilis , DNA from strain 14062 of B. 
fragilis was isolated after Zymolyase digestion, sodium dodecyl sulfate lysis, potassium 
acetate precipitation, phenolxhloroform extractionand ethanol precipitation (Soil, D.R., 
T. Srikantha and S.R. Lockhart: Characterizing Developmentally Regulated Genes in B. 

20 fragilis . In Microbial Genome Methods. K.W. Adolph, editor. CRC Press. New York, 
p 17-37.). DNA was sheared hydrodynamically using an HPLC (Oefher, et. al, 1996) to 
an insert size of 2000-3000 bp. After size fractionation by gel electrophoresis the 
fragments were blunt-ended, ligated to adapter oligonucleotides and cloned into the 
pGTC (Thomann) vector to construct a "shotgun" subclone library. 

25 DNA sequencing was achieved using established ABI sequencing methods on 

ABB 77 automated DNA sequencers. The cloning and sequencing procedures are 
described in more detail in the Exemplification. 
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Individual sequence reads were assembled using PHRAP (P. Green, Abstracts of 
DOE Human Genome Program Contractor-Grantee Workshop V, Jan. 1996, p. 157). The 
average contig length was about 3-4 kb. 

All subsequent steps were based on sequencing by ABI377 automated DNA 
5 sequencing methods. The cloning and sequencing procedures are described in more 
detail in the Exemplification. 

A variety of approaches may be used to order the contigs so as to obtain a 
continuous sequence representing the entire B, fragilis genome. Synthetic 
oligonucleotides are designed that are complementary to sequences at the end of each 

10 contig. These oligonucleotides may be hybridized to libaries of B, fragilis genomic DNA 
in, for example, lambda phage vectors or plasmid vectors to identify clones that contain 
sequences corresponding to the junctional regions between individual contigs. Such 
clones are then used to isolate template DNA and the same oligonucleotides are used as 
primers in polymerase chain reaction (PCR) to amplify junctional fragments, the 

15 nucleotide sequence of which is then determined. 

The B. fragilis sequences were analyzed for the presence of open reading frames 
(ORFs) comprising at least 180 nucleotides. As a result of the analysis of ORFs based on 
stop-to-stop codon reads, it should be understood that these ORFs may not correspond to 
the ORF of a naturally-occurring B. fragilis polypeptide. These ORFs may contain start 

20 codons which indicate the initiation of protein synthesis of a naturally-occurring B. 

fragilis polypeptide. Such start codons within the ORFs provided herein were identified 
by those of ordinary skill in the relevant art, and the resulting ORF and the encoded B. 
fragilis polypeptide is within the scope of this invention. For example, within the ORFs 
a codon such as AUG or GUG (encoding methionine or valine) which is part of the 

25 initiation signal for protein synthesis were identified and the portion of an ORF to 
corresponding to a naturally-occurring B. fragilis polypeptide was recognized. The 
predicted coding regions were defined by evaluating the coding potential of such 
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sequences with the program GENEMARK™ (Borodovsky and Mclninch, 1993, Comp. . 
17:123). 

Each predicted ORF amino acid sequence was compared with all sequences 
found in current GENBANK, SWISS-PROT, and PIR databases using the BLAST 

5 algorithm. BLAST identifies local alignments occurring by chance between the ORF 
sequence and the sequence in the databank (Altschal et al., 1990, L Mol. Biol 215:403- 
410). Homologous ORFs (probabilities less than 10" 5 by chance) andORF's that are 
probably non-homologous (probabilities greater than 10" 5 by chance) but have good 
codon usage were identified. Both homologous, sequences and non-homologous 

10 sequences with good codon usage, are likely to encode proteins and are encompassed by 
the invention. 

B. FRAGILIS NUCLEIC ACIDS 

The present invention provides a library of B. fragilis -derived nucleic acid 

15 sequences. The libraries provide probes, primers, and markers which are used as markers 
in epidemiological studies. The present invention also provides a library of B. fragilis - 
derived nucleic acid sequences which comprise or encode targets for therapeutic drugs. 

The nucleic acids of this invention may be obtained directly from the DNA of the 
above referenced B. fragilis strain by using the polymerase chain reaction (PCR). See 

20 "PCR, A Practical Approach" (McPherson, Quirke, and Taylor, eds., IRL Press, Oxford, 
UK, 1991) for details about the PCR. High fidelity PCRis used to ensure a faithful DNA 
copy prior to expression. In addition, the authenticity of amplified products is verified by 
conventional sequencing methods. Clones carrying the desired sequences described in 
this invention may also be obtained by screening the libraries by means of the PCR or by 

25 hybridization of synthetic oligonucleotide probes to filter lifts of the library colonies or 
plaques as known in the art (see, e.g., Sambrook et al, Molecular Cloning, A Laboratory 
Manual 2nd edition, 1989, Cold Spring Harbor Press, NY). 
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It is also possible to obtain nucleic acids encoding B. fragilis polypeptides from a 
cDNA library in accordance with protocols herein described. A cDNA encoding an B. 
fragilis polypeptide can be obtained by isolating total mRNA from an appropriate strain. 
Double stranded cDNAs can then be prepared from the total mRNA. Subsequently, the 
5 cDNAs can be inserted into a suitable plasmid or viral (e.g., bacteriophage) vector using 
any one of a number of known techniques. Genes encoding B. fragilis polypeptides can 
also be cloned using established polymerase chain reaction techniques in accordance with 
the nucleotide sequence information provided by the invention. The nucleic acids of the 
invention can be DNA or RNA. Preferred nucleic acids of the invention are contained in 

10 the Sequence Listing. 

The nucleic acids of the invention can also be chemically synthesized using 
standard techniques. Various methods of chemically synthesizing polydeoxynucleotides 
are known, including solid-phase synthesis which, like peptide synthesis, has been fully 
automated in commercially available DNA synthesizers (See e.g., Itakura et al. U.S. 

15 Patent No. 4,598,049; Caruthers et al. U.S. Patent No. 4,458,066; and Itakura U.S. Patent 
Nos. 4,401,796 and 4,373,071, incorporated by reference herein). 

In another example, DNA can be chemically synthesized using, e.g., the 
phosphoramidite solid support method of Matteucci et al, 1981, J. Am. Chem. Soc. 
103:3185, the method of Yoo et al, 1989, J. Biol Chem. 764:17078, or other well 

20 known methods. This can be done by sequentially linking a series of oligonucleotide 
cassettes comprising pairs of synthetic oligonucleotides, as described below. 

Nucleic acids isolated or synthesized in accordance with features of the present 
invention are useful, by way of example, without limitation, as probes, primers, capture 
ligands, antisense genes and for developing expression systems for the synthesis of 

25 proteins and peptides corresponding to such sequences. As probes, primers, capture 
ligands and antisense agents, the nucleic acid normally consists of all or part 
(approximately twenty or more nucleotides for specificity as well as the ability to form 
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stable hybridization products) of the nucleic acids of the invention contained in the 
Sequence Listing. These uses are described in further detail below. 

PROBES 

5 A nucleic acid isolated or synthesized in accordance with the sequence of the 

invention contained in the Sequence Listing can be used as a probe to specifically detect 
B. fragilis . With the sequence information set forth in the present application, sequences 
of twenty or more nucleotides are identified which provide the desired inclusivity and 
exclusivity with respect to B. fragilis , and extraneous nucleic acids likely to be 

10 encountered during hybridization conditions. More preferably, the sequence will 
comprise at least about twenty to thirty nucleotides to convey stability to the 
hybridization product formed between the probe and the intended target molecules. 

Sequences larger than 1000 nucleotides in length are difficult to synthesize but 
can be generated by recombinant DNA techniques. Individuals skilled in the art will 

15 readily recognize that the nucleic acids, for use as probes, can be provided with a label to 
facilitate detection of a hybridization product. 

Nucleic acid isolated and synthesized in accordance with the sequence of the 
invention contained in the Sequence Listing can also be useful as probes to detect 
homologous regions (especially homologous genes) of other Bacteroides species using 

20 appropriate stringency hybridization conditions as described herein. 

CAPTURE LIGAND 

For use as a capture ligand, the nucleic acid selected in the manner described 
above with respect to probes, can be readily associated with a support. The manner in 
25 which nucleic acid is associated with supports is well known. Nucleic acid having 
twenty or more nucleotides in a sequence of the invention contained in the Sequence 
Listing have utility to separate B. fragilis nucleic acid from one strain from the nucleic 
acid of other another strain as well as from other organisms. Nucleic acid having twenty 
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or more nucleotides in a sequence of the invention contained in the Sequence Listing can 
also have utility to separate other Bacteroides species from each other and from other 
organisms. Preferably, the sequence will comprise at least about twenty nucleotides to 
convey stability to the hybridization product formed between the probe and the intended 
5 target molecules. Sequences larger than 1000 nucleotides in length are difficult to 
synthesize but can be generated by recombinant DNA techniques. 

PRIMERS 

Nucleic acid isolated or synthesized in accordance with the sequences described 

10 herein have utility as primers for the amplification of B. fragilis nucleic acid. These 
nucleic acids may also have utility as primers for the amplification of nucleic acids in 
other Bacteroides species. With respect to polymerase chain reaction (PCR) techniques, 
nucleic acid sequences of > 10-15 nucleotides of the invention contained in the Sequence 
Listing have utility in conjunction with suitable enzymes and reagents to create copies of 

15 B. fragilis nucleic acid. More preferably, the sequence will comprise twenty or more 
nucleotides to convey stability to the hybridization product formed between the primer 
and the intended target molecules. Binding conditions of primers greater than 100 
nucleotides are more difficult to control to obtain specificity. High fidelity PCR can be 
used to ensure a faithful DNA copy prior to expression. In addition, amplified products 

20 can be checked by conventional sequencing methods. 

The copies can be used in diagnostic assays to detect specific sequences, 
including genes from B. fragilis and/or other Bacteroides species. The copies can also 
be incorporated into cloning and expression vectors to generate polypeptides 
corresponding to the nucleic acid synthesized by PCR, as is described in greater detail 

25 herein. 

The nucleic acids of the present invention find use as templates for the 
recombinant production of B. fragilis -derived peptides or polypeptides 
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ANTISENSE 

Nucleic acid or nucleic acid-hybridizing derivatives isolated or synthesized in 
accordance with the sequences described herein have utility as antisense agents to 
prevent the expression of B. fragilis genes. These sequences also have utility as 
5 antisense agents to prevent expression of genes of other Bacteroides species. 

In one embodiment, nucleic acid or derivatives corresponding to B. fragilis 
nucleic acids is loaded into a suitable carrier such as a liposome or bacteriophage for 
introduction into bacterial cells. For example, a nucleic acid having twenty or more 
nucleotides is capable of binding to bacteria nucleic acid or bacteria messenger RNA. 
10 Preferably, the antisense nucleic acid is comprised of 20 or more nucleotides to provide 
necessary stability of a hybridization product of non-naturally occurring nucleic acid and 
bacterial nucleic acid and/or bacterial messenger RNA. Nucleic acid having a sequence 
greater than 1000 nucleotides in length is difficult to synthesize but can be generated by 
recombinant DNA techniques. Methods for loading antisense nucleic acid in liposomes 
15 is known in the art as exemplified by U.S. Patent 4,241,046 issued December 23, 1980 to 
Papahadjopoulos et al. 

The present invention encompasses isolated polypeptides and nucleic acids 
derived from B. fragilis that are useful as reagents for diagnosis of bacterial infection, 
components of effective anti-bacterial vaccines, and/or as targets for anti-bacterial drugs, 
20 including anti-5. fragilis drugs. 

EXPRESSION OF B FRAGILIS NUCLEIC ACIDS 

Table 2, which is appended herewith and which forms part of the present 
specification, provides a list of open reading frames (ORFs) in both strands and a 
25 putative identification of the particular function of a polypeptide which is encoded by 
each ORF, based on the homology match (determined by the BLASTP2 algorithm) of the 
predicted polypeptide with known proteins encoded by ORFs in other organisms. An 
ORF is a region of nucleic acid which encodes a polypeptide. This region may represent 
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a portion of a coding sequence or a total sequence and was determined from stop to stop 
codons. The first column contains a designation for the ORF ("ORF Name"). The second 
and third columns list the SEQ ID numbers for the nucleic acid ("NT ID") and amino 
acid ( M AA ID") sequences corresponding to each ORF 5 respectively. The fourth and fifth 
5 columns list the length of the nucleic acid ORF ("NT Length") and the length of the 
amino acid ORF ("AA Length "), respectively. The nucleotide sequence corresponding 
to each ORF begins at the first nucleotide immediately following a stop codon and ends 
at the nucleotide immediately preceding the next downstream stop codon in the same 
reading frame. It will be recognized by one skilled in the art that the natural translation 

10 initiation sites will correspond to ATG, GTG, or TTG codons located within the ORFs. 
The natural initiation sites depend not only on the sequence of a start codon but also on 
the context of the DNA sequence adjacent to the start codon. Usually, a recognizable 
ribosome binding site is found within 20 nucleotides upstream from the initiation codon. 
In some cases where genes are translationally coupled and coordinately expressed 

15 together in "operons", ribosome binding sites are not present, but the initiation codon of a 
downstream gene may occur very close to, or overlap, the stop codon of the an upstream 
gene in the same operon. The correct start codons can be generally identified without 
undue experimentation because only a few codons need be tested. It is recognized that 
the translational machinery in bacteria initiates all polypeptide chains with the amino 

20 acid methionine, regardless of the sequence of the start codon. In some cases, 

polypeptides are post-translationally modified, resulting in an N-terminal amino acid 
other than methionine in vivo. The sixth and seventh columns provide metrics for 
assessing the likelihood of the homology match (determined by the BLASTP2 
algorithm), as is known in the art, to the genes indicated in the description frame 

25 ("Description") defined further below. These genes in the Description were identified 
when the designated ORF was compared against a comprehensive non-redundant protein 
database. Specifically, the sixth column represents the Blast Score ("Score") for the 
match (a higher score is a better match), and the seventh column represents the 
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probability ("Probability") for the match (the probability that such a match can have 
occurred by chance; the lower the value, the more likely the match is valid). If a 
BLASTP2 score of less than 100 was obtained, no value is reported in the table. The 
remaining fields below the columns contain additional information relating to the 

5 potential function of the sequence based on the BLASTP2 analysis. Where a match was 
discovered, the field "Protein name" list the protein's name identified from the match. In 
addition, one skilled in the art would be able to identify the match and elucidate its 
function using the "Locus name" and where available the accession number, "Acc#" from 
the database. Lastly, one skilled in the art would appreciate the "Description" field to 

10 further describe the potential function of the protein based on this analysis. This 

information allows one of ordinary skill in the art to determine a potential use for each 
identified coding sequence and, as a result, allows to use the polypeptides of the present 
invention for commercial and industrial purposes. 

Using the information provided in SEQ ID NO: 1 - SEQ ID NO: 5222, SEQ ID 

15 NO: 5223 - SEQ ID NO: 10444 and in Table 2 together with routine cloning and 

sequencing methods, one of ordinary skill in the art will be able to clone and sequence all 
the nucleic acid fragments of interest including open reading frames (ORFs) encoding a 
large variety of proteins of B. fragilis . 

Nucleic acid isolated or synthesized in accordance with the sequences described 

20 herein have utility to generate polypeptides. The nucleic acid of the invention 

exemplified in SEQ ID NO: 1 - SEQ ID NO: 5222 and in Table 2 or fragments of said 
nucleic acid encoding active portions of B, fragilis polypeptides can be cloned into 
suitable vectors or used to isolate nucleic acid. The isolated nucleic acid is combined 
with suitable DNA linkers and cloned into a suitable vector. 

25 The function of a specific gene or operon can be ascertained by expression in a 

bacterial strain under conditions where the activity of the gene product(s) specified by the 
gene or operon in question can be specifically measured. Alternatively, a gene product 
may be produced in large quantities in an expressing strain for use as an antigen, an 
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industrial reagent, for structural studies, etc. This expression can be accomplished in a 
mutant strain which lacks the activity of the gene to be tested, or in a strain that does not 
produce the same gene product(s). This includes, but is not limited to, Eucaryotic species 
such as the yeast Saccharomyces cerevisiae, Methanobacterium strains or other Archaea, 
5 and Eubacteria such as E. coli, B. Subtilis, S. Aureus, S. Pneumonia or Pseudomonas 
putida. In some cases the expression host will utilize the natural B. fragilis promoter 
whereas in others, it will be necessary to drive the gene with a promoter sequence 
derived from the expressing organism (e.g., an E. coli beta-galactosidase promoter for 
expression inE. coli). 

10 To express a gene product using the natural B. fragilis promoter, a procedure such 

as the following can be used. A restriction fragment containing the gene of interest, 
together with its associated natural promoter element and regulatory sequences 
(identified using the DNA sequence data) is cloned into an appropriate recombinant 
plasmid containing an origin of replication that functions in the host organism and an 

15 appropriate selectable marker. This can be accomplished by a number of procedures 
known to those skilled in the art. It is most preferably done by cutting the plasmid and 
the fragment to be cloned with the same restriction enzyme to produce compatible ends 
that can be ligated to join the two pieces together. The recombinant plasmid is 
introduced into the host organism by, for example, electroporation and cells containing 

20 the recombinant plasmid are identified by selection for the marker on the plasmid. 

Expression of the desired gene product is detected using an assay specific for that gene 
product. 

In the case of a gene that requires a different promoter, the body of the gene 
(coding sequence) is specifically excised and cloned into an appropriate expression 
25 plasmid. This subcloning can be done by several methods, but is most easily 
accomplished by PCR amplification of a specific fragment and ligation into an 
expression plasmid after treating the PCR product with a restriction enzyme or 
exonuclease to create suitable ends for cloning. 
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A suitable host cell for expression of a gene can be any procaryotic or eucaryotic 
cell. Suitable methods for transforming host cells can be found in Sambrook et al 
(Molecular Cloning: A Laboratory Manual 2nd Edition, Cold Spring Harbor Laboratory 
Press (1989)), and other laboratory textbooks. 

5 For example, a host cell transfected with a nucleic acid vector directing 

expression of a nucleotide sequence encoding an B. fragilis polypeptide can be cultured 
under appropriate conditions to allow expression of the polypeptide to occur. Suitable 
media for cell culture are well known in the art. Polypeptides of the invention can be 
isolated from cell culture medium, host cells, or both using techniques known in the art 

10 for purifying proteins including ion-exchange chromatography, gel filtration 

chromatography, ultrafiltration, electrophoresis, and immunoaffmity purification with 
antibodies specific for such polypeptides. Additionally, in many situations, polypeptides 
can be produced by chemical cleavage of a native protein (e.g., tryptic digestion) and the 
cleavage products can then be purified by standard techniques. 

15 In the case of membrane bound proteins, these can be isolated from a host cell by 

contacting a membrane-associated protein fraction with a detergent forming a solubilized 
complex, where the membrane-associated protein is no longer entirely embedded in the 
membrane fraction and is solubilized at least to an extent which allows it to be 
chromatographically isolated from the membrane fraction. Chromatographic techniques 

20 which can be used in the final purification step are known in the art and include 

hydrophobic interaction, lectin affinity, ion exchange, dye affinity and immunoaffmity. 

One strategy to maximize recombinant B. fragilis peptide expression in E. coli is 
to express the protein in a host bacteria with an impaired capacity to proteolytically 
cleave the recombinant protein (Gottesman, S., Gene Expression Technology: Methods 

25 inEnzymology 185 , Academic Press, San Diego, California (1990) 1 19-128). Another 
strategy would be to alter the nucleic acid encoding an B. fragilis peptide to be inserted 
into an expression vector so that the individual codons for each amino acid would be 
those preferentially utilized in highly expressed E. coli proteins (Wada et al., (1992) Nuc. 

-37- 



2709.1001-001 



Acids Res. 20:2111-2118). Such alteration of nucleic acids of the invention can be 

carried out by standard DNA synthesis techniques. 

The nucleic acids of the invention can also be chemically synthesized using 

standard techniques. Various methods of chemically synthesizing polydeoxynucleotides 
5 are known, including solid-phase synthesis which, like peptide synthesis, has been fully 

automated in commercially available DNA synthesizers (See, e.g., Itakura et al. U.S. 

Patent No. 4,598,049; Caruthers et al. U.S. Patent No. 4,458,066; and Itakura U.S. Patent 

Nos. 4,401,796 and 4,373,071, incorporated by reference herein). 

The present invention provides a library of B. fragilis -derived nucleic acid 
10 sequences. The libraries provide probes, primers, and markers which can be used as 

markers in epidemiological studies. The present invention also provides a library of B. 

fragilis -derived nucleic acid sequences which comprise or encode targets for therapeutic 

drugs. 

Nucleic acids comprising any of the sequences disclosed herein or sub-sequences 
15 thereof can be prepared by standard methods using the nucleic acid sequence information 
provided in SEQ ID NO: 1 - SEQ ID NO: 5222. For example, DNA can be chemically 
synthesized using, e.g., the phosphoramidite solid support method of Matteucci et al, 
1981, 1 Am. Chem. Soc. 103:3185, the method of Yoo et al, 1989, J. Biol. Chem. 
764: 17078, or other well known methods. This can be done by sequentially linking a 
20 series of oligonucleotide cassettes comprising pairs of synthetic oligonucleotides, as 
described below. 

Of course, due to the degeneracy of the genetic code, many different nucleotide 
sequences can encode polypeptides having the amino acid sequences defined by SEQ ID 
NO: 5223 - SEQ ID NO: 10444 or sub-sequences thereof. The codons can be selected 
25 for optimal expression in prokaryotic or eukaryotic systems. Such degenerate variants 
are also encompassed by this invention. 

Insertion of nucleic acids (typically DNAs) encoding the polypeptides of the 
invention into a vector is easily accomplished when the termini of both the DNAs and the 
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vector comprise compatible restriction sites. If this cannot be done, it may be necessary 
to modify the termini of the DNAs and/or vector by digesting back single-stranded DNA 
overhangs generated by restriction endonuclease cleavage to produce blunt ends, or to 
achieve the same result by filling in the single-stranded termini with an appropriate DNA 
5 polymerase. 

Alternatively, any site desired may be produced, e.g., by ligating nucleotide 
sequences (linkers) onto the termini. Such linkers may comprise specific oligonucleotide 
sequences that define desired restriction sites. Restriction sites can also be generated by 
the use of the polymerase chain reaction (PCR). See, e.g., Saiki et al, 1988, Science 

10 239:48. The cleaved vector and the DNA fragments may also be modified if required by 
homopolymeric tailing. 

The nucleic acids of the invention may be isolated directly from cells. 
Alternatively, the polymerase chain reaction (PCR) method can be used to produce the 
nucleic acids of the invention, using either chemically synthesized strands or genomic 

15 material as templates. Primers used for PCR can be synthesized using the sequence 
information provided herein and can further be designed to introduce appropriate new 
restriction sites, if desirable, to facilitate incorporation into a given vector for 
recombinant expression. 

The nucleic acids of the present invention may be flanked by natural B. fragilis 

20 regulatory sequences, or may be associated with heterologous sequences, including 
promoters, enhancers, response elements, signal sequences, polyadenylation sequences, 
introns, 5'- and 3'- noncoding regions, and the like. The nucleic acids may also be 
modified by many means known in the art. Non-limiting examples of such modifications 
include methylation, "caps", substitution of one or more of the naturally occurring 

25 nucleotides with an analog, internucleotide modifications such as, for example, those 
with uncharged linkages (e.g., methyl phosphonates, phosphotriesters, 
phosphoroamidates, carbamates, etc.) and with charged linkages (e.g., phosphorothioates, 
phosphorodithioates, etc.). Nucleic acids may contain one or more additional covalently 
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linked moieties, such as 5 for example, proteins (e.g., nucleases, toxins, antibodies, signal 
peptides, poly-L-lysine, etc.), intercalated (e.g., acridine, psoralen, etc.), chelators (e.g., 
metals, radioactive metals, iron, oxidative metals, etc.), and alkylators. PNAs are also 
included. The nucleic acid may be derivatized by formation of a methyl or ethyl 

5 phosphotriester or an alkyl phosphoramidate linkage. Furthermore, the nucleic acid 
sequences of the present invention may also be modified with a label capable of 
providing a detectable signal, either directly or indirectly. Exemplary labels include 
radioisotopes, fluorescent molecules, biotin, and the like. 

The invention also provides nucleic acid vectors comprising the disclosed B. 

10 fragilis -derived sequences or derivatives or fragments thereof. A large number of 
vectors, including plasmid and bacterial vectors, have been described for replication 
and/or expression in a variety of eukaryotic and prokaryotic hosts, and may be used for 
cloning or protein expression. 

The encoded B. fragilis polypeptides may be expressed by using many known 

15 vectors, such as pUC plasmids, pET plasmids (Novagen, Inc., Madison, WI), or pRSET 
or pREP (Invitrogen, San Diego, CA), and many appropriate host cells, using methods 
disclosed or cited herein or otherwise known to those skilled in the relevant art. The 
particular choice of vector/host is not critical to the practice of the invention. 

Recombinant cloning vectors will often include one or more replication systems 

20 for cloning or expression, one or more markers for selection in the host, e.g. antibiotic 
resistance, and one or more expression cassettes. The inserted B. fragilis coding 
sequences may be synthesized by standard methods, isolated from natural sources, or 
prepared as hybrids, etc. Ligation of the B, fragilis coding sequences to transcriptional 
regulatory elements and/or to other amino acid coding sequences may be achieved by 

25 known methods. Suitable host cells may be transformed/transfected/infected as 
appropriate by any suitable method including electroporation, CaCl 2 mediated DNA 
uptake, bacterial infection, microinjection, microprojectile, or other established methods. 
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Appropriate host cells include bacteria, archebacteria, fungi, especially yeast, and 
plant and animal cells, especially mammalian cells. Of particular interest are B. fragilis , 
E. coli, B. Subtilis, Saccharomyces cerevisiae, Saccharomyces carlsbergensis, 
Schizosaccharomyces pombi, SF9 cells, CI 29 cells, 293 cells, Neurospora, and CHO 

5 cells, COS cells, HeLa cells, and immortalized mammalian myeloid and lymphoid cell 
lines. Preferred replication systems include Ml 3, ColEl, SV40, baculovirus, lambda, 
adenovirus, and the like. A large number of transcription initiation and termination 
regulatory regions have been isolated and shown to be effective in the transcription and 
translation of heterologous proteins in the various hosts. Examples of these regions, 

10 methods of isolation, manner of manipulation, etc. are known in the art. Under 

appropriate expression conditions, host cells can be used as a source of recombinantly 
produced B. fragilis -derived peptides and polypeptides. 

Advantageously, vectors may also include a transcription regulatory element (i.e., 
a promoter) operably linked to the B. fragilis portion. The promoter may optionally 

15 contain operator portions and/or ribosome binding sites. Non-limiting examples of 
bacterial promoters compatible with E. coli include: b-lactamase (penicillinase) 
promoter; lactose promoter; tryptophan (top) promoter; araBAD (arabinose) operon 
promoter; lambda-derived Pi promoter and N gene ribosome binding site; and the hybrid 
tac promoter derived from sequences of the top and lac UV5 promoters. Non-limiting 

20 examples of yeast promoters include 3-phosphoglycerate kinase promoter, 

glyceraldehyde-3 -phosphate dehydrogenase (GAPDH) promoter, galactokinase (GAL1) 
promoter, galactoepimerase promoter, and alcohol dehydrogenase (ADH) promoter. 
Suitable promoters for mammalian cells include without limitation viral promoters such 
as that from Simian Virus 40 (SV40), Rous sarcoma virus (RSV), adenovirus (ADV), 

25 and bovine papilloma virus (BPV). Mammalian cells may also require terminator 
sequences, polyA addition sequences and enhancer sequences to increase expression. 
Sequences which cause amplification of the gene may also be desirable. Furthermore, 
sequences that facilitate secretion of the recombinant product from cells, including, but 
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not limited to, bacteria, yeast, and animal cells, such as secretory signal sequences and/or 
prohormone pro region sequences, may also be included. These sequences are well 
described in the art. 

Nucleic acids encoding wild-type or variant B. fragilis -derived polypeptides may 
5 also be introduced into cells by recombination events. For example, such a sequence can 
be introduced into a cell, and thereby effect homologous recombination at the site of an 
endogenous gene or a sequence with substantial identity to the gene. Other 
recombination-based methods such as nonhomologous recombinations or deletion of 
endogenous genes by homologous recombination may also be used. 
10 The nucleic acids of the present invention find use as templates for the 

recombinant production of B. fragilis -derived peptides or polypeptides. 

IDENTIFICATION AND USE OF B. FRAGILIS NUCLEIC ACID SEQUENCES 
The disclosed B. fragilis polypeptide and nucleic acid sequences, or other 

15 sequences that are contained within ORFs, including complete protein-coding sequences, 
of which any of the disclosed B. fragilis -specific sequences forms a part, are useful as 
target components for diagnosis and/or treatment of B, fragilis - caused infection 

It will be understood that the sequence of an entire protein-coding sequence of 
which each disclosed nucleic acid sequence forms a part can be isolated and identified 

20 based on each disclosed sequence. This can be achieved, for example, by using an 
isolated nucleic acid encoding the disclosed sequence, or fragments thereof, to prime a 
sequencing reaction with genomic B. fragilis DNA as template; this is followed by 
sequencing the amplified product. The isolated nucleic acid encoding the disclosed 
sequence, or fragments thereof, can also be hybridized to B. fragilis genomic libraries to 

25 identify clones containing additional complete segments of the protein-coding sequence 
of which the shorter sequence forms a part. Then, the entire protein-coding sequence, or 
fragments thereof, or nucleic acids encoding all or part of the sequence, or sequence- 
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conservative or function-conservative variants thereof, may be employed in practicing 
the present invention. 

Preferred sequences are those that are useful in diagnostic and/or therapeutic 
applications. Diagnostic applications include without limitation nucleic-acid-based and 
5 antibody-based methods for detecting bacterial infection. Therapeutic applications 
include without limitation vaccines, passive immunotherapy, and drug treatments 
directed against gene products that are both unique to bacteria and essential for growth 
and/or replication of bacteria. 

10 IDENTIFICATION OF NUCLEIC ACIDS ENCODING VACCINE COMPONENTS 
AND TARGETS FOR AGENTS EFFECTIVE AGAINST B. FRAGILIS 

The disclosed B. fragilis genome sequence includes segments that direct the 
synthesis of ribonucleic acids and polypeptides, as well as origins of replication, 
promoters, other types of regulatory sequences, and intergenic nucleic acids. The 

15 invention encompasses nucleic acids encoding immunogenic components of vaccines 
and targets for agents effective against B. fragilis . Identification of said immunogenic 
components involved in the determination of the function of the disclosed sequences, 
which can be achieved using a variety of approaches. Non-limiting examples of these 
approaches are described briefly below. 

20 

HOMOLOGY TO KNOWN SEQUENCES: 

Computer-assisted comparison of the disclosed B. fragilis sequences with 
previously reported sequences present in publicly available databases is useful for 
identifying functional B. fragilis nucleic acid and polypeptide sequences. It will be 
25 understood that protein-coding sequences, for example, may be compared as a whole, 
and that a high degree of sequence homology between two proteins (such as, for 
example, >80-90%) at the amino acid level indicates that the two proteins also possess 
some degree of functional homology, such as, for example, among enzymes involved in 
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metabolism, DNA synthesis, or cell wall synthesis, and proteins involved in transport, 
cell division, etc. In addition, many structural features of particular protein classes have 
been identified and correlate with specific consensus sequences, such as, for example, 
binding domains for nucleotides, DNA, metal ions, and other small molecules; sites for 

5 covalent modifications such as phosphorylation, acylation, and the like; sites of 

proteimprotein interactions, etc. These consensus sequences may be quite short and thus 
may represent only a fraction of the entire protein-coding sequence. Identification of 
such a feature in an B. fragilis sequence is therefore useful in determining the function of 
the encoded protein and identifying useful targets of antibacterial drugs. 

10 Of particular relevance to the present invention are structural features that are 

common to secretory, transmembrane, and surface proteins, including secretion signal 
peptides and hydrophobic transmembrane domains. B. fragilis proteins identified as 
containing putative signal sequences and/or transmembrane domains are useful as 
immunogenic components of vaccines. 

15 Targets for therapeutic drugs according to the invention include, but are not 

limited to, polypeptides of the invention, whether unique to B. fragilis or not, that are 
essential for growth and/or viability of B. fragilis under at least one growth condition. 
Polypeptides essential for growth and/or viability can be determined by examining the 
effect of deleting and/or disrupting the genes, i.e., by so-called gene "knockout". 

20 Alternatively, genetic footprinting can be used (Smith et al, 1995, Proc. Natl Acad Sci 
USA 92:5479-6433; Published International Application WO 94/26933; U.S. Patent No. 
5,612,180). Still other methods for assessing essentiality includes the ability to isolate 
conditional lethal mutations in the specific gene (e.g., temperature sensitive mutations). 
Other useful targets for therapeutic drugs, which include polypeptides that are not 

25 essential for growth or viability per se but lead to loss of viability of the cell, can be used 
to target therapeutic agents to cells. 
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STRAIN-SPECIFIC SEQUENCES: 

Because of the evolutionary relationship between different B. fragilis strains, it is 
believed that the presently disclosed B. fragilis sequences are useful for identifying, 
and/or discriminating between, previously known and new B. fragilis strains. It is 

5 believed that other B. fragilis strains will exhibit at least about 70% sequence homology 
with the presently disclosed sequence. Systematic and routine analyses of DNA 
sequences derived from samples containing B. fragilis strains, and comparison with the 
present sequence allows for the identification of sequences that can be used to 
discriminate between strains, as well as those that are common to all B. fragilis strains. 

10 In one embodiment, the invention provides nucleic acids, including probes, and peptide 
and polypeptide sequences that discriminate between different strains of B. fragilis . 
Strain-specific components can also be identified functionally by their ability to elicit or 
react with antibodies that selectively recognize one or more B. fragilis strains. 

In another embodiment, the invention provides nucleic acids, including probes, 

15 and peptide and polypeptide sequences that are common to all B. fragilis strains but are 
not found in other bacterial species. 

B. FRAGILIS POLYPEPTIDES 

This invention encompasses isolated B. fragilis polypeptides encoded by the 

20 disclosed B. fragilis genomic sequences, including the polypeptides of the invention 
contained in the Sequence Listing. Polypeptides of the invention are preferably at least 
about 5 amino acid residues in length. Using the DNA sequence information provided 
herein, the amino acid sequences of the polypeptides encompassed by the invention can 
be deduced using methods well-known in the art. It will be understood that the sequence 

25 of an entire nucleic acid encoding an B. fragilis polypeptide can be isolated and 

identified based on an ORF that encodes only a fragment of the cognate protein-coding 
region. This can be achieved, for example, by using the isolated nucleic acid encoding 
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the ORF, or fragments thereof, to prime a polymerase chain reaction with genomic B. 
fragilis DNA as template; this is followed by sequencing the amplified product. 

The polypeptides of the present invention, including function-conservative 
variants of the disclosed ORFs, may be isolated from wild-type or mutant B. fragilis 

5 cells, or from heterologous organisms or cells (including, but not limited to, bacteria, 
fungi, insect, plant, and mammalian cells) including B. fragilis into which an B. fragilis - 
derived protein-coding sequence has been introduced and expressed. Furthermore, the 
polypeptides may be part of recombinant fusion proteins. 

B. fragilis polypeptides of the invention can be chemically synthesized using 

10 commercially automated procedures such as those referenced herein , including, without 
limitation, exclusive solid phase synthesis, partial solid phase methods, fragment 
condensation or classical solution synthesis. The polypeptides are preferably prepared by 
solid phase peptide synthesis as described by Merrifield, 1963, 1 Am. Chem. Soc. 
85:2149. The synthesis is carried out with amino acids that are protected at the alpha- 

15 amino terminus. Trifunctional amino acids with labile side-chains are also protected 
with suitable groups to prevent undesired chemical reactions from occurring during the 
assembly of the polypeptides. The alpha-amino protecting group is selectively removed 
to allow subsequent reaction to take place at the amino-terminus. The conditions for the 
removal of the alpha-amino protecting group do not remove the side-chain protecting 

20 groups. 

Methods for polypeptide purification are well-known in the art, including, 
without limitation, preparative disc-gel electrophoresis, isoelectric focusing, HPLC, 
reversed-phase HPLC, gel filtration, ion exchange and partition chromatography, and 
countercurrent distribution. For some purposes, it is preferable to produce the 
25 polypeptide in a recombinant system in which the B. fragilis protein contains an 
additional sequence tag that facilitates purification, such as, but not limited to, a 
polyhistidine sequence. The polypeptide can then be purified from a crude lysate of the 
host cell by chromatography on an appropriate solid-phase matrix. Alternatively, 
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antibodies produced against an B. fragilis protein or against peptides derived therefrom 
can be used as purification reagents. Other purification methods are possible. 

The present invention also encompasses derivatives and homologues of B. 
fragilis -encoded polypeptides. For some purposes, nucleic acid sequences encoding the 

5 peptides may be altered by substitutions, additions, or deletions that provide for 

functionally equivalent molecules, i.e., function-conservative variants. For example, one 
or more amino acid residues within the sequence can be substituted by another amino 
acid of similar properties, such as, for example, positively charged amino acids (arginine, 
lysine, and histidine); negatively charged amino acids (aspartate and glutamate); polar 

10 neutral amino acids; and non-polar amino acids. 

The isolated polypeptides may be modified by, for example, phosphorylation, 
sulfation, acylation, or other protein modifications. They may also be modified with a 
label capable of providing a detectable signal, either directly or indirectly, including, but 
not limited to, radioisotopes and fluorescent compounds. 

15 To identify B. fragilis -derived polypeptides for use in the present invention, 

essentially the complete genomic sequence of a virulent, methicillin-resistant isolate of 
Bacteroides fragilis isolate was analyzed. While, in very rare instances, a nucleic acid 
sequencing error may be revealed, resolving a rare sequencing error is well within the art, 
and such an occurrence will not prevent one skilled in the art from practicing the 

20 invention. 

Also encompassed are any B. fragilis polypeptide sequences that are contained 
within the open reading frames (ORFs), including complete protein-coding sequences, of 
which any of SEQ ID NO: 1 - SEQ ID NO: 5222 forms a part. Table 2, which is 
appended herewith and which forms part of the present specification, provides a putative 
25 identification of the particular function of a polypeptide which is encoded by each ORF, 
based on the homology match (determined by the BLAST algorithm) of the predicted 
polypeptide with known proteins encoded by ORFs in other organisms. As a result, one 
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skilled in the art can use the polypeptides of the present invention for commercial and 
industrial purposes consistent with the type of putative identification of the polypeptide. 

The present invention provides a library of B. fragilis -derived polypeptide 
sequences, and a corresponding library of nucleic acid sequences encoding the 
5 polypeptides, wherein the polypeptides themselves, or polypeptides contained within 
ORFs of which they form a part, comprise sequences that are contemplated for use as 
components of vaccines. Non-limiting examples of such sequences are listed by SEQ ID 
NO in Table 2, which is appended herewith and which forms part of the present 
specification. 

10 The present invention also provides a library of B. fragilis -derived polypeptide 

sequences, and a corresponding library of nucleic acid sequences encoding the 
polypeptides, wherein the polypeptides themselves, or polypeptides contained within 
ORFs of which they form a part, comprise sequences lacking homology to any known 
prokaryotic or eukaryotic sequences. Such libraries provide probes, primers, and markers 

15 which can be used to diagnose B. fragilis infection, including use as markers in 

epidemiological studies. Non-limiting examples of such sequences are listed by SEQ ID 
NO in Table 2, which is appended hereto and part hereof. 

The present invention also provides a library of B. fragilis -derived polypeptide 
sequences, and a corresponding library of nucleic acid sequences encoding the 

20 polypeptides, wherein the polypeptides themselves, or polypeptides contained within 
ORFs of which they form a part, comprise targets for therapeutic drugs. 

SPECIFIC EXAMPLE: DETERMINATION OF BACTEROIDES PROTEIN 
ANTIGENS FOR ANTIBODY AND VACCINE DEVELOPMENT 
25 The selection of Bacteroides protein antigens for vaccine development can be 

derived from the nucleic acids encoding B. fragilis polypeptides. First, the ORFs can be 
analyzed for homology to other known exported or membrane proteins and analyzed 
using the discriminant analysis described by Klein, et al. (Klein, P., Kanehsia, M., and 
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DeLisi, C. (1985) Biochimica et Biophysica Acta 815, 468-476) for predicting exported 
and membrane proteins. 

Homology searches can be performed using the BLAST algorithm contained in 
the Wisconsin Sequence Analysis Package (Genetics Computer Group, University 

5 Research Park, 575 Science Drive, Madison, WI 53711) to compare each predicted ORF 
amino acid sequence with all sequences found in the current GenBank, SWISS-PROT 
and PIR databases. BLAST searches for local alignments between the ORF and the 
databank sequences and reports a probability score which indicates the probability of 
finding this sequence by chance in the database. ORFs with significant homology (e.g. 

1 0 probabilities lower than 1x10 that the homology is only due to random chance) to 
membrane or exported proteins represent protein antigens for vaccine development. 
Possible functions can be provided to B. fragilis genes based on sequence homology to 
genes cloned in other organisms. 

Discriminant analysis (Klein, et al. supra) can be used to examine the ORF amino 

15 acid sequences. This algorithm uses the intrinsic information contained in the ORF 
amino acid sequence and compares it to information derived from the properties of 
known membrane and exported proteins. This comparison predicts which proteins will 
be exported, membrane associated or cytoplasmic. ORF amino acid sequences identified 
as exported or membrane associated by this algorithm are likely protein antigens for 

20 vaccine development. 

PRODUCTION OF FRAGMENTS AND ANALOGS OF R FRAGILIS NUCLEIC 
ACIDS AND POLYPEPTIDES 

Based on the discovery of the B. fragilis gene products of the invention provided 
25 in the Sequence Listing, one skilled in the art can alter the disclosed structure of B. 
fragilis genes, e.g., by producing fragments or analogs, and test the newly produced 
structures for activity. Examples of techniques known to those skilled in the relevant art 
which allow the production and testing of fragments and analogs are discussed below. 
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These, or analogous methods can be used to make and screen libraries of polypeptides, 
e.g., libraries of random peptides or libraries of fragments or analogs of cellular proteins 
for the ability to bind B. fragilis polypeptides. Such screens are useful for the 
identification of inhibitors of B. fragilis . 

5 

GENERATION OF FRAGMENTS 

Fragments of a protein can be produced in several ways, e.g., recombinant^, by 
proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a 
polypeptide can be generated by removing one or more nucleotides from one end (for a 

10 terminal fragment) or both ends (for an internal fragment) of a nucleic acid which 
encodes the polypeptide. Expression of the mutagenized DNA produces polypeptide 
fragments. Digestion with "end-nibbling" endonucleases can thus generate DNAs which 
encode an array of fragments. DNAs which encode fragments of a protein can also be 
generated by random shearing, restriction digestion or a combination of the above- 

15 discussed methods. 

Fragments can also be chemically synthesized using techniques known in the art 
such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, 
peptides of the present invention may be arbitrarily divided into fragments of desired 
length with no overlap of the fragments, or divided into overlapping fragments of a 

20 desired length. 

ALTERATION OF NUCLEIC ACIDS AND POLYPEPTIDES: RANDOM METHODS 

Amino acid sequence variants of a protein can be prepared by random 
mutagenesis of DNA which encodes a protein or a particular domain or region of a 
25 protein. Useful methods include PCR mutagenesis and saturation mutagenesis. A library 
of random amino acid sequence variants can also be generated by the synthesis of a set of 
degenerate oligonucleotide sequences. (Methods for screening proteins in a library of 
variants are elsewhere herein). 
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PCR MUTAGENESIS 

In PCR mutagenesis, reduced Taq polymerase fidelity is used to introduce 

random mutations into a cloned fragment of DNA (Leung et al., 1989, Technique 1:11- 

5 1 5). The DNA region to be mutagenized is amplified using the polymerase chain 

reaction (PCR) under conditions that reduce the fidelity of DNA synthesis by Taq DNA 

2+ 

polymerase, e.g., by using a dGTP/dATP ratio of five and adding Mn to the PCR 
reaction. The pool of amplified DNA fragments are inserted into appropriate cloning 
vectors to provide random mutant libraries. 

10 

SATURATION MUTAGENESIS 

Saturation mutagenesis allows for the rapid introduction of a large number of 
single base substitutions into cloned DNA fragments (Mayers et al, 1985, Science 
229:242). This technique includes generation of mutations, e.g., by chemical treatment 

15 or irradiation of single-stranded DNA in vitro, and synthesis of a complimentary DNA 
strand. The mutation frequency can be modulated by modulating the severity of the 
treatment, and essentially all possible base substitutions can be obtained. Because this 
procedure does not involve a genetic selection for mutant fragments both neutral 
substitutions, as well as those that alter function, are obtained. The distribution of point 

20 mutations is not biased toward conserved sequence elements. 

DEGENERATE OLIGONUCLEOTIDES 

A library of homologs can also be generated from a set of degenerate 
oligonucleotide sequences. Chemical synthesis of a degenerate sequences can be carried 
25 out in an automatic DNA synthesizer, and the synthetic genes then ligated into an 

appropriate expression vector. The synthesis of degenerate oligonucleotides is known in 
the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) 
Recombinant DNA, Proc 3rd Cleveland Sympos, Macromolecules, ed. AG Walton, 
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Amsterdam: Elsevier pp273 -289; Itakura et al. (1984) Annu. Rev. Biochem, 53:323; 
Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 1 1 :477. Such 
techniques have been employed in the directed evolution of other proteins (see, for 
example, Scott et al (1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429- 
5 2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378- 
6382; as well as U.S. Patents Nos. 5,223,409, 5,198,346, and 5,096,815). 

ALTERATION OF NUCLEIC ACIDS AND POLYPEPTIDES: METHODS FOR 
DIRECTED MUTAGENESIS 

10 Non-random or directed, mutagenesis techniques can be used to provide specific 

sequences or mutations in specific regions. These techniques can be used to create 
variants which include, e.g., deletions, insertions, or substitutions, of residues of the 
known amino acid sequence of a protein. The sites for mutation can be modified 
individually or in series, e.g., by (1) substituting first with conserved amino acids and 

15 then with more radical choices depending upon results achieved, (2) deleting the target 
residue, or (3) inserting residues of the same or a different class adjacent to the located 
site, or combinations of options 1-3. 

ALANINE SCANNING MUTAGENESIS 

20 Alanine scanning mutagenesis is a useful method for identification of certain 

residues or regions of the desired protein that are preferred locations or domains for 
mutagenesis, Cunningham and Wells (Science 244:1081-1085, 1989). In alanine 
scanning, a residue or group of target residues are identified (e.g., charged residues such 
as Arg, Asp, His, Lys, and Glu) and replaced by a neutral or negatively charged amino 

25 acid (most preferably alanine or polyalanine). Replacement of an amino acid can affect 
the interaction of the amino acids with the surrounding aqueous environment in or 
outside the cell. Those domains demonstrating functional sensitivity to the substitutions 
are then refined by introducing further or other variants at or for the sites of substitution. 
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Thus, while the site for introducing an amino acid sequence variation is predetermined, 
the nature of the mutation per se need not be predetermined. For example, to optimize 
the performance of a mutation at a given site, alanine scanning or random mutagenesis 
may be conducted at the target codon or region and the expressed desired protein subunit 
5 variants are screened for the optimal combination of desired activity, 

OLIGONUCLEOTIDE-MEDIATED MUTAGENESIS 
Oligonucleotide-mediated mutagenesis is a useful method for preparing 
substitution, deletion, and insertion variants of DNA, see, e.g., Adelman et al., (DNA 

10 2: 1 83, 1 983). Briefly, the desired DNA is altered by hybridizing an oligonucleotide 
encoding a mutation to a DNA template, where the template is the single-stranded form 
of a plasmid or bacteriophage containing the unaltered or native DNA sequence of the 
desired protein. After hybridization, a DNA polymerase is used to synthesize an entire 
second complementary strand of the template that will thus incorporate the 

15 oligonucleotide primer, and will code for the selected alteration in the desired protein 
DNA. Generally, oligonucleotides of at least about 25 nucleotides in length are used. 
An optimal oligonucleotide will have 12 to 15 nucleotides that are completely 
complementary to the template on either side of the nucleotide(s) coding for the 
mutation. This ensures that the oligonucleotide will hybridize properly to the single- 

20 stranded DNA template molecule. The oligonucleotides are readily synthesized using 
techniques known in the art such as that described by Crea et al. (Proc. Natl Acad Set 
USA, 75: 5765[1978]). 

CASSETTE MUTAGENESIS 
25 Another method for preparing variants, cassette mutagenesis, is based on the 

technique described by Wells et al. (Gene, 34:3 15[1 985]). The starting material is a 
plasmid (or other vector) which includes the protein subunit DNA to be mutated. The 
codon(s) in the protein subunit DNA to be mutated are identified. There must be a 
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unique restriction endonuclease site on each side of the identified mutation site(s). If no 
such restriction sites exist, they may be generated using the above-described 
oligonucleotide-mediated mutagenesis method to introduce them at appropriate locations 
in the desired protein subunit DNA. After the restriction sites have been introduced into 
the plasmid, the plasmid is cut at these sites to linearize it. A double- stranded 
oligonucleotide encoding the sequence of the DNA between the restriction sites but 
containing the desired mutation(s) is synthesized using standard procedures. The two 
strands are synthesized separately and then hybridized together using standard 
techniques. This double-stranded oligonucleotide is referred to as the cassette. This 
cassette is designed to have 3' and 5' ends that are comparable with the ends of the 
linearized plasmid, such that it can be directly ligated to the plasmid. This plasmid now 
contains the mutated desired protein subunit DNA sequence. 

COMBINATORIAL MUTAGENESIS 

Combinatorial mutagenesis can also be used to generate mutants (Ladner et al., 
WO 88/06630). In this method, the amino acid sequences for a group of homologs or 
other related proteins are aligned, preferably to promote the highest homology possible. 
All of the amino acids which appear at a given position of the aligned sequences can be 
selected to create a degenerate set of combinatorial sequences. The variegated library of 
variants is generated by combinatorial mutagenesis at the nucleic acid level, and is 
encoded by a variegated gene library. For example, a mixture of synthetic 
oligonucleotides can be enzymatically ligated into gene sequences such that the 
degenerate set of potential sequences are expressible as individual peptides, or 
alternatively, as a set of larger fusion proteins containing the set of degenerate sequences. 



-54- 



2709.1001-001 



OTHER MODIFICATIONS OF B. FRAGILIS NUCLEIC ACIDS AND 
POLYPEPTIDES 

It is possible to modify the structure of an B. fragilis polypeptide for such 
purposes as increasing solubility, enhancing stability (e.g., shelf life ex vivo and 

5 resistance to proteolytic degradation in vivo). A modified B. fragilis protein or peptide 
can be produced in which the amino acid sequence has been altered, such as by amino 
acid substitution, deletion, or addition as described herein. 

An B. fragilis peptide can also be modified by substitution of cysteine residues 
preferably with alanine, serine, threonine, leucine or glutamic acid residues to minimize 

10 dimerization via disulfide linkages. In addition, amino acid side chains of fragments of 
the protein of the invention can be chemically modified. Another modification is 
cyclization of the peptide. 

In order to enhance stability and/or reactivity, an B. fragilis polypeptide can be 
modified to incorporate one or more polymorphisms in the amino acid sequence of the 

15 protein resulting from any natural allelic variation. Additionally, D-amino acids, non- 
natural amino acids, or non-amino acid analogs can be substituted or added to produce a 
modified protein within the scope of this invention. Furthermore, an B. fragilis 
polypeptide can be modified using polyethylene glycol (PEG) according to the method of 
A. Sehon and co-workers (Wie et al, supra) to produce a protein conjugated with PEG. 

20 In addition, PEG can be added during chemical synthesis of the protein. Other 
modifications of B. fragilis proteins include reduction/alkylation (Tarr, Methods of 
Protein Microcharacterization, J. E. Silver ed., Humana Press, Clifton NJ 155-194 
(1986)); acylation (Tarr, supra); chemical coupling to an appropriate carrier (Mishell and 
Shiigi, eds, Selected Methods in Cellular Immunology, WH Freeman, San Francisco, CA 

25 (1980), U.S. Patent 4,939,239; or mild formalin treatment (Marsh, (1971) Int. Arch of 
Allergy andAppl Immunol, 41: 199-215). 

To facilitate purification and potentially increase solubility of an B. fragilis 
protein or peptide, it is possible to add an amino acid fusion moiety to the peptide 
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backbone. For example, hexa-histidine can be added to the protein for purification by 
immobilized metal ion affinity chromatography (Hochuli, E. et al., (1988) 
Bio/Technology, 6: 1321 - 1325). In addition, to facilitate isolation of peptides free of 
irrelevant sequences, specific endoprotease cleavage sites can be introduced between the 

5 sequences of the fusion moiety and the peptide. 

To potentially aid proper antigen processing of epitopes within an B. fragilis 
polypeptide, canonical protease sensitive sites can be engineered between regions, each 
comprising at least one epitope via recombinant or synthetic methods. For example, 
charged amino acid pairs, such as KK or RR, can be introduced between regions within a 

10 protein or fragment during recombinant construction thereof. The resulting peptide can 
be rendered sensitive to cleavage by cathepsin and/or other trypsin-like enzymes which 
would generate portions of the protein containing one or more epitopes. In addition, such 
charged amino acid residues can result in an increase in the solubility of the peptide. 

15 PRIMARY METHODS FOR SCREENING POLYPEPTIDES AND ANALOGS 

Various techniques are known in the art for screening generated mutant gene 
products. Techniques for screening large gene libraries often include cloning the gene 
library into replicable expression vectors, transforming appropriate cells with the 
resulting library of vectors, and expressing the genes under conditions in which detection 

20 of a desired activity, e.g., in this case, binding to B. fragilis polypeptide or an interacting 
protein, facilitates relatively easy isolation of the vector encoding the gene whose product 
was detected. Each of the techniques described below is amenable to high through-put 
analysis for screening large numbers of sequences created, e.g., by random mutagenesis 
techniques. 

25 

TWO HYBRID SYSTEMS 

Two hybrid assays such as the system described below (as with the other 
screening methods described herein), can be used to identify polypeptides, e.g., 
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fragments or analogs of a naturally-occurring B. fragilis polypeptide, e.g., of cellular 
proteins, or of randomly generated polypeptides which bind to an B. fragilis protein. 
(The B, fragilis domain is used as the bait protein and the library of variants are 
expressed as prey fusion proteins.) In an analogous fashion, a two hybrid assay (as with 
5 the other screening methods described herein), can be used to find polypeptides which 
bind an B. fragilis polypeptide. 

DISPLAY LIBRARIES 

In one approach to screening assays, the Bacteroides peptides are displayed on the 

10 surface of a cell or viral particle, and the ability of particular cells or viral particles to 
bind an appropriate receptor protein via the displayed product is detected in a "panning 
assay". For example, the gene library can be cloned into the gene for a surface 
membrane protein of a bacterial cell, and the resulting fusion protein detected by panning 
(Ladner et al., WO 88/06630; Fuchs et al. (1991) Bio/Technology 9:1370-1371; and 

15 Goward et al (1992) TIBS 18:136-140). In a similar fashion, a detectably labeled ligand 
can be used to score for potentially functional peptide homologs. Fluorescently labeled 
ligands, e.g., receptors, can be used to detect homologs which retain ligand-binding 
activity. The use of fluorescently labeled ligands, allows cells to be visually inspected 
and separated under a fluorescence microscope, or, where the morphology of the cell 

20 permits, to be separated by a fluorescence-activated cell sorter. 

A gene library can be expressed as a fusion protein on the surface of a viral 

particle. For instance, in the filamentous phage system, foreign peptide sequences can be 

expressed on the surface of infectious phage, thereby conferring two significant benefits. 

First, since these phage can be applied to affinity matrices at concentrations well over 
13 

25 10 phage per milliliter, a large number of phage can be screened at one time. Second, 
since each infectious phage displays a gene product on its surface, if a particular phage is 
recovered from an affinity matrix in low yield, the phage can be amplified by another 
round of infection. The group of almost identical E. coli filamentous phages, M13, fd., 
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and fl, are most often used in phage display libraries. Either of the phage gill or gVIII 
coat proteins can be used to generate fusion proteins without disrupting the ultimate 
packaging of the viral particle. Foreign epitopes can be expressed at the NH2-terminal 
end of pill and phage bearing such epitopes recovered from a large excess of phage 

5 lacking this epitope (Ladner et al PCT publication WO 90/02909; Garrard et al., PCT 
publication WO 92/09690; Marks et aL (1992) J. Biol Chern. 267:16007-16010; 
Griffiths et al. (1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; 
and Barbas et al. (1992) PNAS 89:4457-4461). 

A common approach uses the maltose receptor of E. coli (the outer membrane 

10 protein, LamB) as a peptide fusion partner (Charbit et al (1 986) EMBO 5, 3029-3037). 
Oligonucleotides have been inserted into plasmids encoding the LamB gene to produce 
peptides fused into one of the extracellular loops of the protein. These peptides are 
available for binding to ligands, e.g., to antibodies, and can elicit an immune response 
when the cells are administered to animals. Other cell surface proteins, e.g., OmpA 

15 (Schorr et al. (1991) Vaccines 91, pp. 387-392), PhoE (Agterberg, et al. (1990) Gene 88, 
37-45), and PAL (Fuchs et al. (1991) Bio/Tech 9, 1369-1372), as well as large bacterial 
surface structures have served as vehicles for peptide display. Peptides can be fused to 
pilin, a protein which polymerizes to form the pilus-a conduit for interbacterial exchange 
of genetic information (Thiry et al (1989) Appl Environ. Microbiol 55, 984-993). 

20 Because of its role in interacting with other cells, the pilus provides a useful support for 
the presentation of peptides to the extracellular environment. Another large surface 
structure used for peptide display is the bacterial motive organ, the flagellum. Fusion of 
peptides to the subunit protein flagellin offers a dense array of many peptide copies on 
the host cells (Kuwajima et al. (1988) Bio/Tech. 6, 1080-1083). Surface proteins of other 

25 bacterial species have also served as peptide fusion partners. Examples include the 

Staphylococcus protein A and the outer membrane IgA protease of Neisseria (Hansson et 
al. (1992) J. Bacteriol 174, 4239-4245 and Klauser et al. (1990) EMBO J, 9, 1991- 
1999). 
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In the filamentous phage systems and the LamB system described above, the 
physical link between the peptide and its encoding DNA occurs by the containment of 
the DNA within a particle (cell or phage) that carries the peptide on its surface. 
Capturing the peptide captures the particle and the DNA within. An alternative scheme 

5 uses the DNA-binding protein Lad to form a link between peptide and DNA (Cull et al 
(1992) PNAS USA 89:1865-1869). This system uses a plasmid containing the Lad gene 
with an oligonucleotide cloning site at its 3 '-end. Under the controlled induction by 
arabinose, a Lacl-peptide fusion protein is produced. This fusion retains the natural 
ability of Lad to bind to a short DNA sequence known as LacO operator (LacO). By 

10 installing two copies of LacO on the expression plasmid, the Lacl-peptide fusion binds 
tightly to the plasmid that encoded it. Because the plasmids in each cell contain only a 
single oligonucleotide sequence and each cell expresses only a single peptide sequence, 
the peptides become specifically and stablely associated with the DNA sequence that 
directed its synthesis. The cells of the library are gently lysed and the peptide-DNA 

15 complexes are exposed to a matrix of immobilized receptor to recover the complexes 
containing active peptides. The associated plasmid DNA is then reintroduced into cells 
for amplification and DNA sequencing to determine the identity of the peptide ligands. 
As a demonstration of the practical utility of the method, a large random library of 
dodecapeptides was made and selected on a monoclonal antibody raised against the 

20 opioid peptide dynorphin B. A cohort of peptides was recovered, all related by a 

consensus sequence corresponding to a six-residue portion of dynorphin B. (Cull et al. 
(1992) Proc. Natl Acad Sci U.S.A. 89-1869) 

This scheme, sometimes referred to as peptides-on-plasmids, differs in two 
important ways from the phage display methods. First, the peptides are attached to the 

25 C-terminus of the fusion protein, resulting in the display of the library members as 
peptides having free carboxy termini. Both of the filamentous phage coat proteins, pill 
and pVIII, are anchored to the phage through their C-termini, and the guest peptides are 
placed into the outward-extending N-terminal domains. In some designs, the phage- 
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displayed peptides are presented right at the amino terminus of the fusion protein. 
(Cwirla, et al. (1990) Proc. Natl. Acad. Set U.S.A. 87, 6378-6382) A second difference 
is the set of biological biases affecting the population of peptides actually present in the 
libraries. The Lad fusion molecules are confined to the cytoplasm of the host cells. The 

5 phage coat fusions are exposed briefly to the cytoplasm during translation but are rapidly 
secreted through the inner membrane into the periplasmic compartment, remaining 
anchored in the membrane by their C-terminal hydrophobic domains, with the N-termini, 
containing the peptides, protruding into the periplasm while awaiting assembly into 
phage particles. The peptides in the Lad and phage libraries may differ significantly as a 

10 result of their exposure to different proteolytic activities. The phage coat proteins require 
transport across the inner membrane and signal peptidase processing as a prelude to 
incorporation into phage. Certain peptides exert a deleterious effect on these processes 
and are underrepresented in the libraries (Gallop et al. (1994) J. Med. Chern. 37(9):1233- 
1251). These particular biases are not a factor in the LacI display system. 

15 The number of small peptides available in recombinant random libraries is 

7 9 . 
enormous. Libraries of 10 -10 independent clones are routinely prepared. Libraries as 

large as 10 1 1 recombinants have been created, but this size approaches the practical limit 

for clone libraries. This limitation in library size occurs at the step of transforming the 

DNA containing randomized segments into the host bacterial cells. To circumvent this 

20 limitation, an in vitro system based on the display of nascent peptides in polysome 

complexes has recently been developed. This display library method has the potential of 

producing libraries 3-6 orders of magnitude larger than the currently available 

phage/phagemid or plasmid libraries. Furthermore, the construction of the libraries, 

expression of the peptides, and screening, is done in an entirely cell-free format. 

25 In one application of this method (Gallop et al. (1994) J. Med. Chem. 37(9): 1233- 

12 

1251), a molecular DNA library encoding 1 0 decapeptides was constructed and the 
library expressed in an E. coli S30 in vitro coupled transcription/translation system. 
Conditions were chosen to stall the ribosomes on the mRNA, causing the accumulation 
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of a substantial proportion of the RNA in polysomes and yielding complexes containing 
nascent peptides still linked to their encoding RNA. The polysomes are sufficiently 
robust to be affinity purified on immobilized receptors in much the same way as the more 
conventional recombinant peptide display libraries are screened. RNA from the bound 

5 complexes is recovered, converted to cDNA, and amplified by PCR to produce a 
template for the next round of synthesis and screening. The polysome display method 
can be coupled to the phage display system. Following several rounds of screening, 
cDNA from the enriched pool of polysomes was cloned into a phagemid vector. This 
vector serves as both a peptide expression vector, displaying peptides fused to the coat 

10 proteins, and as a DNA sequencing vector for peptide identification. By expressing the 
polysome-derived peptides on phage, one can either continue the affinity selection 
procedure in this format or assay the peptides on individual clones for binding activity in 
a phage ELISA, or for binding specificity in a completion phage ELISA (Barret, et al. 
(1992) Anal Biochem 204,357-364). To identify the sequences of the active peptides 

15 one sequences the DNA produced by the phagemid host. 

SECONDARY SCREENING OF POLYPEPTIDES AND ANALOGS 

The high through-put assays described above can be followed by secondary 
screens in order to identify further biological activities which will, e.g., allow one skilled 

20 in the art to differentiate agonists from antagonists. The type of a secondary screen used 
will depend on the desired activity that needs to be tested. For example, an assay can be 
developed in which the ability to inhibit an interaction between a protein of interest and 
its respective ligand can be used to identify antagonists from a group of peptide 
fragments isolated though one of the primary screens described above. 

25 Therefore, methods for generating fragments and analogs and testing them for 

activity are known in the art. Once the core sequence of interest is identified, it is routine 
for one skilled in the art to obtain analogs and fragments. 
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PEPTIDE MIMETICS OF B. FRAGILIS POLYPEPTIDES 

The invention also provides for reduction of the protein binding domains of the 
subject B. fragilis polypeptides to generate mimetics, e.g. peptide or non-peptide agents. 
The peptide mimetics are able to disrupt binding of a polypeptide to its counter ligand, 

5 e.g., in the case of an B. fragilis polypeptide binding to a naturally occurring ligand. The 
critical residues of a subject B. fragilis polypeptide which are involved in molecular 
recognition of a polypeptide can be determined and used to generate B. fragilis -derived 
peptidomimetics which competitively or noncompetitively inhibit binding of the B. 
fragilis polypeptide with an interacting polypeptide (see, for example, European patent 

10 applications EP-412/762A and EP-B31,080A). 

For example, scanning mutagenesis can be used to map the amino acid residues 
of a particular B. fragilis polypeptide involved in binding an interacting polypeptide, 
peptidomimetic compounds (e.g. diazepine or isoquinoline derivatives) can be generated 
which mimic those residues in binding to an interacting polypeptide, and which therefore 

15 can inhibit binding of an B. fragilis polypeptide to an interacting polypeptide and thereby 
interfere with the function of B. fragilis polypeptide. For instance, non-hydrolyzable 
peptide analogs of such residues can be generated using benzodiazepine (e.g., see 
Freidinger et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM 
Publisher: Leiden, Netherlands, 1988), azepine (e.g., see Huffman et al. in Peptides: 

20 Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 
1988), substituted gama lactam rings (Garvey et al. in Peptides: Chemistry and Biology, 
G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), keto-methylene 
pseudopeptides (Ewenson et al. (1986) J Med Chem 29:295; and Ewenson et al. in 
Peptides: Structure and Function (Proceedings of the 9th American Peptide Symposium) 

25 Pierce Chemical Co. Rockland, IL, 1985), b-turn dipeptide cores (Nagai et al (1985) 
Tetrahedron Lett 26:647; and Sato et al (1986) J Chem Soc Perkin Trans 1:1231), and b- 
aminoalcohols (Gordon et al. (1985) Biochem Biophys Res Commun 126:419; and et al. 
(1986) Biochem Biophys Res Commun 134:71). 
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VACCINE FORMULATIONS FOR B. FRAGILIS NUCLEIC ACIDS AND 
POLYPEPTIDES 

This invention also features vaccine compositions for protection against infection 
5 by B. fragilis or for treatment of B. fragilis infection. In one embodiment, the vaccine 
compositions contain one or more immunogenic components such as a surface protein 
from B. fragilis , or portion thereof, and a pharmaceutical^ acceptable carrier. Nucleic 
acids within the scope of the invention are exemplified by the nucleic acids of the 
invention contained in the Sequence Listing which encode B. fragilis surface proteins. 
10 Any nucleic acid encoding an immunogenic B. fragilis protein, or portion thereof, which 
is capable of expression in a cell, can be used in the present invention. These vaccines 
have therapeutic and prophylactic utilities. 

One aspect of the invention provides a vaccine composition for protection against 
infection by B. fragilis which contains at least one immunogenic fragment of an B. 
1 5 fragilis protein and a pharmaceutically acceptable carrier. Preferred fragments include 
peptides of at least about 10 amino acid residues in length, preferably about 10-20 amino 
acid residues in length, and more preferably about 12-16 amino acid residues in length. 

Immunogenic components of the invention can be obtained, for example, by 
screening polypeptides recombinantly produced from the corresponding fragment of the 
20 nucleic acid encoding the full-length B. fragilis protein. In addition, fragments can be 
chemically synthesized using techniques known in the art such as conventional 
Merrifield solid phase f-Moc or t-Boc chemistry. 

In one embodiment, immunogenic components are identified by the ability of the 
peptide to stimulate T cells. Peptides which stimulate T cells, as determined by, for 
25 example, T cell proliferation or cytokine secretion are defined herein as comprising at 
least one T cell epitope. T cell epitopes are believed to be involved in initiation and 
perpetuation of the immune response to the protein allergen which is responsible for the 
clinical symptoms of allergy. These T cell epitopes are thought to trigger early events at 
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the level of the T helper cell by binding to an appropriate HLA molecule on the surface 
of an antigen presenting cell, thereby stimulating the T cell subpopulation with the 
relevant T cell receptor for the epitope. These events lead to T cell proliferation, 
lymphokine secretion, local inflammatory reactions, recruitment of additional immune 

5 cells to the site of antigen/T cell interaction, and activation of the B cell cascade, leading 
to the production of antibodies. A T cell epitope is the basic element, or smallest unit of 
recognition by a T cell receptor, where the epitope comprises amino acids essential to 
receptor recognition (e.g., approximately 6 or 7 amino acid residues). Amino acid 
sequences which mimic those of the T cell epitopes are within the scope of this 

10 invention. 

Screening immunogenic components can be accomplished using one or more of 
several different assays. For example, in vitro, peptide T cell stimulatory activity is 
assayed by contacting a peptide known or suspected of being immunogenic with an 
antigen presenting cell which presents appropriate MHC molecules in a T cell culture. 

15 Presentation of an immunogenic B. fragilis peptide in association with appropriate MHC 
molecules to T cells in conjunction with the necessary co-stimulation has the effect of 
transmitting a signal to the T cell that induces the production of increased levels of 
cytokines, particularly of interleukin-2 and interleukin-4. The culture supernatant can be 
obtained and assayed for interleukin-2 or other known cytokines. For example, any one 

20 of several conventional assays for interleukin-2 can be employed, such as the assay 

described in Proc. Natl Acad Sci USA, 86: 1333 (1989) the pertinent portions of which 

are incorporated herein by reference. A kit for an assay for the production of interferon is 

also available from Genzyme Corporation (Cambridge, MA). 

Alternatively, a common assay for T cell proliferation entails measuring tritiated 

25 thymidine incorporation. The proliferation of T cells can be measured in vitro by 

3 

determining the amount of H-labeled thymidine incorporated into the replicating DNA 
of cultured cells. Therefore, the rate of DNA synthesis and, in turn, the rate of cell 
division can be quantified. 

-64- 



2709.1001-001 



Vaccine compositions of the invention containing immunogenic components 
(e.g., B. fragilis polypeptide or fragment thereof or nucleic acid encoding an B. fragilis 
polypeptide or fragment thereof) preferably include a pharmaceutically acceptable 
carrier. The term "pharmaceutically acceptable carrier" refers to a carrier that does not 
5 cause an allergic reaction or other untoward effect in patients to whom it is administered. 
Suitable pharmaceutically acceptable carriers include, for example, one or more of water, 
saline, phosphate buffered saline, dextrose, glycerol, ethanol and the like, as well as 
combinations thereof. Pharmaceutically acceptable carriers may further comprise minor 
amounts of auxiliary substances such as wetting or emulsifying agents, preservatives or 
10 buffers, which enhance the shelf life or effectiveness of the antibody. For vaccines of the 
invention containing B. fragilis polypeptides, the polypeptide is co-administered with a 
suitable adjuvant. 

It will be apparent to those of skill in the art that the therapeutically effective 
amount of DNA or protein of this invention will depend, inter alia, upon the 

15 administration schedule, the unit dose of antibody administered, whether the protein or 
DNA is administered in combination with other therapeutic agents, the immune status 
and health of the patient, and the therapeutic activity of the particular protein or DNA. 

Vaccine compositions are conventionally administered parenterally, e.g., by 
injection, either subcutaneously or intramuscularly. Methods for intramuscular 

20 immunization are described by Wolff et al (1990) Science 247: 1465-1468 and by 
Sedegah et al. (1994) Immunology 91 : 9866-9870. Other modes of administration 
include oral and pulmonary formulations, suppositories, and transdermal applications. 
Oral immunization is preferred over parenteral methods for inducing protection against 
infection by B. fragilis . Cainet al. (1993) Vaccine 11: 637-642. Oral formulations 

25 include such normally employed excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium 
carbonate, and the like. 



-65- 



2709.1001-001 



The vaccine compositions of the invention can include an adjuvant, including, but 
not limited to aluminum hydroxide; N-acetyl-muramyl--L-threonyl-D-isoglutamine (thr- 
MDP); N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 1 1637, referred to as nor- 
MDP) ; N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-( 1 ? -2'-dipalmitoyl-sn- 

5 glycero-3-hydroxyphos-phoryloxy)-ethylamine (CGP 19835A, referred to a MTP-PE); 
RIBI, which contains three components from bacteria; monophosphoryl lipid A; 
trehalose dimycoloate; cell wall skeleton (MPL + TDM + CWS) in a 2% squalene/Tween 
80 emulsion; and cholera toxin. Others which may be used are non-toxic derivatives of 
cholera toxin, including its B subunit, and/or conjugates or genetically engineered fusions 

10 of the B, fragilis polypeptide with cholera toxin or its B subunit, procholeragenoid, 

fungal polysaccharides, including schizophyllan, muramyl dipeptide, muramyl dipeptide 
derivatives, phorbol esters, labile toxin of E. coli, non-5, fragilis bacterial lysates, block 
polymers or saponins. 

Other suitable delivery methods include biodegradable microcapsules or immuno- 

15 stimulating complexes (ISCOMs), cochleates, or liposomes, genetically engineered 

attenuated live vectors such as viruses or bacteria, and recombinant (chimeric) virus-like 
particles, e.g., bluetongue. The amount of adjuvant employed will depend on the type of 
adjuvant used. For example, when the mucosal adjuvant is cholera toxin, it is suitably 
used in an amount of 5 mg to 50 mg, for example 10 mg to 35 mg. When used in the 

20 form of microcapsules, the amount used will depend on the amount employed in the 
matrix of the microcapsule to achieve the desired dosage. The determination of this 
amount is within the skill of a person of ordinary skill in the art. 

Carrier systems in humans may include enteric release capsules protecting the 
antigen from the acidic environment of the stomach, and including B, fragilis polypeptide 

25 in an insoluble form as fusion proteins. Suitable carriers for the vaccines of the invention 
are enteric coated capsules and polylactide-glycolide microspheres. Suitable diluents are 
0.2 N NaHC0 3 and/or saline. 
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Vaccines of the invention can be administered as a primary prophylactic agent in 
adults or in children, as a secondary prevention, after successful eradication of B. fragilis 
in an infected host, or as a therapeutic agent in the aim to induce an immune response in 
a susceptible host to prevent infection by B. fragilis . The vaccines of the invention are 
5 administered in amounts readily determined by persons of ordinary skill in the art. Thus, 
for adults a suitable dosage will be in the range of 10 mg to 10 g, preferably 10 mg to 100 
mg. A suitable dosage for adults will also be in the range of 5 mg to 500 mg. Similar 
dosage ranges will be applicable for children. Those skilled in the art will recognize that 
the optimal dose may be more or less depending upon the patient's body weight, disease, 

10 the route of administration, and other factors. Those skilled in the art will also recognize 
that appropriate dosage levels can be obtained based on results with known oral vaccines 
such as, for example, a vaccine based on an E. coli lysate (6 mg dose daily up to total of 
540 mg) and with an enterotoxigenic E. coli purified antigen (4 doses of 1 mg) 
(Schulman et al., J. Urol 150:917-921 (1993); Boedecker et al., American 

15 Gastroenterological Assoc. 999:A-222 (1993)). The number of doses will depend upon 
the disease, the formulation, and efficacy data from clinical trials. Without intending any 
limitation as to the course of treatment, the treatment can be administered over 3 to 8 
doses for a primary immunization schedule over 1 month (Boedeker, American 
Gastroenterological Assoc. 888:A-222 (1993)). 

20 In a preferred embodiment, a vaccine composition of the invention can be based 

on a killed whole E. coli preparation with an immunogenic fragment of an B, fragilis 
protein of the invention expressed on its surface or it can be based on an E. coli lysate, 
wherein the killed E. coli acts as a carrier or an adjuvant. 

It will be apparent to those skilled in the art that some of the vaccine 

25 compositions of the invention are useful only for preventing B. fragilis infection, some 
are useful only for treating B. fragilis infection, and some are useful for both preventing 
and treating B. fragilis infection. In a preferred embodiment, the vaccine composition of 
the invention provides protection against B. fragilis infection by stimulating humoral 
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and/or cell-mediated immunity against B. fragilis . It should be understood that 
amelioration of any of the symptoms of B. fragilis infection is a desirable clinical goal, 
including a lessening of the dosage of medication used to treat B. fragilis -caused disease, 
or an increase in the production of antibodies in the serum or mucous of patients. 

5 

ANTIBODIES REACTIVE WITH B. FRAGILIS POLYPEPTIDES 

The invention also includes antibodies specifically reactive with the subject B. 
fragilis polypeptide. Anti-protein/anti-peptide antisera or monoclonal antibodies can be 
made by standard protocols (See, for example, Antibodies: A Laboratory Manual ed. by 

10 Harlow and Lane (Cold Spring Harbor Press: 1988)). A mammal such as a mouse, a 
hamster or rabbit can be immunized with an immunogenic form of the peptide. 
Techniques for conferring immunogenicity on a protein or peptide include conjugation to 
carriers or other techniques well known in the art. An immunogenic portion of the 
subject B. fragilis polypeptide can be administered in the presence of adjuvant. The 

1 5 progress of immunization can be monitored by detection of antibody titers in plasma or 
serum. Standard ELISA or other immunoassays can be used with the immunogen as 
antigen to assess the levels of antibodies. 

In a preferred embodiment, the subject antibodies are immunospecific for 
antigenic determinants of the B. fragilis polypeptides of the invention, e.g. antigenic 

20 determinants of a polypeptide of the invention contained in the Sequence Listing, or a 
closely related human or non-human mammalian homolog (e.g., 90% homologous, more 
preferably at least about 95% homologous). In yet a further preferred embodiment of the 
invention, the anti-5. fragilis antibodies do not substantially cross react (i.e., react 
specifically) with a protein which is for example, less than 80% percent homologous to a 

25 sequence of the invention contained in the Sequence Listing. By "not substantially cross 
react", it is meant that the antibody has a binding affinity for a non-homologous protein 
which is less than 10 percent, more preferably less than 5 percent, and even more 
preferably less than 1 percent, of the binding affinity for a protein of the invention 
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contained in the Sequence Listing. In a most preferred embodiment, there is no cross- 
reactivity between bacterial and mammalian antigens. 

The term antibody as used herein is intended to include fragments thereof which 
are also specifically reactive with B. fragilis polypeptides. Antibodies can be fragmented 

5 using conventional techniques and the fragments screened for utility in the same manner 
as described above for whole antibodies. For example, F(ab')2 fragments can be 
generated by treating antibody with pepsin. The resulting F(ab')2 fragment can be treated 
to reduce disulfide bridges to produce Fab ! fragments. The antibody of the invention is 
further intended to include bispecific and chimeric molecules having an anti-5. fragilis 

10 portion. 

Both monoclonal and polyclonal antibodies (Ab) directed against B. fragilis 
polypeptides or B. fragilis polypeptide variants, and antibody fragments such as Fab T and 
F(ab')2, can be used to block the action of B. fragilis polypeptide and allow the study of 
the role of a particular B. fragilis polypeptide of the invention in aberrant or unwanted 

15 intracellular signaling, as well as the normal cellular function of the B. fragilis and by 
microinjection of anti-5. fragilis polypeptide antibodies of the present invention. 

Antibodies which specifically bind B. fragilis epitopes can also be used in 
immunohistochemical staining of tissue samples in order to evaluate the abundance and 
pattern of expression of B. fragilis antigens. Anti-5. fragilis polypeptide antibodies can 

20 be used diagnostically in immuno-precipitation and immuno-blotting to detect and 

evaluate B. fragilis levels in tissue or bodily fluid as part of a clinical testing procedure. 
Likewise, the ability to monitor B. fragilis polypeptide levels in an individual can allow 
determination of the efficacy of a given treatment regimen for an individual afflicted with 
such a disorder. The level of an B. fragilis polypeptide can be measured in cells found in 

25 bodily fluid, such as in urine samples or can be measured in tissue, such as produced by 
gastric biopsy. Diagnostic assays using anti-5. fragilis antibodies can include, for 
example, immunoassays designed to aid in early diagnosis of 5. fragilis infections. The 
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present invention can also be used as a method of detecting antibodies contained in 
samples from individuals infected by this bacterium using specific B. fragilis antigens. 

Another application of anti-5. fragilis polypeptide antibodies of the invention is 
in the immunological screening of cDNA libraries constructed in expression vectors such 

5 as Jlgt 1 1 , Xgtl 8-23 , XZAP, and ^ORF8 . Messenger libraries of this type, having coding 
sequences inserted in the correct reading frame and orientation, can produce fusion 
proteins. For instance, Xgtl 1 will produce fusion proteins whose amino termini consist 
of B-galactosidase amino acid sequences and whose carboxy termini consist of a foreign 
polypeptide. Antigenic epitopes of a subject B. fragilis polypeptide can then be detected 

10 with antibodies, as, for example, reacting nitrocellulose filters lifted from infected plates 
with anti-5. fragilis polypeptide antibodies. Phage, scored by this assay, can then be 
isolated from the infected plate. Thus, the presence of B. fragilis gene homologs can be 
detected and cloned from other species, and alternate isoforms (including splicing 
variants) can be detected and cloned. 

15 

KITS CONTAINING NUCLEIC ACIDS, POLYPEPTIDES OR ANTIBODIES OF THE 
INVENTION 

The nucleic acid, polypeptides and antibodies of the invention can be combined 
with other reagents and articles to form kits. Kits for diagnostic purposes typically 

20 comprise the nucleic acid, polypeptides or antibodies in vials or other suitable vessels. 
Kits typically comprise other reagents for performing hybridization reactions, polymerase 
chain reactions (PCR), or for reconstitution of lyophilized components, such as aqueous 
media, salts, buffers, and the like. Kits may also comprise reagents for sample 
processing such as detergents, chaotropic salts and the like. Kits may also comprise 

25 immobilization means such as particles, supports, wells, dipsticks and the like. Kits may 
also comprise labeling means such as dyes, developing reagents, radioisotopes, 
fluorescent agents, luminescent or chemiluminescent agents, enzymes, intercalating 
agents and the like. With the nucleic acid and amino acid sequence information provided 

-70- 



2709.1001-001 



herein, individuals skilled in art can readily assemble kits to serve their particular 
purpose. Kits further can include instructions for use. 

BIO CHIP TECHNOLOGY 
5 The nucleic acid sequence of the present invention may be used to detect B. 

fragilis or other species of Bacteroides acid sequence using bio chip technology. Bio 
chips containing arrays of nucleic acid sequence can also be used to measure expression 
of genes of B. fragilis or other species of Bacteroides. For example, to diagnose a patient 
with a B. fragilis or other Bacteroides infection, a sample from a human or animal can be 

10 used as a probe on a bio chip containing an array of nucleic acid sequence from the 
present invention. In addition, a sample from a disease state can be compared to a 
sample from a non-disease state which would help identify a gene that is up-regulated or 
expressed in the disease state. This would provide valuable insight as to the mechanism 
by which the disease manifests. Changes in gene expression can also be used to identify 

15 critical pathways involved in drug transport or metabolism, and may enable the 

identification of novel targets involved in virulence or host cell interactions involved in 
maintenance of an infection. Procedures using such techniques have been described by 
Brown et ah, 1995, Science 270: 467-470. 

Bio chips can also be used to monitor the genetic changes of potential therapeutic 

20 compounds including, deletions, insertions or mismatches. Once the therapeutic is added 
to the patient, changes to the genetic sequence can be evaluated for its efficacy. In 
addition, the nucleic acid sequence of the present invention can be used to determine 
essential genes in cell cycling. As described in Iyer et aL, 1999 {Science, 283:83-87 ) 
genes essential in the cell cycle can be identified using bio chips. Furthermore, the 

25 present invention provides nucleic acid sequence which can be used with bio chip 
technology to understand regulatory networks in bacteria, measure the response to 
environmental signals or drugs as in drug screening, and study virulence induction. 

-71- 



2709.1001-001 



(Mons et al 9 1998, Nature Biotechnology, 16: 45-48. Patents teaching this technology 
include U.S. Patents 5445934, 5744305, and 5800992. 

DRUG SCREENING ASSAYS USING B. FRAGILIS POLYPEPTIDES 

5 By making available purified and recombinant B. fragilis polypeptides, the 

present invention provides assays which can be used to screen for drugs which are either 
agonists or antagonists of the normal cellular function, in this case, of the subject B. 
fragilis polypeptides, or of their role in intracellular signaling. Such inhibitors or 
potentiators may be useful as new therapeutic agents to combat B. fragilis infections in 

10 humans. A variety of assay formats will suffice and, in light of the present inventions, 
will be comprehended by the person skilled in the art. 

In many drug screening programs which test libraries of compounds and natural 
extracts, high throughput assays are desirable in order to maximize the number of 
compounds surveyed in a given period of time. Assays which are performed in cell-free 

15 systems, such as may be derived with purified or semi-purified proteins, are often 

preferred as "primary" screens in that they can be generated to permit rapid development 
and relatively easy detection of an alteration in a molecular target which is mediated by a 
test compound. Moreover, the effects of cellular toxicity and/or bioavailability of the test 
compound can be generally ignored in the in vitro system, the assay instead being 

20 focused primarily on the effect of the drug on the molecular target as may be manifest in 
an alteration of binding affinity with other proteins or change in enzymatic properties of 
the molecular target. Accordingly, in an exemplary screening assay of the present 
invention, the compound of interest is contacted with an isolated and purified B, fragilis 
polypeptide. 

25 Screening assays can be constructed in vitro with a purified B. fragilis 

polypeptide or fragment thereof, such as an B. fragilis polypeptide having enzymatic 
activity, such that the activity of the polypeptide produces a detectable reaction product. 
The efficacy of the compound can be assessed by generating dose response curves from 
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data obtained using various concentrations of the test compound. Moreover, a control 
assay can also be performed to provide a baseline for comparison. Suitable products 
include those with distinctive absorption, fluorescence, or chemi-luminescence 
properties, for example, because detection may be easily automated. A variety of 
5 synthetic or naturally occurring compounds can be tested in the assay to identify those 
which inhibit or potentiate the activity of the B. fragilis polypeptide. Some of these 
active compounds may directly, or with chemical alterations to promote membrane 
permeability or solubility, also inhibit or potentiate the same activity (e.g., enzymatic 
activity) in whole, live B, fragilis cells. 

10 

OVEREXPRESSION ASSAYS 

Overexpression assays are based on the premise that overproduction of a protein 
would lead to a higher level of resistance to compounds that selectively interfere with the 
function of that protein. Overexpression assays may be used to identify compounds that 

15 interfere with the function of virtually any type of protein, including without limitation 
enzymes, receptors, DNA- or RNA-binding proteins, or any proteins that are directly or 
indirectly involved in regulating cell growth. 

Typically, two bacterial strains are constructed. One contains a single copy of the 
gene of interest, and a second contains several copies of the same gene. Identification of 

20 useful inhibitory compounds of this type of assay is based on a comparison of the activity 
of a test compound in inhibiting growth and/or viability of the two strains. The method 
involves constructing a nucleic acid vector that directs high level expression of a 
particular target nucleic acid. The vectors are then transformed into host cells in single 
or multiple copies to produce strains that express low to moderate and high levels of 

25 protein encoding by the target sequence (strain A and B, respectively). Nucleic acid 
comprising sequences encoding the target gene can, of course, be directly integrated into 
the host cell. 
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Large numbers of compounds (or crude substances which may contain active 
compounds) are screened for their effect on the growth of the two strains. Agents which 
interfere with an unrelated target equally inhibit the growth of both strains. Agents 
which interfere with the function of the target at high concentration should inhibit the 

5 growth of both strains. It should be possible, however, to titrate out the inhibitory effect 
of the compound in the overexpressing strain. That is, if the compound is affecting the 
particular target that is being tested, it should be possible to inhibit the growth of strain A 
at a concentration of the compound that allows strain B to grow. 

Alternatively, a bacterial strain is constructed that contains the gene of interest 

10 under the control of an inducible promoter. Identification of useful inhibitory agents 
using this type of assay is based on a comparison of the activity of a test compound in 
inhibiting growth and/or viability of this strain under both inducing and non-inducing 
conditions. The method involves constructing a nucleic acid vector that directs high- 
level expression of a particular target nucleic acid. The vector is then transformed into 

15 host cells that are grown under both non-inducing and inducing conditions (conditions A 
and B, respectively). 

Large numbers of compounds (or crude substances which may contain active 
compounds) are screened for their effect on growth under these two conditions. Agents 
that interfere with the function of the target should inhibit growth under both conditions. 

20 It should be possible, however, to titrate out the inhibitory effect of the compound in the 
overexpressing strain. That is, if the compound is affecting the particular target that is 
being tested, it should be possible to inhibit growth under condition A at a concentration 
that allows the strain to grow under condition B. 

25 LIGAND-BINDING ASSAYS 

Many of the targets according to the invention have functions that have not yet 
been identified. Ligand-binding assays are useful to identify inhibitor compounds that 
interfere with the function of a particular target, even when that function is unknown. 
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These assays are designed to detect binding of test compounds to particular targets. The 
detection may involve direct measurement of binding. Alternatively, indirect indications 
of binding may involve stabilization of protein structure or disruption of a biological 
function. Non-limiting examples of useful ligand-binding assays are detailed below. 

5 A useful method for the detection and isolation of binding proteins is the 

Biomolecular Interaction Assay (BIAcore) system developed by Pharmacia Biosensor 
and described in the manufacturer's protocol (LKB Pharmacia, Sweden). The BIAcore 
system uses an affinity purified anti-GST antibody to immobilize GST-fusion proteins 
onto a sensor chip. The sensor utilizes surface plasmon resonance which is an optical 

10 phenomenon that detects changes in refractive indices. In accordance with the practice of 
the invention, a protein of interest is coated onto a chip and test compounds are passed 
over the chip. Binding is detected by a change in the refractive index (surface plasmon 
resonance). 

A different type of ligand-binding assay involves scintillation proximity assays 

15 (SPA, described in U.S. Patent No. 4,568,649). 

Another type of ligand binding assay, also undergoing development, is based on 
the fact that proteins containing mitochondrial targeting signals are imported into isolated 
mitochondria in vitro (Hurt et aL, 1985, Embo 1 4:2061-2068; Eilers and Schatz, Nature, 
1986, 322:228-231). In a mitochondrial import assay, expression vectors are constructed 

20 in which nucleic acids encoding particular target proteins are inserted downstream of 
sequences encoding mitochondrial import signals. The chimeric proteins are synthesized 
and tested for their ability to be imported into isolated mitochondria in the absence and 
presence of test compounds. A test compound that binds to the target protein should 
inhibit its uptake into isolated mitochondria in vitro. 

25 Another ligand-binding assay is the yeast two-hybrid system (Fields and Song, 

1989, Nature 340:245-246). The yeast two-hybrid system takes advantage of the 
properties of the GAL4 protein of the yeast Saccharomyces cerevisiae. The GAL4 
protein is a transcriptional activator required for the expression of genes encoding 
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enzymes of galactose utilization. This protein consists of two separable and functionally 
essential domains: an N-terminal domain which binds to specific DNA sequences 
(UAS G ); and a C-terminal domain containing acidic regions, which is necessary to 
activate transcription. The native GAL4 protein, containing both domains, is a potent 

5 activator of transcription when yeast are grown on galactose media. The N-terminal 
domain binds to DNA in a sequence-specific manner but is unable to activate 
transcription. The C-terminal domain contains the activating regions but cannot activate 
transcription because it fails to be localized to UAS G . In the two-hybrid system, a system 
of two hybrid proteins containing parts of GAL4: (1) a GAL4 DNA-binding domain 

10 fused to a protein X' and (2) a GAL4 activation region fused to a protein 'Y 1 . If X and Y 
can form a protein-protein complex and reconstitute proximity of the GAL4 domains, 
transcription of a gene regulated by UAS G occurs. Creation of two hybrid proteins, each 
containing one of the interacting proteins X and Y, allows the activation region of UAS G 
to be brought to its normal site of action. 

15 The binding assay described in Fodor et aL, 1991, Science 251:767-773, which 

involves testing the binding affinity of test compounds for a plurality of defined polymers 
synthesized on a solid substrate, may also be useful. 

Compounds which bind to the polypeptides of the invention are potentially useful 
as antibacterial agents for use in therapeutic compositions. 

20 Pharmaceutical formulations suitable for antibacterial therapy comprise the 

antibacterial agent in conjunction with one or more biologically acceptable carriers. 
Suitable biologically acceptable carriers include, but are not limited to, phosphate- 
buffered saline, saline, deionized water, or the like. Preferred biologically acceptable 
carriers are physiologically or pharmaceutical^ acceptable carriers. 

25 The antibacterial compositions include an antibacterial effective amount of active 

agent. Antibacterial effective amounts are those quantities of the antibacterial agents of 
the present invention that afford prophylactic protection against bacterial infections or 
which result in amelioration or cure of an existing bacterial infection. This antibacterial 
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effective amount will depend upon the agent, the location and nature of the infection, and 
the particular host. The amount can be determined by experimentation known in the art, 
such as by establishing a matrix of dosages and frequencies and comparing a group of 
experimental units or subjects to each point in the matrix. 
5 The antibacterial active agents or compositions can be formed into dosage unit 

forms, such as for example, creams, ointments, lotions, powders, liquids, tablets, 
capsules, suppositories, sprays, aerosols or the like. If the antibacterial composition is 
formulated into a dosage unit form, the dosage unit form may contain an antibacterial 
effective amount of active agent. Alternatively, the dosage unit form may include less 
10 than such an amount if multiple dosage unit forms or multiple dosages are to be used to 
administer a total dosage of the active agent. Dosage unit forms can include, in addition, 
one or more excipient(s), diluent(s), disintegrant(s), lubricant(s), plasticizer(s), 
colorant(s), dosage vehicle(s), absorption enhancer(s), stabilizer(s), bactericide(s), or the 
like. 

15 For general information concerning formulations, see, e.g., Gilman et al. (eds.), 

1990, Goodman and Oilman's: The Pharmacological Basis of Therapeutics, 8th ed., 
Pergamon Press; and Remington's Pharmaceutical Sciences, 17th ed., 1990, Mack 
Publishing Co,, Easton, PA; Avis et al. (eds.), 1993, Pharmaceutical Dosage Forms: 
Parenteral Medications, Dekker, New York; Lieberman et al (eds.), 1990, 

20 Pharmaceutical Dosage Forms: Disperse Systems, Dekker, New York. 

The antibacterial agents and compositions of the present invention are useful for 
preventing or treating B. fragilis infections. Infection prevention methods incorporate a 
prophylactically effective amount of an antibacterial agent or composition. A 
prophylactically effective amount is an amount effective to prevent B, fragilis infection 

25 and will depend upon the specific bacterial strain, the agent, and the host. These 

amounts can be determined experimentally by methods known in the art and as described 
above. 
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B. fragilis infection treatment methods incorporate a therapeutically effective 
amount of an antibacterial agent or composition. A therapeutically effective amount is 
an amount sufficient to ameliorate or eliminate the infection. The prophylactically and/or 
therapeutically effective amounts can be administered in one administration or over 
5 repeated administrations. Therapeutic administration can be followed by prophylactic 
administration, once the initial bacterial infection has been resolved. 

The antibacterial agents and compositions can be administered topically or 
systemically. Topical application is typically achieved by administration of creams, 
ointments, lotions, or sprays as described above. Systemic administration includes both 
10 oral and parental routes. Parental routes include, without limitation, subcutaneous, 
intramuscular, intraperitoneal, intravenous, transdermal, inhalation and intranasal 
administration. 

EXEMPLIFICATION 

15 

CLONING AND SEQUENCING B. FRAGILIS GENOMIC SEQUENCE 

This invention provides nucleotide sequences of the genome of B. fragilis which 
thus comprises a DNA sequence library of B. fragilis genomic DNA. The detailed 
description that follows provides nucleotide sequences of B. fragilis , and also describes 
20 how the sequences were obtained and how ORFs (Open Reading Frames) and protein- 
coding sequences can be identified. Also described are methods of using the disclosed B. 
fragilis sequences in methods including diagnostic and therapeutic applications. 
Furthermore, the library can be used as a database for identification and comparison of 
medically important sequences in this and other strains of B. fragilis as well as other 
25 species of Bacteroides. 

Chromosomal DNA from strain 14062 of B. fragilis was isolated after Zymolyase 
digestion, sodium dodecyl sulfate lysis, potassium acetate precipitation, 
phenolxhloroform extraction and ethanol precipitation (Soil, D.R., T. Srikantha and S.R. 
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Lockhart: Characterizing Developmentally Regulated Genes in B. fragilis . In Microbial 
Genome Methods. K.W. Adolph, editor. CRC Press. New York, p 17-37.). Genomic B. 
fragilis DNA was hydrodynamically sheared in an HPLC and then separated on a 
standard 1% agarose gel. Fractions corresponding to 2500-3000 bp in length were 
5 excised from the gel and purifed by the GeneClean procedure (Biol 01, Inc.). 

The purified DNA fragments were then blunt-ended using T4 DNA polymerase. 
The healed DNA was then ligated to unique &fXI-linker adapters (5'- 
GTCTTCACCACGGGG-3 ' and 5 '-GTGGTGAAGAC-3 5 in 100-1000 fold molar 
excess). These linkers are complimentary to the itaXI-cut pGTC vector, while the 
10 overhang is not self-complimentary. Therefore, the linkers will not concatermerize nor 
will the cut- vector religate itself easily. The linker-adapted inserts were separated from 
the unincorporated linkers on a 1% agarose gel and purified using GeneClean. The 
linker-adapted inserts were then ligated to BstXl-ont vector to construct a "shotgun" 
sublclone libraries. 

15 Only major modifications to the protocols are highlighted. Briefly, the library 

was then transformed into DH5a competent cells (Gibco/BRL, DH5a transformation 
protocol). It was assessed by plating onto antibiotic plates containing ampicillin and 
IPTG/Xgal. The plates were incubated overnight at 37°C. Transformants were then used 
for plating of clones and picking for sequencing. The cultures were grown overnight at 

20 37°C. DNA was purified using a silica bead DNA preparation (Engelstein, 1996) 
method. In this manner, 25 jig of DNA was obtained per clone. 

These purified DNA samples were then sequenced using primarily ABI dye- 
terminator chemistry. All subsequent steps were based on sequencing by ABB 77 
automated DNA sequencing methods. The ABI dye terminator sequence reads were run 

25 on ABB 77 machines and the data was transferred to UNIX machines following lane 
tracking of the gels. Base calls and quality scores were determined using the program 
PHRED (Ewing et al, 1998, Genome Res. 8: 175-185; Ewing and Green, 1998, Genome 
Res. 8: 685-734). Reads were assembled using PHRAP (P. Green, Abstracts of DOE 
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Human Genome Program Contractor-Grantee Workshop V, Jan. 1996, p. 157) with 
default program parameters and quality scores. The initial assembly was done at 7.8 fold 
coverage and yielded 223 contigs. 

Finishing can follow the initial assembly. Missing mates (sequences from clones 

5 that only gave reads from one end of the Bacteroides DNA inserted in the plasmid) can 
be identified and sequenced with ABI technology to allow the identification of additional 
overlapping contigs. 

End-sequencing of randomly picked genomic lambda was also performed. 
Sequencing on a both sides was done for all lambda sequences. The lambda library 

10 backbone helped to verify the integrity of the assembly and allowed closure of some of 
the physical gaps. Primers for walking off the ends of contigs would be selected using 
pick_primer (a GTC program) near the ends of the clones to facilitate gap closure. These 
walks can be sequenced using the selected clones and primers. These data are then 
reassembled with PHRAP. Additional sequencing using PCR-generated templates and 

15 screened and/or unscreened lambda templates can be done in addition. 

To identify B. fragilis polypeptides the complete genomic sequence of B. fragilis 
were analyzed essentially as follows: First, all possible stop-to- stop open reading frames 
(ORFs) greater than 180 nucleotides in all six reading frames were translated into amino 
acid sequences. Second, the identified ORFs were analyzed for homology to known 

20 (archeabacter, prokaryotic and eukaryotic) protein sequences. Third, the coding potential 
of non-homologous sequences were evaluated with the program GENEMARKTM 
(Borodovsky and Mclninch, 1993, Comp. Chem. 17:123). 

IDENTIFICATION, CLONING AND EXPRESSION OF B. FRAGILIS NUCLEIC 
25 ACIDS 

Expression and purification of the B. fragilis polypeptides of the invention can be 
performed essentially as outlined below. 
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To facilitate the cloning, expression and purification of membrane and secreted 
proteins from B. fragilis , a gene expression system, such as the pET System (Novagen), 
for cloning and expression of recombinant proteins in E. coli, is selected. Also, a DNA 
sequence encoding a peptide tag, the His-Tag, is fused to the 3' end of DNA sequences 
5 of interest in order to facilitate purification of the recombinant protein products. The 3 ' 
end is selected for fusion in order to avoid alteration of any 5' terminal signal sequence. 

PCR AMPLIFICATION AND CLONING OF NUCLEIC ACIDS CONTAINING ORF'S 
ENCODING ENZYMES 

10 Nucleic acids chosen (for example, from the nucleic acids set forth in SEQ ID 

NO: 1 - SEQ ID NO: 5222 for cloning from the 14062 strain of B. fragilis are prepared 
for amplification cloning by polymerase chain reaction (PCR). Synthetic oligonucleotide 
primers specific for the 5 f and 3 7 ends of open reading frames (ORFs) are designed and 
purchased from GibcoBRL Life Technologies (Gaithersburg, MD, USA). All forward 

15 primers (specific for the 5 f end of the sequence) are designed to include an Ncol cloning 
site at the extreme 5 1 terminus. These primers are designed to permit initiation of 
protein translation at a methionine residue followed by a valine residue and the coding 
sequence for the remainder of the native B. fragilis DNA sequence. All reverse primers 
(specific for the 3 ; end of any B. fragilis ORF) include a EcoRI site at the extreme 5 1 

20 terminus to permit cloning of each B. fragilis sequence into the reading frame of the 
pET-28b. The pET-28b vector provides sequence encoding an additional 20 carboxy- 
terminal amino acids including six histidine residues (at the extreme C-terminus), which 
comprise the His-Tag. 

Genomic DNA prepared from the 14062 strain of B. fragilis is used as the source 

25 of template DNA for PCR amplification reactions (Current Protocols in Molecular 
Biology, John Wiley and Sons, Inc., F. Ausubel et ah, eds., 1994). To amplify a DNA 
sequence containing an B. fragilis ORF, genomic DNA (50 nanograms) is introduced 
into a reaction vial containing 2 mM MgCl2 ? 1 micromolar synthetic oligonucleotide 
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primers (forward and reverse primers) complementary to and flanking a defined B. 
fragilis ORF, 0.2 mM of each deoxynucleotide triphosphate; dATP, dGTP, dCTP, dTTP 
and 2.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., 
Branchburg, NJ, USA) in a final volume of 100 microliters. 

5 Upon completion of thermal cycling reactions, each sample of amplified DNA is 

washed and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, 
MD, USA). All amplified DNA samples are subjected to digestion with the restriction 
endonucleases, e.g., Ncol and EcoRI (New England BioLabs, Beverly, MA, 
USA)(Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et 

10 al., eds., 1994). DNA samples are then subjected to electrophoresis on 1 .0 % NuSeive 
(FMC BioProducts, Rockland, ME USA) agarose gels. DNA is visualized by exposure 
to ethidium bromide and long wave uv irradiation. DNA contained in slices isolated 
from the agarose gel is purified using the Bio 101 GeneClean Kit protocol (Bio 101 
Vista, CA, USA). 

15 

CLONING OF B. FRAGILIS NUCLEIC ACIDS INTO AN EXPRESSION VECTOR 
The pET-28b vector is prepared for cloning by digestion with restriction 

endonucleases, e.g., Ncol and EcoRI (Current Protocols in Molecular Biology, John 

Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). The pET-28a vector, which encodes 
20 a His-Tag that can be fused to the 5 1 end of an inserted gene, is prepared by digestion 

with appropriate restriction endonucleases. 

Following digestion, DNA inserts are cloned (Current Protocols in Molecular 

Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994) into the previously 

digested pET-28b expression vector. Products of the ligation reaction are then used to 
25 transform the BL21 strain of E. coli (Current Protocols in Molecular Biology, John Wiley 

and Sons, Inc., F. Ausubel et al, eds., 1994) as described below. 
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TRANSFORMATION OF COMPETENT BACTERIA WITH RECOMBINANT 
PLASMIDS 

Competent bacteria, E coli strain BL21 or E. coli strain BL21(DE3), are 
transformed with recombinant pET expression plasmids carrying the cloned B. fragilis 
5 sequences according to standard methods (Current Protocols in Molecular, John Wiley 
and Sons ? Inc., F. Ausubel et al, eds., 1994). Briefly, 1 microliter of ligation reaction is 
mixed with 50 microliters of electrocompetent cells and subjected to a high voltage 
pulse, after which, samples are incubated in 0.45 milliliters SOC medium (0.5% yeast 
extract, 2.0 % tryptone, 10 mM NaCl, 2.5 mM KC1, 10 mM MgC12, 10 mM MgS04 and 
10 20, mM glucose) at 37^C with shaking for 1 hour. Samples are then spread on LB agar 
plates containing 25 microgram/ml kanamycin sulfate for growth overnight. 
Transformed colonies of BL21 are then picked and analyzed to evaluate cloned inserts as 
described below. 

15 IDENTIFICATION OF RECOMBINANT EXPRESSION VECTORS WITH B. 

FRAGILIS NUCLEIC ACIDS 

Individual BL21 clones transformed with recombinant pET-28b B. fragilis ORFs 

are analyzed by PCR amplification of the cloned inserts using the same forward and 

reverse primers, specific for each B. fragilis sequence, that were used in the original PCR 
20 amplification cloning reactions. Successful amplification verifies the integration of the 

B. fragilis sequences in the expression vector (Current Protocols in Molecular Biology, 

John Wiley and Sons, Inc., F. Ausubel et al, eds., 1994). 

ISOLATION AND PREPARATION OF NUCLEIC ACIDS FROM 
25 TRANSFORMANTS 

Individual clones of recombinant pET-28b vectors carrying properly cloned B. 
fragilis ORFs are picked and incubated in 5 mis of LB broth plus 25 microgram/ml 
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kanamycin sulfate overnight. The following day plasmid DNA is isolated and purified 
using the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, CA, USA). 

EXPRESSION OF RECOMBINANT B. FRAGILIS SEQUENCES IN E. COLI 

5 The pET vector can be propagated in any E. coli K-12 strain e.g. HMS174, 

HB101, JM109, DH5, etc. for the purpose of cloning or plasmid preparation. Hosts for 
expression include E. coli strains containing a chromosomal copy of the gene for T7 
RNA polymerase. These hosts are lysogens of bacteriophage DE3, a lambda derivative 
that carries the lad gene, the lacUVS promoter and the gene for T7 RNA polymerase. T7 

10 RNA polymerase is induced by addition of isopropyl-B-D-thiogalactoside (IPTG), and 
the T7 RNA polymerase transcribes any target plasmid, such as pET-28b, carrying its 
gene of interest. Strains used include: BL21(DE3) (Studier, F.W., Rosenberg, A.H., 
Dunn, J. J., and Dubendorff, J.W. (1990) Meth. Enzymol. 185, 60-89). 

To express recombinant B. fragilis sequences, 50 nanograms of plasmid DNA 

15 isolated as described above is used to transform competent BL21(DE3) bacteria as 
described above (provided by Novagen as part of the pET expression system kit). The 
lacZ gene (beta-galactosidase) is expressed in the pET-System as described for the B. 
fragilis recombinant constructions. Transformed cells are cultured in SOC medium for 1 
hour, and the culture is then plated on LB plates containing 25 micrograms/ml 

20 kanamycin sulfate. The following day, bacterial colonies are pooled and grown in LB 

medium containing kanamycin sulfate (25 micrograms/ml) to an optical density at 600 

nM of 0.5 to 1.0 O.D. units, at which point, 1 millimolar IPTG was added to the culture 

for 3 hours to induce gene expression of the B. fragilis recombinant DNA constructions . 

After induction of gene expression with IPTG, bacteria are pelleted by 

o 

25 centrifugation in a Sorvall RC-3B centrifuge at 3500 x g for 15 minutes at 4 C. Pellets 

are resuspended in 50 milliliters of cold 10 mM Tris-HCl, pH 8.0, 0.1 M NaCl and 0.1 

o 

mM EDTA (STE buffer). Cells are then centrifuged at 2000 x g for 20 min at 4 C. Wet 

o _ 
pellets are weighed and frozen at -80 C until ready for protein purification. 
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A variety of methodologies known in the art can be utilized to purify the isolated 
proteins. (Current Protocols in Protein Science, John Wiley and Sons, Inc., J. E. Coligan 
et al., eds., 1995). For example, the frozen cells may be thawed, resupended in buffer 
and ruptured by several passages through a small volume microfluidizer (Model M-l 10S, 

5 Microfluidics International Corporation, Newton, MA). The resultant homogenate may 
be centrifuged to yield a clear supernatant (crude extract) and following filtration the 
crude extract may be fractionated over columns. Fractions may be monitored by 
absorbance at OD28O nm - m ^ P eak f ract i° ns ma Y analyzed by SDS-PAGE 
The concentrations of purified protein preparations may be quantified 

10 spectrophotometrically using absorbance coefficients calculated from amino acid content 
(Perkins, S.J. 1986 Eur. J. Biochem. 157, 169-180). Protein concentrations are also 
measured by the method of Bradford, M.M. (1976) Anal. Biochem. 72, 248-254, and 
Lowry, O.H., Rosebrough, N., Farr, A.L. & Randall, R.J. (1951) J. Biol. Chem. 193, 
pages 265-275, using bovine serum albumin as a standard. 

1 5 SDS-polyacrylamide gels of various concentrations may be purchased from 

BioRad (Hercules, CA, USA), and stained with Coomassie blue. Molecular weight 
markers may include rabbit skeletal muscle myosin (200 kDa), E. coli (-galactosidase 
(1 16 kDa), rabbit muscle phosphorylase B (97.4 kDa), bovine serum albumin (66.2 kDa), 
ovalbumin (45 kDa), bovine carbonic anhydrase (31 kDa), soybean trypsin inhibitor 

20 (21.5 kDa), egg white lysozyme (14.4 kDa) and bovine aprotinin (6.5 kDa). 

EQUIVALENTS 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments and methods 
25 described herein. The specific embodiments described herein are offered by way of 
example only, and the invention is to limited only by the terms of the appended claims, 
along with the full scope of equivalents to which such claims are entitled. 
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TABLE 2 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



\222AS2B1..±2J2 



TTT 



I1.5e-i5 



Protein name 



Locus Name 



Acc# 



hypothetical protein jhpl21l 



C71832 



Description 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



10020167 cl BO 



Protein name 



Locus Name 



1.4e-65 



ACC# 



glutaminase A 



Description 



AB029552 



Aspergillus oryzae gtaA gene tor glutaminase A, complete cds. 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



Protein name 



750 



TIT 



i.8e-28 



Locus Name 



Acc# 



alpJia- 1 , 6 -mannanase 



Description 



gp:AB024331 



AB024331 



Bacillus circulans aman6 gene tor alpha- 1, 6 -mannanase, completecds . 



ORF Name 



NTID 



NT AA 

— , — ■ Score Probability 
AAID Length Length 



.j£JA4JS5ijS„.±i„5j5 



1446 



Protein name 



Locus Name 



Acc# 



Description 



86 



NT 



AA 



ORF Name 



NT ID 



12105450 c2 116 



AAID Length Length 



2367 



Score Probability 
|4.6e-243 



Protein name 



Locus Name 



immunoreactive 8 9KD antigen PG8 7 



gp:AF17b722 



Acc# 



AF175722 



Description 



Porphyromonas gmgivalis strain W50 immunoreactive 89KD antxgenPG87 gene, 
complete cds . 



ORF Name 



NT ID 



AAID 



NT AA 
— , — , Score 
Length Length 



14547327 cl 92 



3TT 



7513" 



TuT" 



Probability 
\2.1e-16 



Protein name 



Locus Name 



glutammase A 



gp:Afe02$552 



Acc# 



AB029552 



Description 



Aspergillus oryzae gtaA gene tor glutammase A, complete cds . 



NT 



AA 



ORF Name 



NTID 



iai2ii...ci...aa 



AAID Length Length 



1111 



Score Probability 
|2.6e-75 



Protein name 



Locus Name 



putative aldose 1-epimerase 



gp : SC4A7 



Acc# 



AL133423 



Description 



Streptomyces coelicolor cosmid 4A7 , 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



E73TT 



325§ 



7FF" 



Probability 
|l.$e-76 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6U27 



Acc# 



JC6027 



Description 



87 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TJS 



Score Probability 

uTon 



Protein name 



Description 



Locus Name 



|gp:AP000y6^ 



Acc# 



AP000969 



Oryza sativa genomic dna, chromosome 1, clone : P0011D01 . 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



224576S6 c2 n2 



10 



WITT 



ITT 



TTTW 



Protein name 



Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



aa±am&...ca...i2A i [tt 



Length Length 
255 



Score Probability 



35 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



TT 



Length Length 
T5T 



Score Probability 
: 6.de-S2 



Protein name 



Locus Name 



hypothetical protein SCJ4.42C 



bir:T^7i2b 



Acc# 



T37125 



Description 



ORF Name 



Protein name 



NTID 



TT 



hypothetical protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TTJT 



ITT 



Locus Name 



pir :S7604b 



5.5e-7i 



Acc# 



S76045 



88 



ORF Name 



NT ID 



NT AA 

— — Score Pro bability 
AAID Length Length 



124647811 c2 lUb 



6 .Oe-ll 



Protein name 



Locus Name 



unknown 



gp:U96771 



Acc# 



U96771 



Description 

frevoteiia bryantii putative polygalacturonase, B-l, 4- endoglucanase, and 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



26507018 c2 119 



IT 



5237 



1284 



Protein name 



Locus Name 



sp:*XAfl_BAdiJU 



Acc# 



P42107 



Description 

HYPOTHETICAL 46.2 KB PkOTlilN IN A^NH-CT TR lNTE^NlO kEUloN 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


29A6±bJ:i...al..±l& 


16 


5238 


158 


477 













Protein name 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



endo-araJomase 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



XT" 



TUTT 



5.4e-34 



Locus Name 



gp:D85132 



Acc# 



D85132 



Description 

Bacillus subtilis DNA tor endo-arabmase , complete cas . 



89 



ORF Name 



4142127 C2 10y 



Protein name 



NTID 



ITS" 



NT 



AA 



AAID Length Length 
ITT 



Score Probability 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



|2.3e-17 



Locus Name 



alpha -1,6 -mannanase 



Acc# 



AB024331 



Description 



Bacillus circulans amane gene tor alpha- 1, 6 -mannanase, completecas . 



ORF Name 



Protein name 



NTID 



2TT 



S24^ 



NT 



AA 



AAID Length Length 

m — 



Score Probability 



240 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



TT 



AAID 



NT AA 

— — , Score Probability 
Length Length 



142" 



Locus Name 



Acc# 



Description 



[NO-HIT 



90 



ORF Name 



NTID 



12205438 cl 6 



Protein name 



hypothetical protein 



Description 



NT 



AA 



AAID Length Length 
— 



TZZT 



Score Probability 
TZT1 — 



Locus Name 



pir : JQ102U 



3.4e-167 



Acc# 



JQ1020 



ORF Name 



Protein name 



NTID 



ST 



NT AA 

— — Score Probability 
AAID Length Length 



S7¥T 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



FT 



Locus Name 



glucan 

1 , 4 -beta-glucosidase , : exo- 1 , 4 -beta-glucosidase 



Description 



pir : JC482b 



|2.3e-48 



Acc# 



JC4825 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



25 



5247 



gluthatione peroxidase 



T7TT 



TTTT 



FT3T 



7 ,4e-46 



Locus Name 



gp:LLAJiuy 



ACC# 



AJ000109 



Description 

Lactococcus lactis carB and gpo genes . 



91 



ORF Name 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 
TH 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Pro bability 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protexn name 



Description 



NT 



AA 



NTID 



AAID 



£3" 



Length Length 



Score Probability 
Oe^ 



Locus Name 



sp:ARaJ?'_HUMAW 



Acc# 



P54793 



NT 



AA 



ORF Name 



NTID 



AAID 



5251 



Length Length 



Score Probability 



1.6e-105 



Protein name 



Description 



Locus Name 



sp:HEXA_±>Oktil 



Acc# 



P49008 



{BETA-NAHA^lil) 



92 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



24486016 t'2 2l 



1.4e-09 



Protein name 



Locus Name 



Acc# 



response regulator 



gpT^PlCTSTW 



AJ006398 



Description 

Streptococcus pneumoniae rroy and nK09 genes; two component systemuy. 





ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2S584525_±l m lO 


51 


S253 


786 | 


2361 


2l2 


2.8e-14 



Protein name 



Locus Name 



putative secreted protein 



gp:£Cl?41 



Acc# 



AL117387 



Description 



Streptomyces coelicolor cosmid F41. 



ORF Name 



NT AA 

— — Score Prob ability 
NT ID AAID Length Length 



26.2.D.5.5.3.D....c3....y.B....... 



;5.0e-32 



Protein name 



Locus Name 



phospnonate monoester Hydrolase 



gp:BCU448!i>2 



Acc# 



U44852 



Description 

Burkholderia caryophylli PG2982 pnospnonate monoester nydroiase tpenAj gene, 
complete cds. 



ORF Name 



NT AA 

— — , Score Proba bility 
NT ID AAID Length Length 



|26.1S.17...±3....3.Q 



TBT" 



Protein name 



Locus Name 



Acc# 



Description 



iNO-tilT 



93 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



2952812 c2 68 



IT 



^5" 



i.4e-34 



Protein name 

Description 
AftYLSuLPAtAfiEl fi PRfclCUR&OR, lASli} 



Locus Name 



sp:AaSE_HUMAU 



Acc# 



P51690 



NT 



AA 



ORF Name 



NTID 



AAID 



3020203 c2 67 



Length Length 



TT7T 



Score Probability 
|2.3e-« 



Protein name 



Description 



Locus Name 



sp:llfiXA_P0fe<3l 



Acc# 



P49008 



(BETA-NAWA^E) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1464 



i.0e-71 



Protein name 



Description 



Locus Name 



sp:M0DF_EC0Ll 



Acc# 



P31060 



pROTlSiti PtfRA) 



NT 



AA 



ORF Name 



NTID 



AAID 



4.7.2. 6.5.&&...G.i...b.fc.. 



Length Length 
T7TT 



TTTT 



Score Probability 
i.Se-125 



TUT 



Protein name 



Locus Name 



nypotnetical protein £>2uyv 



pir:H64S76 



Acc# 



H64976 



Description 



94 



NT 



AA 



ORF Name 



NT ID 



AAID 



5273452 cl 61 



Length Length 
7T7 



Score Probability 
3.4e-VS 



783 



Protein name 



Locus Name 



sp : PMGiJilcJOLl 



Acc# 



P31217 



Description 

(PGAM 1) (B£>G-DE££NDENT PGAM I) 



NT 



AA 



ORF Name 



NT ID 



AAID 



1056958-7 cl by 



Length Length 

rum — 



Score Probability 
|l.le-52 



WIT 



Protein name 



Locus Name 



melxJDiase 



gp : TEMKLA 



Acc# 



Y08557 



Description 



T . etnanolicus melA and lacA genes. 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



TUT 



TTnr 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



15.5.3.15..7.6....aI...7A | FT 



Length Length 

Tim — 



Score Probability 
0.0016 



ITT 



Protein name 



Locus Name 



cytocnrome-c oxidase, chain III 



Ipir:^6yb4 



Acc# 



S36954 



Description 



95 



NT 



AA 



ORF Name 



NTID 



16532750 c2 104 



AAID Length Length 




Score Probability 
3.5e-08 



TTD" 



Protein name 



Locus Name 



tap:AC0054tf<> 



Acc# 



AC005489 



Description 



Genomic sequence tor Arabidopsis tnaliana BAC F14N23 tromCnromosome 1, 
complete sequence . 



NT 



AA 



ORF Name 



NTID 



1^28427 ti 5 



AAID Length Length 
W7T 



Score Probability 
1.3e-57 



593" 



Protein name 



Locus Name 



N utilization substance protein A 



pir :H722l3 



Acc# 



H72213 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
^ 



Score Probability 
i.ie-Si 



Protein name 



Locus Name 



sp:ABCX_CYAPA 



Acc# 



P48255 



Description 

PROBABLE ATP - DEPENDEN T TRAN SPORTER YCh l lb 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



195 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



96 



UKr iName 






AAID 


NT 




AA 
Length 




Score 


Probability 


205463b_r2_34 


46 




5268 


115 


348 




353 


3.4e-32 


Protein name 












Locus 


Name 


Acc# 


hypothetical protein b0866 


pir :B64825 


B64825 


Description 




















ORF Name 


NTID 




AAID 


NT 




AA 
Length 




Score 


Probability 


llkl2±QL(&„c±»£.l - 


47 




526^> 




1662 




742 


2.1e-73 


Protein name 












Locus 


Name 


Acc# 


probable secreted a 


Ipna-galactosicLase 






pir:T36472 


T36472 


Description 




















ORF Name 


NTID 




AAID 


NT 
Length 




AA 
Length 




Score 


Probability 


ll$±9£±l.±±..3. 


48 




5270 


44$ 


1350 




1532 


4.0e-l57 




















Protein name 












Locus 


Name 


Acc# 


L-rucose permease 


gp:AF137263 


AF137263 


Description 




BacteroicLes tnetaiotaomicron 


30S nbosomal protein al6 


-liJceprotein, rucose 




gene cluster, and RNA polymerase sigma 


factorSigZ-like protein 


(sigZ) genes, 




complete cds . 










































ORF Name 


NTID 




AAID 


NT 
Length 




AA 
Length 


Score 


Probability 


iiJimLcUi 


49 




5271 


573 


1722 




291 


5.3e-23 


Protein name 












Locus 


Name 


Acc# 


receptor antigen (RagAj 


gp:PGI130872 


AJ130872 



Description 



£>orphyromonas gmgivalis W5 0 receptor antigen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



97 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


23679512_cl_6l 


50 


5272 




555 


1668 


432 


7 . 7e-42 



Protein name 



Locus Name 



Acc# 



115K outer membrane protein precursor : Susc 
protein 



Description 



pir : JC6027 



JC6027 



ORF Name 



NTID 



|24i3.D.6..7.7....ci...iiy.., 



Protein name 



NT AA „ „ , , . _ . . 
— — Score Prob ability 
AAID Length Length 



411 



Til 



i.Se-06 



Locus Name 



Acc# 



probable sigK protein 



Description 



pir :F7083O 



F70830 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



FlF" 



TF5T 



ITS" 



Protein name 



Locus Name 



unknown 



gp:U96771 



Acc# 



U96771 



Description 



Prevotelia bryantu putative polygalacturonase, B-i, 4- enaoglucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



AAID 



2i8.D.Mb.:/....al...6.b. I 



FT 



Length Length 




Score Probability 



TT2 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



98 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



i'A 28 



F2~ 



] 



IT7T 



Score Probability 
0.00063 



Protein name 



Description 



Locus Name 



sp:TRHYJiAI41T 



Acc# 



P37709 



1* RlCHOHYAblN 



ORF Name 



NTID 



NT AA 
— — Score 
AAID Length Length 



126212805 ±2 29 



55 



W7T 



Probability 
2.2e-07 



Protein name 



Description 



Locus Name 



sp:YHBC_BlCJOLi 



Acc# 



P03843 



HYPO T HE TI CAL 16 . 9 KB PROTEIN IN NU^A -METV 1NTBR0BMI0 kialciloN 



ORF Name 



NTID 



NT AA 
— — Score 
AAID Length Length 



^7T 



Probability 
7.7e-i«6 



Protein name 



Description 



Locus Name 



sp:Y074_^VNVJ 



Acc# 



Q55790 



Hy^OTHE-riCAL 52.8 K D PROTHilN 5LR0074 



NT 



ORF Name 



NTID 



29ABA±0.b...±±...b± 



FT 



AAID Length Length 



AA 

— Score Probability 



3T" 



Protein name 



Description 



Locus Name 



Acc# 



INO-HIT 



99 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



30578i2ife ±1 4 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



Protein name 



FT* 



TTTT 



IT 



0.0077 



Locus Name 



Acc# 



pro.ba.ble serine proteinase 



Description 



pir:T365b2 



T36552 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



Protein name 



TT7T" 



Locus Name 



l.ie-40 



Acc# 



Description 



Q55792 



riVfrCrtHfiTieAt, 50.0 K D PROTEIN 5Lr0O'/6 



ORF Name 



NT AA 

— — Score Probability 
NTID AAID Length Length 



Protein name 



7ZT 



Locus Name 



Acc# 



Description 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



35442313 ti i 



2.4e-86 



Protein name 



Locus Name 



sp:FU(X)_Uc!OLl 



Acc# 



P11549 



Description 

lACTALDEiHYfiEl Kfl&UCTASfi, (PROPANE DIOL OXtDOREtitjeTAfiij) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
2l5~ 



WW 



Score Probability 
II . 7e-103 



Protein name 



Locus Name 



L-±uculose-l -phosphate aldolase 



|gp:Atfl37263 



Acc# 



AF137263 



Description 

Bacteroides thetaiotaomicron 30S nsosomal protein si6-XiJceprotein, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 





ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


4D.0.15.15^cl^6.0. 


54 


5286 


388 1167 


135 


3.7e-06 



Protein name 



Locus Name 



transmembrane sensor 



gp:AF051691 



Acc# 



AF051691 



Description 

£>seudomonas aeruginosa stress tactor A (pstA) , ECF sigma tactor (riul) , 
transmembrane sensor (f iuR) , and hydroxamate- typef errisiderophore receptor 
(fiuA) genes, complete cds. 



ORF Name 



NTID 



NT AA „ ^ -i -i • -i ■ ■ 

— — Score Pr obability 
AAID Length Length 



419.6.aQl...t2...2.b... 



11467 



[T745~ 



|5.2e-l80 



Protein name 



Locus Name 



L-tuculose Kinase 



|gp:AFl37263 



Acc# 



AF137263 



Description 

Bacteroides thetaiotaomicron 3 us rir>osomai protein sib-iiKeprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds. 



101 



ORF Name 



NTID 



NT AA 

— , — „ Score Probability 
AAID Length Length 



5157137 ±1 £ 



1.2e-171 



Protein name 



Locus Name 



Initiation lactor lF2-alpha 



lgp:ECAJ2£40 



Acc# 



AJ002540 



Description 



Escherichia coli (strain EcoAU93 07) miB gene encoamgtransiationai 
initiation factor IF2 . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



FT" 



4l£~ 



5.5e-ll2 



Protein name 



Locus Name 



mrS-liKe protein 



gp:MLCBJ2^ 



Acc# 



Z98741 



Description 



MycoJoacterium leprae cosmid B22 . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



FT" 



8.1e-22 



Protein name 



Locus Name 



L-tucose permease 



gp:AFi372<^ 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S ribosomal protein si6-iikeprotein, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



ORF Name 



NTID 



NT AA ^ _ , , . _ , . 
— — , Score Probability 
AAID Length Length 







|7.3e-62 



Protein name 



Locus Name 



sp : £l?LD_KCJOLl 



Acc# 



P32674 



Description 

F ORMATE AC E TYLTRANS FE RASE 2 , (PYRUVATE FORMATE - L YAS E 2) 



102 



ORF Name 



NTID 



ITT 



Protem name 



nypotneticai protein TM02 8 0 



Description 



NT 



AA 



AAID Length Length 

fitm — 



Score Probability 
2.7e-74 



555 



Locus Name 



pir:F72:J95 



Acc# 



F72395 



ORF Name 



15&&flai7„..cl..IUtt.. 



Protein name 



NTID 



TT 



NT AA 

— — , Score Probability 
AAID Length Length 



52ST 



25TT 



7ST 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



2LD.7.3.46.3.7....aZ...8.U 



Protein name 



NT 



AA 



NTID 



AAID 



72" 



Length Length 



Score Probability 
3.5e-45 



STJ5 



Locus Name 



pro£>a£>le pyruvate tormate- lyase activating 
enzyme, pflC homolog 



Description 



|pir:A694:il 



Acc# 



A69431 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



iiaifiaii..±i...i5 


73 




52^5 


76 231 











Locus Name 



Acc# 



Description 



INO-HTT 



ORF Name 



Protein name 



NTID 



AAID 



IT 



probable competence protein ComF 



Description 



NT AA 

— — , Score Probability 
Length Length 



7W 



1AT 



2.5e-20 



Locus Name 



Ipir:i. , 7b402 



Acc# 



F75402 



103 



ORF Name 



234376^7 ti 1 



Protein name 



NT ID 



75" 



NT 



AA 



AAID Length Length 

fin — 



Score Probability 



Locus Name 



Acc# 



Description 



MO -HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2M5m:i:l.z1.j±.. t hg 



^7W 



WTUT 



1.7e-40 



Protein name 



Locus Name 



JoZlP nistidine Kinase 



gp:PPUY1824b 



Acc# 



Y18245 



Description 



Pseudomonas putida todx, tod ?, todcJl, todK, todB, toctA, todD,todE, toctG, 
todl, todH, todS, todT genes. 



ORF Name 



Protein name 



NT AA 

— — Score Pr obability 
NTID AAID Length Length 



iaam&i-.±i..A i 



7T 



5755 1 157 



Locus Name 



Acc# 



Description 



KO-HIT 



ORF Name 



Protein name 



NTID 



75" 



AAID 



NT 



Length Length 
TT4? 



AA 

— Score Probability 



Locus Name 



Acc# 



Description 



KO-HIT 



104 



ORF Name 



NTID 



NT AA ^ _ , , . -. . . 
— — , Score Probability 
AAID Length Length 



365S5%1> ti ib 



7T 



|2.7e-71 



Protein name 



Locus Name 



alpha galactosidase precursor 



gp:AF061331 



Acc# 



AF061331 



Description 



Saccharopolyspora erythraea alpna gaiactosidase precursor (melA)gene, 
complete cds. 



ORF Name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



cl 61 



|4.7e-l2 



Protein name 



Locus Name 



|sp:YCNy_BAUsU 



Acc# 
P94425 



Description 

HYPO T H E TICAL 10.9 KB PROTEIN IN PHk(J -GDH INTURGENIC RKCjION 



NT 



AA 



ORF Name 



NTID 



FT" 



AAID Length Length 



TTZT 



Score Probability 
|6.3e-88 



Protein name 



Locus Name 



115K outer membrane protein precursor : Susc 
protein 



bir:JC6 027 



Acc# 



JC6027 



Description 



ORF Name 



Protein name 



NTID 



AAID 



ST 



15304 



putative aldose 1-epimerase 



Description 



NT AA 

— — Score P robability 
Length Length 



1145 



5.2e-84 



Locus Name 



|gp:Stf4A7 



Acc# 



AL133423 



Streptomyces coelicolor cosmia 4A7 , 



105 



ORF Name 



NTID 



AAID 



NT AA 

— — • Score P robability 
Length Length 



4103512 t'l 21 



3T" 



\T7T 



Protein name 



Description 



Locus Name 



Acc# 



sp:SUHBJi!C!OLl 



EXtraG£ni<!! StJPPkESSOk PROTElisf Suhb 



ORF Name 



44227£§ tl IS 



Protein name 



NTID 



84 



AAID 



— — Score Pr obability 
Length Length 



Locus Name 



Acc# 



Description 



iNO-HtT 



ORF Name 



NT 



AA 



NTID 



±$.l±Lb.0....a'L..&l 



S5" 



AAID Length Length 
1372 



Score Probability 
5.Se-7a 



Protein name 



Description 



Locus Name 



sp:XVLli^KCJOLl 



Acc# 



P09098 



D-XVLO^E-^kOToN riVMPOkTEk (b -XYLO^ii! TkANSPOkTUk) 



NT 



AA 



ORF Name 



NTID 



1ST 



AAID Length Length 
S5 - 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



106 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



FT 



AAID Length Length 
£TT7 



Score Probability 



Locus Name 



Acc# 



[MO-HIT 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 

— — Score Pro bability 
Length Length 



Locus Name 



Acc# 



(NO-HIT 



ORF Name 



mXlb.2...cJ....3.bl.. 



Protein name 



unknown 



Description 



NT 



AA 



NTID 



ST 



AAID Length Length 



Score Probability 
S.5e-18 



2T7 



Locus Name 



|gp:AP12bl"51" 



Acc# 



AF125164 



Bacteroides tragi lis 638R polysaccharide b IPS B2j raosyntnesisiocus , 
complete sequence; and unknown genes. 



NT 



AA 



ORF Name 



NTID AAID Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



107 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Probability 
Length Length 



142968&S cl 248 



53TT 



2.5e-22 



Protein name 



Locus Name 



conserved Hypothetical protein AF0781 



pir:E6W47 



Acc# 



E69347 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 

rm — 



Score Probability 



F7IT 



TIF" 



i.0e-7* 



Protein name 



Description 



Locus Name 



bp:E<2UtJylbb 



Acc# 



U89166 



Eikenella corrodens lysine decarboxylase (ECOKLD) gene, completecas. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



±4.6A&b.Ll.±2..±±L I [37 



E7T7" 



5.5e-89 



Protein name 



Locus Name 



single- strand DNA-specitic exonuclease 
homo log yrvE 



EarTESSSFTT 



Acc# 



H69980 



Description 



ORF Name 



15..7.3.5.8.8LZ...Cl...Za8... 



Protein name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



S7T5" 



TZUT 



li.5e-6t 



Locus Name 



renm-bindmg protein- related protein : protein 
slrl975 :protein slrl975 



|pir:S7!>64y 



Acc# 



S75649 



Description 



108 



ORF Name 



16016075 &2 'AtiH 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — , Score Prob ability 
AAID Length Length 



ST 



TIT 



coenzyme F3 9 0 syntnetase utsA-3) Jaomolog 



Description 



TTJW 



2.3e-llb 



Locus Name 



pxr :DbybUI 



Acc# 



D69501 



ORF Name 



±6£1S.112.±±..JA 



Protein name 



NTID 



FIT? 



NT 



AA 



AAID Length Length 
STQ 



Score Probability 



TTT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



I.4e-19 



Locus Name 



sprYHCGJ^oLl 



Acc# 



P45423 



Description 

HfrfrOTHaflCAL 45 .5 KD PROTEIN IN GLTF'-NANi' t Nl'ERGENI C REGION IDiVbJ 



109 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



Jib 



2.2e-94 



Protein name 



Locus Name 



putative epimerase/ dehydratase Wbil 



|gp:AF06407TT 



Acc# 



AF064070 



Description 

Burkhoideria pseudomallei putative dihydroorotase (pyrC) gene, partial cds; 
putative 1 -acyl-sn-glycerol- 3 -phosphateacyl transferase (plsC) , putative 
diadenosine tetraphosphatase (apaH) , complete cds; type II O-antigen 
biosynthesis gene cluster , complete sequence; putative undecaprenyl 
phosphateN-acetvlglucosaminvltransf erase , and putative 









NT 


AA 




Score 


Probability 


ORF Name 


NT ID 


AAID 


Length 


Length 






2&llSi>2 m c2...2St& 


100 


5322 


101 


306 




74 




Protein name 








Locus 


Name 


Acc# 



sp:NU3M_kAT 



P05506 



Description 

KfAPH-tJblQtJlrtONEi OXJD6fefiDtJCiTA JSEi gkAl^j 3, 



ORF Name 



NTID 



NT AA 

— — Score Probabi lity 
AAID Length Length 



[TOT" 



7TJ5" 



Protein name 



Locus Name 



Acc# 



Description 



tNO-HlT 





ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Pr 


obability 


211&&2...G2...232 


, 102 5324 


461 


13S6 


305 




2.0e-27 



Protein name 



CapSK 



Description 



Locus Name 



gp:SAU73374 



Acc# 



U73374 



Staphylococcus aureus type 8 capsule genes, capBA, capBB, capbc, capsu, 
cap8E, cap8F, cap8G, cap8H, cap8I, cap8J, cap8K, cap8L,cap8M, cap8N, cap80, 
cap8P, complete cds. 



110 



ORF Name 



NT ID 



NT AA 

— — Score Probab ility 
AAID Length Length 



2117505 c3 317 



TUT 



3BF" 



2.7e-S6 



Protein name 



Locus Name 



Acc# 



otnA protein 



pir :S70ybB 



S70958 



Description 



ORF Name 



Protein name 



NT ID 



NT AA „ „ , , . , . . 
— — Score Probability 
AAID Length Length 



I.0e-120 



Locus Name 



Acc# 



sp:YDIJ_KC0Ll 



P77748 



Description 

HYPOTHETICAL 115.2 KB PROTEIN IN hl^-A ROD INTJkikgiWllj kUciluN 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



Protein name 



TUB" 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



TUT 



AAID 



NT AA 

— — Score Probabi lity 
Length Length 



TTET 



Locus Name 



Acc# 



Description 



INO-HIT 



111 



ORF Name 



Protein name 



NTID 



AAID 



TUT 



TTIT 



NT 



AA 



Length Length 

wn — 



Score Probability 



143 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



NTID 



AAID 



TUT 



TUT 



NT AA 

— — Score P robability- 
Length Length 



&TT 



TTT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



TUT 



NT 



AA 



Length Length 
TH3 



Score Probability 



TUT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



23L&aaiSL7„.±1...27. 



Protein name 



NTID 



AAID 



TTT 



5332 



NT 



AA 



Length Length 
S3 - 



Score Probability 



T5T 



Locus Name 



Acc# 



Description 



MO-HfT 



ORF Name 



Protein name 



NTID 



AAID 



TTT 



T5JT 



NT AA 

— — Score Probab ility 
Length Length 



F5~ 



TTT 



Locus Name 



Acc# 



Description 



MO-HIT 



112 



NT 



AA 



ORF Name 



NT ID 



AAID 



2410^88 rl VI 



TIT 



Length Length 



Score Probability 
7.7e-83 



PIT 



Protein name 



Locus Name 



indolepyruvate oxidoreductase, aipna suJaumt I foir :G69114 



Description 



Acc# 



G69114 



ORF Name 



Protein name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



ITT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



Description 



NT ID 



AAID 



NT AA 

— — Score Probability 
Length Length 



ITT" 



TUT 



i.0e-46 



Locus Name 



sp:XYLR_AMATH 



Acc# 



Q44406 



xylose: RgpREissoR 



NT 



AA 



ORF Name 



NT ID 



AAID 



rnimiEiziiii 



TIE" 



Length Length 



— Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



|N0-HiT 



113 



NT 



AA 



ORF Name 



NT ID 



TT5" 



AAID Length Length 
553 



195 



Score Probability 
I.2e-27 



ITU 



Protein name 



Locus Name 



indolepyruvate terredoxin oxidoreductase, 
subunit beta (iorB) homolog 



Description 



|pir:E69bO:i 



Acc# 



E69503 



ORF Name 



NTID 



— — „ Score Prob ability 
AAID Length Length 



ITT 



4.5e-94 



Protein name 



Locus Name 



Wbpfe 



lgp:PAU50:i% 



Acc# 



U50396 



Description 



Pseudomonas aeruginosa Wzz tRoi; (wzz (roij ) gene, partial cds,WPpA (wppb) , 
WbpB (wbpB) , WbpC (wbpC) , WbpD (wbpD) , WbpE (wbpE) ,Wzy (Rfc) (wzy (rf c) ) , 
Wzx (wzx) , HisH (hisH) , HisF (hisF) , WbpG(wbpG), WbpH (wbpH) , Wbpl (wbpl) , 
WbpJ (wbpJ) , WbpK (wbpK) , WbpL(wbpL), WbpM (wbpM) and WbpN (wbpN) genes, 
complete cds, and UvrB(uvrB) crene, partial cds . 



ORF Name 



NTID 



AAID 



\2A/k2£$X±..cl...2l& J [TIF 



Protein name 



NT AA 

— — Score Proba bility 
Length Length 



89 



270 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NT 



AA 



NTID 



L2.4.54.0.a.7S....cl...2.4S I PT? 



AAID Length Length 
I53TI — 



[775" 



Score Probability 
11314 



|1.3e-197 



Protein name 



Locus Name 



putative aminotransferase 



|gp:AFmi64 



Acc# 



AF125164 



Description 



Bacteroides tragilis 638R polysaccnande B (PS B2 ) Piosyntnesislocus , 
complete sequence; and unknown genes. 



114 



ORF Name 



NT ID 



ti 68 



TIT 



Protein name 



surtace antigen BspA 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TIT 



TTTT 



Locus Name 



pir :T3ioy4 



|3.1e-25 



Acc# 



T31094 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Probabi lity 
Length Length 



TTT 



TIT 



T7T 



\2Ae-B6 



Protein name 

Description 
METHI0K1 YL- T RNA FOkMYLTkAN d Jj' JiikAd k! , 



Locus Name 



Acc# 



sp:FMT_BAC!yU 



ORF Name 



NTID 



NT AA ^ _ , , . , , . 

— — Score Proba bility 
AAID Length Length 



tit 



TFT 



TTT 



|6.9e-ll 



Protein name 



Locus Name 



unknown 



gp:AP04a74^ 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



NT 



AA 



ORF Name 



NTID 



AAID 



2.5AZ9.8.X2....L3...XZ1... 



TIT 



Length Length 
TTIZ 



Score Probability 




Til 



Protein name 



Description 



Locus Name 



sp:V^7J_Mli! i rJA 



Acc# 



Q58383 



HYPOTHETICAL PROTEIN 



115 



ORF Name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



|2558:ib77 c3 ibb 



1182 



T7T 



3.4e-10 



Protein name 



Locus Name 



CapSI 



jgp:SAUS1973 



ACC# 



U81973 



Description 



Staphylococcus aureus capsule gene cluster Cap5A tnrough CapbPgenes, 
complete cds . 



ORF Name 



NTID 



NT AA „ n , , . - . . 
— — Score Probab ility 
AAID Length Length 



I256675S2 C± 363 



Protein name 



chloride channel, probable, homo log 



Description 



STT 



TTTTe^T 



Locus Name 



pir :tf6$426 



Acc# 



F69426 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
HI - 



Score Probability 
2.8e-07 



ITS 



Protein name 



Locus Name 



tachylectin-3 



gp:AB017484 



Acc# 



AB017484 



Description 



Tachypleus trictentatus mRNA tor tacnyiectin-3 , complete cds. 



NT 



AA 



ORF Name 



NTID 



TIT 



AAID Length Length 
53B 



311 



Score Probability 
1.7e-23 



271 



Protein name 



Locus Name 



Acc# 



gp:ECMPL 



X03345 



Description 

E. coli npl gene tor N-acetylneurammate lyase summit (EC4.1.3 .3) . 



116 



ORF Name 



NT ID 



AAID 



NT AA 0 _ , , . . , . 
— — , Score Probability 
Length Length 



12660463b c3 ibb 



T7TT 



TTTT 



|1.3e-66 



Protein name 



Locus Name 



unknown 



[gp:AFi44^7T 



Acc# 



AF144879 



Description 



Leptospira interrogans rr£> locus, complete sequence. 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



25505313 cl 242 



490 



±rnr 



TTT 



0.0002$ 



Protein name 



Locus Name 



putative polysaccharide polymerase 



gp:£t>tf0$23$ 



Acc# 



U09239 



Description 



Streptococcus pneumoniae type 19F capsular polysaccnaricleJDiosyntnesis 
operon, ( cp s 1 9 f ABCDE FGHIJKLMNO ) genes, complete cds,and aliA gene, partial 
cds . 



ORF Name 



2a3.1.7.£&CL...cl...2Lai.. 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



T73~ 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



2S.SD.5.6.3.a...cl...2.17.. 



Protein name 



NTID 



AAID 



NT AA 

— — , Score Proba bility 
Length Length 



7TT 



2TT 



Locus Name 



Acc# 



Description 



BTO-HIT 



117 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

m% — 



Score Probability 
UT2 



1.4e-05 



Protein name 



Locus Name 



DNA- binding protein HB 



pir :C75600 



ACC# 



C75600 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



3.i3.a;L..c:L...2asL 



TIT 



1251 



|6.5e-24 



Protein name 



Locus Name 



|sp:YY^0_lW^U 



Acc# 



P37489 



Description 

HYPOTHETICAL 4^.2 KB PROTEIN IK f COTE-T ETB INTEkGENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



l±&±&02h....c2Jl&0. I [13* 



Length Length 



Score Probability 
2.7e-48 



Protein name 



Description 



Locus Name 



sp : 3MG1_EC0LI 



Acc# 



P05100 



TT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



7T 



rzrr 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



118 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



22064137 c2 294 



TUT 



Protein name 



Locus Name 



2.3e-07 



ACC# 



unknown 



|gp:AF048749 



Description 



AF048749 



Bacteroides tragilis capsular polysaccharide tuosyntnesis operon, complete 
sequence . 



ORF Name 



— — Score Probability 
NTID AAID Length Length 



32323512 rl 17 



Protein name 



TTT 



411 



1236 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



NT AA 

— — „ Score Proba bility 
AAID Length Length 



Protein name 



TUT 



T7T 



1 . le-18 



Locus Name 



Acc# 



DNA repair protein RAD2 5 nomolog 



Description 



pir :Ffey2y4 



F69294 



ORF Name 



NTID 



AAID 



NT AA 
Length Length 



Score Probability 



12.6±l±ll..±2.„$b.., 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



119 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



337 c3 353 



T¥TT 



Protein name 



acetyl transterase homolog 



Description 



|3.7e-5I 



Locus Name 



|pir:S70673 



Acc# 



S70673 



ORF Name 



3.40.114D.2:..±1...2D... 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 
73" 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
TTT 



Score Probability 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



3.ill^.7.7...±3....15.1.. 



Protein name 



NTID 



[TO" 



hypothetical protein 3 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



fTTE" 



5.7e-07 



Locus Name 



pxr :S28487 



ACC# 



S28487 



120 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



34167S67 &2 



i.ie-il>4 



Protein name 



Locus Name 



gp:AB02597U 



Acc# 



AB025970 



Description 



Plesiomonas shigelloides gene tor 0RF1P, £>RF2P, ORF3P, ORF4P,ORF5P, ORF6P, 
0RF7P , 0RF8P, 0RF9P, ORF10P, 0RF11P. 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Probab ility 
Length Length 



36361063 c2 3(30 



145 



ITTTT 



Protein name 



Locus Name 



WbpH 



gp:PAU50356 



Acc# 



U50396 



Description 



Pseudomonas aeruginosa wzz [ROD (wzz (roij ) gene, partial cas,WPpA (wppB; , 
WbpB (wbpB) , WbpC (wbpC) , WbpD (wbpD) , WbpE (wbpE),Wzy (Rfc) (wzy (rfc)), 
Wzx (wzx) , HisH (hisH) , HisF (hisF) , WbpG(wbpG), WbpH (wbpH) , Wbpl (wbpl) , 
WbpJ (wbpJ) , WbpK (wbpK) , WbpL(wbpL), WbpM (wbpM) and WbpN (wbpN) genes, 
complete cds, and UvrB(uvrB) gene , partial cds. . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


331XQ25...jC^...^^JB 


|146 


536$ 


ISO 


543 


153 


5.4e-ll 

















Protein name 



Locus Name 



serine O-acetyltransterase, 



pir :E53402 



Acc# 



E53402 



Description 



ORF Name 



Protein name 



NTID 



TFT 



NT 



AA 



AAID Length Length 



Score Probability 
|9.5e-m 



TTW 



Locus Name 



gp:D64132 



Description 

Porphyromonas gmgivaiis PorR ana PorS genes, complete cds. 



Acc# 



D64132 



121 



ORF Name 



NT ID 



— — , Score Probability 
AAID Length Length 



I402217& c2 262 



ITS" 



J7T 



TTZT 



7.1e-07 



Protein name 



Locus Name 



Acc# 



Description 

riYfrOTHE'l'lCAL §1.2 Kft PROl ' iJlN IN APPA-CSPH iNT'ERGENIC REGION 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


4033l25_cl_250 


149 


5371 


226 


681 


546 


1.2e-52 



Protein name 



Locus Name 



ribulose-5-phospnate 3-epimerase nomolog yloR I |pir :B6$§79 



Acc# 



B69879 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TUZT 



5>.0e-25 



Protein name 



Locus Name 



conserved hypothetical protein BBO'/uy 



pir :DVUIH8 



Acc# 



D70188 



Description 



ORF Name 



|4fta£a.7..7...±l..Aii.. 



Protein name 



Description 



NTID 



AAID 



1ST" 



PT7T 



NT AA 

— — Score Pr obability 
Length Length 



|1.2e-3I 



Locus Name 



|sp:MUC_BORBU 



Acc# 



051372 



PUTATIVE ENDONUCjLiilAS E 550411, 



122 



NT 



AA 



ORF Name 



NTID 



4103375 cl 241 



T5T 



AAID Length Length 



Score Probability 
4 . le-84 



Protein name 



Locus Name 



putative transterase 



gp:BBR007747 



Acc# 



AJ007747 



Description 



Bordetella oronchiseptica cosmid bjolpsi . 



NT 



AA 



ORF Name 



NTID 



14457512 13 184 



TFT 



AAID Length Length 




JIT 



Score Probability 
|2.Se-l6 



TIT 



Protein name 



Locus Name 



conserved Hypothetical protein mthb^ 



pir :F£32lO 



Acc# 



F69210 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



5T71T 



FITT 



9 . Oe-06 



Protein name 



Locus Name 



probable NADH-plastoqumone oxiaoreductase 
subunit 



pir:C71Uia 



Acc# 



C71018 



Description 



NT 



AA 



ORF Name 



NTID 



l4.7.maLti..±i...m.. 



T5T 



AAID Length Length 
2T75 — 



5777 



Score Probability 
6.3e-13 



Protein name 



probable purine NTPase PABUSI2 



Locus Name 
| |pir:F7S103 



ACC# 



F75103 



Description 



123 



ORF Name 



NT ID 



47S6250 ±2 ^ 



P3T 



Protein name 



AAID 



'5T71T 



hypothetical protein MTH6 58 



Description 



NT AA 

— , — , Score Probability 
Length Length 



T7T 



|4.3e-0^ 



Locus Name 



pir:E6S187 



Acc# 



E69187 



ORF Name 



Protein name 



NTID 



TFT 



AAID 



NT 



AA 



Length Length 

cm — 



Score Probability 



7T 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score P robability 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 
3.0e-05 



Locus Name 



sp : CME3_BACtiU 



Acc# 



P39695 



Description 
COME OPElROtf PROTEIN 3 



124 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



5894001 ri 46 



TOT" 



3T 



1ST 



0.020 



Protein name 



Description 



Locus Name 



sp:UDS_STRPV 



Acc# 



Q07172 



(UDP-(3Lffl)ft) (UDPfllSH) 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
— 



TUT 



Score Probability 
!8.3e-38 



Protein name 



Locus Name 



putative transferase 



gpTBBRWTTTT 



Acc# 



AJ007747 



Description 



Bordetella bronchi septica cosmid. BJdLPSI. 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



WIT 



Score Probability 
6.5e-38 



Protein name 



Locus Name 



transposase 



gp:AF038866 



Acc# 



AF038866 



Description 



Bacteroides tragilis transposon Tn5520 transposase (brpH) anamoJDilization 
protein BmpH (bmpH) genes, complete cds . 



NT 



AA 



ORF Name 



7.8.15.12.. ..£.2.. ..12. 3... 



NTID AAID Length Length 
575? 



Score Probability 



249 



Protein name 



Description 



Locus Name 



Acc# 



INO-HIT 



125 



NT 



AA 



ORF Name 



NT ID 



AAID 



462777 i± 73 



153" 



Length Length 



S7T 



Score Probability 




3.3e-43 



Protein name 

Description 
XMttftlNfi MOSt>fiORlBOStt-TRANS?ERASfi # 



Locus Name 



sp:XPT_BACSU 



Acc# 



P42085 



NT 



AA 



ORF Name 



NTID 



AAID 



cl 232 



Length Length 



2TI4" 



Score Probability 
TZ 



0.013 



Protein name 



Locus Name 



sp:HBB_PAtfPO 



Acc# 



P04244 



Description 
HEMOGLOBIN BETA CHAIN 



NT 



AA 



ORF Name 



NTID 



AAID 



Ift£3.Iftft5„.G2...I2£ I 



Length Length 
T2T" 



Score Probability 



TTT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^~ 



TF7~ 



TIE" 



3.2e-07 



Protein name 



Locus Name 



actmorhodm polyJcetide dimerase-related 
protein 



pir :C72410 



Acc# 



C72410 



Description 



126 



NT 



AA 



ORF Name 



NT ID 



AAID 



10757837 ci iiS 



TSTT 



Length Length 



1248 



Score Probability 

— 



1.4e-37 



Protein name 



Description 



Locus Name 



sp:YRKO_BACSU 



Acc# 



P54442 



HYPOTHETICAL 46.4 tib PROTEIN Itf BLTR- SpOIIIC INl^kflEltflC REGION 1 



NT 



AA 



ORF Name 



NTID 



AAID 



11953533 £1 20 



157" 



Length Length 
T5 - 



Score Probability 
^ 



0.0020 



Protein name 



Description 



Locus Name 



sp:HXD3_BRARE 



ACC# 



042370 



H0ME0B0X PROTEIN H0X-D3 (FRAGMENT) 



ORF Name 



NTID 



AAID 



NT AA 
T — Ll T — « Score Probability 
Length Length =c 



[X77T 



7F" 



[2TT 



Protein name 

Description 
INO-HTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



TTT 



AAID Length Length 

— 



Score Probability 
T5I 



1.4e-42 



Protein name 



Locus Name 



putative GTP-binding protein 



gp:ATAC004786 



Acc# 



AC004786 



Description 



Arabidopsis thaliana chromosome II BAC T20K9 genomic sequence, complete 
sequence . 



127 



NT 



AA 



ORF Name 



NTID 



14542187 il 61 



AAID Length Length 
— 



T75" 



52BT 



Score Probability 

m — 



6 .2e-49 



Protein name 

Description 
HYPOTHETICAL PROTEIN HI0318 



Locus Name 



|sp:Y3i8_HAHlN 



Acc# 



P43984 



NT 



AA 



ORF Name 



NTID 



AAID 



15125552 c3 164 



T7T 



Length Length 
HAWS 



Score Probability 

vzi — 



Protein name 



Description 



Locus Name 



Acc# 



|gp:t)$0S57 



E.coli genomic DNA, KoJaara clone #347 (44.2-44.5 mm.). 



ORF Name 



NTID 



NT AA 

_ _ — _ — _ Score Probability 
AAID Length Length ^ 



T7T" 



T7T 



6 . 9e-98 



Protein name 



Locus Name 



Isp : YODE_PSEAE 



Acc# 



Q01609 



Description 

HYPOTHETICAL 40.7 Kd PROTEliN IN OPDE 3'ftgQI0tf l0fe?2J 



NT 



AA 



ORF Name 



NTID 



2Laaia£3.si...ci..±a4 1 \tk 



AAID Length Length 

'zm — 



Score Probability 

— 



2.4e-47 



Protein name 



Locus Name 



recR protean 



pir :H75547 



Acc# 



H75547 



Description 



128 



NT 



AA 



ORF Name 



NTID 



AAID 



T7F" 



Length Length 



Score Probability 

mi — 



3.0e-88 



Protein name 

Description 
PUTATIVE AMINOTRANSFERASE B, 



Locus Name 



Sp : PATB_BACSU 



ACC# 



Q08432 



ORF Name 



NT AA 

vp"p tt\ t\. 7\ tr t — ^ -r — ^, Score Probability 
NTID AAID Length Length x - 



ttf 



Protein name 

Description 
InO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



TTF" 



AAID Length Length 
F4u"S 



Score Probability 



TUT 



TJT 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



\^&b3A$±...a±..±5.6. I [T7^ 



Length Length 
TTT 



Score Probability 




l.6e-ll 



Protein name 

Description 
HYPOTHETICAL P20 PROTEIN 



Locus Name 



sp:VP20_BAOLI 



Acc# 



P05332 



129 



ORF Name 



c3 155 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 
— 



Score Probability 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



Protein name 



NT ID 



TOT" 



NT AA . . 
, ^ ,. — u — ^ Score Probability 
AAID Length Length • L 



"7TT 



TFT 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



Protein name 



Description 



NTID 



NT AA 

_ _ _ — ^ T — , •■ — ^ Score Probability 
AAID Length Length L 



\219Ab.±02...a2.A&a I IXS2 



TBT" 



4.7e-i2 



Locus Name 



sp:RIBD__METJA 



Acc# 



Q58085 



PUTATIVE RIBOFLAV IN B I OSYNTHESIS ENZYM E 



ORF Name 



NTID 



NT AA 
-r — T — ,.i Score Probability 
AAID Length Length 2 - 



2&3.3A6.3..7....C1...122... 



£2T 



3.6e-16 



Protein name 



Locus Name 



cation ettlux system (czcB-liXe) 



pir:E70342 



Acc# 



E70342 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




T7T" 



TFT 



Score Probability 




Protein name 



Locus Name 



oxicLoreductase, aldo/Keto reauctase tamily 



plrTWFUUT 



Description 



1.6e-18 



Acc# 



H72307 



130 



NT 



AA 



ORF Name 



265940S7 ci 109 



NT ID AAID Length Length 




TTT 



Score Probability 
F5B 



Protein name 



Locus Name 



oxictoreductase , aldo/keto reductase tamily 



pir:H72307 



Acc# 



H72307 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



3.2i.7..7.5.5....a2...112.. 



Length Length 



Score 



TTT 



Probability 
|1.3e-34 



Protein name 



Locus Name 



plant -metabolite dehydrogenase homolog yvgN 



pir :C70040 



Acc# 



C70040 



Description 



NT 



AA 



ORF Name 



ma2iaa...ci...iia i itft 



NT ID AAID Length Length 

15*05 — 



Score Probability 
FOT 



3.3e-5$ 



Protein name 



Locus Name 



oxidoreductase , aido/keto reductase tamily 



pir :H72307 



Acc# 



H72307 



Description 



NT 



AA 



ORF Name 



NT ID AAID Length Length 

5*Tu" — 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



3.5.3.i.3.aa,6....Gl...lli I 



AAID Length Length 
5¥TI 



Score Probability 
751 



2.0e-75 



Protein name 



Locus Name 



oxidoreductase, aldo/keto reductase tamily 



pir :A72308 



Acc# 



A72308 



Description 



131 



ORF Name 



NTID 



NT AA 
— — Scopes 
AAID Length Length 



3948575 c2 123 



TSU" 



fZTT 



Probability 
|l.ie-56 



Protein name 



Description 



Locus Name 



sp: YF08_METJA 



Acc# 



Q58903 



HWOttiEflCAL ABC TRANSPORT ATP -BINDING PROtEltf MJ1508 



NT 



AA 



ORF Name 



NTID 



1406417$ tl 23 



AAID Length Length 




Score Probability 

m — 



8.2e-l3 



Protein name 



Locus Name 



aspartate ammotransterase 



gpTTvFuTSTST 



Acc# 



AF035157 



Description 



Lactococcus lactxs aspartate ammotransterase (aspC) gene, completeccts . 



NT 



AA 



ORF Name 



NTID 



AllbAlS..±l..M. 



AAID Length Length 
5^T3 



1497 



Score Probability 
T7T3 



5.4e-34 



Protein name 



Locus Name 



nypothetical protein 



pir:S75887 



Acc# 



S75887 



Description 



ORF Name 



NTID 



AAID 



&4.iai3.5....t2....5.6... 



5415 



Protein name 

Description 
INO-HIT 



NT 



AA 



Length Length 
4^ 



Score Probability 



141 



Locus Name 



Acc# 



132 



ORF Name 



14486261 cl 121 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



T5T 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



T5T 



yqge hypothetical protein 



Description 



NT 



AA 



Length Length 



Score Probability 
5.ie-22 



Locus Name 



tpir:K72114 



Acc# 



H72114 



ORF Name 



4.7.L5..7.1£5...cl...l0.0... 



Protein name 

Description 
[NO-HIT 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



7T 



Locus Name 



Acc# 



ORF Name 



6.120.±2..±t...22.. 



Protein name 

Description 
[NO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 
TT2 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
[NO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 
ITTT 



Score Probability 



WIT 



Locus Name 



Acc# 



133 



ORF Name 



10631882 c2 238 



Protein name 







NT 


AA 


NT ID 


AAID 


Length 


Length 


195 


5421 


61 


186 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NT ID 



2uTT 



AAID 



NT AA 

— — , Score Probability 
Length Length 



TTT 



3.9e-08 



Locus Name 



Acc# 



nypotnetical protein yngA 



Description 



pir :F69892 



F69892 



ORF Name 



NT ID 



TUT 



Protein name 



AAID 



NT AA 

— — Score Probability 
Length Length 



1058 



3177 



TTTTT 



6.3e-246 



Locus Name 



Acc# 



hypothetical protein mexF 



Description 



pir :T3U83U 



T30830 



ORF Name 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



1.6e-06 



Locus Name 



Acc# 



ct46 9 nypothetical protein 



Description 



pir :D72060 



D72060 



ORF Name 
126.9.0.8.17....C2L...2.2.5... 



NTID 



flUT 



Protein name 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



134 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



T5T 



5.0e-24 



Protein name 



Locus Name 



conserved Hypothetical protein 



pirTF7^F- 



Acc# 



F72386 



Description 



ORF Name 



NT ID 



AAID 



Protein name 



hypothetical protein aq_3 80 



Description 



NT AA 

— , — 1 Score Probability 
Length Length 



TUT 



TIT- 



LOCUS Name 



pir:A7<m4 



3.0e-0S 



Acc# 



A70334 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



liiflL7.a42Li.±i...2i I 



NT ID AAID Length Length 

— 



134 



405 



Score Probability 
8 . 9e-34 



Protein name 



Locus Name 



sp:YYAH_6ACsU 



Acc# 



P37516 



Description 

HYPOTHE T ICAL 14.4 KB PROTEIN I N TETO-EXOA INT E R GENIC REGION (Ofei? 1 *') 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2uT 



7TT" 



2.2e-i2 



Protein name 



Locus Name 



hypothetical protein MTH93y 



|pir:G6<mb 



Acc# 



G69225 



Description 



135 



NT 



AA 



ORF Name 



NT ID 



TUT 



AAID Length Length 
213 



Score Probability 

inm 



Protein name 



Locus Name 



mannanase 



gp:U95771 



Acc# 



U96771 



Description 



Prevotella bryantii putative polygalacturonase, B-i, 4-enaogiucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NT ID 



114705251 tl 15 



TUT 



AAID Length Length 
15133 — 



5UT 



Score Probability 
|TT5 



10.000^2 



Protein name 



Locus Name 



Acc# 



unknown protein 



lgp:BA(Jc!OMCjA 



Description 



Bacillus subtilis (clone p ED4) comG- (1 , 2 , 3 , 4 , 5 , b , ana 7) proteins mcomG 
operon, complete cds. 



ORF Name 



NTID 



NT AA 

— — , Score Pro bability 
AAID Length Length 



Err 



[2F5~ 



[7^8" 



|6.2e-^4 



Protein name 



Locus Name 



conserved nypotnetical protein yjKA 



pir :E6ybbi 



ACC# 
E69851 



Description 



ORF Name 



Protein name 



DNA ligase 



Description 



NTID 



NT AA 

— — Score Proba bility 
AAID Length Length 



TTT 



TUUF 



l.le-147 



Locus Name 



bp:BST0Iib7b 



Acc# 



AJ011676 



Bacillus stearotnermophilus Iig gene. 



136 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



I2.4e-3i 



Protein name 



Locus Name 



conserved nypotnetical protein AF22U1 



bir:A69Si>b 



ACC# 



A69525 



Description 



ORF Name 



NT ID 



AAID 



±6.10A6&2.±l...b±. 



TUT 



Protein name 



nypotnetical protein AF186 7 



Description 



NT AA n _ , , . . . , 
— — , Score Prob ability 
Length Length 



T3T3" 



or 



Locus Name 



pir :B6y4b.5 



3.1e-06 



Acc# 



B69483 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA ^ _ , , . _ . . 

— — Score P robability 
Length Length 



54TT 



T5T 



|1.2e-?5 



Locus Name 



sp : pyr£>_aOOAe 



Acc# 



066461 



(DHODEHAShl) 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



Ii&aa2aa5t...ca...aa7. i 



5433 



Length Length 



Score Probability 
li.Se-177 



1724 



Locus Name 



nypotnetical protein 



pir : JQ102U 



Acc# 
JQ1020 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



■iaa.7.5L2L...ti...2La.. 



2TT 



3.5e-34 



Protein name 



Locus Name 



Acc# 



conserved nypotnetical protein AF187 8 



pir :K6y4«4 



E69484 



Description 



137 



ORF Name 



188905 ci 17y 



Protein name 



NTID 



NT 



AA 



AAID Length Length 




Score Probability 



IT 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



Description 



NTID 



NT AA 

— — Score Probabi lity 
AAID Length Length 



[2TT 



5441 



11058 



1223" 



;.2e-20 



Locus Name 



|sp:YQEN_iiAcJ^U 



Acc# 
P54459 



HVEOTHSTICaL 40.5 KD £>R0TSl lSf iti COMJjC-RPST iN'rgR^IsflC RKCjION 



ORF Name 



NTID 



NT AA 

— — Score Proba bility 
AAID Length Length 



2:0.5Ll5.D.a...ci....2b.b... 



TTT 



5¥4T 



TTT 



T75- 



Protein name 



Locus Name 



conserved nypotnetical protein aq__!386 



pir :F70420 



Description 



Acc# 



F70420 



ORF Name 



a0..7.il.7.0.1..±2..-B.y.., 



Protein name 



NT 



AA 



NTID 



AAID 



TIT 



— — Score P robability 
Length Length 

TIZ 



72 



Locus Name 



Acc# 



Description 



138 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
EJTTO 



Score Probability 



59" 



Locus Name 



Acc# 



Description 



zi 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



— — Score Probability 
Length Length 



FT" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



conserved nypotnetical protein 



Description 



NT 



AA 



Length Length 



Score Probability 
i.3e-U 



Locus Name 



Acc# 



E72209 



ORF Name 



\215J£.2..±!...22.. 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 



— Score Probability 



\T7T 



Locus Name 



Acc# 



[MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



£F5~ 



Locus Name 



Acc# 



Description 



MO -HIT 



139 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




227 




248 


747 




3.ie-ib 


Protein name 








Locus Name 


Acc# 










gp:AE>U722^ 


U72238 


Description 
















Anabaena PCC712U 


OREfcl, 0RFR2 




and ORFRo genes, 


complete 




sequences . 
















ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


23£3£Sl2_±l_9 


22S 


5456 


7§4 | 


2355 


14b 


2.0e-04 



Protein name 



Locus Name 



conserved hypotneticai protein AFiuiv 



bir:A69377 



Acc# 



A69377 



Description 







NT 


AA 


Score 


Probability 


ORF Name 


NTID AAID 


Length 


Length 








21B.8Ab.b±..±±..A±. 


225 5451 


131 


395 


136 


4.ie-0S 





Protein name 



Locus Name 



6 3 KDa protein 



|gp:MBU7Jbbi 



Acc# 



U73653 



Description 



Mycobacterium bovis 62 kba pro tein, 47 kDa protein ana cip* gene,compie 
cds . 



NT 



AA 



ORF Name 



NTID AAID Length Length 

— 



Score Probability 



TST" 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



140 



ORF Name 



124316061 11 6 



Protein name 
Description 

etcfett 



NTID 



NT AA 

^ „^ _ — . _ — , Score Probability 
AAID Length Length JL 



5453 



T3X 



Locus Name 



Acc# 



ORF Name 



24a3.43.S.3....Cl...l5.1.. 



Protein name 

Description 
MO-HIT 



NT 



AA 



NTID 



AAID Length Length 




Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — „ Score Probability 
Length Length dL - 



l2L44a5a2fi..±i...it I izst 



Protein name 

Description 
ttTO-HlT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



|2MM0.1.7....al...lf^. I |23F 



AAID Length Length 
B¥S7 



Score Probability 

mi — 



Protein name 



Locus Name 



hypothetical protein ;jhp0694 



E 



ir:F7l£>0l 



Acc# 



F71901 



Description 



141 



ORF Name 


vn-pT'n 
JN 1 ±U 




NT 


AA 
Length 


Score 


Probability 


24500032_£3_li>i 


236 


5458 






1501 


7.7e-lb4 | 


Protein name 








Locus 


Name 












sp : SYD_ 


BACSU 


032038 


Description 














{ASPRS) 1 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


246*4276 , 0_c3_3ll 


237 


5453 


415 


1245 


1550 


1.2e-205 



Protein name 



Locus Name 



L-tucose isomerase 



p:Afl37i6i 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30^ nbosomal protein sib -iiKeprotem, rucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



\2±126h.b.U..±2...L± 



Protein name 



Locus Name 



Acc# 



Description 



[NO- HIT 



ORF Name 



Protein name 



NTID 



AAID 



ATP synthase fu, sununit d* 



Description 



NT 



AA 



Length Length 




1ST" 



Score Probability 

on 



Locus Name 



pir :A64bb2 



Acc# 



A64662 



142 



ORF Name 



24S$7S>43 11 12 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NT ID 



NT AA ^ , "l ■ t * j- 
T — ^, _ — _ Score Probability 
AAID Length Length JL 



253llfia2..±1...2I ...J EIT 



3TB" 



2TT 



8.9e-18 



Protein name 



Description 



Locus Name 



gp:AB024563 



Acc# 



AB024563 



Bacillus haloclurans gene tor YfrlL, YFlM, YFlN, YHDE, HM£> and ARGS, complete 
cds . 



NT 



AA 



ORF Name 



NTID 



£&l.m5,.±l..A£). I |2*J 



AAID Length Length 

szzz — 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



a£i.7.£S4H...ci...ia4 i [^4i 



Length Length 



Score Probability 
TUl 



0.0015 



Protein name 



Locus Name 



sensory transduction system regulatory 
protein slrl837 :protein slrl837 tprotein 
Slrl837 



pir:S77341 



Acc# 



S77341 



Description 



143 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 


Score 


Probability 


2S57837S_±3_1U0 


244 


5466 


67 


204 






Protein name 








Locus 


Name 


Acc# 


Description 














NO -HIT 1 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2L.7.41Ml..±A...lil 


245 


5467 


608 


[1827 


lii 


0.005^ 



Protein name 



Description 



Locus Name 



Acc# 



Q37143 



pREfrROTEUSf frkANSLOOA SE HHiDC SUBLIMIT 



ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


1&$±$.:LL±±..al 


246 5468 


Z$1 


854 


296 


| 3.8e-26 



Protein name 



Locus Name 



Xylk 



lgp:£3Ulb98b 



Acc# 



U15985 



Description 



-i,4-xylanase ixynA) gene, complete 



Bacillus stearotnermopnuus encto-fceta 
cds. 



ORF Name 



NTID 



— — Score Probability 

AAID Length Length 

ll.ie-40 



fZTT 



Protein name 



Locus Name 



sp:PYkki_i4At^U 



Acc# 



P25983 



Description 
Dl^YDROOROTATii! DhiH ¥ D&0(jKN AShl ELECTRON 



144 



ORF Name 



NTID 



— — Score Probability 



AAID Length Length 



5T7TT 



TUT 



Protein name 



Locus Name 



hypothetical protein Rv28l6c 



pir :C706<J1 



Acc# 



C70691 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



ST7T" 



7¥T 



6 . 8e-b9 



Protein name 



Description 



Locus Name 



sp :TRMD_BACSU 



ACC# 



031741 



METHVL'l'kAMy t'tlkAtltl J 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 

— 



TTT 



Score Probability 
7.0e-itt 



Protein name 



Locus Name 



hypothetical protein cjTjy« 



|pir:A7Ibl9 



Acc# 



A71519 



Description 



ORF Name 



NTID 



NT — Score Probability 



AAID Length Length 



251 


5475 


452 


1355 599 



12 . 9e-b8 



Protein name 



Locus Name 



conserved nypothetical protein yqtu 



pir :Ae>yyb4 



Acc# 



A69954 



Description 



ORF Name 



Protein name 



NTID 



AAID 



T5T 



ST74~ 



— — Score Probability 
Length Length 

£3 



192 



Locus Name 



Acc# 



Description 



[NO-HIT 



145 



ORP Name 



NTID 



135432303 C2 228 



Protein name 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



hypothetical protein HP0049 



Description 



1355" 



77F" 



3.2e-77 



Locus Name 



|pir:A64526 



Acc# 



A64526 



ORF Name 



NTID 



NT AA , , . , . 
T — _ — ^, Score Probability 
AAID Length Length iL 



|3.5&3.2a6.2...t3....1Q.a I 



2448 



TTT 



2 . 9e~09 



Protein name 



Locus Name 



sp : YBJ2_EC0LI 



Acc# 



P75831 



Description 

HYPOTHETICAL ABC TRANSPORTER ATP-BINDING PROTEIN YBJZ 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length sL ~ 



Z&'S.&2ttl...G2.J23:Z I 



5477 



1.2e-79 



Protein name 



Description 



Locus Name 



sp:BI0P_BACSH 



Acc# 



P22806 



LIGASE) 



NT 



AA 



ORF Name 



NTID 



AAID 



5478 



Length Length 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 
NO-HIT 



146 



NT 



AA 



ORF Name 



NTID 



AAID 



3923465 £3 135 



[25T 



Length Length 



Score Probability 
5TD 



7.9e~49 



Protein name 



Locus Name 



amp nucleosidase 



pir :A72021 



Acc# 



A72021 



Description 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


^.aZB.3.3.^...Q^...^.&.Z 


25S 


5480 


464 


1355 





Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

n7 ,^. n T — T — _ Score Probability 
AAID Length Length J - 



2.5.1115£..±1.JA I 125? 



I.ie-49 



Protein name 



Locus Name 



OprM 



|gp:AB0il3§l 



Acc# 



AB011381 



Description 

Pseudomonas aeruginosa gene tor OprM, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



3.9A213.7....r.2...5A 



Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



147 



NT 



AA 



ORF Name 



NT ID 



AAID 



3545552 ci 163 



FIST 



Length Length 
F3~ 



Score Probability 



T"JT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



3.aa5t6AD....az...Z5.a i 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ,£ - 



4i0.3.3.S.3....c.2L...M.I I 



T7""T 



TFT* 



3.7e-i2 



Protein name 



Locus Name 



Acc# 



sp : YBDF_ECOLI 



Description 

HYPOTHETICAL 14 . 1 KD PROTEIN IN NENB-ENTD INTERGENIC REGION 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



r744- 



l.3e-l4 



Protein name 

Description 
HYPOTHETICAL PROTEIN MJ057S 



Locus Name 



sp:Y97&_METJA 



Acc# 
Q58388 



148 



ORF Name 



NT ID 



Protein name 



NT aa 

^ _ _ — _ — ^. Score Probability 
AAID Length Length JL 



hypothetical protein mexE 



Description 



1.0e-41 



Locus Name 



pir:T30829 



Acc# 



T30829 



NT 



AA 



ORF Name 



NT ID AAID Length Length 

— 



TITT 



Score Probability 




7.0e-10 



Protein name 



Description 



Locus Name 



gp:YP±02KB 



Acc# 



AL031866 



Yersinia pestis 102 Kinases unstable region: trom 1 to 119443 . 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
— 



Score Probability 
fZZI 



2.5e-20 



Protein name 
Description 

CHLORAMPHENICOL ACUT YLTRAKS FERAS E III, 



Locus Name 



sp : CAT3_ECOLI 



ACC# 



P00484 



NT 



AA 



ORF Name 



NTID 



AAID 



|&£3.l0.3.2...c2....23.3 1 



5490 



Length Length 




Score Probability 



T43 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



149 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length * L - 



4551061 c3 254 



0.020 



Protein name 



Locus Name 



NADH dehydrogenase subunit 4L 



gp:BMMITOCH01 



Acc# 



AF110610 



Description 



Boophilus mlcroplus NADH dehydrogenase subunit 4 (ND4 J gene, partial cds; 
NADH dehydrogenase subunit 4L (ND4L) gene, completecds; tRNA-Thr and 
tRNA-Pro genes, complete sequence; and NADHdehydrogenase subunit 6 (ND6) 
gene, partial cds, mitochondrialgenes for mitochondrial products. 



ORF Name 



4S570S7 ±1 15 



Protein name 



NTID 



TVT 



NT 



AA 



AAID Length Length 

— 



Score Probability 



Locus Name 



Acc# 



Description 
NO -HIT 



ORF Name 



NTID 



NT AA 
T — T — Score Probability 
AAID Length Length 



T7T 



74F 



$.6e-06 



Protein name 

Description 
HYPOTHETICAL PROTEIN MJ0797 



Locus Name 



Acc# 



sp:Y7§V_MJBTJA | Q58207 



ORF Name 



NTID 



AAID 



NT AA 

— , — j , Score Probability 
Length Length 



&9J3J.65....L1...WA | \T77 



5494 



I.6e-07 



Protein name 



Locus Name 



conserved hypothetical protein yknz 



bir:E658S& 



Acc# 



E69858 



Description 



150 



NT 



AA 



ORF Name 



NTID 



AAID 



5084381 C3 310 



TTT 



Length Length 



Score Probability 
TTFS — 



2.4e-il8 



Protein name 



Locus Name 



FucR 



gp:AF13 72 63 



ACC# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S ribosomal protein SlS-likeprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



5lS7$l2 tl 13 



TTT 



Length Length 




Score Probability 




Protein name 



Locus Name 



sp:Y^O§_MfiTJA 



Acc# 



Q58903 



Description 

HYPOTHETICAL ABC TRANSPORTER ATP -BINDING PROTEIN MJ1508 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



i5iaiaA2..±i...i2La i 



Protein name 



Locus Name 



amino acxd ABC transporter, ATP -binding 
protein 



pxr :H72356 



Acc# 



H72356 



Description 



ORF Name 



Protein name 



NTID 



AAID 



[T7ST 



Description 

Bactenopnage P2, complete genome. 



NT AA 

— , ^ — _ Score Probability 
Length Length a£ - 



nrTir 



FIT 



I3T 



Locus Name 



|gp:AF063097 



10.00016 



Acc# 



151 



NT 



AA 



ORF Name 



15859525 c3 299 



T77 



NTID AAID Length Length 

— 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



l fi412ftI2...c2...iaft I 



Length Length 



2061 



Score Probability 
TX&> — 



;.7e-233 



Protein name 



high temperature protein HtpG 



Locus Name 
|gp:AF1762?5"~ 



Acc# 



AF176245 



Description 



Porphyromonas gxngxvalis high temperature protexn HtpG ( htpG ) gene, complete 
cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



6.21116.2..±2...5.6. I \ZT5 



Length Length 
T3T" 



Score Probability 



TUT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



i&iz£.i.±i..xi9. ..j mu 



ttUT 



Length Length 
JUT 



Score Probability 
W^B 



Protein name 



Locus Name 



dihydrociipicolinate syntiiase 



pir:B7224S 



Acc# 
B72246 



Description 



152 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length A ~ 



1507574 ci 149 



TZ7T 



l.ie-193 



Protein name 
Description 

NfiGAflVfi REGULATOR OF GEtfSTlC COMPE^EtfCfi MfiCfe 



Locus Name 



isp:MECfl_BACaD 



Acc# 



P37571 



NT 



AA 



ORF Name 



NTID 



AAID 



985150S 13 113 



Length Length 



ITT 



Score Probability 




l.Se-OS 



Protein name 



Locus Name 



hypothetical protein 



gp:SfiL2437{S? 



Acc# 



AJ243707 



Description 



Synechococcus elongatus petB gene, petD gene ana ORFl. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



lu9£3.0£2..±3.„.15.i I 1257 



TTT 



ITT 



6.1e-74 



Protein name 



Locus Name 



ATP syntnase Fl, suhunit alpha 



pir:P72231 



Acc# 



F72231 



Description 



ORF Name 



13.3.S6.2...t3....116.. 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



hypothetical protein 



TW 



1.2e-S4 



Locus Name 



tap:STE242827 



Acc# 



AJ242827 



Description 

Streptomyces tenclae atp gene and ORF2 (partial) , strain Tue901/8c. 



153 



ORF Name 



13672255 t2 $5 



Protein name 



NT ID 



AAID 



T5UT 



NT AA „ t < -i , 
— , — , Score Probability 
Length Length ± - 



Locus Name 



Acc# 



Description 
NO -HIT 



ORF Name 



NT 



AA 



NTID 



AAID 



U5fiim...c3..„3.ii I ms 



Length Length 



T3B1T 



Score Probability 
1573 



Protein name 



Locus Name 



conserved hypothetical integral membrane 
protein HP1184 



Description 



pir :H64667 



6.6e-45 



Acc# 



H64667 



ORF Name 



Protein name 



NTID 



M5.il0.0.7...±l...£ 



AAID 



15505 



NT AA 

— , — , Score Probability 
Length Length 



TT0~ 



Locus Name 



Acc# 



Description 
WO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



conserved hypothetical protein yJcnZ 



Description 



NT 



AA 



Length Length 



Score Probability 
0.0015 



TT7 



Locus Name 



pir :E6985fcS 



Acc# 



E69858 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— „ — , Score Probability 
Length Length 



404 



fTIT 



|4.9e-l7 



Locus Name 



antibiotic resistance protein homolog ywoG 



pir :B7U06b 



Acc# 
B70065 



Description 



154 



NT 



AA 



ORF Name 



NT ID 



AAID 



15104137 tl 1 



2W 



Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NT ID 



NT AA ^ ^ , _ , _ . 
— , — , Score Probability 
AAID Length Length J ~ 



X5.6.23A4.3....G3...3.2.2. I I25T 



SETT 



I.0e-78 



Protein name 



Locus Name 



Salmonella typhimurium transcriptional 



gp : STYSTMPi 



Acc# 



AF170176 



Description 



Salmonella typhimurium tragment STMF1 . 



NT 



AA 



ORF Name 



NT ID 



isaifti..±a...i2i I p55 



AAID Length Length 
15^14 — 



TUT 



Score Probability 

— 



1.8e-99 



Protein name 
Description 

PROBABLE URACIL PERMEASE (URACIL TRANSPORTER) 



Locus Name 



sp:URAA_HAEIN 



ACC# 



P45117 



ORF Name 



NT ID 



AAID 



NT AA 

Score Probability 
Lengtn Lengtn 



\16±12L82..±1.A5. I 1353 



TTTT 



3.4e-78 



Protein name 

Description 
ATP SYNTHASE ALPHA CHAIN, 



Locus Name 



Acc# 



sp :ATPA_RICPft | 0502 



88 



155 



ORF Name 



NT ID 



AAID 



NT AA 
— ■— — Score 
Length Length 



254 


5516 




252 


879 




470 





Probability 
1.4e-44 



Protein name 



Description 



Locus Name 



sp:ATPGJAAC£W 



Acc# 



P37810 



AHP £}y^lTHASfi CjAMMA CHAIN, 



ORF Name 


NT ID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l&3453lV__c2_264 


2$5 5517 


285 


858 


120 


2.0e-0S 



Protein name 



Locus Name 



3 1 , 5 1 -cyclic-nucleotide pnospnodiesterase, 
cpdA homolog MTH178:Icc related protein 



Description 



pir:P6Sl£>4 



Acc# 



F69104 



ORF Name 



Protein name 



NT ID 



NT AA 
— — , Score 

AAID Length Length 



\2±9M^.D....z±Jl±l I 



Locus Name 



Probability 



Acc# 



Description 



ORF Name 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 

mi — 



Score Probability 



Locus Name 



Acc# 



Description 



156 



NT 



AA 



ORF Name 



NT ID 



23464^6 ±i lbi 



AAID Length Length 
fF&2 — 



Score Probability 
3.5e-0^ 



153 



Protein name 



Locus Name 



HelC 



|gp:LPU117u4 



Acc# 



U11704 



Description 



Legionella pneumop hila HeiC (heic) gene, complete cas . 









NT 


AA 


Score Probability 


ORF Name 


NTID 


AAID 


Length Length 


236$4£S£_rlJib 


2&§ 


5521 




225 


678 


±1$ S.5e-l4 



Protein name 



Description 



Locus Name 



sp:(2Sl_kUMAN 



ACC# 



Q08623 



G3i PROTEIN 



ORF Name 



NTID 



NT AA 
— — i S core 

AAID Length Length 



Probability 
S.0e-2b 



Protein name 



Locus Name 



transcription regulator, crp ramiiy 



pir :F722^b 



Acc# 



F72285 



Description 



NT 



ORF Name 



NTID 



JUT 



AAID Length Length 



AA 

— Score Probability 



^71 



7TT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



157 



ORF Name 



Protein name 

Description 
[MO-HIT 



NT 



AA 



NTID 



AAID 



JUT 



Length Length 
TJT 



Score Probability 



Locus Name 



Acc# 



ORF Name 



NTID 



^±m.7.*L±3„..±io. I \juj 



Protein name 



AAID 



STZT 



NT 



AA 



Length Length 



Score Probability 



JUT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



242Liaaa2...t2L...ai... I pro 



T7T 



Probability 
|5.2e-13 



Protein name 



Description 



Locus Name 



sp:ATPL_ANASP 



Acc# 



P12409 



ATP SYNTHASE C CHAIN, (LIPID-BINDING PROTEIN) 



ORF Name 



NTID 



AAID 



NT AA 

^ — , , — , Score Probability 
Length Length 



Z£2.5L&4.1.7....t2...5.3. 



JUT 



379 



1140 



TJU~ 



1.2e-45 



Protein name 



Locus Name 



sensory transduction system regulatory 
protein slll229 :protein slll229 .-protein 



(pir:S7B524 



Acc# 



S75524 



Description 



158 



NT 



AA 



ORF Name 



NTID 



AAID 



24335127 c3 332 



Length Length 
332" 



Score Probability 
7¥5 



1.3e-76 



Protein name 



Locus Name 



sp:YIEN_ECOLI 



Acc# 
P31473 



Description 

HY&O'MMICAL 56.4 KD PROTEItf IN AStfA-KUP iNfERGENlC RfiGtON 



ORF Name 



NTID 



TUT 



Protein name 



hypothetical protein 3hp03 36 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TUT 



3.4e-i5 



Locus Name 



plr .-C71944 



Acc# 



C71944 



ORF Name 



NTID 



NT AA 
T — ^, T — Score Probability 
AAID Length Length ^ 



TST" 



FIT" 



0 . 049 



Protein name 



Locus Name 



nonstructural protein 



gp:AF012732 



Acc# 



AF012732 



Description 



Bovine viral diarrhea virus strain Yak nonstructural protein (pl2 5 J mRNA, 
partial cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



TUT 



5531 



Length Length 



TUT 



Score Probability 
TUB 



7.3e-23 



Protein name 

Description 
THI0&ED0XIN (TRXJ 



Locus Name 



sp:THI0_B0RB0 



Acc# 



051088 



NT 



AA 



ORF Name 



NT ID 



24414155 rl 20 



3TB" 



AAID Length Length 
^T2 — 



T5TT 



Score Probability 
131 



0.0040 



Protein name 



Locus Name 



unknown 



gp:U9£771 



Acc# 



U96771 



Description 



Prevotelia bryantii putative polygalacturonase, B-l, 4- encloglucanase, ana: 
mannanase genes, complete cds; and unknowngenes . 



ORF Name 



NT ID 



AAID 



24489452 C2 269 



irr 



FE1T 



Protein name 



long- chain- tatty-acid CoA ligase 



Description 



NT AA 
„ — , „ — J , Score Probability 
Length Length 



T^3T 



Locus Name 



bir:D7u3§6 



2.0e-^§ 



Acc# 



D70386 



NT 



AA 



ORF Name 



NT ID 



AAID 



116&±MA..±2..3A I I3T2 



5534 



Length Length 



Score Probability 
1.6e~ll 



IFF 



Protein name 



Description 



Locus Name 



sp:ATPE_CHLLI 



Acc# 



P35111 



ATP SYNTHASE EPSILON CHAIN, 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TTT 



55^5" 



3.6e-26 



Protein name 

Description 
HULA PROTEIN 



Locus Name 



|sp:HELA_LEGPN 



Acc# 



Q48815 



160 



NT 



AA 



ORF Name 



NT ID 



AAID 



24875042 t3 147 



|5b36 



Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



'2L^aax5L7.7....t2...ast | HT5 



7TT 



B.ie-i6 



Protein name 



Description 



Locus Name 



sp:XYNB_BTJTFI 



Acc# 



P26223 



D-XYLAltf X YLAItf <M YdROLAS E B) 



NT 



AA 



ORF Name 



NTID 



AAID 



254.7.28.42...C3....3.15... 



Length Length 
F7 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



\16A&£.$l&..±l..XXl I I7T7 



Length Length 



f7F5~ 



Score Probability 



TT5 



Protein name 



Locus Name 



receptor antigen (RagAj 



gp:PGI130872 



Acc# 



AJ130872 



Description 



Porpnyromonas gingival is W50 receptor antigen (rag) locus encodings major 
immunodominant 55kDa antigen. 



161 



ORF Name 



Protein name 



Description 



NT ID 



AAID 



NT AA 

— — Score Probability 
Length Length 



TOT 



5540 



|1.7e-4ti 



Locus Name 



sp : PVkD JflcJoLl 



Acc# 



P05021 



(DK0t)ElHASfi) 



ORF Name 



2^42827 c3 342 



Protein name 



NTID 



3T5" 



5541 



probable atp- dependent neiicase 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



1057 I [TT74 



2T4~ 



Locus Name 



pir:A7l80b 



Acc# 



A71805 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TUT 



5542 



4.8e-27 



Protein name 



Description 



Locus Name 



sp:CZCB_ALciiil> 



Acc# 



P94176 



CATION INFLUX &j¥riTJ5 M PkOTklN cJ^(J.b 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



fZTT 



S.3e-16 



Protein name 



Locus Name 



11SK outer membrane protein precursor : susc 
protein 



pir:JC602V 



Acc# 



JC6027 



Description 



162 



OPF Name NT ID AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


29973182_c2_246 322 5544 


857 


2574 1042 


3.4e-10b | 


Protein name 


Locus Name 


Acc# 


(p)ppGpp synthetase 




U86377 


Description 




Baciiius subtiixs tpipp^PP synthetase 
adeninephosphoribosyl transferase (apt) 


(relA) 
genes , 


and 

complete cds . 






ORF Name NTID AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


3l3l$l0_c2_266 323 | 5545 


492 


1479 832 


6.0e-&3 


Protein name 






Locus Name 


Acc# 



sp:VC<30_KCjOLl 



P76007 



Description 

PU T ATIVE M( + )/H( + ) UXCHANc^ik KOdu 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


1165M.i).^l^A2 


324 


5546 


103 312 


134 





Protein name 



Locus Name 



gp:AB01b87y 



Acc# 



AB015879 



Description 



£>orphyromonas gingivalis anaK operon genes, complete cds. 



ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


116.6A0.!l..±l..±5.b. 


" 325 5547 


415 


1243 


276 


S.Se-24 



Protein name 



Description 



Locus Name 



sp:ATf>6__RH0kU | 



Acc# 



P15012 



ATP SVNTHASE A CHA IN, (^JkuTUlN b) 



163 



Protein name 

Description 
|N0- HIT 



NT 



AA 



ORF Name 


NT ID 


AAID 


Length 


Length 


33240828_t2_62 


326 


5548 


885 


2658 



Score Probability 



Locus Name 



Acc# 



ORF Name 



NTID 



XXJ.B3.\i3J...XZ..J.B. 



Protein name 



conserved Hypothetical protein 



Description 



NT 



AA 



AAID Length Length 
— 



TUT 



TIT 



Score Probability 
TT7I 



0.024 



Locus Name 



pir:<372385 



Acc# 



G72385 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TTF 



5.6e-i6 



Protein name 



Locus Name 



diacylglycerol kinase 



gp:BSU29177 



Acc# 



U29177 



Description 



Bacillus subtilis PhoH (phoH) gene, partial cds, diacylglycerolkinase (dgJc) 
gene, complete cds, and cytidine deaminase (cdd) gene, partial cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



l&riB2.B5..±l..±±& I 



5b51 



Length Length 



Score Probability 
2540 



6.1e-264 



Protein name 

Description 
ATP SYNTHASE BETA CHAIN, 



Locus Name 



sp:ATPB_BACFR 



ACC# 



P13356 



164 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



|34i8ib6lJ:l_ly 



\555T 



T5T 



Protein name 



Locus Name 



1I5K outer membrane protein precursor : suse 
protein 



Description 



pir: JCfiOUV 



Acc# 
JC602 7 



ORF Name 



NTID 



156250M..±1...&1 1 



Protein name 



\55T 



NT 



AA 



AAID Length Length 
T5T5 



— Score Probability 



5JJT 



Locus Name 



Acc# 



Description 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



mmi.iL.iyiL 



555T 



Length Length 
T5tt 



Score Probability 
1.3e-^ 



Protein name 



Locus Name 



DNA helicase nomoiog 



gp:AJ?'lU8ii8 



Acc# 



AF108138 



Description 



Homo sapiens DtiA h elicase homoiog (frlFl) mRNA, partial cas . 



NT 



AA 



ORF Name 



NTID 



AAID 



43.5.5.2b^...cl..^ay... 



&3T 



5555- 



Length Length 
T551 



575~ 



Score Probability 



Protein name 



Locus Name 



Beta-N-Acetyiglucosamimctase 



gprABOlbibU 



Acc# 



AB015350 



Description 



gtreptomyces thermoviolac eus nagi* gene torBeta-JN-Acetyigiucosaminiaase, 
complete cds . 



165 



ORF Name 



NTID 



4454637 ti 5 



Protein name 



dTDP-glucose 4- 6 -dehydratase : protein 
slr0809 rprotein slr0809 



Description 



NT 



AA 



AAID Length Length 
5556 



TUT 



Score Probability 
TTCS — 



|6.8e-i07 



Locus Name 



pir :S75550 



Acc# 



S75550 



NT 



AA 



ORF Name 



NTID 



ASLJiiaXL±2™5i I |335 



AAID Length Length 




Score Probability 



Protein name 

Description 
NO- HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



I45sm£...ai„.iaa i 



AAID Length Length 
5^53 — 



T5T 



[I3W 



Score Probability 
1251 



^.5e-25 



Protein name 



Description 



Locus Name 



gp:ECOUW82 



Acc# 



L10328 



"W. coli; the region trom 81.5 to 84.5 minutes. 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



iB.&16M...a2...2Si. I I3T7 



3TT 



1644 



i.Se-186 



Protein name 

Description 
PRISMANE PROTEIN 



Locus Name 



|sp:PRlS_£>E5Vli 



Acc# 



P31101 



166 



ORF Name 



NTID 



Protein name 



35STT 



ATP syntnase b'U, summit b 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



5TTT 



Locus Name 



|pir:H7223l 



1.2e-Ib 



Acc# 



H72231 



NT 



AA 

— Score Probability 
Length Length 



ORF Name 



NTID 



AAID 



£27Ab.a.b,.±^..^. 



353" 



55^1" 



T53~ 



57TT 



2T5" 



|1.4e-17 



Protein name 



Locus Name 



^lPO-ATPase subunit delta 



|gp:AP0yyb22 



Acc# 



AF098522 



Description 



Lactobacillus acidophilus ur acil phospnoriJDosyi trans r erase luppjgene, 
partial cds; and FIFO-ATPase operon, complete sequence. 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 


5.5£5.b.l.±±..±l& 


340 


BB62 




1437 136 


6.4e-06 


Protein name 








Locus Name 


Acc# 










sp:YF07_M±i!TJA 


Q58902 


Description 












-HYPOTHETICAL J^koTETN MJlb07 | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 


£g.3.u&^al^i:/.b.„ 


341 


5563 




2S§ 213 


2.4e-17 


Protein name 








Locus Name 


Acc# 



RNA-JDinding protein 



gp:ANARBPL)2 



Description 

Anabaena variabil is rbpD gene tor kNA-sinamg protein, compietecds. 



167 



NT 



AA 



ORF Name 



NTID 



£0468^1 13 140 



AAID Length Length 
5553 — 



TUT 



WIT 



Score Probability 
555 



3 .4e-64 



Protein name 



Locus Name 



3 - me t hyl - 2 - oxobut anoa t e 



gp : CGPAN 



Acc# 



X96580 



Description 



C.glutamicum panB, pane & xyiB genes. 



NT 



AA 



ORF Name 



NT ID 



AAID 



Length Length 
T552 — 



¥43 



Score Probability 
1751 



l.le-2l 



Protein name 

Description 
NITROGEN REGULATION PROTEIN NTRY, 



Locus Name 



Acc# 



Q04850 



ORF Name 



NTID 



AAID 



NT AA , , . , . 
— , — , Score Probability 
Length Length 



3.Q3iflii...ca„.m i m% 



5555" 



TIT 



Protein name 

Description 
INO-HTT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA , ^ . , . 
— ^ — , Score Probability 
Length Length 



^5" 



555T 



755" 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



168 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



3755325 t3 148 



1070 



3.5e-108 



Protein name 



Locus Name 



Acc# 



Description 



Q55336 



TJ (^ORMAfE-bEPEtffttltiT GA& T'RANSFORMYLASE J 



ORF Name 



NTID 



AAID 



NT AA o ^ u , , n . 
— , — ^ Score Probability 
Length Length 



1057752 12 175 



3TT 



2TT 



S.8e-5l 



Protein name 



Locus Name 



Acc# 



thio- specific antioxidant ( tsa) peroxidase 



pir:E72036 



E72036 



Description 



ORF Name 



NTID 



iim&a„.c2„.Ai2 i pro 



Protein name 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



rr 



2TT 



Locus Name 



Acc# 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



iiaaAifti..c2L„42a i 



Length Length 



Score Probability 
73 



0.021 



Protein name 



Locus Name 



ATP binding protein 



gp : BBATPBP 



Acc# 



X91S65 



Description 



B . burgdorferi a£>p gene. 



169 



NT 



AA 



ORF Name 



NT ID 



11S90£ cl 393 



AAID Length Length 
F£33 



Score Probability 



T£7T 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID 



TFT" 



Length Length 
FT" 



Score Probability 
0.020 



f77 



Protein name 



Locus Name 



pE66L 



gp:ASU18466 



Acc# 



U18466 



Description 



Atrican swine tever virus, complete genome. 



ORF Name 



NTID 



AAID 



NT AA 
— , — , Score 
Length Length 



X26.1t202..±l...lb.Z I 552 



TIT" 



Probability 
8 . 5e-45 



Protein name 



Locus Name 



hypothetical protein 



gp : A&AAM Y<3 



Acc# 



X58627 



Description 



A.haloplanKtis amy gene tor alpha- amylase 
1 , 4 - alpha -D-glucanglucanohydrolase . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




Score 



Probability 
\S.6e-22 



Protein name 



Locus Name 



single stranded DNA- binding protein 



gp:SSU640d5 



Acc# 



U64095 



Description 



Shewanella sp. PT99 single stranded DNA- binding protein (ssb) gene, complete 
cds . 



170 



NT 



AA 



ORF Name 



NTID 



AAID 



1367792 t3 258 



F57F" 



Length Length 




Score Probability 

m% — 



2.3e-26 



Protein name 
Description 

HYPOTHETICAL PkOTSltf fluifiS 



Locus Name 



sp:YB65_HAEIN 



Acc# 



P44118 



NT 



AA 



ORF Name 



NTID 



13650S2 cl 320 



AAID Length Length 



5577 



1260 



Score Probability 
3.$e-75 



714 



Protein name 



Locus Name 



autoaggregat ion-mediating protein 



gp:AF091502 



Acc# 



AF091502 



Description 



Lactobacillus reuteri autoaggregat ion -mediating protexn (aggHJgene, 
complete cds . 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



, im7.ai7....a2...4fia | 



TZT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



I35T 



Length Length 
TUT 



— Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO- HIT 



171 



NT 



AA 



ORF Name 



NTID 



U3314808 ±2 161 



T — _ — Score Probability 
AAID Length Length i - 



TUT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



£5T 



Length Length 
— 



Score Probability 



380 



Protein name 

Description 
WO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



li44iaai...ai...aafi.„ .1 i^tt 



AAID Length Length 




Score Probability 



ITT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



144M0.3.7....a3....5.2d.. 



7£T 



NTID AAID Length Length 
— 



789 



Score Probability 
T^> 



1.5e-32 



Protein name 
Description 

CYTOCHROME C BIOGENESIS PROTEIN CCSA 



Locus Name 



sp : CCSA_CYACA 



Acc# 



P31564 



172 



NT 



AA 



ORF Name 



NT ID 



14660927 ci 394 



AAID Length Length 

eftsu — 



Score Probability 
P55 



7.8e-I8 



Protein name 



Description 



Locus Name 



Acc# 



|gp:SCYDL057W 



S.cerevisiae chromosome IV reacting frame ORF YDL057w. 



NT 



AA 



ORF Name 



NTID 



AAID 



114665882 C2 461 



JET 



5555" 



Length Length 
T5T 



Score Probability 



Protein name 

Description 
iNO-MtT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length =L 



.i4fiLi£iai...ci...iaa 1 1^4 



5W 



TO" 



TIT 



0. 00061 



Protein name 



Locus Name 



hypothetical protein 



Acc# 



AJ132945 



Description 



Yersinia enterocolitica WA 314 right arm of the high-pathogenicityisland. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



15.6.3.25...±2...17.S. I 



■3.Se-17 



Protein name 



Locus Name 



ss-DNA binding protein 12RNP2 precursor 



gp:SY012RMP2 



Acc# 



D17359 



Description 

Synechococcus 6301 gene for ss-DNA binding protein 12RNP2, completecds . 



173 



ORF Name 



NTID 



15660937 cl 345 



Protein name 



AAID 



hypothetical protein 



Descri ption 



NT 



AA 



Length Length 



Score 



ZTT 



Probability 
13 .6e-60 



Locus Name 



pir :T33724 



Acc# 



T33724 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length Jl - 



ifiL2L3.a5„.al„.5fiS I |3T7 



7.4e-09 



Protein name 



Locus Name 



Mag44 



|gp:DEPMAG44 



Acc# 



D17682 



Description 



Dermatophagoldes tarinae mRNA tor Mag44, partial cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



1&44.7.&.7.S...C1...5&Q I PES' 



WW 



Length Length 



Score Probability 



Protein name 

Description 
MO-ttlT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



16.5.2.53.18....C2.A11.. 



Length Length 



Score Probability 



TTZ5~ 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



174 



ORF Name 



165582 ci yn 



Protein name 



Description 



NTID 



AAID 



— — Score Probability 
Length Length - 



\rnr 



TUT5 - 



TTT 



7 .8e-10 



Locus Name 



lsp:P£IM_CLoAti 



Acc# 



P33655 



DNA PRIMASE , 



ORF Name 



l6$006§7 c2 420 



Protein name 



NTID 



AAID 



TTT 



TT5T 



nypotnetical protein yycu 



Description 



NT 



AA 



Length Length 
£4(3 



TTT 



Score Probability 
1 . 2e-38 



414 



Locus Name 



pir :A700yO 



Acc# 



A70090 



ORF Name 



Protein name 



NTID 



372 



T5TT 



NT 



AA 



AAID Length Length 



Score Probability 



TTT 



TZT 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



Description 



NTID 



TTT 



AAID 



NT 



AA 



Length Length 
222 



Score Probability 



77 



Locus Name 



Acc# 



175 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length J ~ 



c3 5u9 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



±9.0.15...±±..A1 1 tHF 



AAID Length Length 



Score Probability 



I2.ie-160 



Protein name 



Locus Name 



branching enzyme 



gp:AB026630 



Acc# 



AB026630 



Description 



Emericella nidulans gene tor branching enzyme, complete cds . 



NT 



AA 



ORF Name 



NTID 



I9.7.iai...r.3....2ai | fTTS 



AAID Length Length 
— 



1145 



Score Probability 
TT5 



3.1e-0S 



Protein name 
Description 

PORIN P PRECURSOR { OUTER MEMBRANE PROTEIN Dl) 



Locus Name 



sp : P0£P_PS£a£ 



Acc# 



P05695 



NT 



AA 



ORF Name 



NTID 



AAID 



YTTT 



— , — , Score Probability 
Length Length 

T7T~ 



Protein name 

Description 
NO -HIT 



Locus Name 



Acc# 



176 



NT 



AA 



ORF Name 



NT ID 



c2 423 



T73~ 



AAID Length Length 



Score Probability 




0.021 



Protein name 



Locus Name 



two- component sensor nisticiine kinase homoiog 
ybdK 



pir:P55747 



Acc# 



F69747 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



19JA0.B&1...Q1...5A1 1 nm 



Length Length 



Score Probability 



Ii.2e-ii8 



Protein name 



Locus Name 



sp:F«R_KCOLl 



Acc# 



P52067 



Description 

FOSMIDOMYCIN RESISTANCE PROTEIN 



NT 



AA 



ORF Name 



NT ID 



AAID 



I pinr 



Length Length 



Score Probability 



Protein name 

Description 
W0-H1T 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



\zq:3X4.q.q.:l..&2...&q.z 



NT ID AAID Length Length 
KOTS — 



WIT 



Score Probability 
14 .8e-l84 



Protein name 

Description 
AT P - DE PENDENT PROTEASE LA 1, 



Locus Name 



sprLONlJyiYXXA 



Acc# 



P36773 



177 



ORF Name 



120344086 £2 157 



Protein name 



NTID 



AAID 



NT AA 
— , — , Score 
Length Length 



Locus Name 



Probability 



Acc# 



Description 
WO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



5605 



NT 



AA 



Length Length 
I7TT 



Score Probability 



HIT 



Locus Name 



Acc# 



Description 

NO-HIT 



ORF Name 



Protein name 



NTID 



7&T 



AAID 



5606 



NT 



AA 



Length Length 
TTT 



Score Probability 



Locus Name 



Acc# 



Description 
INO-HTT 



ORF Name 



NTID 



|2ASA1551...cI...a25 ...J |3TC 



Protein name 



NT AA 
_ . — T — Score Probability 
AAID Length Length 



TOT" 



Locus Name 



Acc# 



Description 

no-hit 



178 



NT 



AA 



ORF Name 



NTID 



20980213 cl 392 



AAID Length Length 
— 



Score Probability 
Wl 



'0.034 



Protein name 



Description 



Locus Name 



sp:Y0R5_TTVi 



Acc# 



P19280 



HYPOTHETICAL 9.5 Ki) efeOffillJ 



NT 



AA 



ORF Name 



NTID 



2126506 cl 314 



TST 



AAID Length Length 
— 



EE7TT 



Score Probability 
S3 



O.OOO16 



Protein name 



Locus Name 



transcription regulator pnage- related nomolog 
ydcN 



pir : C69774 



Acc# 



C69774 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



2I4ft£aa7.„.aa..AftI I POT 



Length Length 



Score Probability 



F73T 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TUT 



Score Probability 



TTT" 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



179 



ORF Name 



NTID 



NT AA 
AAID Length Length 



21664055 c3 585 





5612 




SI 



Score Probability 




0.042 



Protein name 



Locus Name 



ATP synthase gamma chain 



Acc# 



AB027877 



Description 



Schizosaccharomyces pombe gene tor ATP synthase gamma chain, partial eels, 
clone :TA25 . 



ORF Name 



NTID 



AAID 



NT AA 

— ^ — _ Score Probability 
Length Length 



2l677i§0 c3 566 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 

irrm 



Protein name 



Locus Name 



estrogen receptor 



|pir:^2659S 



Acc# 



S26595 



Description 



ORF Name 



Protein name 



NTID 



J5T 



hypothetical protein slr0882 



Description 



NT 



AA 



AAID Length Length 
— 



T7T 



Score Probability 



Locus Name 



pir :S77272 



1. Oe-17 



Acc# 



S77272 



180 



NT 



AA 



ORF Name 



NTID 



2175£26S c2 419 



AAID Length Length 
— 



Score Probability 



Protein name 

Description 
iNO-HTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



11521 



Score Probability 




7.5e-4i 



Protein name 



Description 



Locus Name 



sp : GLNA_BACCE 



Acc# 



P19064 



GLUTAMINE SYNTHETASE, (GLUTAMATE- -AMMONIA LiGASE) 



NT 



AA 



ORF Name 



NTID 



AAID 



Z2Q.a7..7.6.2...c3...A9.6. I \T5Z 



Length Length 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
^ — J , — , Score Probability 
Length Length Jl - 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



181 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



TUUT 



Protein name 



Locus Name 



p-ammobenzoate syntnase component I homolog 



Description 



pir :F64187 



^.3e-7B 



Acc# 



F64187 



ORF Name 



Protein name 



NTID 



NT AA 

— , — ^ Score Probability 
AAID Length Length ^ 



355" 



T7F" 



5TT 



0.042 



Locus Name 



sp : TGN3J&AT 



Acc# 



P19814 



Description 

TRANS-GOLGI NETWORK INTEGRAL MEMBRANE PROTEIN TGN3S PRECURSOR 



ORF Name 



Protein name 



NTID 



TuTT 



AAID 



5622 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
KTO-HIT 



ORF Name 



Protein name 



unknown 



Description 



NT 



AA 



NTID 



AAID 



l 2M3.8.S.S.7....cl...3.a5. I FTOT 



Length Length 



Score 



FT 



Probability 
10.013 



Locus Name 



jgpTAFSTilST 



Acc# 



AF074396 



Desuitotomacuium thermoci sternum 
UDP-acetylglucosaminel-carboxyvinyl transferase (murA) gene; partial cds; 
yydA, f erredoxin (fdx) , dissimilatory sulfite reductase subunit A 
(dsrA) j dissimilatory sulfite reductase subunit B (dsrB) , and dsrD 
genes , complete cds ; and unknown gene . 



182 



in 1 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



23472178 t3 271 



TSTT 



l.Se-122 



Protein name 



Locus Name 



Acc# 



Xylose Isomerase 



gp:RPL152472 



Description 



AJ132472 



Ruminococcus flavefaciens xylan utilization operon . 



ORF Name 



NTID 



NT AA 

m— — T — T — Score Probability 
AAID Length Length 



23397202 c5 5i3 



|4g~ 



Protein name 



Locus Name 



Acc# 



hypothetical protein F21D9.3 



pir:T21205 



T21205 



Description 



ORF Name 



NTID 



NT AA o - - i - ■ 
_ — T — . u Score Probability 
AAID Length Length 



2.3.6l3.2.13.2.„.£1...6A.. 



TUT 



WIT 



7.6e-60 



Protein name 



Locus Name 



xylulose kinase 



gp:AF001974 



Acc# 



AF001974 



Description 



Thermoanaerobacter ethanolicus putative TrkG gene, partial cds, andputative 
TrkA, xylose isomerase (xylA) and xylulose kinase (xylB) genes, complete cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



2Liii4iaa...ai...ifia I 



Length Length 
3T~ 



Score Probability 





0.042 



Protein name 
Description 

HYPOTHETICAL PROTEIN MJ1213 



Locus Name 



Acc# 



sp:YC13_METJA | Q58610 



183 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



73" 



237 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 




1.9e-93 



Protein name 



Description 



Locus Name 



sp : TGT_BACSU 



ACC# 



032053 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 
32 



0. 013 



Protein name 



Locus Name 



Acc# 



M protein precursor 



pir:S61081 



Description 



ORF Name 



NTID 



Protein name 

Description 
NO-HIT 



AAID 



NT 



AA 



„ — J . — , Score Probability 
Length Length 

Tff3 



SO 



Locus Name 



Acc# 



184 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length J ~ 



Protein name 



[¥TTT 



TJT 



IT 



0.016 



Locus Name 



Acc# 



MesF 



Description 



gp:AF143443 



AF143443 



Leuconostoc mesenterordes plasmid pHY30 MesG (mesG) gene, partialcds; and 
mesentericin BIOS (mesB) , MesH (mesH) , and MesF (mesF)genes, complete cds . 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



24027213 C2 460 



Protein name 



411 



5633 



TJT 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NTID 



AAID 



NT AA o _ , _ . . 
— — , Score Probability 
Length Length 



\lAl±±16±.±X...ll I 



Protein name 



5634 



TIT 



W5T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



242&5.8.a6....G2....4,6.9... 



Protein name 



NTID 



AAID 



fflT 



NT 



AA 



Length Length 
T5B~ 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



185 



ORF Name 



NT ID 



AAID 



NT AA 

— „ — , Score Probability 
Length Length 



243054^7 c3 5b6 



Protein name 



RTF 



Locus Name 



Acc# 



Description 



IMO-HIT 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length * L * 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— , — L1 Score Probability 
Length Length 



'2.&1U6AI.±1...11! | BTS 



Protein name 



T2TT 



Locus Name 



Acc# 



Description 



IMO-HIT 



ORF Name 



2.4ia.7.S..7..7....cl...3.7.i.. 



Protein name 



NTID 
[2T7 



AAID 



— , — , Score Probability 
Length Length 



ST" 



7T 



0.0075 



Locus Name 



Acc# 



Description 
SAFEINOSE OPEkON REMiESSOft 



sp : PAFk_E<^!0Ll 



P21867 



186 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length J ~ 



24406557 t2 15^ 



T5T 



|4.2e-0§ 



Protein name 



Locus Name 



protein antigen LmSTll 



|gp:LMU73S45 



Acc# 



U73845 



Description 



Leishmania ma^or protein antigen LmSTll mRNA, partial eels . 



NT 



AA 



ORF Name 



NT ID 



AAID 



244lS$l2 t3 257 



BIST 



Length Length 
1355 



TFT 



Score Probability 
S3 



0.0025 



Protein name 



Locus Name 



putative repressor protein 



gp:BA1242593 



Acc# 



AJ242593 



Description 
Bacteriophage A118 complete genome. 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



J5T 



TUTT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



Z&5.U3.2L&2...C.3....5.3.&,. 



WIT 



AAID Length Length 
5513 



Score Probability 
73 



0.017 



Protein name 



Locus Name 



hypothetical protein MJ1664 



pir :F64507 



Acc# 



F64507 



Description 



187 



ORF Name 



NT ID 



AAID 



24633387 cl 354 



Protein name 



nypotnetical protein T27E13.6 



Description 



NT 



AA 



Length Length 



Score Probability 




Locus Name 



pir :T00580 



|4.5e-«' 



Acc# 



T00580 



ORF Name 



NT ID 



AAID 



NT AA 

— — J _. Score Probability 
Length Length ^~ 



^6L^aaia...a2L..AaQ i 



TT7" 



W7T 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2L£&4£&a6....ai...m 



Length Length 
TIT" 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2&6AS.2!1..±1...29.5. I HTff 



Length Length 
TTZQ — 



419 



Score Probability 
TWo — 



1.0e-126 



Protein name 



Locus Name 



putative UDP-glucose dehydrogenase 



igp:AF15942 8 



ACC# 



AF159428 



Description 



Burkholderia pseudomaliei putative UDP-glucose dehydrogenase (udg) .putative 
ADP-heptose synthase (waaE) , and putativeADP-glycero-mannoheptose epimerase 
(gmhD) genes, complete cds. 



188 



NT 



AA 



ORF Name 



NTID 



2464S4I2 ti 23 



AAID Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



2&6.4&5.6.2....C3...3.5.5l I I3T7 



AAID Length Length 
— 



Score Probability 



WT 



2W 



Protein name 

Description 
1N0-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
121 



Protein name 



Locus Name 



thiol rdisul tide interchange protein homolog 
yneN 



pir :E69891 



Description 



l.Se-07 



Acc# 



E69891 



ORF Name 



Protein name 



NTID 



a43.aifiis„±i„.aa 1 1325 



dTDP~6-deoxy-D-glucose-3 , 5 epimerase 



Description 



NT 



AA 



AAID Length Length 
— 



Score Probability 
533 



Locus Name 



gp:AFu4874a 



|2.0e-52 



Acc# 



AF048749 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



189 



NT 



AA 



ORF Name 



NTID 



AAID 



24798568 ±2 219 



Length Length 
IUZ~ 



Score Probability 



Protein name 

Description 
(NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



24a0.ib.B.l...al...3.:/.D. 



Length Length 
7T" 



Score Probability 



HIT 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TIT 



Score Probability 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TTT 



Score Probability 



Protein name 

Description 
WG-HM 



Locus Name 



Acc# 



190 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
5535 — 



TU7T 



Score Probability 




Protein name 



Locus Name 



sp:YVAA_BACSU 



Acc# 



032223 



Descri ption 

HY'POfHS'TlCAL OXlDORfiDUM-ASEi IN FH(JT)-0£>tJBD itiTfiRfiEltf lC kEGlON 



NT 



AA 



ORF Name 



c2 404 



4T? 



NTID AAID Length Length 

— 



T3T~ 



515" 



Score Probability 

m 



0.044 



Protein name 



Locus Name 



envelope glycoprotein 



bp:A*02l7:Sd 



Acc# 



AF021739 



Description 



HIV-1 isolate sing clone 45 trom the Netherlands, envelopegiycoprotein 
(env) gene, partial cds . 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



25A2B1±2.±1...2&B... 



5658 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
133" 



Score Probability 



Protein name 
Description 



Locus Name 



Acc# 



191 



ORF Name 



NTID 



25572212 cl 315 



Protein name 



AAID 



hypothetical protein yopO 



Description 



NT AA 

— ^ — ^ Score Probability 
Length Length <£ - 



TZT 



155" 



0.042 



Locus Name 



Acc# 



pzr :T12849 



NT 



AA 



ORF Name 



NTID 



AAID 



2.£k6AD.&&....ca...3.3.Q.., 



Protein name 

Description 
FFTTTT 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
TO 11711 



Score Probability 



Locus Name 



Acc# 



Description 
[NO- HIT 



ORF Name 



NTID 



Z6A33.2,16..„a3...AB.3. | W%1 



Protein name 



AAID 



5663 



NT AA 

— L1 , — ^ Score Probability 
Length Length 1 ~ 



81 



Locus Name 



Acc# 



Description 
INO-HTT 



NT 



AA 



ORF Name 



NTID 



AAID 



2&5.1.7....al...M6. I WZI 



Length Length 
5^ 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



192 



NT 



AA 



ORF Name 



NT ID 



25601510 c2 448 



AAID Length Length 
5555 — 



Score Probability 
i.7e-05 



TUT 



Protein name 



Locus Name 



hypothetical protein MJ160 8 



pir :G64500 



Acc# 



G64500 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
— 



Score Probability 




"a.le-40 



Protein name 



Locus Name 



conserved hypothetical protein aq__1386 



pir :F70420 



Acc# 



F70420 



Description 



NT 



AA 



ORF Name 



NTID 



|ififiaii4i...ci„.4A£ | in? 



AAID Length Length 

^zn — 



1221 



Score Probability 
355 



Protein name 



Locus Name 



succinate- -CoA ligase (ADP- terming) , beta 
chain 



Description 



pir :H70439 



1.7e-85 



Acc# 



H70439 



ORF Name 



NTID 



AAID 



NT AA 

— , , — ^ Score Probability 
Length Length 



1112.6±...al..All I ETC 



5668 



480 



Protein name 

Description 
ISO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2.7.7.a3.Q5....c3....5.&a I 1537 



Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



193 



NT 



AA 



ORF Name 



NTID 



2822161 cl 395 



AAID Length Length 
— 



Score Probability 



7JT 



2202 



Protein name 
Description 

NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



|2ftift2ii..±i...afi I Hi? 



Length Length 
7T" 



Score Probability 



TTT 



Protein name 

Description 
JN0-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
, — , — , Score Probability 
Length Length J ~ 



ToTT 



2.1e-09 



Protein name 

Description 
HYPOTHETICAL PROTEIN HI1602 



Locus Name 



sp:YG02_HAEIN 



Acc# 



P44270 



NT 



AA 



ORF Name 



NTID 



AAID 



29.3.3.Sa25....r.2...122 1 [J^T 



5673 



Length Length 
73~ 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



194 



ORF Name 



NT ID 



AAID 



NT AA 

— ^ — _ Score Probab ility 
Length Length 



25412501 ti 79 



Protein name 



T5T 



7.5e-i4 



Locus Name 



Acc# 



Description 



|sp:LSPA__STACA 



Q59835 



PflMIDASE) (SIGNAL PElP-TlDASfi II) (SPASe! 11) 



ORF Name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



254700S1 cl 376 



Protein name 



3TT 



Probability 
0.0025 



Locus Name 



Acc# 



hypothetical protein PH0283 



Description 



bir:D7l4$3 



D71453 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



ajnaaftflLi-..ci-..52hL i hst 



^5" 



Probability 
Ii.2e-i09 



Protein name 



Locus Name 



cytochrome c peroxidase 



gp:AF200362 



Acc# 



AF200362 



Description 



Haemophilus ducreyi oxaloacetate decarboxylase gamma chain (oadGj gene, 
partial cds; oxaloacetate decarboxylase alpha chain (oadA) , oxaloacetate 
decarboxylase beta chain (oadB) , and alkylphosphonateuptake protein (phna) 
genes, complete cds; ccp gene, completesequence; cytochrome c peroxidase 
gene, complete cds ; and unknowncrene . 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



^77" 



T5T 



Protein name 



Locus Name 



Acc# 



Description 
NO-HIT 



195 



NT 



AA 



ORF Name 



NTID 



AAID 



51572502 ti 24 



Length Length 



Score Probability 
TTB 



Protein name 



Locus Name 



type I restriction enzyme hsdM : hypothetical 
protein H91_orf 543 : hypothetical protein 
H91 orf543 



Description 



pir:S73820 



Acc# 



S73820 



NT 



AA 



ORF Name 



NTID 



32055567 c5 507 



AAID Length Length 
5^73 1 IT2T IBSS 



Score Probability 
T5I 



'7.6e-l2 



Protein name 



Locus Name 



hypothetical protein 



bp:SSUl8S30 



Acc# 



Y18930 



Description 



SultoloJDUS soltataricus 2 81 KJd genomic DNA fragment, strain P2 . 



ORF Name 



NTID 



NT AA 

_ ^ T — _ T — ^, Score Probability 
AAID Length Length 



iaa£AJfia...ai...i5& I fss 



4.3e-89 



Protein name 



Locus Name 



succinate- -CoA iigase (ADP- forming) , alpha 
chain 



(pir:F69715> 



Acc# 



F69719 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length A ~ 



3.3.8.5t9^5....al...3.6.a I 



ITS" 



8.Se-l6 



Protein name 



Locus Name 



Acc# 



hypothetical protein TM1650 



bir:G72227 



Description 



G72227 



196 



ORF Name 



1^4017140 c3 49& 



Protein name 



NTID 



NT AA 
T — ^, x — ^, Score Probability 
AAID Length Length JL 



ST" 



Locus Name 



Acc# 



Description 
INO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



5683 



NT AA 
, — ^. — L , Score Probability 
Length Length 



TFT 



Locus Name 



1.4e-19 



Acc# 



Description 



'sp:YT2 9 JflTCTO 



P71564 



PUTATIVE OXlDCRElOUCTASE kV0$45, 



ORF Name 



NTID AAID 



2AH6A&2...C2..A1& I 



Protein name 



^4" 



NT AA 

— — Score Probability 
Length Length 



T7T 



1419 



Locus Name 



7.£e-l40 



Acc# 



Description 
ISOMERASE) 



sp:UXAC_fiCOLl 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



77T" 



Locus Name 



Acc# 



Description 

NO-HIT 



NT 



AA 



ORF Name 



NT ID 



34406502 c2 403 



AAID Length Length 



Score Probability 



75T 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
T , T — . , Score Probability 
Length Length 



7.Se-07 



Protein name 



Description 



Locus Name 



sp:Y374_METJA 



Acc# 
Q57819 



Htt>OTHfiTlCAL PR0K1S MJ0574 



ORF Name 



NTID 



NT AA 

— , » — Score Probability 
AAID Length Length JL 



3.9e-l27 



Protein name 



Locus Name 



sp : YHCX_BACSU 



Acc# 



P54608 



Description 

HYPOTHETICAL 60.2 KD PROTEIN IN CSPB-GLPP INTEROENIC REGION 



NT 



AA 



ORF Name 



NTID 



3.6.3.3..7.562..„C3.„.55.3. 



AAID Length Length 
— 



Score Probability 
ST 



0.00020 



Protein name 



Locus Name 



regulatory protein CsgD 



gp:EC0CHRLI2 



Acc# 



AF081826 



Description 

Escherichia coii csg cluster, partial sequence. 



198 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



3650000^ ci 328 



TZ5T 



5.5e-85 



Protein name 



Locus Name 



macro! ide-ett lux determinant 



gprSIHWJbbV 



ACC# 



U83667 



Description 



Streptococcus pneumoniae mac rolide-ettlux aetermmant imetE) gene f complete 
cds. 



ORF Name 



cl 373 



Protein name 



5^ 



NT 



AA 



NTID AAID Length Length 





Score Probability 



TUT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



|40.2lB.a2....a3.„.bLb.3... 



Protein name 



NTID 



T7TT 



AAID 



NT 



AA 



— — Score Probability 
Length Length 





Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
355 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



199 



ORF Name NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Pr 


obability 


*065760_ti_63 472 " 5694 


255 768 




4 . ie-2U 


Protein name 






Locus 


Name 




Acc# 


nypotnetical protein 


pir:S75926 




S75926 


Description 














ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


±0.11L^tl^± 473 


5695 | 


|773 ||2322 


121 




i . De-uo 


Protein name 


Locus 


Name 




Acc# 


outer membrane protein 


gp:NGU819b9 




U81959 


Description 














Neisseria gonorrhoeae outer membrane protein 


(omp8 5) gene, co 


mpieuecus . j 


ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


llS^lL^a^lb. 474 


5696 


8S 


267 


77 




U * UIO j 


Protein name 






Locus 


Name 




Acc# 


hypothetical protein ZC4 / . i 


pir :T27b92 


T27592 


Description 














ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


kll&M^tl^.lA 4 75 


5697 


50$ 


1530 


J1371 






Protein name 






Locus Name 




Acc# 


xylose transporter 


gp:AB009b9J 




AB009593 



Description 



Tetragenococcus halophilus rbs C, rbsb, xylR, xylA, xym ana xymgenes, 
partial and complete cds . 



200 



ORF Name 



4545012 t2 166 



Protein name 



Description 



PROTEIN) 



NT 



AA 



NTID 



AAID 



ITS" 



Length Length 
T9T 



Score Probability 

m 



0.028 



Locus Name 



sp:C 4 RP_ETOLT 



Acc# 



P03020 



ORF Name 



45532SS tl 45 



Protein name 



NTID 



ATT 



AAID 



NT AA 

— , — , Score Probability 
Length Length aL 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



NTID 



4fias&as„.ca„AaA I itts 



Protein name 



AAID 



NT AA 

— , , , — 4 Score Probability 
Length Length 



Locus Name 



Acc# 



Description 

NO-HIT 



ORF Name 



Protein name 



NTID 



FT7T" 



AAID 



FTuT" 



NT AA 

— , — , Score Probability 
Length Length 



TTT 



Locus Name 



Acc# 



Description 
INO-HIT 



201 



NT 



AA 



ORF Name 



NTID 



4731400 t2 134 



AAID Length Length 

rrm — 



Score Probability 



7T" 



2T5" 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



*ftmi5..±i„.m 



Length Length 
JZZO 1 



Score Probability 
551 



1.7e-2il 



Protein name 



Locus Name 



isoleucine--tRNA ligase, lies : lsoleucyl-tRNA 
synthetase : isoleucyl -tRNA synthetase 



|pir:H70203 



Acc# 



H70203 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



A8.&3.5.3.2...cl...5..7.A 



Length Length 



Score Probability 



Protein name 

Description 
ITO^ITT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



5705 



Length Length 
— 



Score Probability 
^5 



i.3e-64 



Protein name 



Locus Name 



probable phosphoserine phosphatase 



|pir:T36772 



Acc# 



T36772 



Description 



ORF Name 



Protein name 

Description 
MO -HIT 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



Locus Name 



Acc# 



202 



NT 



AA 



ORF Name 



NT ID 



AAID 



16022037 cl 337 



rrur 



Length Length 
J5T 



Score Probability 
753 



9.6e-76 



Protein name 



Locus Name 



Acc# 



sp:YHTM_ECOLI 



Description 

HYt>OKlElT?ICAL » J to ^ROfEltf In RUSfe-PiT inTerGeNIC RSGlOtf 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



634716S c3 



ITTulT 



1^41 I 14932 



TFT 



1.3e-0S 



Protein name 



Description 



Locus Name 



gp:AB00S550 



Acc# 



AB008550 



Pseudomonas aeruginosa phage phi CTX, complete genome sequence. 



NT 



AA 



ORF Name 



NTID 



£S.3.iS.D..7...±1...7.S. I W7 



AAID Length Length 




TUT 



T5T 



Score Probability 
3.1e-08 



TT7 



Protein name 



Locus Name 



probata e ctnaK suppressor 



(pir:D7I366 



Acc# 



D71366 



Description 



ORF Name 



NTID 



ai.7.8.2.7....a3....S15. I [4¥S 



Protein name 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



rRNA methylase homolog ysgA 



Description 



TFT 



5>.0e-25 



Locus Name 



(pir:G^55S4 



Acc# 



G69984 



203 



— — Score Probability 



ORF Name 



NT ID 



AAID Length Length 



T7TT 



TUJT 



2.3e-b^ 



Protein name 



Locus Name 



protein Kinase nomoiog Tm 



|gp:At ! 070b2U 



Acc# 



AF070520 



Description 

Sinorhizobium melxlofcx prote in kinase nomoiog Tni (tmj andKxoP-ii 
protein genes, complete cds; and unknown genes. 



ORF Name 



$4637 ci 36b 



Protein name 



NTID 



WW 



AAID 



ZTlT 



— — Score Probability 
Length Length 

mi 1 



AA 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



491 



AAID 



5713 



— — Score Probability 
Length Length 



"TUT 



Locus Name 



Acc# 



Description 



IN0-H1T 



ORF Name 


NTID 


AAID 


NT AA 
- — — , Score 
Length Length 


Probability 


5.3.5..7.&2:A...cl...i.Zfo. 


492 


5714 


£TT™ 1242 110 


3.3e-l4 


Protein name 






Locus Name 


Acc# 








5p:YBflH_EC0Ll 


P75742 


Description 










HYPOTHETICAL b4 


2 KD PROTEIN 


IN pHkB-NEI IN TERGENIC KUti±UJ>J 


i 



204 



NT 



AA 



ORF Name 



NT ID 



AAID 



10520312 12 45 



W5T 



Length Length 
17T 



Score Probability 



3BT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NT ID 



AAID 



NT AA 

— , — „ Score Probability 
Length Length 



10.6.S.lS.7..7„..a3....2SI... I Wtt 



TUT 



TTF 



7.3e-07 



Protein name 



Locus Name 



hypothetical protein APE1165 



pir:H72586 



Acc# 



H72586 



Description 



ORF Name 



NTID 



AAID 



l lOA2&5£.l^tl^.9. | 



Protein name 



conserved hypothetical protein 



Description 



NT AA 

— , — , Score Probability 
Length Length 





5717 




204 | 


615 




148 





Locus Name 



tpir:C72361 



i.8e-10 



Acc# 



C72361 



ORF Name 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID Length Length 
T5E 



Score Probability 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



l£1418..7....t3....8.Q I 1557 



AAID Length Length 



ITTlU 



5^T 



Score Probability 
31T7 



|1.3e-32 



Protein name 



Locus Name 



|gp:AB012555 



Acc# 



AB012956 



Description 

Vibrio cholerae genes tor O-antigen synthesis/ strain M04 5, complete ccts . 



205 



NT 



AA 



ORF Name 



14435841 t3 95 



NTID AAID Length Length 



Score Probability 




1.3e-64 



Protein name 



Locus Name 



rubrerytnrm 



gp:AF202316 



Acc# 



AF202316 



Description 



Mooreila thermoacetica rubrerythrin gene, complete cds . 



NT 



AA 



ORF Name 



NTID 



1443153? £1 22 



AAID Length Length 



Score Probability 
TuT 



0.012 



Protein name 



Locus Name 



comEA protein- related protein 



pir:F72301 



Acc# 



F72301 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— ^ — ^, Score Probability 
Length Length 



l&56.23.8.:7....t3....8.1 



5722 



T5B" 



Protein name 

Description 
|N0-HtT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



isiQSMi^c^isfiL 1 pr 

Protein name 



Length Length 
TIT 



Score Probability 
W7UT3 



87 



Locus Name 



nypotnetical protein M70.1 



pir:«3032 



Acc# 



T33032 



Description 



ORF Name 



NTID 



15..7.!?.iS0.1...c3....22S. I IFDT 



Protein name 

Description 
NO-HIT 



AAID 



NT AA 

— ^ _ — L1 Score Probability 
Length Length A ~ 



ITT 



Locus Name 



Acc# 



206 



NT 



AA 



ORF Name 



NTID 



t3 76 



AAID Length Length 

— 



Score Probability 
37S 



|4.6e-37 



Protein name 



Locus Name 



conserved nypotnetxcal protein 



pir :G72409 



Acc# 



G72409 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
— 



Score Probability 



7T 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



1M0.25...±1...3.D. 



AAID Length Length 
57T7 — 



Score Probability 



Protein name 
Description 

im^rTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



aft£0.£ai5L±a...7.a i pro 



Length Length 
5TT 



Score Probability 





3.5e-16 



Protein name 



Locus Name 



sp:Y516 BOJRBU 



ACC# 



051468 



Description 

ttYPOTHE^iCAL WA/MA MET^YLT&Atf&E'E&ASfi BB6S16, 



207 



NT 



AA 



ORF Name 



NTID 



BTTT 



AAID Length Length 
fTJF? — 



Score Probability 
|i.2e-82 



WIT 



Protein name 



Locus Name 



dinycirolipoamiae " 
dehydrogenase, : 2 -oxoglutarate dehydrogenase 
complex cha-in E3:acetoin d ehydrogenase complex 



pir :14U'/y4 



Acc# 



140794 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


26$^143_c2_l^ 


608 5730 


$3 


252 | 






Protein name 








Locus 


Name 


Acc# 


Description 














NO-HIT 














ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


HlOAia^aX^l 


509 


5731 


82 


249 






Protein name 








Locus 


Name 


Acc# 


Description 
















NO-HIT 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


m&iiii^ii^i. 


510 


5732 


318 


957 


2§7 


" 3.4e-25 



Protein name 



Locus Name 



putative oxidorecLuctase 



gp:SCP7b 



Acc# 



AL121600 



Description 

Sbreptomyces coelicolor cosmicl F /6 . 



208 



NT 



AA 



ORF Name 



NT ID 



AAID 



22679637 tl 11 



5733 



Length Length 



Score Probability 
JZZ 



i.4e-33 



Protein name 



Locus Name 



conserved hypothetical protean ysnA 



pir:C69986 



Acc# 



C69986 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
STH 



Score Probability 
TT7 



&.3e-05 



Protein name 



Locus Name 



outer membrane protein tolc precursor ( tolC) 
RP224 



(pir:H7i733 



Acc# 



H71733 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



23.SM0.0.D...±1...1.1 ...I VSTS 



Length Length 



— , Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



nfmail...a2...l5..7... I [514 



AAID Length Length 




Score Probability 
— 



3.0e-129 



Protein name 

Description 
HYPOTHETICAL 53. 



Locus Name 



sp:YGPH ECOLI 



Acc# 



P52043 



« KD PROTEIN IN SBM-FBA INTERGENIC REGION (0452) 



209 



ORF Name 



Protein name 



NT ID 



5T5~ 



57TT 



NT 



AA 



AAID Length Length 



— Score Probability 



148 



447 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NT ID 



— — score Probability 



AAID Length Length 



[5T5~ 



5775" 



15T 



fTTTT 



2.5e-i2 



Locus Name 



cnromosomai Hemolysin u 



gp:AF0tfl284 



ACC# 



AF081284 



Description 



Escherichia coli strain e!F' t675 chromosomal hemolysin D IhlyDj gene .partial 
cds; and Hpl (hpl) , Hp2 (hp2) , Hp3 <hp3) , and Hp 4 (hp4) genes, complete cds . 



ORF Name 



l4.12$:±D:z..±L„±h.., 



Protein name 



NTID 



5TT 



— — Score Probability 



AAID Length Length 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



57W 



NT 



AA 



Length Length 
255 



Score Probability 



84 



Locus Name 



Acc# 



Description 



MO-HIT 



210 



NT 



AA 



ORF Name 



NT ID 



2440%62_c2_lbb 



AAID Length Length 
— 



Score 



PUT 



I3T5" 



PUT 



Probability 
|6.8e-li 



Protein name 



Locus Name 



iron-uptaKe tactor 



IgpzAFOblsyu 



Acc# 
AF051690 



Description 

Pseudomonas aeruginosa iron-uptaKe tactor ipiud) , " 
hydroxamate-typeferrisiderophore receptor (piuA) , and iron-uptake factor 
(piuB) genes, complete cds . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


744l5875J:2_Sb 


520 


5742 


538 


1517 521 


7.2e-68 


Protein name 








Locus Name 


ACC# 

Z48540 



Description 

Pseudomonas aeruginosa atsR, atsts, atsc & atsA genes. 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2A^^b.lJ.:Z-c2...lHU 


521 | 


5743 




1017 




i.0e-S6 


Protein name 








Locus Name 


Acc# 










sp:NADA_^VNVi 


P74578 


Description 














QqiUgLttfAtfB StfNTHlilTASfl A | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l±6A±$l:L±±..±tiA 




5744 


474 


1425 


417 


5.7e-35? 


Protein name 








Locus Name 


Acc# 










sp : FlICJo^RAl 1 


P17164 


Description 















211 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

Length 


Score 


Probability 


247i396iJ:2_37 


523 




5745 


304 




915 


375 




1.3e-34 



Protein name 



Locus Name 



Acc# 



prolipoprotem diacyl glyceryl transferase 
(lgt) RP046 



|pir:F71712 



F71712 



Description 



NT 



AA 



ORF Name 



NTID 



^.5.5.U5.3.a6....tl...O; I 



AAID Length Length 
5723 — 



Score Probability 
^ 



|2.3e-i9 



Protein name 



Locus Name 



chloramphenicol acetyl transferase 



gp:AF124757 



ACC# 



AF124757 



Description 



Zymomonas mobriis losmid. clone 43D2, complete sequence. 



NT 



AA 



ORF Name 



NTID 



AAID 



assLmaa..±i...aa I bsf 



Length Length 



Score Probability 
PT 



0.0020 



Protein name 



Locus Name 



sp:EREB_ECOLI 



Acc# 



P05789 



Description 

ERYTHROMYCIN ESTERASE TYPE II, 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length " L 



2fiLifi.7.iis..±a...ja.. 



T74- 



2.0e-ll6 



Protein name 
Description 



Locus Name 



:sp:YYAP_BACSU 



Acc# 



P37518 



212 



ORF Name 



267b76J7 ti B« 



Protein name 



NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


527 


5745 






2208 


867 




l . 2e-«b 








Locus 


Name 




Acc# 



sllll80:protein sllll80 



Description 



S75806 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


A ■/ A X£t A I, A ^ 2ft 


528 


5750 


357 


1194 


355 


2.1e-32 


Protein name 








Locus Name 


Acc# 










sp:PBP_BACSU 


P39844 


Description 














' PUTATIVE PENICILLIN 


"ETHDING ProTEIN PkECUkSuk 






ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


maatt7....ca...2ia 


529 


5751 


527 


1584 


1185 


2.4e-1^0 


Protein name 








Locus Name 


Acc# 










sp:NAI>B_PSEAE 




Description 














L-ASPAkTATE OXIDASE, (OUINoLINATE SYNTHETASE 


BJ 






ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


3.3.3.5.210.:/...±l...ii 


530 


5752 


261 


7S6 


257 


"j 5.1e-22 



Protein name 



Locus Name 



Acc# 



Description 



sp:Y117_HELkY 



P56080 



HYPO T HETICAL PkuTE IN HP011V 



213 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



3457567^ ci 141 



mr 



ll.2e-164 



Protein name 



Locus Name 



sprMUTSJIAWIN 



Acc# 
P44834 



Description 
SNA MlSMA'Jffl IMPAIR PROTEIN MUTfci 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


4094l2B_t^_5l 


532 


5754 




599 




1S00 J 


83 


0.026 



Protein name 



Locus Name 



erythromycin esterase nomoiog yJDtu 



pir:A697b0 



Acc# 



A69750 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score Probability 


±2±£>A2..±J...±&± 


533 


5755 


470 1413 


5b 7 


i . ze-iui 


Protein name 






Locus Name 


Acc# 


putative protein 


gp:ATAJr>2 2 


Z99708 


Description 














Arabidopsis thaiiana DNA cnromosome 4, 


ESSA 1 


AP2 contig rragmentiNo. z. j 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score Probability 


43J.2&3.7....13....&6. 


534 


57S6 


228 bb/ 




















Protein name 








Locus Name 


Acc# 


Description 















NO-HIT 



214 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



— Score Probability 



4422762 ±2 b2 



5757 



T5W 



TTT 



8.4e-12 



Protein name 

putative giucosyl transterase 



Locus Name 



gp:AF10bllb 



Acc# 



AF105116 



Description 

gtreptococcus pneumoniae type 1 3C Cpsl9Uk icpsi^UK; gene, partialcds ; 
putative oligosaccharide repeat unit transporter (cpsl9CJ) , UDP-N-acetyl 
glucosamine-2-epimerase (cpsl9CK) , and putativeglucosyl transferase 
(cpsl9CS) genes, complete cds; andglucose-1 -phosphate thymidylyl transferase 
(cpsl9CL) gene, partialcds . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


&M1M±...Q2...±M 


536 


575& 


54 


285 






Protein name 








Locus 


Name 


Acc# 


Description 














MO-HIT | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


&$&$±2h.Jc2.Jz& 


537 


575$ 


§§ 


267 


87 


0.0057 



Protein name 



Description 



Locus Name 



sp:PBP4_MAiiliN 



Acc# 



P45161 



ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


43M^Lx±..11.4... 


' 53S 5760 


562 


155$ 


1101 


i.9e-lll 



Protein name 



Locus Name 



probable suitate transporter 



pir:A7146J 



Acc# 



A71463 



Description 



215 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



598587b ci 220 



"SJT 



ST5" 



5.9e-b0 



Protein name 



Locus Name 



terrichrome-iron receptor .i:protem 
slrl490:protein slrl490 



pir:S7445V 



Acc# 



S74457 



Description 



ORF Name 


NT ID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


&163Ab:L±2...lA 


5762 


372 


1119 


165 


3.7e-09 



Protein name 



hypothetical protein PAB1767 



Description 



Locus Name 



pir rBVblifo 



Acc# 



B75136 



ORF Name 



Protein name 



NTID 



AAID 



5^T" 



NT 



AA 



Length Length 
TTT2 — 



Score Probability 



T7T 



Locus Name 



Acc# 



Description 



[NO -HIT 



ORF Name 



Protein name 



NT 



NTID 



AAID Length Length 



AA 

— Score Probability 



57^4" 



5.1e-2l7 



Locus Name 



putative leucyl tRNA synthetase 



|gp:M'0694¥r 



Acc# 



AF069441 



Description 



Arabidopsis thaliana BAO Tl^B lV Irom chromosome iv f near is.* civi,compie 
sequence . 



216 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
753 



57SF 



25TT 



Score Probability 
5.5e-57 



587 



Protein name 



Locus Name 



putative glycosyl translerase 



gp:AP04S74y 



Acc# 



AF048749 



Description 



Bacteroxdes tragi iis capsular polysaccnariae biosyntnesis operon, complete 
sequence . 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



1054^7 ti 214 



^4T" 



5766 



ITT 



TUTT 



3 . Oe-104 



Protein name 



Locus Name 



superoxide aismutase 



gp : BNRSOD2 



Acc# 



D13756 



Description 

Bacteroides tra giiis bNA for superoxide dismutase, complete cds . 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



£7£7~ 



T3T" 



Protein name 



Locus Name 



Acc# 



Description 



□ 



NO-HIT 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


lOmiS-M^al^A 


.... 54 6 


5768 


283 852 


803 


7.le-80 



Protein name 



Locus Name 



alpha-b-giucose-i-pnospnate 



gp:YEPA^CA 



Acc# 



L27130 



Description 



Yersinia pseudotuberculosis alpha-D-glucose-l-pnospnatecytidyiyltransterase 
(ascA) gene, complete cds. 



217 



NT 



AA 



ORF Name 



NT ID 



10837887 cl 303 



547 



AAID Length Length 
TT25 



Score Probability 
5.8e-10i 



1002 



Protein name 



Locus Name 



CDP-glucose-4, 6 -aenyaratase 



pxr :iJ4VU7U 



Acc# 



D47070 



Description 



NT 



ORF Name 



NTID 



AAID Length Length 



AA 

— Score Probability 



F77TT 



Protein name 



Description 



Locus Name 



Acc# 



K0-H1T 



ORF Name 



NTID 



AAID 



537" 



5771 



] i 



NT Score Probability 

i.8e-83 



Length Length 



T7T 



FIT 



Protein name 



Locus Name 



sp:ATOC_>!(JoLl 



Acc# 



Q06065 



Description 

£)BC AkBOX y LA& ill INHIBIT**) (ORNl THlNB! Di^Ak60X¥LASlil ANTIZYJYLE) 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


li:m2M..±SL...&& 


550 


57715 


107 


524 


152 


1.2e-i0 



Protein name 



Description 



Locus Name 



|sp:CBlk_SALTY 



Acc# 
Q05592 



CBTK PROTEIN 



218 



ORF Name 



Protein name 



NT ID 



¥5T 



5773 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



1N0-H1T 



ORF Name 



NTID 



AAID 



Protein name 



S77T" 



— — Score Probability 
Length Length 



TTT 



Locus Name 



Acc# 



Description 



IH0-H1T 



ORF Name 



Protein name 



NTID 



AAID 



5775" 



— — Score Probability 
Length Length 



TIM- 



LOCUS Name 



Acc# 



Description 



[NO -HIT 



ORF Name 



Protein name 



ThiH 



NTID 



— — Score Probability 



AAID Length Length 



5775" 



TT5T" 



[4 .3e-ay 



Locus Name 



|gp:AFlb40b4 



Acc# 



AF154064 



Description 

Salmonella typhimurium TniH ItniH) gene, complete cas . 



219 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



14489050 ±2 180 



5S5~ 



5777 



555" 



TFT 



4 . 7e-07 



Protexn name 



Locus Name 



aspartate ammotransterase 



pir :D7b4yb 



Acc# 



D75496 



Description 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



146A£2u.7...±l...l8... 



5S5~ 



5T7F" 



1555" 



11788 



11878 



|S.fie-lS4 



Protein name 
Description 

THIAMINE BIOSVN'MiliJIg l>koTUIN TH1(J 



Locus Name 



Acc# 



sp:THlC_HAOfc!U 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



F57~ 



5779 



T7¥" 



0.021 



Protein name 



Locus Name 



conserved hypothetical protein MTH4 b y 



|pir:D6yibi 



Acc# 



D69161 



Description 



ORF Name 



Protein name 



Description 



NTID 



5S5~ 



57W 



NT 



AA 



AAID Length Length 



Score Probability 



T55" 



0.0^9 



Locus Name 



sp:GENK_li!cJoLl 



Acc# 



P02988 



PROTEIN K 



220 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



114867327 tJ 2ib 



TITT 



H7T 



|3.0e-i0 



Protein name 



Locus Name 



|sp:Yl^NJi!<JoLl 



Acc# 
P27850 



Description 

tiYpOl'ttK'riOiAL 54.7 KB frRO'l'Kld M UDP-UBiij iNl'KRCiKNi G REGION PRECURSOR 



NT 



AA 



ORF Name 



NT ID 



1606401b C2 38b 



AAID Length Length 
E23 



I4u" 



Score Probability 
0.0060 



Protein name 



Locus Name 



Acc# 



trbA protein 



plr :A4ybb2 



Description 



ORF Name 



NTID 



— — Score Probability 



AAID Length Length 



l&4Mi..cl...JLUli.. 



T5T 



5.0e-47 



Protein name 



Locus Name 



conserved hypothetical protein HPUib^ 



pir :B64b40 



Acc# 



B64540 



Description 



ORF Name 



Protein name 



Description 



NTID 



5784 



NT 



AA 



AAID Length Length 



Score Probability 



7W 



11337 



2.Se-144 



Locus Name 



'spiPCRAJAAO^T 



Acc# 



P56255 



A?fr-DfifrfiNSSNl' HKLH aSe! PC 4 RA, 



221 



NT 



AA 



ORF Name 



NT ID 



196087b C2 4Ui 



EST 



AAID Length Length 
715 



TT 



Score Probability 
TTJJU72 



FT 



Protein name 



Locus Name 



nypotnetical protein MJ16UB 



pir:GS4500 



Acc# 



G64500 



Description 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


ia&aat:Att„.ai„.ia7. 


564 


5786 




442 






128 


5.0e-0b 



Protein name 



Locus Name 



unknown 



|gp:AP1448Vy 



Acc# 



AF144879 



Description 

Leptospira interrogans rtJD locus, complete sequence. 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


t^lOM^al^Al 


.... 565 


5787 


451 


1356 


1278 


3.3e-1^0 



Protein name 



Locus Name 



CDP-4-keto-6-deoxy-D-glucose-3-aenydratase 



Acc# 



AJ251713 



Description 

Yersinia pestis strain EV 7 6 hemH gene (partial) ana u-antigen geneciuster 
for ddhD gene, ddhA gene, ddhB pseudogene, ddhC gene, prtgene, wbyH gene, 
wzx gene, wbyl pseudogene, wbyJ gene, wzypseudogene, wbyK gene, gmd 
pseudogene, fcl pseudogene, manC gene,wbyL gene, manB gene, wzz gene and gsk 
gene (partial) . . . 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



TUT 



JUT 



TPT 



2.1e-34 



Protein name 



Locus Name 



nypotnetical protein jnpuuy4 



pTrTWTTTTT 



Acc# 



E71975 



Description 



222 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



20087751 cA 4^9 



3 . 3e-146 



Protein name 



Locus Name 



putative UDP-GicNAc : uncle caprenylpnospnate 



gp:AP04ri74y 



Acc# 



AF048749 



Description 

Sacteroides tragilis capsular polysaccharide fciosyntnesis operon, complete 
sequence . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2052030:2_c3_462 


56§ 


5750 


4dS 


1488 


112 


5.£e-0£ 



Protein name 



Locus Name 



immunoreactive 5UKD antigen PGb3 



bp:APl7S720 



Acc# 



AF175720 



Description 



Porphyromonas gingivaiis strain WbO immunoreactive bOKD antigenPGbi gene, 
complete cds . 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


ZQS.ab.UlZ^.tl^l 




5791 


SOI 2406 


181 


9.9e-33 



Protein name 



Locus Name 



terncnrome-iron receptor 3:protexn 
slrl490 :protein slrl490 



|pir:S744bV 



Acc# 
S74457 



Description 



ORF Name 



Protein name 



NTID 



AAID 



570 



NT 



AA 



Length Length 




Score Probability 



57 



Locus Name 



Acc# 



Description 



NO -HIT 



223 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



215040b ci 308 



!T7T~ 



SUIT 



i.ie-20 



Protein name 



Locus Name 



UDP-glucose-4-epimerase/aTDP-giucose-4, b 



gp:AP04tJ74y 



Acc# 



AF048749 



Description 



Bacteroides fragi iis capsular polysaccharide biosynthesis operon, complete 
sequence . 



ORF Name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



5211475b 11 7 



572 



Protein name 



precorr m- 6 Y methyiase : protein 
S110099 :protein sll0099 



Description 



478 



1437 



353" 



|l.8e-3S 



Locus Name 



Acc# 



S76697 



ORF Name 



Protein name 



NTID 



573 



AAID 



TTWT 



NT 



AA 



Length Length 
— 



Score Probability 



Locus Name 



Acc# 



Description 



ORF Name 



2LM3.15.0....cl...ii4.. 



Protein name 



NTID 



TTT 



AAID 



TT^~ 



— — Score Probability 
Length Length 



TUT 



ITT 



Locus Name 



Acc# 



Description 



224 



NT 



AA 



ORF Name 



2351432 t2 17b 



575 



NT ID AAID Length Length 

mi — 



Score Probability 



5797 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



NT 



AA 



ORF Name 



NT ID AAID Length Length 



Score Probability 



\22.5.6XLL'L..al..AHK 



2A0 



77T 



JUT 



i.4e-33 



Protein name 



Locus Name 



putative glycosyl transterase 



gp:AF07i0at> 



Acc# 



AF071085 



Description 



Snterococcus rae calis strain Ot^lkg polysaccnarzLae mosyntnetic genecluster, 
partial sequence. 



ORF Name 



Protein name 



NT 



AA 



NTID AAID Length Length 



Score Probability 



F7W 



S3" 



BUT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



Description 



NTID AAID 



— — Score Probability 
Length Length 

1230 



Locus Name 



: sp:HXK3_kUMAN 



Acc# 



P52790 



H E XOKINA^E T YPR ill, (UK 111) 



225 



ORF Name 



NT ID 



Protein name 



orrJb 



Description 



NT 



AA 



AAID Length Length 



Score Probability 
0.0054 



Locus Name 



pir:T417^ 



Acc# 



T41782 



ORF Name 



NTID 



NTT AA 

— — Score Probability 
AAID Length Length 



211bA6M....aZ...A5& I pro 



333" 



1320 



ti.8e-ii 



Protein name 



Locus Name 



conserved hypothetical protein yKnz. 



Description 



pir :E6ybb« 



Acc# 



E69858 



ORF Name 



Protein name 



Description 



NT 



AA 



I53T 



NTID AAID Length Length 

mm — 



Score Probability 



133" 



Locus Name 



Acc# 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



SWT 



5804 



Length Length 
TT5 



Score Probability 



153" 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



5HT 



|4.2e-S2 



Locus Name 



Acc# 



sp : THlciJiKJoLl 



226 



NT 



AA 



ORF Name 



NTID 



23957ttili ti 2bV 



AAID Length Length 




TUT 



Score Probability 

o.ooii 



Protein name 



Locus Name 



chaperone GrpE type '1 



gp:AFUy«bifo 



Acc# 



AF098636 



Description 

tficotiana tabacum chaperone GrpE type 2 (GrpE 2 ) mKNA, nuclear 
mitochondrial protein, complete cds . 



geneencoding 



ORF Name 



NTID 



NT AA 
— — , Score 
AAID Length Length 



24023442: cl 311 



BBS" 



Probability 
2 . 9e-60 



Protein name 



Description 



Locus Name 



sp:Vt?AR_HA(JsU 



Acc# 



P96593 



HYPOTHETICAL 4b. 


7 KD PROTEIN 


IN MUTT-GSIB INTUkcilWlU 


REGION 






ORF Name 


NTID 


NT 

AAID Length 


AA 
Length 


Score 


Probability 


lA0AS^b:l..±-2...±±l 


586 


5808 95 


288 


82 




0 . UU18 


Protein name 








Locus Name 




Acc# 


unknown protein 


gp:3CCXV106K 


X95258 


Description 
















S.cerevisiae 10 


6kbp t ragmen t 


trom chromosome 


XV. 










ORF Name 


NTID 


NT 

AAID Length 


AA 
Length 


Score 


Probability 




587 


S§6$ 448 


1347 






l . 6e-8^ 


Protein name 


Locus Name 




ACC# 



Ma+/H+- exchanging protein :Na+/H+ antiporter I |pir : 



JX0360 



Description 



NT 



AA 



ORF Name 



NT ID 



24239006 ti 2b9 



AAIP Length Length 



TuTT" 



Score Probability 
0.0053 



ITU" 



Protein name 



Description 



Locus Name 



gp : EOOttHUUX 



Acc# 



L19083 



Escherichia coli khsfi genetic element; detective RnsE core protein, complete 
cds; complete 0RF-E2; H-rpt subelement; complete ORF-H . 



ORF Name 



124303127 t2 175 



Protein name 



NTID 



AAID 



NT AA 

— — Score Pr obability 
Length Length 



14T 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probabil ity 
Length Length 



2^26.B.l...al..A0A I 



WIT 



l .ie-120 



Protein name 



Description 



Locus Name 



Acc# 



sp : SYFB JiCOLl 



TRNA LIGASE BETA CHAIN) (PHEk^) 



NT 



AA 



ORF Name 



NTID 



AAID 



2M10.7.B.0...±1....7.1 1 [53T 



Length Length 



Score Probability 



T5T 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



228 



ORF Name 



24412512 t2 54 



Protein name 



Description 



NT 



AA 



NT ID 



AAID Length Length 
£73 



Score Probability 



Locus Name 



Acc# 



1N0-H1T 



ORF Name 



24SM:/.li..±A..^UU.. 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



hypotnetical protein MTH6 71 



Description 



72T" 



1.3e-25 



Locus Name 



[pxr:D691yy 



Acc# 



D69189 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



li.7e-30 



Protein name 



Locus Name 



Acc# 



sp:YLYi^_BAC^U 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



1806 



Score Probability 
2.5e-61 



Protein name 



Locus Name 



precomn-3 metnylase 



pir :A64497 



ACC# 



A64497 



Description 



229 



NT 



AA 



ORF Name 



NTID 



124642311 c2 ±92 



AAID Length Length 



Score Probability 
|2.ie-i8 



TZ1 



Protein name 



Locus Name 



unknown 



gp:AF04«74y 



Acc# 



AF048749 



Description 

kacteroides Iragilis capsular polysaccharide biosyntnesis operon, complete 
sequence . 



ORF Name 



24f54§«3 cl 322 



Protein name 



— — , Score Probability 
NTID AAID Length Length 



F5T" 



T3T 



411 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



NT AA n „ , , , , . . 
— — Score Probability 
AAID Length Length 



coJoalamm .biosyntnesis protein N 



Description 



l32£ I m$T 



Locus Name 



pir:C6904^ 



■1.8e-ll5 



Acc# 



C69048 



ORF Name 



NTID 



NT AA 

— — , Score Probab ility 
AAID Length Length 



2sai)Ab.:/.b....ci...4a.2 


559 


5821 


173 


522 


106 















Protein name 



Locus Name 



hypotnetical protein AF0456 



Description 



3.Ie-05 



Acc# 



H69306 



230 



ORF Name 



NT ID 



NT AA 

— — , Score Proba bility 
AAID Length Length 



Protein name 



EOT" 



Locus Name 



Acc# 



Description 



[NO-MIT 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Pr obability 
Length Length 



Protein name 



1512 



Locus Name 



|3.7e-67 



Acc# 



proJoaJDle membrane protein 60 847 



Description 



pir:G54821> 



G64822 



ORF Name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 



2&£.M5.1il...cL..3.i4 1 



Protein name 



T7W 



2.5e-i96 



Locus Name 



Acc# 



Description 



sp:LEPAJiA(j£W 



P37949 



<3 , T&-fifltt>llSf<3 PROTam L ill PA 



ORF Name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 



Protein name 



717 



5T5" 



l.ie-49 



Locus Name 



Acc# 



MPT-syntnase sulturylase 



Description 



bp : SYMdrfOKB 



Y16560 



Synechococcus PCC7 942 moeB gene. 



231 



NT 



AA 



ORF Name 



NT ID 



AAID 



Length Length 
ITS" 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



TTTT 



Score Probability 
|3.4e-85 



Protein name 



Description 



Locus Name 



gp:AF0^^6 



Acc# 



AF025396 



Vibrio anguiiiarum rtb region, partial sequence. 



NT 



AA 



ORF Name 



NTID 



AAID 



3.0.S22M6...±2...12.Z.. 



Length Length 



7WT 



Score Probability 
5.3e-07 



Protein name 



Description 



Locus Name 



sp:?Etfl__SA(JsU 



Acc# 



P25053 



REGULATORY PROTEIN MINI 



NT 



ORF Name 



NTID 



AAID 



ll±L5M.b..±'L.h. 



Length Length 



AA 

— , Score 



W2T 



Probability 
0.0018 



Protein name 



Locus Name 



hypotnetical protein MTH6 70 



pTrTCFSTST" 



Acc# 



C69189 



Description 



232 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



332S9500 c2 ^ 



TUT 



i.7e-142 



Protein name 



Locus Name 



glucose- 1-pnosphate tnymiayi transterase 



gp:AP04a749 



Acc# 



AF048749 



Description 

Bacteroides tragiiis capsular polysaccharide mosyntnesis operon, complete 
sequence . 



ORF Name 



NTID 



Protein name 



RNA methyitransterase homolog yetA 



Description 



NT 



AA 



AAID Length Length 
T3T7 



478 



Score Probability 
i.ie-69 



7TT7 



Locus Name 



ipir:E69793 



Acc# 



E69793 



ORF Name 



Protein name 



NTID 



MD.M.6.^..±1...9. I FTS 



NT AA 

— — Score Probability 
AAID Length Length 



Locus Name 



Acc# 



Description 





ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


telllttL^al^AQl 


6ii 


5S33 


314 945 


223 


|4.7e-i« 



Protein name 



Locus Name 



ADP-L-glycero-D-manno-heptose-6-epimerase ~j jpjr :G70TTu" 



Acc# 
G70330 



Description 



233 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
EST 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


lllBA6.1±.±±J±lb. 


613 




205 


618 


219 


±Ae-2& 



Protein name 



Locus Name 



sp7Tm F ET3YfTYT" 



Acc# 



P72965 



Description 

^OPHOSfrHORVlASfi) (TMg-PPAfciS) (MIaM^ E- PHOSPHATE SlfNTkAsjE) 





ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


3L£2iAaa£...c2...asii 


614 


SS36 


297 


894 


182 


6.8e-l3 



Protein name 



Locus Name 



glucosyl transferase 



|gp:£MUS2844 



ACC# 



U52844 



Description 



Serratia marcescens putative glycosyltransterase , 
putativeglycosyltransf erase, putative heptosyllll transferase 
(waaQ) , 3-deoxy-manno-octulosonic acid transferase (waaA) , 
glucosyltransf erase (waaE) , and KdtB (kdtB) genes, complete cds ; and 
Fpg(fpg) gene, partial cds. _ . — 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


telA&A&&...szl...Z&t 


615 




5S37 


119 


360 


93 


0.00025 



Protein name 



Locus Name 



unknown 



Acc# 



AF007381 



Description 



Plavobacterium jonnsoniae gnamg motility protein iglOAj gene , complete 
cds ; and unknown genes . 



234 



ORF Name 



NT ID 



— — , Score Probab ility 
AAID Length Length 



I35T" 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TUT 



Score 



357 



Probability 
|2.fle-07 



Protein name 



Description 



Locus Name 



gp : STYSTMt'i 



Acc# 



AF170176 



Salmonella typmmunum tragment STMFl . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
[5MT5 — 



Score 



Probability 
4.0e-2§l 



Protein name 



Description 



Locus Name 



|sp:£6t)l£JJLOSY 



Acc# 



P22983 



DIKINA^) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

ron — 



Score 



11415 



Probability 
9.3e-60 



Protein name 



Locus Name 



precorrin-3 metnylase 



gp:BMAJ75& 



Acc# 



AJ000758 



Description 

Bacillus megaterium I6k£> genomic sequence, cobaiamm operon. 



235 



ORF Name 



NTID 



KIT AA 

— — Score P robability 
AAID Length Length 



155542 c3 4fci0 



IT55" 



|6.7e-B4 



Protein name 



Locus Name 



dTDP-6-deoxy-D-glucose-i, b epimerase 



bp:AJ?'04tt74y 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular poiysaccnariae Diosyntnesis operon, complete 
sequence . 



ORF Name 



NTID 



40530 ±2 144 



Protein name 



conserved hypothetical protein 



Description 



NT 



AA 



AAID Length Length 



14u~ 



— ^ Score Probability 
|3.2e-36 



391 



Locus Name 



bir:C7g2S6 



Acc# 



C75256 



ORF Name 



Protein name 



Description 



NTID 



— — Score Probability 
AAID Length Length 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



l^3.M4b.b....ci..Ab.i.. 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



TIT 



TJTT 



flTT 



3.5e-l9 



Protein name 
Description 

nitrckjRN regulation protein MTkb, 



Locus Name 



sp:NTR&_R*iOC 4 A 



Acc# 



P09431 



236 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



4S88828 c3 buv 



35" 



87 



0.0012 



Protein name 



Locus Name 



unknown 



gp:AP007i8l 



Acc# 



AF007381 



Description 



Piavobactenum johnsomae gl iding motility protein igioA) gene, complete 
cds ; and unknown genes . 



ORF Name 



4881412 i± 72 



Protein name 



535 



NT 



AA 



NTID AAID Length Length 

W£2 — 



Score Probability 



T5T 



Locus Name 



Acc# 



Description 



[MO -HIT 



ORF Name 



Protein name 



NTID 



— — Score Probability 



AAID Length Length 



5848 



Tu3" 



0 . 016 



Locus Name 



Na+/H+- exchanging protein siiub89 : wa+/H+ 
antiporter :Na+/H+ antiporter 



E 



ir:S74414 



Acc# 



S74414 



Description 



ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Pr 


obability 


|48.8.2L.7.£B...±i....2Lb.y. 


627 5849 


511 


936 


1UJ 


0.00^0 



Protein name 



Locus Name 



growtn-associatea protein 



gpTYEFGAP" 



Acc# 



L27645 



Description 

Brachydanio rerio growtn-associatea protein, complete cas. 



237 



NT 



AA 



ORF Name 



NT ID 



14884635 c3 487 



AAID Length Length 
TOSS — 



Score Probability 

uoi — 



|1.7e-37 



Protein name 



Locus Name 



unJcnown 



|gp:AF14437$ 



Acc# 



AF144879 



Description 

Leptospira interrogans rtb locus, complete sequence. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



|4§847l2 c2 401 



15851 





254 


765 




555 



Protein name 



Locus Name 



exodeoxyribonuc lease 



pir:B«l26 



Acc# 



B69126 



Description 



ORF Name 



Protein name 

Description 
iNO-HM 



NTID 



AAID 



NT AA 
_ — J _. _ — Ll Score Probability 
Length Length J ~ 



630 



ITOTO" 



T53" 



TOT" 



Locus Name 



Acc# 



ORF Name 



Protein name 



RcsC 



Description 



NT 



AA 



NTID 



5.QA15J....aZ...3J.3. 



AAID Length Length 
— 



2871 



Score Probability 
TO3 



3.0e-34 



Locus Name 



gp:AF07l2l5 



Acc# 



AF071215 



Proteus mirabilis regulator ot swarming behavior precursor (rsbA) and RcsB 
[rcsB) genes, complete cds; and RcsC (rcsC) gene, partialcds. 



238 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



5854 



73T" 



T773~ 



Protein name 



Locus Name 



carboxynorsperraiame decarboxylase : protein 
S110873 rprotein s!10873 



Description 



bir:a7V2faB 



1.8e-i06 



Acc# 



S77268 



ORF Name 



Protein name 



CbiD protein 



Description 



NT 



AA 



NT ID 



AAID Length Length 



Score Probability 



1573" 



7777" 



7¥¥" 



T777 - 



12. 8e-^ 



Locus Name 



|gp:BMAJ7b8 



Acc# 



AJ000758 



Bacillus megater ium l6kb genomic sequence, comiamm operon. 



ORF Name 



Protein name 



NT ID 



I53T" 



7777 



NT 



AAID Length Length 



AA 

— Score Probability 



77" 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NT ID 



777" 



777T 



hypotnetical protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



T77T - 



Locus Name 



pir :£226l4 



1.4e-b^ 



Acc# 



S22614 



239 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



7087642 cl iUb 



Protein name 



Description 



Locus Name 



Acc# 



NO -HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
STB 



Score Probability 
8.8e-66 



Protein name 



Description 



Locus Name 



Acc# 



P53579 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



— Score Probability 



5860 



TuTT 



l.le-ll 



Protein name 



Locus Name 



conserved hypothetical protein MThli^bi 



pir :F6y03b 



Acc# 



F69035 



Description 



NT 



AA 



ORF Name 



NTID 



\&&&5±hLjR±zAkiX I 



AAID Length Length 
F^T — 



Score Probability 
(TUB 



10.014 



Protein name 



Locus Name 



IsprYBJZ^ECOLl 



Acc# 



P75831 



Description 

HYPOTHET I CAL AJ^cJ TRANS POk'i 'ER ATP- BINDING PUotein ybjz, 



240 



NT 



AA 



ORF Name 



NTID 



AAID 



11757880 c2 81 



Length Length 



Score Probability 
Tu15 



3.2e-06 



Protein name 



Locus Name 



hypothetical protein PH1670 



Description 



pir :F71047 



Acc# 



F71047 



ORF Name 



Protein name 

Description 
MO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



14.7.2&413...±1...& 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



15854 



Length Length 
^5 



Score Probability 



288 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



AAID 



15l8.18.:7.7.7....c3....9.5. 



Length Length 
7JT 



Score Probability 
ITS 



l.le-12 



Protein name 



Locus Name 



serine-ricn protein 



|pir:T2550S 



Acc# 



T39903 



Description 



ORF Name 



Protein name 



SatC 



NTID 



NT AA 

„ , ^ — . — _ Score Probability 
AAID Length Length 



[&4T" 



24u~ 



72T 



1219 



Locus Name 



|gp:AFii£25i 



Description 

Bacteroides rragilis Joatl operon, complete sequence. 



S.$e-l34 



Acc# 



AF116251 



241 



ORF Name 



'209688 ±3 38 



Protein name 
Description 

ffrcmrr 



NT 



AA 



NT ID 



AAID Length Length 
"53 



Score Probability 



'5867 



Locus Name 



Acc# 



ORF Name 



2.2Q2.7...±3....3.1... 



Protein name 
Descri ption 



NTID 



AAID 



NT AA 

— , , — . Score Probability 
Length Length aL - 



3.7e-67 



Locus Name 



sp:GCP_HAEIN 



Acc# 



P43764 



ORF Name 



Protein name 



BatB 



Description 



NTID 



NT AA „ n , , . - . ^ 
— , — , Score Probability 
AAID Length Length aL 



^4T 



5869 



TuT4" 



3.1e-l02 



Locus Name 



IgpiA^ll^Sl 



Acc# 



AF116251 



Bacteroides tragilis bat I operon, complete sequence. 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TIT 



Score Probability 




3.6e-76 



Protein name 
Description 

CELL MVlSlOtf SftOtBlH FtSY 



Locus Name 



sp : FTSYJIAEIN 



Acc# 



P44870 



242 



NT 



AA 



ORF Name 



NT ID 



AMD 



mrr 



Length Length 



Score Probability 
T5S 



1.2e-ir 



Protein name 



Description 



Locus Name 



sp: Y531__MKTJA 



Acc# 



Q57951 



ORF Name 



NTID 



AAID 



23834376 c2 84 



Protein name 



Hypothetical protein APE1982 



Description 



NT 



AA 



Length Length 
TZT 



Score Probability 
TT1 



|1.0e-07 



Locus Name 



fointmSOO 



Acc# 



H72500 



NT 



AA 



ORF Name 



NTID 



2425.5.7.D.D...±1...5... I I55T 



AAID Length Length 
5873 



Score Probability 
3TT7£ — 



TTTTT 



Protein name 



Locus Name 



BatD 



gp:AFl±S25i 



Acc# 



AF116251 



Description 



Bacteroides tragilis bat I operon, complete sequence. 



ORF Name 



NTID 



NT AA 

^ ^ — Ll — L7 Score Probability 
AAID Length Length JL 



2L^D..7.5.3.7....f2....2Ll I (£52 



51T7T" 



1ST 



TTT 



1.5e-13 



Protein name 



Locus Name 



rxbosomal protein L28 



pir:E64104 



Acc# 



E64104 



Descri ption 



243 



ORF Name 



24415903 £1 7 







NT 


AA 


NTID 


AAID 


Length 


Length 


653 


5875 


279 


§40 



Score Probability 

irssi — 



|4.0e-141 



Protein name 



Locus Name 



BatE 



gp:AF11625i 



Acc# 



AF116251 



Description 

Bacteroides Iragilis batl operon, complete sequence. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



124823552 tl 2 



TUT 



2.0e-ll 



Protein name 
Description 

CHLOROPLAST BOS RIBOSOMAL PROTEIN L33 



Locus Name 



sp:te33_ODOSl 



Acc# 



P49565 



NT 



AA 



ORF Name 



NTID 



AAID 







Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



|2.7.2l212..±1...3.1 



AAID Length Length 
— 



Score Probability 
2TJI 



|1.6e-i2 



Protein name 



Locus Name 



antigen 3 32 



pir : JN0292 



Acc# 



JN0292 



Description 



244 



NT 



AA 



ORF Name 



NTID 



3145055 ci 40 



AAID Length Length 
— 



5879 



Score Probability 

wjm — 



1770" 



Protein name 



Locus Name 



DNA gyrase A subumt 



gp:AB017712 



Acc# 



AB017712 



Description 



Bacteroides tragilis gyrA gene tor DNA gyrase A subunit, completecds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
— 



75T 



Score Probability 
H555 



!2.$e-176 



Protein name 



Locus Name 



BatA 



bp:AFll«5l 



Acc# 



AF116251 



Description 



Bacteroides tragilis bat I operon, complete sequence. 



NT 



AA 



ORF Name 



NTID AAID Length Length 

— 



F7TT 



Score Probability 
&m 



5.7e-36 



Protein name 



Locus Name 



conserved Hypothetical protein BB0175 



pir:G70il>i 



Acc# 



G70121 



Description 



NT 



AA 



ORF Name 



3A2L6.6.9.6.a...£3..„3.Z.. 



NTID AAID Length Length 




Score Probability 
532 



il.3e-$4 



Protein name 



Locus Name 



conserved Hypothetical protein aq_84 9 



k>ir:E70373 



Acc# 



E70373 



Description 



245 



NT 



AA 



ORF Name 



NTID 



134554375 ti 1 



AAID Length Length 
— 



Score Probability 

wi — 



l.ie-58 



Protein name 



Locus Name 



hypothetical protein 



pir :S76561 



Acc# 



S76561 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length ^~ 



5884 



TUUT 



WIT 



S.Se-82 



Protein name 



Locus Name 



probable moxR protein 



pir:B70874 



Acc# 



B70874 



Description 



NT 



AA 



ORF Name 



NTID 



4.9.6.15a.7..„G3....a7. 



AAID Length Length 

^wz — 



1513" 



Score Probability 
6.9e-0$ 



164 



Protein name 



Locus Name 



conserved hypothetical protein aq_854 



pir;B70374 



Acc# 



B70374 



Description 



ORF Name 



Protein name 



NTID 



AAID 



Description 
DNA-BIMDING PROTEIN HU 



NT AA „ . ■ _ « 
— , — , Score Probability 
Length Length * £ - 



Locus Name 



sp:DBH__THEMA 



|4.2e-ll 



Acc# 



P36206 



246 



ORF Name 



NT ID 



NT AA 
T — _ — ^. Score Probability 
AAID Length Length JL 



7072675 13 36 



Protein name 



1.4e-67 



Locus Name 



Acc# 



BatB 



Description 



|gp:AFll<^l 



AF116251 



Bacteroides tragilis batl operon, complete sequence. 



ORF Name 



Protein name 



NT ID 



AAID 



5888 



NT 



AA 



Length Length 
7T" 



2TT 



Score Probability 



Locus Name 



Acc# 



Description 

mrmr 



ORF Name 



NT ID 



NT AA 
— — T — ^, Score Probability 
AAID Length Length ^ 



XlXl$.6&l.±l..&l I WTT 



Protein name 



lipase-like protein 



Description 



BT5" 



1551 



3T5~ 



Locus Name 



pir : A64706 



Acc# 



A64706 



ORF Name 



NTID 



12£2flHfi2L±3i...fi2 1 1555 



Protein name 



AAID 



5890 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



NTID 



AAID 



iaai45flfl...ci-.iafi i 



5831 



Protein name 



hypothetical protein BB053 0 



Description 



NT AA „ , i_ * i • *_ 
— ^ T — Ll Score Probability 
Length Length aL 



2.7e-16 



Locus Name 



pir:A70l66 



Acc# 



A70166 



247 



ORF Name 



17086686 t3 $S 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 

mi — 



Score Probability 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



NT ID 



AAID 



&£.5.8L£3.£.3....£l...ia | F7T 



Protein name 



hypothetical protein jhpl3 80 



Description 



NT AA 

— , — , Score Probability 
Length Length 



11242 



2^" 



1.8e-16 



Locus Name 



pir:(371815 



Acc# 



G71815 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2±$±B5.2..&2..±±6. | WTZ 



|2.7e-05 



Protein name 



Locus Name 



cytochrome Jd 



gp:AF017516 



Acc# 



AF017516 



Description 



Bombus pascuorum cytochrome b (cytb) gene, mitochondrial geneencoding 
mitochondrial protein, partial cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
FTJIT 



Score Probability 
II . 8e-106 



Protein name 



Locus Name 



Acc# 



Description 
COBVRIC ACID SYNTHASE 



spiCBlf^SALfY 1 



Q05597 



248 



ORF Name 



NT ID 



i2 71 



Protein name 



hypothetical protein jhpl3 79 



Description 



NT 



AA 



AAID Length Length 

ssss — 



Score Probability 



Locus Name 



pir:F71815 



\2.9e-21 



Acc# 



F71815 



NT 



AA 



ORF Name 



243Ll.7.aafiL...tZ...5L3. I 



NT ID AAID 
pT7 



Length Length 



Score Probability 
B3S 



Protein name 



Locus Name 



nicotinate -nucleotide- -drmethylbenz imidazole 
phosphoribosyl transferase 



brr:A75577 



Description 



B.5e-4i 



Acc# 



A75577 



NT 



AA 



ORF Name 



NTID 



AAID 



2^3A5.16.7....tl..A.. 



Length Length 
F2I 



Score Probability 
ITS 



Protein name 



Locus Name 



cobinamide kinase / cobinamide phosphate 
guanylyl transferase 



pir :S52220 



Description 



Acc# 



NT 



AA 



ORF Name 



NTID 



24417.212...ca.„iai ...J FT7 



AAID Length Length 
— 



1W 



Score Probability 
1775 



5.7e-126 



Protein name 



Locus Name 



proline --tRNA ligase, pros : prolyl -tRNA 
synthetase : prolyl -tRNA synthetase 



pir :A70150 



Acc# 



A70150 



Description 



249 



NT 



AA 



ORF Name 



NT ID 



AAID 



24541303 c2 166 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID 



2Afc.6.kuU5...c2...15.6. 1 



Length Length 



Score Probability 



1335" 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NT ID 



2&8.22.2.13....L2L...6.& 



AAID Length Length 
— 



TUUT 



Score Probability 
|3.8e-6S 



Protein name 



Locus Name 



immunoreactive 36 JcDa antigen PG14 



lgp:AFI457S8 



Acc# 



AF145798 



Description 



Porphyromonas gingivaiis strain W50 immunoreactive 36 kDa antigenPG14 gene, 
complete cds . 



NT 



AA 



ORF Name 



24.8.22L6.8.8....12...6.6. 



rax 



NT ID AAID Length Length 

S3U3 — 



4X1 



Score Probability 
|7.3e-07 



114 



Protein name 



Locus Name 



hypothetical protein 



bir:S76776 



Acc# 



S76776 



Description 



250 



m 



NT 



AA 



ORF Name 



NTID 



25401437 ci 115 



AAID Length Length 

— 



Score Probability 

\rm — 



|2.0e-i3 



Protein name 



Description 



Locus Name 



sp:YJJP_HAEIN 



Acc# 



P44520 



HYPOTHETICAL PROTEIN SlOlOfi 



NT 



AA 



ORF Name 



NTID 



AAID 



c2 145 



Length Length 



MIT 



Score Probability 




5.3e-20 



Protein name 



Description 



Locus Name 



sp:YJJP_ECOLI 



Acc# 



P39402 



HYPOTHETICAL 30.5 KD PROTEIN IN DNAT-BGLJ INTERGENIC REGION (F277) 



NT 



AA 



ORF Name 



NTID 



3.16.5„7.US.Q...c2....1ii I [EM 



AAID Length Length 
^ 



T7¥T 



Score Probability 
TTUE — 



5.5e-112 



Protein name 



Locus Name 



Acc# 



sp:YIDE_ECOLI 



Description 

HYPOTHETICAL 58 . $ KI) PROTEltf IS GLVC-ISPB INTERGEN1C REGION (0R2A) 



NT 



AA 



ORF Name 



NTID 



3.2.2L2La^2....CJL...17.2L 



AAID Length Length 




Score Probability 
^22 



1.2e-24 



Protein name 



Locus Name 



conserved hypothetical protein yvqK 



pir :D70046 



Acc# 



D70046 



Description 



251 



ORF Name 



NT ID 



^64^11 rl 5 



Protein name 



probable phosphoglycerate mutase 



Description 



NT 



AA 



AAID Length Length 

E^rrs — 



7ST 



Score Probability 
T5S 



|3.7e-ii 



Locus Name 



pir :B75539 



ACC# 



B75539 



ORF Name 



NT ID 



AAID 



NT AA , , 

Length Length Probability 



Mi3.5.aaa...c3....iao. i ff7 



Protein name 
Description 

CUBD PROTEIN 



5909 



|1.3e-46 



Locus Name 



sp : COBD_PSEDE 



ACC# 
P21634 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 
31 



0.0025 



Locus Name 



be t a - 1 r opomyo s m 



pir :S23470 



Acc# 



S23470 



Description 



ORF Name 



NTID 



AAID 



NT AA 

t — t — ^ Score 
Length Length 



Protein name 



Locus Name 



tricorn protease 



gp:TAU7i*B50 



Description 



Probability 
I.3e-8S 



Acc# 



U72850 



Tnermoplasma acidophilum GTP-£>incting protein and tricorn protease (Tkl) 
genes, complete cds . 



ORF Name 



NT ID 



_. — _ — ^, Score Probability 
AAID Length Length x 



14807062 ci 115 



IS912 



Protein name 



coJoyrmic acid a,c-diamide synthase 



Description 



TTTT 



Locus Name 



pir :A75619 



\2.1e-64 



Acc# 



A75619 



NT 



AA 



ORF Name 



NTID 



AAID Length Length Probability 
5913 



OS" 



1.3e-37 



Protein name 



Locus Name 



two component sensor 



lgp:AF030352 



Acc# 



AF030352 



Description 



Pseudomonas aeruginosa two component sensor ( lemA) gene , partialcds . 



NT 



AA 



ORF Name 



NTID 



!*116.b.aA...cl...lIl ...I [532 



AAID Length Length 




TZT 



WTTT 



Score Probability 

— 



3 . 8e-26 



Protein name 



Locus Name 



CobD 



gp:STU90625 



Acc# 



U90625 



Description 



salmonella typmmurium alpha- noazoie- 5 1 -phosphate phospatase CobC(cobC) 
gene, partial cds and putative aminotransferase CobD (cobD)gene / complete 
cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 



JIT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



253 



ORF Name 



5131255 12 54 



Protein name 



NT ID 



coJoalamin synthase 



Description 



NT 



AA 



AAID Length Length 




75T 



Score Probability 
£FB 



3.5e-23 



Locus Name 



pir:H75576 



Acc# 



H75576 



ORF Name 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 

mm — 



Score Probability 



ST 



Locus Name 



Acc# 



Description 
[NO-HIT 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length J - 



lQDA21Z6....t3....11Z 



T7F" 



TUT 



|4.3e-09 



Protein name 



Locus Name 



hypothetical protein 



|gp:SSU18530 



Acc# 



Y18930 



Description 



Sulfolobus soitataricus 281 kb genomic DNA fragment, strain P2 , 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TUT 



Protein name 

Description 
PTPUTT 



Locus Name 



Acc# 



254 



ORF Name 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 
■53 



Score Probability 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



13.S9.5.0..7....G2....2.8.2.. 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



^7T 



T3W 



TU7T 



i.7e-108 



Locus Name 



Acc# 



sp:HI^X_k!CoLl 



HISTIMnOl D^HYDkOGfitfASfi, (MDri) 



ORF Name 



Protexn name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



INC) -HIT 



ORF Name 



NTID 



m5££i£..±a...iia t [tttt 



Protein name 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



£1T 



Locus Name 



Acc# 



Description 



NO -HIT 



255 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length J ~ 



1WT 



5924 



I5ii 



Protein name 



Locus Name 



histidme Kinase 



gp:AFI1444^ 



Acc# 



AF114442 



Description 



Nostoc punctirorme nistidine kinase (hepK) gene, complete cds. 



ORF Name 



14£30063 c3 ^56 



Protein name 



Description 



NTID 



TuT" 



NT AA n , , ■ _ i 
— — Score Probability 
AAID Length Length d ~ 



1170 



4 . 9e-88 



Locus Name 



sp:fliS7JlAEl*I 



Acc# 



P44327 



NT 



AA 



ORF Name 



NTID 



r7uT" 



AAID Length Length 




Score Probability 
T7T7 — 



l.ie-282 



Protein name 



Locus Name 



B12- dependent 



|gp:ECOUW89 



Acc# 



U00006 



Description 



E. coli cnromosomal region rrom 89.2 to y^.8 minutes. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TTTT 



2.§e-l2$ 



Protein name 
Description 

T RANSCRIPTION TERMINATION FACTOR RHO 



Locus Name 



sp:RHO_PSEPL 



Acc# 



P52155 



256 



OPT? Mamp NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




380 1143 


113 


0.0067 


Protein name 




Locus Name 


Acc# 






gp:PFMAL3M 




Description 












Plasmodium talciparum MAL3P2, complete 


sequence . 




















NT 
Length 


AA 
Length 


Score 


Probability 


l$$5452_c3_340 707 552$ 


133 402 




0.0025 


Protein name 




Locus Name 


Acc# 






gp : SVCPtTRl 1 


L36958 


Description 












Synechocystis sp. (clone pSYN4ll) giycmamiae riDonucieociaecransionnyiase 
(purT) , Orfl34 and dnaA genes, complete cds , photosystem II reaction center 
protein D2 (psbD) gene, 5' end. 














ORF Name NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2LttiiacLt2L„±a.„isi 70§ 5550 


642 1929 


1069 


l.le-123 


Protein name 




Locus Name 


Acc# 


nypotneticai protein Rv2438c 


pxr :D70680 


D70680 



Description 



ORF Name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



5931 



F7TT 



TUT 



Protein name 



Locus Name 



0.0091 



Acc# 



nypotneticai protein 



|gp:Ai^l09l 



Description 



AF021091 



Helicobacter pylori hypothetical protein (HP03 95) , nypotheticalprotem 
(HP0394) , chemotaxis protein CheV (cheV) , bifunctionalchemotaxis protein 
CheF (cheF) , chemotaxis protein CheW (cheW) , andadhes in -thiol peroxidase 
TagD (tagD) genes, complete cds; andsuperoxide dismutase SodB (sodB) gene, 
partial cds . 



257 



ORF Name 



NTID 



NT AA 

— — Score Proba bility 
AAID Length Length 



21612M> ti IbU 



Protein name 



7Tu" 



T5T 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



7TX" 



AAID 



^TI- 



NT 



AA 



Length Length 



— Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



Protein name 



TTT 



Locus Name 



1.5e-13 



Acc# 



Description 



sp : PLEC__OAU(Jk 



P37894 



NON-MOT I L E AND PHAS E -RE ST ANCE PkOTKlU, 



ORF Name 



NTID 



— — Score Prob ability 
AAID Length Length 



\12$£216:±...g2..:Xx& I 1713" 



15533" 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



258 



ORF Name 



23477187 tl b4 



Protein name 



BrkB 



Description 



NTID 



7TT 



NT AA 

— — , Score Prob ability 
AA1D Length Length 



TFT 



398 



5.9e-37 



Locus Name 



bir:I40^8 



Acc# 



140328 



ORF Name 



Protein name 



NTID 



ITT 



NT 



AA 



AAID Length Length 

m — 



Score Probability 



W5~ 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



NTID 



7TT 



AAID 



NT AA 

— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



Protein name 



NTID 



[7TT 



AAID 



T9JT 



NT AA 

— — Score Pr obability 
Length Length 

TO 



FT 



0.025 



Locus Name 



Acc# 



hypothetical protein PH0161 



Description 



tpir:G7l23y 



G71237 



ORF Name 



NTID 



\2&25.$A21...c.l...l&l. I ITTF 



Protein name 



MT AA 

— — Score Proba bility 
AAID Length Length 



15940 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



124335943 ci 23b 



ITT 



Protein name 



Locus Name 



conserved hypothetical protein mth««4 



Description 



ie.0e-12 



Acc# 



B69218 



ORF Name 



Protein name 



NTID 



NT 



AAID Length Length 



AA 

— Score Probability 





5942 


93 


2§2 



Locus Name 



Acc# 



Description 



K0-H1T 



ORF Name 



Protein name 



NTID 



AAID 



TIT 



NT 



AA 



Length Length 

wn — 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



|24S.410.S.0....cl...2.i4.. 



Protein name 



NTID 



AAID 



TIT 



NT 



AA 



— — Sc ore Probability 

Length Length 

m — 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



723 



5945 



NT 



— — Score Probability 
Length Length 

TU£"S 



Locus Name 



Acc# 



Description 



NO-HIT 



260 



NT 



AA 



ORF Name 



NT ID 



AAID 



2464853*$ ci 2ii 



Length Length 
[TST 



Score Probability 

mi — 



|7.4e-^0 



Protein name 



Description 



Locus Name 



sp:V746_Mt!TJA 



Acc# 
Q58156 



HYPOTHETICAL PRO'l'KI N MJU746 



ORF Name 



24650^02 rl lb 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
TT71 



— Score Probability 



72T 



Locus Name 



Acc# 



Description 



KfO-HlT 



ORF Name 



Protein name 



NT 



NTID 



AAID Length Length 



AA 

— Score Probability 



1.0e-144 



Locus Name 



sodium/prolme symporter (proline permease) I ipir :C6yiib 



Acc# 



C69115 



Description 



ORF Name 



NTID 



AAID 



NT AA 
— — , Sc ore 
Length Length 



V7TT 



TZUT 



Protein name 



Description 



Locus Name 



Probability 



Acc# 



IMO-ttll 4 



261 



ORF Name 



NT ID 



NT — Score Probability 



AAID Length Length 



24806567 cl 22l 



F7TT 



i.2e-21 



Protein name 



Description 



Locus Name 



sp:D3BD_HAKlJsJ 



Acc# 



P44919 



ORF Name 



.24833386 c2 321 



Protein name 



NT ID 



AAID 



— — Score Probability 
Length Length 

T&n — 



34T 



Locus Name 



Acc# 



Description 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


SL5.M.7.iJ.7...±i....lb.y. 


730 


5552 


258 777 


85 


0.0093 



Protein name 



Locus Name 



0RF12ii hypothetical protein 



E 



p:A?00821U 



Acc# 



AF008210 



Description 



Buchnera aphidicola genomi c fragment containing icnaperone HspbO) groKL, 
biosynthesis initiating protein (dnaA) , ATP operon (atpCDGAHFEB) , and 
putative chromosome replication protein (gidA) genes, complete cds ; and 
termination factor Rho (rho) gene, partialcds. 



ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2S.B53A2^al^i'±± 


731 5953 


64 


195 


92 


" 0.00016 



Protein name 



Locus Name 



hypothetical protein ssrivbb 



tainS'M'm 



Acc# 



S74779 



Description 



ORF Name 



NTID 



NT AA 
_ _ — _ — Score Pro bability 
AAID Length Length JL 



26220277 cl 252 



TST 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 
*r — ^ — ^, Score Probability 
AAID Length Length JL 



ITIJ2B" 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 
T — _ T — _ Score Probability 
AAID Length Length 



1734 



5956 116 


351 




263 





i.2e-22 



Protein name 



Locus Name 



sp:YHAI_ECOLI 



Acc# 



P42622 



Description 

HYPOTHETICAL 12 . K KD PROTEIN IN EXUR-TDOC INTERGENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



Iifi^lfil2...cl..i2i I 1775" 



Length Length 



Score Probability 



l4.6e-57 



Protein name 
Description 

PHOSPHATE TRANSAMINASE ) 



Locus Name 



Sp:HIS8_CANMA 



Acc# 



P56099 



263 



ORF Name 



2S2838V cl 2 lb 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
553 



Score Probability 



Locus Name 



Acc# 



Description 



IN0-H1Y 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probabil itv 


mi4D.a.U...cI...2Al 


737 


5959 


724 2175 141/ 


6.1e-14b 


Protein name 










Locus Name 


Acc# 












sp:DCP_EC0LI 




Description 














frfifrTrDVL-DlPEMlDASfl £>CP, 


(DlPEPTIDYL CARBOXY PEPTIDASE) 




ORF Name 


NTID 


AAID 




NT 
Length 


AA 

— Score 
Length 


Probability 


3.&^6j5^..±1...4u 


73§ 


5960 




245 738 |b/i 


2.7e-55 


Protein name 










Locus Name 


Acc# 


uridine Kinase uak 


pir:dby728 


G69728 


Description 


ORF Name 


NTID 


AAID 




NT 
Length 


AA 

— „ Score 
Length 


Probability 


llll&kti&^tl^rib. 




15561 


1 


464 1395 4/3 


6.6e-4b 


Protein name 




Locus Name 


Acc# 


unknown 


gp:A^086638 


AF086638 



Description 



Pseudomonas pubida (^umA precu rsor (cumA) and (JumB icumtsj genes, compieue 
cds; and unknown genes. 



264 



ORF Name 



Protein name 



NT ID 



— — Score Probability 



AAID Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT ID 



AAID 



yfft 



— — Score Probability 
Length Length 





TIT 



WW 



2.4e-40 



Locus Name 



lsp:YHHW_±i!<JoLl 



Acc# 



P46852 



Description 

HYPOTHETICAL 'lb . 3 KB PftOlfeiiM id flftlTk-CjCaT iN TBkc^NlU region 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


3A^5.a5J...±2L...l2a 


742 


$$64 


493 


1482 


352 





Protein name 



damage -inducible protein pabo^4J 



Description 



Locus Name 



pir:A75l5l 



Acc# 



A75151 



ORF Name 



3.6.D.5.6.S.li)....aL2.b.l.. 



Protein name 



NTID 



NT 



AA 



AAID Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



PtfO-HtT 



ORF Name 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



muz' 



ITTT 



[TOT 



0.00014 



Locus Name 



hypothetical prote in £V2Rv.C)6 Se2El9.ua 



pir :f^48l9 



Acc# 



T34819 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



3312781 12 9S 



Length Length 
JIT 



TUTT 



Score Probability 
1T7 



1.8e-37 



Protein name 



Locus Name 



Acc# 



hypothetical protein F19D11 . 16 : hypothetical 
protein F14M4 . 29 hypothetical protein F14M4.29 



pir :T02689 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




1054 t BTBF 



Score Probability 
J7S 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Description 



4.3e-45 



Acc# 



JC6027 



ORF Name 



NTID 



NT AA 

_ _ _ . _ — — Score Probability 
AAID Length Length s - 



a.aiiaii..±i...iAi I [ttt 



1.4e-202 



Protein name 
Description 

CH0KDR0-6-^Lt'ATASE REGULATORY PROTEIN 



Locus Name 



sp:<MJRjJAOW 



Acc# 



Q02550 



ORF Name 



NTID 



NT AA 

^ T — _ T — _ Score Probability 
AAID Length Length ^ 



IM&5.5.2...cl..il7. I |7OT 



5970 



[T7T 



[2.6e-34 



Protein name 



Locus Name 



sp:Y120_METTH 



Acc# 



026223 



Description 

PttovrlvE tfA£)H DfiHtfDR0(3fiKrASE/lJAf)lt»)H NlT£6RE£>tJOTAS£, 



266 



ORF Name 



NTID 



4022512 cl 209 



Protein name 



NT AA , , . n ■ 
— , — , Score Probability 
AAID Length Length ^ 



ferredoxin {tax- 3) iiomolog 



Description 



2.7e-C^ 



Locus Name 



pir :C69294 



Acc# 



C69294 



NT 



AA 



ORF Name 



NTID 



Ii0£9.m...c3....18.7. I 



AAID Length Length 
5972 



7ST" 



Score Probability 
[TT7 



1.5e-14 



Protein name 



Locus Name 



leader peptidase Lep 



gp:AF188620 



Acc# 



AF188620 



Description 



Bordetella pertussis lep operon, complete sequence. 



NT 



AA 



ORF Name 



NTID 



[7517 



AAID Length Length 
^71 — 



Score Probability 
9.4e-117 



Protein name 



Locus Name 



sp:SR54_BACSU 



Acc# 



P37105 



Description 

SIGNAL RECOGNITION PARTICLE PROTEIN (FIFTY- FOUR H0M0L0C) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
[S5T 



Score Probability 
ll.le-17 



Protein name 



Locus Name 



hypothetical protein PAB1763 



pir :D7bl37 



Acc# 



D75137 



Description 



267 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



14589092 cl 2V2 



75T 



i.2e-20 



Protein name 



Locus Name 



Acc# 



terric uptaKe regulator homolog 



AF095596 



Description 



Staphylococcus aureus strain ISP3 ferric uptake regulator homolog ( furB) 
gene, complete cds. 



ORF Name 



Protein name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



1.4e-S7 



Locus Name 



Acc# 



synthase III 



Description 



pir :F70394 



F70394 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



Protein name 



7^" 



5977 



TST 



Locus Name 



i.7e-62 



Acc# 



Description 



|sp:HI3iJ5ALTY 



P00499 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



4&7.&5£i..cl...2lfi I 1755 



STTT 



Protein name 



T5T 



4.9e-33 



Locus Name 



Acc# 



Description 



sp : SMPB_BACSU 



032230 



SMALL PROTEIN B HOMOLOG 



268 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



4960S12 13 153 



FT7T 



Protein name 



Description 



Locus Name 



sp:THIO_BORBU 



Acc# 



051088 



THIOREDOXIN (TRX) 



ORF Name 



Sl?5§7S c2 320 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 
T5T 



Score Probability 



Locus Name 



Acc# 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



SillSSL2L..±i...lSJl I 



Length Length 



Score Probability 
— 



Protein name 



Locus Name 



raw starch digesting amylase precursor 



gp:A?0676S3 



Acc# 



AF067653 



Description 



Cytophaga sp. raw starch digesting amylase precursor, gene , complete cds . 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



|5„7S52.7..,±1..152.. I TTUU 



5982 



1 [¥T7 



|2.0e-20 



Protein name 



Locus Name 



thioredoxm-like protein 



gp:ATAC01071S 



Acc# 



AC010718 



Description 



Arabidopsis thaliana chromosome I BAC F28016 genomic sequence , complete 
sequence . 



269 



NT 



AA 



ORF Name 



NTID 



AAID 



5983 



Length Length 
ITT 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



i£lA2ai2L.ci.„2£a I P7B3 



Protein name 

Description 
ETCFHTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



fi25.0.aai...cl...2d.7. I |7CT 



AAID Length Length 



11757 



Score Probability 
ITS 



Protein name 



Locus Name 



conserved Hypothetical protein BB0195 



pir:C70124 



Description 



|4.5e-05 



Acc# 



C70124 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length aL 



S.44213..7....c3....3.£2l I [7S¥ 



Protein name 
Description 

HYPOTHETICAL 11.1 KD PROTlSltf IN RPOX S 1 REGION 



Locus Name 



sp:YE>E>X_STRCO 



1.7e-07 



Acc# 



P37977 



270 



ORF Name 



829S92 c2 503 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 
7T5 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



15988 



NT 



AA 



Length Length 



Score Probability 



1864 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



a.7.a3.a.7....ci...2.48.. 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



ITT 



'5989 



Length Length 
T3TT 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



hypothetical protein PH16 7 0 



Description 



NT AA 

— , — , Score Probability 
Length Length 



TTT 



7TT 



Locus Name 



pir:P71047 



i.Se-10 



Acc# 



F71047 



ORF Name 



NTID 



11.7.2.1DAD....tl...42. I [757 



Protein name 

Description 
NO-HIT 



AAID 



NT AA , , , , . 
— , , — , Score Probability 
Length Length sC 



7T 



Locus Name 



Acc# 



271 



NT 



AA 



ORF Name 



NT ID 



1256885 t3 183 



77TT 



AAID Length Length 

^wi — 



Score Probability 
7.6e-48 



Protein name 



Locus Name 



Man26A 



:AP1264Vi 



Acc# 



AF126471 



Description 



Cellulomonas t±mi Man26A (man26AJ gene, complete ccis. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



127732S5 c2 



TTT 



TTT 



5.7e-3$ 



Protein name 



Locus Name 



conserved nypothetical protein 



pir:B723£>l 



Acc# 



B72391 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



imS.S.3.5....al...2il I 1777 



Protein name 



Locus Name 



Acc# 



Aryisuitatase precursor (EC 3.1.6.1; 



gp:D9073I 



Description 



E.coli genomic DNA, Kohara clone #280(33.7-34.1 mm.). 



NT 



AA 



ORF Name 



NTID AAID Length Length 



Score Probability 



!imi43.1...al...£l5u I [777 



1 rros — I \TZT 



1.4e-0S 



Protein name 



Locus Name 



Acc# 



TRK system potassium uptake protein (trJcA) 



gp;U32745 



Description 

Haemophilus mtluenzae Rd section 60 ol 163 ot the complete genome. 



272 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



14651512 ti 5 



ITT 



TWT 



I.2e-77 



Protein name 



Locus Name 



Acc# 



lsp:YASG_B(»LI 



Description 

HYPOTHETICAL SYtat>6R?E& IIS/ PfiRR-ARfltf HiTfifefifilSfrC ftEGtON 



ORF Name 



1472£062 cl 203 



Protein name 



NTID 



AAID 



NT 



Length Length 



AA 

— Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^~ 



l££M3.£.l.±3....liu I [TTS 



5998 



BUT 



l. 3e-4l 



Protein name 



Locus Name 



dimethyl amine corrinoicl protein MtbC 



gp:AFl02^23 



Acc# 



AF102623 



Description 



Methanosarcma barken dimethyl amine cornnoid protein MtiDC 

(mtbC) , trimethylamine methyltransf erase MttB (mttB) , trimethylaminecorrinoid 
protein MttC (mttC) , putative transmembrane protein MttP (mttP) , and 
dimethylamine methyltransf erase MtbBl (mtbBl) genes , complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2.Q3.2.5.25.2....C2...3.DJ. 



VTTT 





5555 




637 


1514 1050 



5.7e-i25 



Protein name 

Description 
NADH-PLASTOQUINONE 0X1D0REDUCTASE CHAIN 5, 



Locus Name 



Acc# 



P31971 



273 



ORF Name 



NT ID 



2117177 12 71 



TIT 



Protein name 



encto-l, 4~beta-mannosidase 



Description 



NT 



AA 



AAID Length Length 



TIT 



Score Probability 
|i.5e-3i 



354 



Locus Name 



pir:D72278 



Acc# 



D72278 



NT 



AA 



ORF Name 



NTID 



2.16.b.^6.i?.a...tl...6. I UT5 



AAID Length Length 
FTOT — 



TT5T 



Score Probability 
|6.0e-iS 



Protein name 



Locus Name 



renin- binding protein-related protein : protein 
slr!975 rprotein slrl975 



lr ;S75649 



Acc# 



S75649 



Description 



ORF Name 



NTID 



iiai5111.±l...Z I 



Protein name 



NT AA 

_ — _ — T — _ Score Probability 
AAID Length Length 



Locus Name 



Acc# 



Description 

HT^TrrT 



ORF Name 



1121t).0.b.l.±1...10. 



Protein name 



Man25A 



NTID 



NT AA n . . • ■ 
— , — , Score Probability 
AAID Length Length 



781 



T7T 



4.5e-44 



Locus Name 



gp:AP12647I 



Acc# 



AF126471 



Description 

Cellulomonas timi Man26A (man26A) gene, complete cds. 



274 



ORF Name 



NTID 



NT AA 
AAID Length Length 



&52550S17 c2 317 



1782 





6004 




398 



[IT5T 



Score Probability 
55 



0.035 



Protein name 



Locus Name 



Acc# 



endo-beta-l , 3-glucanase precursor 



gp:AF013169 



Description 



Pyrococcus turiosus beta-glucosidase tcelB) gene, complete cds;adh-lam 
operon, complete sequence; biotin ligase BirA homolog (birA) gene, complete 
cds; and 2-phosphoglycerate kinase (pgk)gene, partial cds . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



TUT 



4 . 8e-12 



Protein name 



Locus Name 



conserved hypothetical protein SC9C7.14C 



pir :T3596b 



Acc# 



T35965 



Description 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



2.126.0.10.1.±1..±1& I (75¥ 



TTTT 



Protein name 



Locus Name 



conserved hypothetical protein 



pir:B7227S 



Acc# 



B72278 



Description 



ORF Name 



Protein name 



NTID 



AAID 



6007 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
[NO-HIT 



275 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length. Length aL 



EST 



Protein name 



Locus Name 



NADH dehydrogenase (ubiquinone) , I chain I 
RP795 



pir :h!7164U 



Acc# 



E71640 



Description 



ORF Name 



NT ID 



\2&A92±11..±±...2 



7FT 



Protein name 



probable secreted glucosidase 



Description 



NT 



AA 



AAID Length Length 
— 



1075 I mrz 



Score Probability 
1.4e-07 



TE1 



Locus Name 



E 



ir:T3S164 



Acc# 



T35164 



NT 



AA 



ORF Name 



NT ID 



l&tell&l....cl...ll.& I V7WB 



AAID Length Length 



Score Probability 



TZST 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



i4.6A&o.s.i...ax...20.$. i rm 



AAID Length Length 

CTn — 



Score Probability 
T57 



2.4e-26 



Protein name 



Locus Name 



Acc# 



alpha- 1, 3/4- tucosidase precursor 



gp:SSU39394 



U39394 



Description 

Streptomyces sp. alpha- l , 3/4- tucosidase precursor gene, completecds . 



276 



NT 



AA 



ORF Name 



NT ID 



24645437 c3 384 



AAID Length Length 
— 



SuT" 



2706 



Score Probability 
175 



3.8e-08 



Protein name 



Locus Name 



115K outer membrane protein precursor :SusC 
protein 



pir:JC6u27 



Acc# 



JC6027 



Description 



ORF Name 



NT ID 



2«&aiaa..±i...iaa.. 



Protein name 



probable glycosyl hydrolase 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



7TT 



Locus Name 



bir:T36467 



ll.ie-49 



Acc# 



T36467 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



iS4aaifia...ci...aia 1 1752 



Probability 
5.7e-55 



Protein name 



Locus Name 



Acc# 



sp:Ntr0H_fiC0Ll 



Description 

OXIDOREDUCTASE CHAIN 8) (NU08) 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



23.8XI6.5.6....C.3....3.6.& I 



W7T 



|4.4e-87 



Protein name 



Locus Name 



Acc# 



sp:TRKH_EC0Ll 



Description 

TftK SYSTEM POTASS luM UPTAKE PROfEltf fftKH 



277 



ORF Name 



26230265 t2 67 



Protein name 

Description 
[NO-HIT 



NT 



AA 



NT ID 



AAID Length Length 
— 



Score Probability 



TOTS 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NT ID 



26.3.6.Q7.1.7....t3....18.3. I [75"F 



[STTT7 



AAID Length Length 
7TT 



Score Probability 
35? 



i.9e-3i 



Protein name 



Locus Name 



phosphogiycolate phosphatase (gph) homo log 



pir :C70184 



Acc# 



C70184 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



1W 



719 



Protein name 



Locus Name 



1.9e-73 



Acc# 



NADH dehydrogenase {ubiquinone) , chain 
4.2:protein slrl291 :protein slrl291 



|pir:S74£§7 



Description 



S74687 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length jL 



i&ia4&ii...ci...m i 



6di$ 




531 


l£$£ 




738 





l4.7e-S5 



Protein name 



Locus Name 



Acc# 



NADH dehydrogenase (ubiquinone) , I chain 
nuoD2 



pir :D70413 



D70413 



Description 



278 



NT 



AA 



ORF Name 



NT ID 



26587708 ±2 



AAID Length Length 
— 



TFTT" 



Score Probability 




3.8e-i3 



Protein name 



Locus Name 



unknown 



gp:U5677i 



Acc# 



U96771 



Description 



Prevotella bryantii putative polygalacturonase, B-l, 4 -endoglucanase, and 
mannanase genes , complete cds; and unknowngenes . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



265^4137 12 75 



16021 



1011 



Protein name 



Locus Name 



methyl cobamxde :CoM metJayltransterase isozyme 



EpTEFuTTTTT 



Acc# 



AF013713 



Description 



Metnanosarcrna JoarJceri methylcobamide : CoM methyl transt erase isozymeA 
(mtbA) , monomethylamine corrinoid protein (mtmC) , 

monomethylaminemethyltransf erase (mtmB) , putative monomethylamine permease 
(mtmP) / and unknown genes, complete cds. 



NT 



AA 



ORF Name 



NTID 



2S.S.M.7.I2....C2....10.3. I 



AAID Length Length 
TUTS 



Score Probability 
775 



1.3e-71 



Protein name 

Description 
NAM-PIASTOQUINONE OXIDOPEDUOTASE CHAIN 2, 



Locus Name 



sp:NU2C_SYNY3 



Acc# 



P72714 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



l$A$15£±...a2...1()A I fSTTT 



1.3e-18 



Protein name 



Locus Name 



sp:NU30_ANTP0 



Acc# 



Q31792 



Description 

NADH-PLASTOOUINONE OXIDOREOTCTASE CHAIN 3, CHL0R0PLA5T, 



279 



NT 



AA 



ORF Name 



NTID 



3177670S c3 371 



AAID Length Length 
— 



Score Probability 
T53 



4.7e-12 



Protein name 



Locus Name 



NADH dehydrogenase (ubiquinone) , I chain nuoB 



pir :C70413 



Acc# 



C70413 



Description 



NT 



AA 



ORF Name 



NTID 



3.25.3.2.8.3.8....£2L...&Ct I 



AAID Length Length 



Score Probability 



5TS" 



Protein name 
Description 

NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



\11B.15.19A..±2..JA 1 [SuT 



NTID AAID Length Length 

16026 



Protein name 



hypothetical protein 



Description 



Score Probability 
S3 



0.045 



Locus Name 



pir:C72397' 



Acc# 



C72397 



ORF Name 



NTID 



AAID 



NT AA , , . , . 
, — ± , — L1 Score Probability 
Length Length i - 



3.6.13.Z6.R6.„.a3....3.6.<L I [3u3 



Protein name 

Description 
BJO-HIT 



114 



345" 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



3.6.3.6.a26.2....GZ...3.Q5i I VSUE 





6028 




172 


519 




204 





2 . le-16 



Locus Name 



NADH dehydrogenase ( ubiquinone) , I chain J 



pir:C71S3S> 



Acc# 
C71839 



Description 



280 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 

— 



Score Probability 
7 . 6e-4l 



TU5 



Protein name 



Locus Name 



sensory transduction histidme Kinase 
slr2098 :protein slr2098 :protein slr2098 



>ir:S75130 



Description 



Acc# 



S75130 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
TT5 



|1.4e-28 



Protein name 



Locus Name 



NADH dehydrogenase I , subunit nuoB 



gp : ECNUOO 



Acc# 



X68301 



Description 

E.coli DNA sequence of nuo operon. 



NT 



AA 



ORF Name 



NTID 



AAID 



4145.9.D..7....ci...20.2 1 



Length Length 
TTT7T 



Score Probability 
7T3 



2.4e-8i 



Protein name 



Locus Name 



receptor antigen (Rag A) 



|gp:PGI130872 



Acc# 



AJ130872 



Description 



Porphyromonas gingival is W50 receptor 
immunodominant 55kDa antigen. 



antigen (rag) locus encodings, major 



NT 



AA 



ORF Name 



NTID 



AAID 



STTT 



Length Length 



12085 



Score Probability 
6 . 3e-18 



T7T 



Protein name 



Locus Name 



Sipl protein 



|pir:S27762 



Acc# 



S27762 



Description 



281 



NT 



AA 



ORF Name 



NT ID 



4566876 C2 285 



3TT 



AAID Length Length 



1464 



Score Probability 

m — 



2.5e-3S 



Protein name 



Locus Name 



sp:YIDJ_ECOLI 



Acc# 



P31447 



Description 

riYPOTOtftlCAL 5^.3 Kb P&MfilN ltt ElMRD-flLvG iNtElRflfiNlC kE<31(5N 



NT 



AA 



ORF Name 



NTID 



AAID 



4575513 c3 369 



IBT2" 



— , — , Score Probability 
Length Length < 

T2F 



purr 



Protein name 



Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



5llluaSL.c2L.i£i£ I WH 



SUIT 



Length Length 



TUT 



Score Probability 
1231 



2.Se-ld 



Protein name 



Description 



Locus Name 



Acc# 



Q00244 



NADH- PLASTOQUINONE OXIDOREDUCTASE CHAIN 4L, 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



5a&0iii2Jta._iafi i pr^ 



6036 



5TT 



|4.9e-46 



Protein name 



Locus Name 



utilizing regulatory protein tutC 



gp:TTU57900 



Acc# 



U57900 



Description 



Thauera aromatica utilizing regulatory protein tutC (tutC) utilizing 
regulatory protein tutB (tutB) , putative DNA bindingprotein TutBi (tutBl) , 
and putative protein kinase TutCl (tutCl) genes , complete cds . 



282 



NT 



AA 



ORF Name 



NT ID 



6444137 t3 129 



AAID Length Length 
[£TTT7 — 



Score Probability 
T3% 



2.8e-06 



Protein name 



Locus Name 



CmuC protein 



fep:MSMli3i7 



Acc# 



AJ011317 



Description 



Methylobacterium sp. CM4, cobD, metF, cmuB, cmuC, partial cobC anclcobQ, 
genes and genes encoding 0rf219 and 0rf361. 



NT 



AA 



ORF Name 



NTID 



707415S tl 1 



AAID Length Length 



Score Probability 

im: — 



l .4e-14 



Protein name 



Locus Name 



unKnown 



|gp:U9£77l 



Acc# 



U96771 



Description 



Prevotella bryantiz putative polygaiacturonase, B-l, 4-endoglucanase, and 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



TZZDilESZcXTZSIZZZZl [3T7 



NTID AAID Length Length 




Score Probability 
— 



1.6e-139 



Protein name 



Locus Name 



sp:DXS_HAEIN 



Acc# 



P45205 



Description 

1 - DEOXYXYLtrLOSE - 5 - MOS^HATS SYNTHASE (t)X£ SYNfiiASE} 



ORF Name 



NTID 



AAID 



NT AA 
T — L1 T — J _ 1 Score Probability 
Length Length 



SL7.2L167....C1...12SL 



SIT 



3 . Oe-43 



Protein name 



Locus Name 



|sp:EXUT 



Acc# 



P42609 



Description 
HEXURONATE TRANSPORTER 



283 



ORF Name 



NTID 



NT AA 

,, TT ^ T — ^ T — ^, Score Probability 
AAID Length Length 



$954305 rl 3 



3.8e-I45 



Protein name 



Locus Name 



beta-xylo-glucosidase 



|gp:TBZ5S27$ 



Acc# 



Z56279 



Description 



T.brockii cglF, cgiG, xglS and cglT genes. 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
— 



Score Probability 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



anasii..±i„fi I 



AAID Length Length 



7T" 



Score Probability 
[53 



0.017 



Locus Name 



■sp:<3p3a_flMWA 



Acc# 



Q95152 



GLYCOPROTEIN 38 PRECURSOR (GP38) (MUCIN-TY^E MEMBRANE PROTEIN GP40) 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



lflaiJl.7.3.Qa...tl.„21 1 



T7T 



Protein name 



Locus Name 



Acc# 



Description 
INO-HIT 



284 



ORF Name 



101S9501 t3 191 



Protein name 



NTID 



WIT 



AAID 



6045 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
ETCPHTT 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



iaai4aaa..±i...4& i 



l.2e-6l 



Protein name 



Locus Name 



Acc# 



IsprDINP ECOLI 



Description 
DNA-DAMAGE- INDUCIBLE PROTEIN P 



ORF Name 



NTID 



AAID 



l0.i4&fiLl£...£l.„a6. I [32^ 



Protein name 



hypothetical protein APE245 7 



Description 



NT AA 

— , — , Score Probability 
Length Length 



ST 



TTT 



TUT 



1.4e-05 



Locus Name 



Acc# 



H72476 



ORF Name 



NTID 



AAID 



Iftim3....Gi...A22 1 [52^ 



Protein name 
Description 

PRE&ROTEIM TEtMSLOCASE SECA OTBUNIT 



NT AA 

— „ — , Score Probability 
Length Length 



1110 I nrm 



'3.2e-184 



Locus Name 



|sp:SE!CA_ftI10CA 



Acc# 



P52966 



285 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 2 ~ 



11147938 ±2 90 



WIT 



Protein name 



115K outer membrane protean precursor : SusC 
protein 



Description 



9.2e-25 



Locus Name 



pir : JC6 027 



Acc# 



JC6027 



NT 



AA 



ORF Name 



NTID 



AAID 



11.7.15.S.7...C1...26.1 1 m$ 



Length Length 



11050 



Score Probability 

mi — 



5.8e-46 



Protein name 



Locus Name 



sp:APBE_HASIN 



Acc# 



P44550 



Description 

THIAMINE BIOSYNTHESIS LIPOPROTEIN APBE PRECURSOR 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length J ~ 



iiaiiia2L..±2...ai I ires 



2^" 



|i.3e-40 



Protein name 



Description 



Locus Name 



sp : STS_RAT 



ACC# 



P15589 



SULFATE SULFOHYDROLASE) (ARYLSULFATASE C) (ASC) 



NT 



AA 



ORF Name 



NTID 



i2aaia2...ca...i£a i 



AAID Length Length 
T52? — 



Score Probability 
T9£ 



9.6e-37 



Protein name 
Description 

(PSEUDOURIDYLATE SYNTHASE) (URACIL HYDROLYASE) 



Locus Name 



sp : RLUAJEC0L1 



Acc# 



P39219 



286 



ORF Name 



128775 tl 51 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



5HT" 



Locus Name 



Acc# 



Description 



ORF Name 



NTID 



AAID 



12aa.7.£xfi3L...cl...idl I [537 



6 054 



Protein name 



oxidorecluctase , short chain 
dehydrogenase/reductase family 



Description 



NT AA 

— , — , Score Probability 
Length Length 



507 


S24 




488 





1.7e-46 



Locus Name 



bir:E72427 



Acc# 



E72427 



ORF Name 



Protein name 



NTID 



NT AA 
T — ^ T — _ Score Probability 
AAID Length Length -L 



TTTT 



TOST" 



Locus Name 



2.1e-72 



Acc# 



sp:YFCC_E00Ll 



Description 

fltttfttiEflCAL KD PRMSltf Itf PfA-FOLX lNT?ER<3EKflC REGION 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



Protein name 



Locus Name 



Acc# 



Description 
WO-SM 



287 



NT 



AA 



ORF Name 



NTID 



AAID 



14551502 t3 177 



Length Length 



Score Probability 



Protein name 

Description 
NO -HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



\±±lH19£2...al...ll± I 



Length Length 



Score Probability 



11151 



Protein name 

Description 
(NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



U7.23.7.£i...c2...15.0. J I3T7 



16059 



i.le-79 



Protein name 



Locus Name 



type III DNA modification enzyme 
(methyl transferase) 



pir:F71810 



Acc# 



F71810 



Description 



NT 



AA 



ORF Name 



14&7.£5.7.B...±I...17. I IBTff 



NTID AAID Length Length 



Score Probability 

cms — 



1.2e-27 



Protein name 



Locus Name 



Acc# 



probable beta-glycosyltransferase trsC 



pir:S51262 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



6061 



TUT 



Protein name 
Description 

urcmrT 



Locus Name 



Acc# 



288 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



156642 C3 469 



UT 



TUT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



lSan2L.„t2...1£i3t I 



AAID Length Length 



Score Probability 
2.$e-2& 



Protein name 



Locus Name 



Acc# 



sp:YDAO_ECOLI 



Description 

HWOTfifi'TlCAL 55.5 KD £>£OTEi™ IN DfiPA-lNTft iNtERflfetflC ftfeGSOtf 



ORF Name 



NTID 



NT AA 
T — ^ — ^ Score Pro babili ty 
AAID Length Length JL 



^4T" 



6064 



Protein name 

Description 
IMO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



l£ft2ft&ii.±l.„£l ...J [MT 



aTTF 



l.6e-90 



Protein name 



Locus Name 



Acc# 



GTP-bxncLLng protein 



gp:AF019407 



AF019407 



Description 

Caulobacter crescentus GTP-binamg protein (cgtA) gene, completecas . 



289 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



16529461 t2 112 



ITT" 



TulT 



3.2e-06 



Protein name 



Locus Name 



Hypothetical protein PH03 6 0 



plr :E71143 



Acc# 



E71143 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length dL 



Protein name 



hypothetical protein 



Description 



1296 



T72T 



|2.3e-177 



Locus Name 



pir : JQ1020 



Acc# 
JQ102 0 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



6068 



481 



TT45- 



2.6e-l62 



Protein name 



Locus Name 



unknown 



|gp:AF04S?4$ 



Acc# 



AF04 8749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



NT 



AA 



ORF Name 



NTID 



AAID 



— , Score Probability 
Length Length 



|2.0e-29 



Protein name 

Description 
HYPOTHETICAL £>kOT£ltf MTSl§l2 



Locus Name 



|sp:YI12 METTH 



Acc# 
027840 



290 



ORF Name 



NT ID 



NT AA 
T — . -t x — _ Score Probability 
AAID Length Length z - 



15757152 12 119 



848 



Me-111 



Protein name 



Locus Name 



nucleotide sugar epimerase 



gp:AF05S755 



Acc# 



AF059755 



Description 

Vibrio vulniticus nucleotide sugar epimerase gene, complete cds. 



NT 



AA 



ORF Name 



l$$2l§7 13 2l2 



S4^ 



NT ID AAID Length Length 

5u7l — 



355" 



Score Probability 
TW3 



8.7e~l5 



Protein name 



Locus Name 



iumQ protein .-protein slr!213 :protein slr!213 



pir:S77548 



ACC# 



S77548 



Description 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length A ~ 



2D.0.5.D.4Q2..±3....253. I IS5TJ 



TIT 



ll,8e-07 



Protein name 



Locus Name 



phosphopyruvate hydratase 



pir:C75251 



Acc# 



C75251 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



851 




£073 




315 


960 


1557 



2 . 3e-170 



Protein name 



Locus Name 



putatxve UDP-GicNAc :undecaprenylpnosphate 



gp:AF048749 



ACC# 



AF048749 



Description 



Bacteroides Iragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



291 



NT 



AA 



ORF Name 



2037502 tl 17 



352 



NTID AAID Length Length 

6074 



Protein name 



conserved hypothetical protein 



Description 



77F 



Score Probability 
2^5 



Locus Name 



pir :D72320 



Acc# 



D72320 



NT 



AA 



ORF Name 



NTID 



2.a.73A6.Z5i.„t3....ZQS.... I 1553" 



AAID Length Length 
TIT 



ZTT 



Score Probability 
TZ5 



i.3e-18 



Protein name 



Locus Name 



hypothetical protein 



gp:S5U18«0 



Acc# 



Y18930 



Description 



Suitolobus soltataricus 2 81 kb genomic DNA fragment, strain P2 . 



NT 



AA 



ORF Name 



NTID AAID Length Length 



Score Probability 



|20..7.S.4A27...±!...m ...J [53¥ 



TFT 



TUTT 



TZTT 



5.7e-174 



Protein name 



Locus Name 



UDP-glucose-4-epimerase/ctTDP-glucose-4 / 6 



gp:AF048743 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



NT 



AA 



ORF Name 



2L1151D....tl...5i7. I 155? 



NTID AAID Length Length 




I 11029 



Score Probability 
TU1 



2.6e-27 



Protein name 



Locus Name 



Acc# 



activator protein 



gp:AF047527 



AF047527 



Description 

Pseudomonas tluorescens activator protein (mtlR) gene, completecds . 



292 



NT 



AA 



ORF Name 



NTID 



AAID 



21640887 12 117 



Length Length 



Score Probability 
TF7 



1.7e-08 



Protein name 



Locus Name 



Acc# 



hypothetical protein 7.17 



bir:D47677 



Description 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length ^~ 



T5T 



8.2e-7S 



Protein name 



Locus Name 



thiophene and turan oxidation protein 



pir:C70375 



Acc# 
C70375 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
S45 



Score Probability 
TTIS — 



3.7e-ll5 



Protein name 



Locus Name 



putative methyl transferase 



gp:AF04 8 74 9 



Acc# 



AF048749 



Description 



Bacteroid.es tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



ORF Name 



NT AA 

— — , Score Probability 
NTID AAID Length Length JL 



£25Au23..7....r.:U.4 



6081 



522 1569 




304 





i.ie-45 



Protein name 
Description 

SULFATE StJLFOH^DROLASE!) (arYLSDLfataSS C) (aSC) 



Locus Name 



|sp:STS_HUMAH 



Acc# 



P08842 



293 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



3T 



1ST" 



Probability 
TTTTJn 



Protein name 



Description 



Locus Name 



sp:3P&C_XENLA 



Acc# 



P36378 



(OSTEONECTIN) (BASEMENT MEMBRANE PROTEIN BM-40) 



ORF Name 



NTID 



Protein name 



phosphopyruvate Hydra t as e 



Description 



NT 



AA 



AAID Length Length 
6083 



Score Probability 
§.7e-0§ 



07 



Locus Name 



toir:C«2Sl 



Acc# 



C75251 



ORF Name 



Protein name 

Description 
INO-HIT 



NT 



AA 



NTID 



AAID Length Length 

— 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
IN0-H1T 



NT 



AA 



NTID 



23L£L427.Sfl„.c2„.aaSt I IffBT 



AAID Length Length 

— 



Score Probability 



ST" 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TTT 



3.2e-61 



Protein name 



Locus Name 



do licnol -phosphate mannosyl trans t erase 



pir :G70463 



Acc# 



G70463 



Description 



294 



NT 



AA 



ORF Name 



NTID 



24064142 t2 



AAID Length Length 
— 



7W 



Score Probability 




Protein name 



Locus Name 



-hypothetical protein ywnB 



pir :E7Q063 



Accft 



E70063 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



24ilA142...c2...m 



Length Length 
TST 1 [57? 



Score Probability 



Protein name 

Description 
IHO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



\2.lll9£.U...c2...2A0. I 1557 



Length Length 



TTT 



Score Probability 




1.9e-24 



Protein name 



Locus Name 



hypothetical protein yisX 



pir:G69835 



Acc# 



G69838 



Description 



ORF Name 



NTID 



Z424.5.43.Z...C2...&23. | 155^ 



Protein name 

Description 
INO-HIT 



AAID 



6090 



NT AA 

— , — , Score Probability 
Length Length ' L - 



W5T 



Locus Name 



Acc# 



295 



ORF Name 



NT ID 



— — Score Probability 



AAID Length Length 



24257IS7 12 132 



9.0e-il2 



Protein name 



Locus Name 



putative carboxybiotm decarooxylase su&unit 
of 



|gp:MkUB795TT 



Acc# 



U87980 



Description 

Malonomonas rubra putative I S-element gene, partial cas, anamaionace 
decarboxylase gene cluster (madY, madZ, madG, madB, madA,madE, made, madD, 
madH, madK, madF, madL, madM, madN) genes , complete cds . 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


2440i507_ci__2yy 


§70 


£092 


510 1533 


2702 


" 4.2e-2Sl 



Protein name 



Locus Name 



unknown 



gp:AP04874y 



Acc# 



AF048749 



Description 

Bacteroides tragilis capsular p olysaccharide mosyntnesis operon, compieue 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


liiiim^ci^ 


871 


6053 


642 


1929 


110 


" 0.0037 



Protein name 



Locus Name 



sp:Y01iWJ'lYL!LE 



Acc# 



Q49757 



Description 
H^OTHKTlCAL 31.1 K D PROTEIN B19i7_F2_iS 



296 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



124473817 12 106 



15094 





431 


1296 


457 





3.3e-43 



Protein name 



Locus Name 



putative hemolysin 



gp:AF0513!56 



Acc# 



AF051356 



Description 



Streptococcus mutans YtqB lytqB) gene, partxal cds; ABC transporter (abcX) , 
putative permease (perM) , putative hemolysin (hlyX) pyruvate -formate lyase 
activating enzyme (pf 1C) , D-alanine-D-alanylcarrier protein ligase (dltA) , 
integral membrane protein (dltB) , D-alanyl carrier protein (dltC) , 
extramembranal protein (dltD) , andputative exopolyphosphatase . (ppxl) genes, 



NT 



AA 



ORF Name 



|2.4.4jaS3il..jc2..J.4S J [273 



NTID AAID Length Length 

wuvs — 



Score Probability 
0.00014 



113 



Protein name 



immunogenic 7 5 JcDa protein PG4 



Locus Name 
|gp:AF145800 



Acc# 



AF145800 



Description 



Porpnyromonas gingival is strain W50 immunogenic 7 5 kDa protein PG4gene, 
complete cds. 



NT 



AA 



ORF Name 



12.4.«13.Q5...±2...1J..6 J [HT4 



NTID AAID Length Length 
I6TT35 1 1153 1 1552 



Score Probability 




l.Oe-SS 



Protein name 



Locus Name 



unknown 



gp:AF048749 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



NT 



AA 



ORF Name 



2&hA2J.63...Cl...2.15 



WT5 



NTID AAID Length Length 

— 



Score Probability 
|2.3e-S7 



[6F5 



Protein name 



Locus Name 



|sp:RIBB_E0OLI 



Acc# 



P24199 



Description 

3 / 4-£)lHVDR0^-5-Bai , AJ^0NH 4-MOSPfiMB SYNTHASE (DHBP SV^TKASfi) 



297 



ORF Name 



124651515 12 51 



Protein name 

Description 
MO-HIT 



NT 



AA 



NTID 



AAID 



TT7TT 



Length Length 



Score Probability 



TIT 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



AAID 



2.&b.&&D.aD....z3...A3J. I [377 



Length Length 



WIT 



Score Probability 




|4.1e-16 



Protein name 



Locus Name 



probable uridine phosphoryiase APE2105 



Description 



pir:D725i£ 



Acc# 



D72516 



NT 



AA 



ORF Name 



NTID 



AAID 



] 218M6£.1.±1..±9.5. I |57ff 



Protein name 

Description 
NO-HIT 



Length Length 



Score Probability 



TTFT 



Locus Name 



Acc# 



ORF Name 



Protein name 
Description 

NO-HIT 



NTID 



AAID 



NT AA o ^ _ _._ . 
„ — , — _ Score Probability 
Length Length Jl ~ 



15101 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length i - 



a£mm„.ca„.45a I [smr 



TUT 



1.4e-17 



Protein name 



Locus Name 



hypothetical protein S111671 



bir:S74655 



Acc# 



S74655 



Description 



298 



NT 



AA 



ORF Name 



NTID 



26261313 tl 58 



AAID Length Length 

mm — 



Score Probability 



T5T 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Iifi444&a2...ci„.141 1 1555 



Length Length 



Score Probability 
T5S5 — 



4.6e-163 



Protein name 



Description 



Locus Name 



sp:ENO_J!TAAU 



Acc# 



069174 



GLYCEikAffi MYDfeO-LYASE) {LAMlKfM BlJtfMttG £>feOTfil]Sf} 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



55T 



T5TT 



P4T 



Protein name 



Locus Name 



putative hypoxanthme guanine 



gp:AF048749 



ACC# 



AF048749 



Description 



Bacteroides Iragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



NT 



AA 



ORF Name 



NTID 



AAID 



Z533Ab.8.b....C2....3Ab. I 155? 



Length Length 



Score Probability 



WTT 



Protein name 

Description 
NO -HIT 



Locus Name 



Acc# 



299 



ORF Name 



26595257 i'i 180 





885 




6107 




175 


528 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length aL 



2.b&6AD.X2....a2.„A0A„ I 



1068 



TIT" 



0.0032 



Protein name 



Locus Name 



gamma response I protein 



:ATH13i708 



Acc# 



AJ131708 



Description 
Arabidopsis thaliana gr I gene, exons 1-3. 



ORF Name 



NTID 



NT AA 
T — _ . — Score Probability 
AAID Length Length i - 



l&3£±0&±..al...l6± I IB57 



1017 I (31352 



l. oe-H9 



Protein name 



Locus Name 



restriction endonuciease 



jgp 



Acc# 



AF060119 



Description 



Pasteurella haemolytica methyltranslerase (mod) and restrictionendonuclease 
(res) genes, complete cds. 



NT 



AA 



ORF Name 



Z6.8.3.6.6.8.Q....£2L...ia2 1 IffTO 



NTID AAID Length Length 

prro — 



Score Probability 
— 



8.9e-i2i 



Protein name 



Locus Name 



immunoreactive 4 7 kd antigen PG12 0 



gp:AF144640 



Acc# 



AF144640 



Description 



Porphyromonas gingival is strain W50 immunoreactive 4 7 JcD antigenPG12 0 gene, 
complete cds . 



300 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



275125 12 ill 



TUT 



|2.7e-06 



Protein name 



Locus Name 



hypothetical protein Kv233ic 



|pir:F7070b 



Acc# 



F70705 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



T7TT 



3TT 



Protein name 



Description 



Locus Name 



sp:YFIH_HAElN 



Acc# 



P44552 



HYPO T HETICAL PROTE IN HI 01 7 b 



NT 



ORF Name 



NTID 



AAID Length Length 



AA 

— Score Probability 



15115 



7.9e-07 



Protein name 



Locus Name 



HADH dehydrogenase mciquinone) , cnain z 



pir:Tllily 



Acc# 



T11319 



Description 



ORF Name 



£S.MJ.lb.b...±^ll&.. 



Protein name 



Description 



NTID 



NT 



AA 



AAID Length Length 
TT5I — 



Score Probability 
6.9e-:i4 



7£3 



Locus Name 



sp : C&teAJsACM 



Acc# 



P19579 



301 



ORF Name 


NT ID 


±±t\XD 


NT 

Lon rri~ Vi 


AA 
TiPncrth 


Score 


Probability 


30084588_12_jL27 


893 


5115 


50 


183 






Protein name 








Locus 


Name 


Acc# 


Description 














NO-HIT | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l&ll±b.l...al..£12 


894 


5116 J 


524 


|±57£> 


572 


2.1e-5b 


Protein name 








Locus 


Name 


ACC# 



alkaline pnospnatase 



gp:SSPl>HuA2 



Description 

Synechococcus iP<J<J7 S42 phoV gene tor alKaime pnospnatase. 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



imiaaii...ci...i^. 



5¥T 



15 .Se-li> 



Protein name 



Locus Name 



DMA polymerase III, aipna suDumt 



pir :C72360 



Acc# 



C72360 



Description 









NT 


— , Score 


Probability 


ORF Name 


NTID 


AAID 


Length 


Length 






3.2.40.6.:/.b...±3..-i8.2 


855 


5118 


135 


408 lib 


7.2e-07 


Protein name 








Locus Name 




Acc# 



protein-export membrane protein 



bir:E!7l8i7 



Description 



ORF Name 



NTID 



Protein name 



NT 



AA 



AAID Length Length 
— 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



302 



ORF Name 



NT ID 



AAID 



33357811 ti 45 



3W 



Protein name 



ftistidine Kinase sensor protein 



Description 



NT AA 

— . „. — , Score Probability 
Length Length d ~ 



TT5" 



Locus Name 



pir:D70328 



0. 00042 



ACC# 



D70328 



ORF Name 



NT ID 



AAID 



NT AA 

— ^ — ^ Score Probability 
Length Length ai - 



3.3A8.aOAl...tZ...lia.. 



5W 



Protein name 



Locus Name 



|sp:TPMN_XEWLA 



Acc# 



Q01174 



Description 
I TROPOMYOSIN ALPHA CHAIN, N0N MUSCLE 



NT 



AA 



ORF Name 



NTID 



AAID 



3.3.s.aiaa.7...x3...xaz i 



Length Length 
151 



252 



Score Probability 
0.042 



Protein name 

Description 
HYPOTHETICAL PROTEIN HI 104 5 



Locus Name 



Acc# 



sp:YA49_HAEIN 



NT 



AA 



ORF Name 



NTID 



3A18.9.3.8.5....11...3.a 



AAID Length Length 
PT21 — 



irr 



Score Probability 
[514 



3.0e-49 



Protein name 



Locus Name 



gp:BCY1113S 



Acc# 



Y11138 



Description 

B.cereus DNA tor ORF1, ORF2 and ORF3 (24 02 Jop) 



303 



NT 



AA 



ORF Name 



NT ID 



AMD 



154407193 tl 47 



Length Length 



Score Probability 



Protein name 
Description 

no-hit 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
[TOT 



Score Probability 
T53 



1.4e-14 



Protein name 



Locus Name 



glycosyl translerase PAB0772 



pir :B75096 



Acc# 



B75096 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
£3 



Score Probability 



T5T 



Protein name 

Description 
IWO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

wm — 



Score Probability 



Protein name 

Description 
IWO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



MLS.2.1&1..±2...±Q.& I 



AAID Length Length 



TFZW 



Score Probability 
57u 



|S.0e-51 



Protein name 



Locus Name 



pyrrdoxai pnospnate JDiosyntnetic protein PcbcA 



pir :H70373 



Acc# 



H70373 



Description 



304 



ORF Name 



346664W ti 224 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 
— 



Score Probability 



TTT 



Locus Name 



Acc# 



Description 



NO-HIT | 


ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 
ry 1 ^ is 


3.6AlA£.i:/..±3....M9. 




6130 


123 


372 


221 


13 . 3e- io 


Protein name 


Locus Name 


ACC# 


hypotneticai protein 


pir :H7fe47i 


H75473 


Description 
















ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


3.5.^0.0.^11^3.^. .. 




6131 


440 


1323 


1205 


l.Se-122 


Protein name 










Locus Name 


Acc# 


putative tJDP-glucose deiaydrogenase 


gp:AF1^9428 


AF159428 


Description 




Surkhoideria pseudomallei putative UDP-giucose aenydrogenase (udg> , putative 
ADP-heptose synthase (waaE) , and putativeADP-glycero-mannoheptose epimerase 
(gmhD) genes, complete cds . 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


3.9.miR^c^3.3.4 


910 


6132 


699 


2100 


3614 


0.0 


Protein name 


Locus Name 


ACC# 



receptor 



gp:AP04S74y 



AF048749 



Description 

Bacberoides tragii is capsular poiysaccnariae mosyntnesis operon, complete 
sequence . 



305 



ORF Name 



NT ID 



39S0900 ti Ism 



SIT" 



Protein name 



probable galactosyitransterase rrsu 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



11179 



T7T 



|1.3e-31 



Locus Name 



Acc# 



bir:^Sl26i 



ORF Name 



im6.5..7.^:/....ci...^y... 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
733 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



\±0.$A5.±l...a±..A±b. 



NTID 



— — Score Probability 
AAID Length Length 



3TT 



|i.8e-i06 



Protein name 



Locus Name 



sp : YQFA__BACSU 



Acc# 
P54466 



Description 



ORF Name 



4IIS.&aflL..±a...l.7.5 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
7TS 



Score Probability 



Locus Name 



Acc# 



Description 



INO-MIT 



ORF Name 



Protein name 



NTID 



AAID 



wrrr 



hypothetical protein jnpl456 



Description 



— — Score Probability 
Length Length 



ITT 



198 



i.2e-15 



Locus Name 



pir:C7l^C>6 



Acc# 



C71806 



306 



ORF Name 


NT ID 


A 7\ TPl 

AA1JJ 


NT 


AA 


Score 


Probability 


4147280_c2_374 


516 


6138 


67 


204 






Protein name 








Locus 


Name 


Acc# 


Description 














NO-HIT | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


45l5.u1u..±3....19A 


517 


6135 


385 


1158 


642 





Protein name 



Locus Name 



WbpU 



|gp :AF0 JS9TT 



ACC# 



AF035937 



Description 



Pseudomonas aeruginosa str ain tAl'S OS RpsA (rpsA) gene, partiaicds ; 
Ihf-Beta, Wzz (wzz), and Wzx (wzx) genes, complete cds; andwbp gene cluster 
for 0-antigen biosynthesis, complete sequence. 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


4mi^.±i....lfo.Z 


918 


6140 


473 


1422 


143 


1.6e-0S 



Protein name 



Locus Name 



unknown 



|gp;U$677r 



Acc# 



U96771 



Description 



Prevoteila bryan tii putative polygalacturonase, b- 1 , 4-enaoglucanase , ana 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

— 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



[NO-HIT 



307 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



14331300 c2 jyu 



WW 



l.le-Ob 



Protein name 



Locus Name 



hypothetical protein 



|gp 



:S3U1^30 



Acc# 



Y18930 



Description 

Sultolobus soltatancus 281 Kn genomic una tragment, strain P2 . 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


440$462_c2__403 


521 


5143 


506 




1521 


531 


| 1.2e-61 



Protein name 



Locus Name 



conserved hypothetical protein aq^iibb 



pir:P704l8 



Acc# 



F70418 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


±6.SA±b^.±^.±&l 


$22 


5144 


717 


2154 


122 


0.0007^ 



Protein name 



Locus Name 



putative pept idyl -prolyl cis»trans isomerase I i gp . ASAJ^iib 



Acc# 
AJ002316 



Description 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


m5.2L5.:L±2...iu4 


923 


5145 


427 1284 


S$ 


0.011 



Protein name 



Locus Name 



membrane protein 



gp:PPUVl824b 



Acc# 



Y18245 



Description 



Pseudomonas putida todX, tod F, todcJl, tocttja, todB, toaA, codb, 
todl, todH, todS, todT genes. 



todE, todci, 



308 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



4S04632 c3 476 



f!rnr 



TTTF 



|2.3e-liJ 



Protein name 



Locus Name 



unknown 



bp:AP04aV4y 



Acc# 



AF048749 



Description 



Sacteroides iragilis capsu iar polysaccnaricte siosyntnesis oper on, complete 
sequence . 



ORF Name 


NTID 


NT 

AAID Length 


AA 
Length 


Score 


Probability 


5llO700J:l_35 


925 


5147 485 |1458 


780 


1.9e-r/ 


Protein name 








Locus Name 


Acc# 


O-antigen repeat unit transporter wzx 


gp:A?172324 


AF172324 


Description 




Escherichia coir GalF IgalF) gene, partial cds ; o-antigen repeatunit 
transporter Wzx (wzx) , WbnA (wbnA) , O-antigen polymerase Wzy(wzy) , WbnB 
(wbnB) , WbnC (wbnC) , WbnD (wbnD) , WbnE (wbnE) , UDP-Glc-4-epimerase GalE 
(galE) , 6-phosphogluconate dehydrogenaseGnd (gnd) , UDP-Glc- 6 -dehydrogenase 
Ucrd (ucrd) . and WbnF (wbnF)qenes, complete cds; and chain lenqth determinant 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


&2.7&1&1..±2...12A 




£143 


192 |579 


335 


3 .4e~4ti 


Protein name 








Locus Name 


Acc# 










gp:A£0l7bO8 


AB017508 


Description 














Bacillus halodurans C-I2b genomic una, 


32 kb iragment, compietecas . | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


saiiass^j-^A 




5145 


157 474 


514 


7.£e-60 


Protein name 








Locus Name 


Acc# 


unknown 


|gp:AF04*W4y 


AF048749 



Description 

Sacteroides trag ilis capsular polysaccnaride siosyntnesis operon, complete 
sequence . 



309 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



WIS 



Protein name 



sensory transduction mstiaine Kinase 
slr2098 rprotein slr2098 :protein slr2098 



Description 



Locus Name 



birrSVbiJU 



6.4e-45 



Acc# 



S75130 



ORF Name 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



5151 



OS- 



LO cus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



Length Length 
— 



Score Probability 
6.5e-^ 



Locus Name 



putative alpna-glucosidase 



gp:AAC 4 2b2lbl 



Acc# 



AJ252161 



Description 



Alicyclobacillus acidocald arius maltose/maltoctextrine transportgene region 
(malEFGR genes, cdaA gene and glcA gene) . 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



11050 



|I.4e-SJ 



Protein name 



Locus Name 



spiYtfdBJikJoLl 



Acc# 



P36979 



Description 

HYPOTHETICAL 43.1 KB l>kOTKiN IN MM-ciCP E IMTJJkcjjaTic kwivn 



310 



ORF Name 



Protein name 



— — Score Probability 
NT ID AAID Length Length 



6154 



75" 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



6.s.5Ar/.fc.z...c:i..A:/.u.. 



Protein name 



— — Score Probability 
NTID AAID Length Length 



6155 



2UT 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NT 



AA 



53T 



NTID AAID Length Length 





Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID AAID Length Length 

rai 



Score Probability 



[5T5T 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



— — Score Probability 
NTID AAID Length Length 



535" 



FITS 1 153 I fl^ 



Locus Name 



Acc# 



Description 



311 



NT 



AA 



ORF Name 



NT ID 



$813 c3 47:1 



AAID Length Length 

nm — 



Score Probability 



55" 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



16160 



Length Length 
— 



Score Probability 
5.7e-55 



SOT 



Protein name 



Locus Name 



conserved Hypothetical protein yjcg* 



pir :D6yybb 



Acc# 



D69856 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TTTT 



7T 



Protein name 



Locus Name 



unknown 



gp:U^6V71 



Acc# 



U96771 



Description 



frrevoteiia bryan tii putative polygalacturonase, B-l, 4-enaoglucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Pr 


obability 


±0:th:±lbA±2...±'2. 


940 


6162 


131 


356 


224 


i.6e-18 



Protein name 



Locus Name 



TgA Pc receptor-like protein A428L 



bir:T17931 



ACC# 



T17931 



Description 



312 



ORF Name 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT ID 



AAID 



"NTT AA 

— — Score Probability 
Length Length 



ITT 



|2.2e-06 



Protein name 



Description 



Locus Name 



sprVlk^JitlkliW 



Acc# 



P13225 



VlkULflNChi KKCUbON T ftANSCRlFriONAL ACTIVATOR 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



Protein name 



TTT 



l.5e-l8 



Locus Name 



Acc# 



hypothetical protein Fi4Fy.b 



Description 



pTrTTTTTTl" 



T33774 



ORF Name 



Protein name 



NTID 



hypothetical protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



1296 



frrzr 



2.3e-177 



Locus Name 



pir: JQ1020 



Acc# 
JQ1020 



313 



NT 



AA 



ORF Name 



NT ID 



|225S012a rl 11 



AAID Length Length 
3T~ 



252 



— ^ Score Probability 

o.o3i 



Protein name 



Locus Name 



Acc# 



P36378 



Description 

(OSrtiO^'riN) (ON) (BASEMENT ME MBRANE PRUTfllN feM-40) 



NT 



AA 



ORF Name 



NT ID 



23463691 Ci 111 



AAID Length Length 
F4u 



Score Probability 



T7T 



Protein name 



Description 



Locus Name 



Acc# 



PSTO-HIT 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


115.X$.16.±..±1...$.2 


$4 7 














Protein name 








Locus 


Name 


Acc# 


Description 
















NO-HIT 1 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2Liaa^Q2...ci...a2 


|$4$ 


6170 


711 


2136 




"48$ 


§ . £e-44 


Protein name 








Locus 


Name 


Acc# 



receptor antigen (RagA) 



Description 



Porphyromonas gmg ivalis W50 receptor antigen (rag; locus encodinga major 
immunodominant 55kDa antigen. 



314 



ORF Name 



24640675 c2 99 



Protein name 



NTID 



NT AA 
_ — ^, T — Sc ore Probability 
AMD Length Length z - 



ZT7T 



Locus Name 



Acctt 



Description 
[NO-HIT 



ORF Name 



NTID 



i^4au&3Lua.„t2...AQ I 



Protein name 



NT 



AA 



tvtvxt^ r — Score Probability 
AAID Length Length ^ 



Locus Name 



Acc# 



Description 

FrcmTT 



ORF Name 



NTID 



\1520£Ml...cL..8.1... I [TOT 



Protein name 



NT AA n ^ , , , _ , t 
T — ^, _ — Score Probability 
AAID Length Length JL 



FT7T 



receptor antigen (RagA) 



Description 



Locus Name 



gp:F>GT130S72 



2.3e-17 



Acc# 



AJ130872 



Porphyromonas gingivalis W50 receptor antigen { rag) locus encodinga ma^or 
immunodominant 55kDa antigen. 



ORF Name 



NT AA 

^ mTT . -r — ^, — ^ Score Probability 
NTID AAID Length Length JL 



2L63L6QfiL3L&...tl...a I 



6174 



TOT" 



6 .ue-l7 



Protein name 



Locus Name 



Acc# 



gp:AHU56832 



U56832 



Description 



Aeromonas nycLrophila FK5 06 binding protein (tkpA) gene, completecds m 3 . 9 
kb fragment. 



315 



NT 



AA 



ORF Name 



NT ID 



281331^ ci 11U 



VST 



AAID Length Length 
— 



Score Probability 
1.3e-48 



Protein name 



Description 



Locus Name 



[sp:YHAM_E(JUljl ' 



Acc# 



P42626 



HYPOTHETICAL 19.4 KD frROTKlM iM fiXUk-'l'D CC iNTbikGtEtil 0 REGION (F188) 



ORF Name 


NT ID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


341?67V_cl_8S 


$54 6176 


354 


1065 


171 


3 -3e-12 



Protein name 



Locus Name 



KIAAU879 protein 



gp :ABU2Ubyb 



ACC# 



AB020686 



Description 



Homo sapiens mRNA tor KlAAQBvy protein, complete cas. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



— Score Probability 



3..7.3.2bi..±l...lU.. 



ZTTT 



FT" 



5T" 



0.020 



Protein name 



Description 



Locus Name 



gp : AFSCK 



Acc# 



X70080 



A. tranciscana Scr gene (homologue ot urosopmia Sex comns reduced) . 



ORF Name 



' — — Score Probability 
NTID AAID Length Length 



Protein name 



Description 



Locus Name 



Acc# 



316 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



4742&i^ ri b4 



6179 



T75~ 



TITT 



3W 



7.2e~bO 



Protein name 



Locus Name 



hypotneticai protein 



|gp:M i mj27i5" 



Acc# 



AJ132745 



Description 

Arabidopsis thaliana hypothetical protein, clone EMuya^y . 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


48045S2_c2_l(>3 




£lS0 


452 |l35S 


155 


2.6e-ll | 



Protein name 



Locus Name 



Acc# 



putative outer membrane porm 



Description 

Vibrio cholerae glutamyl tkNA synthet ase (glbX) gene, partial cas /putative 
outer membrane porin (ompA) , unknown protein, vibriobactinreceptor precursor 
(viuA) , and ViuB protein (viuB) genes, completecds; and VibF (vibF) gene, 
partial cds . 



ORF Name 


NTID AAID 


NT AA 
Length Length 


Score 


Probability 


&&15m.ix...a±..M. 


555 6181 


153 582 


ISO 


|7.4e-14 



Protein name 



Locus Name 



RNA polymerase sigma tactor sigz-iiKe protein |gp : AFi37263 
Description 



Acc# 



AF137263 



tucose 



Bacteroides thet aiotaomicron nbosomal protein S16-liKeprotem, 

gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



ORF Name 


NTID AAID 


NT AA 
Length Length 


Score 


Probability 


±&A10±2^a±^.b. 


$60 6182 


577 1131 




1.4e-19 


Protein name 




Locus Name 


Acc# 






|gp:AF0^^424 


AF083424 


Description 










Atelme Herpesvirus 


3 complete genome. 









317 



ORF Name NT ID 


AAJLD 


NT 




AA 
Length 


Score 


Probability 


535IJ>07_t2_3^ 961 


6183 


378 


J-.L-J / 






Protein name 








Locus Name 


Acc# 


Description 












■ 1 


NO-HIT 












1 


ORF Name jniijj 




NT 
Length 


AA 
Length 


Score 


Probability 


saaim...ci...iCL7. 962 


5184 


352 


1059 


147 


1 . 3e-07 


Protein name 








Locus Name 


Acc# 


transmembrane sensor 


gp:AFC 


51691 


AF051691 


Description 




frseudomonas aeruginosa stress factor A (pstAj , ecf sigma tactor itiulj , 
transmembrane sensor (fiuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds . 




ORF Name NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


aitti&£-.c:i...iaa 963 i 


618$ 


824 


2475 




1.2e-l3 


Protein name 


Locus Name 


Acc# 


serine/ threonine protein Kinase related 




pir:H690b4 


H69064 


protein 














Description 














ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l&£A0Ai....c£...l3.9. 964 


6l86 




^94 


125 


0.0003O 


Protein name 








Locus Name 


Acc# 


115K outer membrane protein precursor: 


SusC 




pir: JC6027 


JC6027 


protein 















Description 



318 



ORF Name 



10742^ cl 106 



Protein name 



NT ID 



965 



NT 



AA 



AAID Length Length 
ffSB 



Score Probability 



35" 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



Locus Name 



sp:YFIC_BA<JtJU 



Acc# 



P54719 



HtffrOMmcjAL ABC 4 ^KA^SfrOR'rKR AT^-felNDlNCj PRCyi'EItJ 2 Iti ^LVB(J 3 ' REGION 



ORF Name 



Protein name 



NTID AAID Length Length 



— — Score Probability 



ST" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



ABC transporter, ATP -binding protein 



Description 



11545 



8 . le-127 



Locus Name 



bir:E7^yb 



Acc# 



E72396 



319 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



cJ 144 



WITT 



ll.Oe-bb 



Protein name 



Locus Name 



Isp : SBCbJWotJA 



Acc# 
068033 



Description 

EXONttCLEASS SBCD HOmOLOG 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



— Score Probability 



24651^7 c2 138 



1143 



Protein name 



Locus Name 



tibronecton type III 



|gp : HtfMFHJA" 



Acc# 



M12549 



Description 



Human tibronecti n gene type III homology unit corresponding to 
thecell -binding domain, exons 6 and 7. 



NT 



ORF Name 



NTID 



AAID Length Length 



AA 

— Score Probability 



2i££Auub....cJ....lib... 



TTT 



12531 



i.5e-^4 



Protein name 



Locus Name 



probable exonuciease, 



pir :T034bi> 



Acc# 



T03465 



Description 



ORF Name 



Protein name 



NTID 



TTT 



16134 



NT 



AA 



AAID Length Length 



Score Probability 



98 



TTT 



Locus Name 



Acc# 



Description 



[NO-HIT 



320 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


273442_c2_ilV 


973 




363 


1092 180 


2 . 7e-ll 


Protein name 


Locus Name 


Acc# 


cation ettiux system 


(CZCB- 


like) 






Jpir:C7041b 


C70415 


Description 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


23.47.11^.^1^6. 


574 


6196 


19b 






Protein name 










Locus Name 


Acc# 


Description 














NO-HIT | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 


iaMttiLi^ta^a 


97S 


6197 


345 


1038 


-| 2.4e-24 


Protein name 


Locus Name 


Acc# 



hypothetical protein TM16 93 



pir:G72223 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TT73 



33TT 



Score Probability 
|4.2e-27 



JUT 



Protein name 



Locus Name 



probable phospnoesterase, yKun; 



bir:B6986b 



Acc# 



B69865 



Description 



321 



ORF Name 



NTID 



— — Score P robability 
AAID Length Length 



I34173431_tl_b 



mr 



EOT 



IIST 



|3.6e-14 



Protein name 



Locus Name 



Acc# 



"SlgJT 



bp:AFllbiJ4 



Description 



Pseudomonas tluore scens PpsA (ppsA) gene, partial cas; Estx (est*; ,Menu 
(menG), CmaX (cmaX) , CrfX (crfX) , CmpX (cmpX) , SigX (sigX) ,OprF (oprF) , and 
CobA (cobA) genes, complete cds; and unknown gene. 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


3466l30l_cl_l02 


978 


6200 


1083 


3252 | 


3^4 


g.3e-53 



Protein name 



Locus Name 



acrirlavine resistance protein iacrB) nomoiog j BTrTDTUTTT 



Acc# 



D70117 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


3.9.3.S.21b....a2...iia 


313 


5201 


550 


1653 


384 


7.4e-33 



Protein name 



Locus Name 



cation ettlux (AcrB/Acru/Acrf tamiiyj 



pir:P70368 



Acc# 



F70368 



Description 



NT 



ORF Name 



NTID 



±l$A6A2.±±..:.l± 



380 



AAID Length Length 



AA 

— Score Probability 



T5T 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



322 



ORF Name 



NT ID 



AAID 



NT AA 
— , — , Score 
Length Length 



4805286 cl 99 



SIT 



Probability 
|2.7e-50 



Protein name 



Locus Name 



acritlavine resistance protein (acrB) homo log 



pir :D70117 



Acc# 



D70117 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



5iasm...ai...iai i 



Length Length 
TUT 



TT5T 



Score Probability 
TTT5 



0.0047 



Protein name 

Description 
HYPOTHETICAL PROTEIN HI1340 



Locus Name 



sp:YD40_HAEIN 



Acc# 



P44165 



NT 



AA 



ORF Name 



NT ID 



6&5.3.A3.S....C1...115. I m? 



AAID Length Length 

rzus — 



Score Probability 



Protein name 

Description 
I&O-HIT 



Locus Name 



Acc# 



ORF Name 



NT ID 



AAID 



NT AA 

— L1 ^ — ^ Score Probability 
Length Length 



I1U233.8L7..7....C.2...25.U 1 J5tt 



1ST 



EuTJT 



|i.8e-207 



Protein name 



Locus Name 



putative epimerase/ dehydratase 



gp:AFl2Sl£4 



Acc# 



AF125164 



Description 



Bacteroid.es tragilis 638R polysaccharide B (PS B2) biosynthesrslocus , 
complete sequence; and unknown genes. 



323 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
£uT 



Score Probability 



Locus Name 



Acc# 



Description 



[NO- HIT 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



T5T 



5 . Oe-23 



Protein name 



Locus Name 



hypothetical protein RV2731 



|pir:B70bO6 



Acc# 



B70506 



Description 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


lCk&LUM±±^JliX 


587 


6209 


113 342 


125 





Protein name 



Locus Name 



Acc# 



HipA protein. 



gp:DyOVy4 



Description 

fi.coli genomic DdA , Kohara clone #^03 (34 . i -34 . 6 mm.) 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



10.^.Mi:Z...aA...2.y.3.. 



ITT" 



tutt 



1742 



|2.2e-17^ 



Protein name 



Locus Name 



Acc# 



putative epimerase/aenyctratase 



IgpTAFT^TST" 



AF125164 



Description 

Bacteroides iragilis fejtik polysaccharide (PS B'A) rnosyntnesisiocus , 
complete sequence; and unknown genes. 



324 



NT 



AA 



ORF Name 



NTID 



11023432 Cl 20b 



555" 



AAID Length Length 
1215 — 



\£7TT 



[1T1" 



Score Probability 
|2.7e-21b 



Protein name 



Locus Name 



Acc# 



putative glycosyltranslerase 



|gp:AF12bi64 



AF125164 



Description 

feacteroides rragiiis p olysaccharide B (PS B2) mosyntnesislocus, 

complete sequence; and unknown genes. 



ORF Name 



1168551 ti 61 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



550" 



Locus Name 



Acc# 



Description 



NO-HIT | 


ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l.2xBj&±l..±l...xi.l yyi 




162 489 


53 


■| 0.031 


Protein name 


Locus Name 


Acc# 


cell cycle progression restoration a 


protein 


gp:AP011794 


AF011794 


Description 




Homo sapiens cell cycle progression 


restoration 8 pre 


tern [ u fk. o ; m±<±\iii , 




complete cds . 














ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


llS20.6AB..„cl..2&a 992 


6214 


61 lob 






Protein name 






Locus Name 


Acc# 



Description 



(NO-HIT 



NT 



AA 



ORF Name 



13804187 tl 47 



NT ID AAID Length Length 





SB" 



Score Probability 
10.0018 



Protein name 



Locus Name 



nypotneticai protein 



|gp:MTH^4ibbb 



Acc# 



AJ243656 



Description 



Methanobactenum thermoautotropnicum enr»A, B, c, D f a, F, G, H, I, J, K, l, 
M, N, O, P, Q, & ORFS 1,2 & 3. 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l42S06i7_t3_l47 




5215 




1154 






Protein name 








Locus 


Name 


Acc# 


Description 














NO-HIT | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l^£8A£a..±l...ll& 


$55 


6217 


121 


555 


100 


|2 .2e-0b 



Protein name 



Locus Name 



hypothetical protein TMliiU 



pir:F72zlb/ 



Acc# 



F72267 



Description 









NT 


AA 

— Score 


Probability 


ORF Name 


NTID 


AAID 


Length 


Length 


i427.b.^b.Z..±A...lb.Z 


555 


5218 


551 


2045 1133 


7.6e-lib 


Protein name 








Locus Name 


Acc# 



(p)ppGpp syntnetase 



lgp:BSU86377 



Description 



Bacillus subtilis (p)ppGpp syntnetase (relA) ana 
adeninephosphoribosyltransf erase (apt) genes, complete cds . 



326 



ORF Name 



146483S0 ti 18 



Protein name 



Locus Name 



Acc# 



Description 
NO -HIT 



ORF Name 



NT ID 



AAID 



NT AA 

— _ — _ Score Probab ility 
Length Length 



ii^&aaaa2„.t3L.„ii^ i |55ff 



T2T" 



95" 



0.00057 



Protein name 



Locus Name 



hypothetical protein PFB0225C 



pir :E71620 



Acc# 



E71620 



Description 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length L 



Protein name 



ybeB protein homolog iojap : protein 
slrl886 .-protein slrl886 



Description 



|4.3e-18 



Locus Name 



foir:S77145 



Acc# 



S77145 



ORF Name 



NT ID 



i4AasLifii-.±i...iiaL I w 



Protein name 



AAID 



NT 



AA 



Length Length 



Score Probability 



TIT 



Locus Name 



Acc# 



Description 

etcfett 



NT 



AA 



ORF Name 



NTID 



15705575 cl 135 



AAID Length Length 

— 



7ST 



TIuT" 



Score Probability 
7^5 



|2.4e-79 



Protein name 

Description 
HYPOTHETICAL 36.3 K£> PkOTfilKf CY277.1S 



Locus Name 



sp:YS18_MYCTU 



Acc# 



P71777 



NT 



AA 



ORF Name 



NTID 



15730675 cl 133 



AAID Length Length 

— 



Score Probability 
1552 



|4.0e-86 



Protein name 



Locus Name 



pnospnonopyruvate decarboxylase, f om2 



pir :S60212 



Acc# 



S60212 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



19.5.3.M&2...C.3....27.8. 



Length Length 



Score Probability 
^ 



7.2e-87 



Protein name 



Locus Name 



Acc# 



'sp:YBDG_ECOLl 



Description 

HYPOTHETICAL 46,6 KD PROTEIN IN PHEP-NFNB INTERGENIC REGION 



ORF Name 



NTID 



AAID 



NT AA 

, Score Probability 
Length Length 



1004 



7TF" 



l.2e-70 



Protein name 

Description 
SOJ PROTEIN 



Locus Name 



sp : SOJJBACSU 



Acc# 



P37522 



328 



NT 



AA 



ORF Name 



NT ID 



11005 



AAID Length Length 
WZD — 



[sir 



Score Probability 
PTu~55 



|I.le-106 



Protein name 



Locus Name 



Acc# 



putative unaecaprenyi-pnospnate 



bp:AFl2blfo4 



AF125164 



Description 

Bacteroxdes tragxiis 6^Sk po ly saccharide B (PS B2> JDiosyntnesxsiocus, 
complete sequence; and unknown genes 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


22<S§964^_c2J>49 


1006 


622$ 




355 


106S | 465 


4 . 7e-44 



Protein name 



Locus Name 



Acc# 



putative giycosyltransrerase 



gp:AFl25l64 



AF125164 



Description 

Bacteroxdes rragilis 6l8k polysaccharide 5 (\>ti tiu) biosyntnesisiocus , 
complete sequence; and unknown genes. 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


23.4^.0.a.:/-b....c3....2B^. 


1007 


622$ 


508 


1527 155 


" |3.4e-12 


Protein name 








Locus Name 


Acc# 



putative tuppase 



|gp:AFl2blb4 



Description 

Bacteroxdes rragilis fe^BR polysacchar ide B (PS B2 ) siosyntftesisiocus , 
complete sequence; and unknown genes. 



ORF Name 



NTID 



— — Score Probability 

AAID Length Length 



aiiaiii^c^ii& I 



145" 



Protein name 



Locus Name 



Acc# 



Description 
[NO-HIT 



329 



ORF Name 



23554555 t3 142 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 





6231 




254 


765 



Locus Name 



Acc# 



Description 
INO-UIT 



ORF Name 



NT ID 



iiiaasn2Jta»ii5. I \tutu 



Protein name 



NT 



AA 



AAID Length Length 
6232 



Score Probability 



TUT 



Locus Name 



Acc# 



Description 
NO- HIT 



ORF Name 



NTID 



AAID 



NT AA 

— A , , — L1 Score Probability 
Length Length * L 



Z3.6.3.^2.5iZ...ai...iaz I lion 



£233 



TTJ" 



11002 



6.7e-il 



Protein name 



Locus Name 



dolichol-P-glucose synthetase homolog 



pir:E69322 



Acc# 



E69322 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— _ — , Score Probability 
Length Length 



TUTT 



Protein name 



phosphoenolpyruvate phosphomutase FOMi 



Description 



TT7F" 



ll.4e-140 



Locus Name 



Acc# 



pir:S60205 



NT 



AA 



ORF Name 



NTID 



2.3.S.2.aS.2....c2....Z16. | 11013 



AAID Length Length 



purr 



Score Probability 
£173 



1.7e-37 



Protein name 



Locus Name 



hypothetical protein 



pir:S75344 



Acc# 



S76344 



Description 



330 



NT 



AA 



ORF Name 



NT ID 



24017687 ti ii>2 



TuTT" 



AAID Length Length 
3^5 



Score Probability 
5.4e-27 



Protein name 



Description 



Locus Name 



Acc# 



spiCD^AJlAHlN 



SYNTHASE) 



NT 



AA 



ORF Name 



NT ID 



AAID 



I24226SS7 C2 241 



TuT3" 



Length Length 



Score Probability 
l.le-26 



TuT 



Protein name 



Locus Name 



activator protein 



gp:AF04Vb27 



Acc# 



AF047527 



Description 



Pseudomonas tluorescens activator protein (mtiK) gene, compietecas . 



ORF Name 



NT ID 



NT AA 

— — Score Pro bability 
AAID Length Length 



lA19£±lb..±l..±A& I [TuTF 



TUT 



Protein name 



Description 



Locus Name 



Acc# 



NO -HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



TUTT 



Length Length 




Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



331 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



124641921, cl 2Ui 



Protein name 



Locus Name 



3.8e-06 



Acc# 



galactosyltransterase homoiog 



Description 



pir:GS94bb 



G69465 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1014 


6241 


342 


1029 


179 



Protein name 



Locus Name 



|2.0e-il 



Acc# 



capsular polysaccharide biosynthsis protein ipir :F7U44i 



Description 



F70441 



ORF Name 



Protein name 



NTID 



H026 



NT 



AA 



AAID Length Length 



Score Probability 



174" 



T75~ 



wr 



Locus Name 



0.011 



Acc# 



proJDaJDle membrane protein 
YOL019W: hypothetical protein 02 313 



Description 



|pir:S66701 



S66701 



ORF Name 



NTID 



TuTT- 



Protein name 



AAID 



— — Score Probability 

Length Length 



mr 



TIT 



|3.3e-i8 



Locus Name 



Acc# 



Description 



sp:V266_ARC!J? , U 



029973 



HYPOTHETICAL PkoTEIN AF026 6 



332 



NT 



AA 



ORF Name 



NTID 



126567176 c2 217 



TU7T 



AAID Length Length 
T5Z 



Score Probability 



VIM 



I25T 



Protexn name 



Description 



Locus Name 



Acc# 



[NO-HIT 



ORF Name 



NTID 



NT AA 
— — Score 
AAID Length Length 



TU7T 



3W 



JUT 



Probability 
|6.7e-5U 



Protein name 



Description 



Locus Name 



Acc# 



P31857 



HYPOTHETICAL 32.4 Kb PROtEiiU Iti ^iDB -tJrtCI INtergenic kkuiujn 



NT 



AA 



ORF Name 



NTID 



TU24" 



AAID Length Length 
STu 



Score Probability 
l.7e-l7 



Protein name 



Description 



Locus Name 



sp:«65_UAfc!lM 



Acc# 



P44033 



HYPOTHETICAL l>koTliiI N H106bb 



NT 



AA 



ORF Name 



NTID 



AAID 



116.16.1±±.±1..£± 



Length Length 




Score Probability 



[3W 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



333 



NT 



AA 



ORF Name 



NTID 



3337£$06 c3 250 



AAID Length Length 



Score Probability 
5.Se-32 



Protein name 



Locus Name 



LicDi 



gp:AP106biy 



Acc# 



AF106539 



Description 



Streptococcus pneumoniae Licftl (licDl) and LicD2 {lxcD2) genes , complete 
cds; and unknown gene. 



ORF Name 



NTID 



NT AA 

— — Score Proba bility 
AAID Length Length 



33406567 ±2 82 



1027 


624$ 925 


277$ 


129 





Protein name 



Locus Name 



II 5K outer membrane protein precursor : SusC 
protein 



pir: JC^02 7 



Description 



6.1e-0S 



Acc# 



JC6027 



ORF Name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



6250 



TuT7~ 



1.2e-3i 



Protein name 



Locus Name 



putative alconol dehydrogenase 



gp:C2A3§2 



Acc# 



AL078635 



Description 
Amycolatopsis orientaiis cosmid pczA3b2. 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



3.5L3.3.6.S..7.6....aI...iy.bL.. 



TOT 



i.4e-2ii 



Protein name 



Locus Name 



Acc# 



putative epimerase 



gp:AP12blb4 



AF125164 



Description 

Bacteroides tragilis 638k polysaccharide B [v& B2) biosynthesisiocus, 
complete sequence; and unknown genes. 



334 



NT 



AA 



ORF Name 



NT ID 



AAID 



35401627 ci 288 



6252 



Length Length 
\Z2Z 



141 



Score Probability 
l . ye-40 



TIT 



Protein name 



Locus Name 



WcgF 



gp:AP12blb4 



Acc# 



AF125164 



Description 



feacteroides trag ilis 63$R polysaccnaride B (PS B2) Mosynthesislocus, 
complete sequence; and unknown genes. 



ORF Name 


NT ID 


AAID 


NT AA 
— - — Score 
Length Length 


Probability 


36362675_cl_207 


1031 


5253 


197 


594 317 


2.2e-2S 


Protein name 








Locus Name 


Acc# 










gp:A£s008SbO 


AB008550 


Description 












Pseuaomonas aeruginosa pnage pnx cxx, 


complete genome sequence 




ORF Name 


NT ID 


AAID 


NT AA 
— — Score 
Length Length 


Probability 


3.5.1i0.2S„..aA...2B.7. 


1032 


6254 


166 


501 181 


5 . 8e-l4 



Protein name 



Locus Name 



unknown 



gp:AF12bl64 



Acc# 



AF125164 



Description 



Bacteroides tragi lis 638k polysaccharide B (PS B'Z) siosyntnesisiocus, 
complete sequence; and unknown genes. 



NT 

ORF Name NTID AAID Length 


AA 

— , Score 
Length 


Probability 


3.M3.15.3....C2...^S. 1033 6255 296 8 


91 1278 


3.3e-l30 


Protein name 


Locus Name 


Acc# 


glucose -l-pnospnate tnymxdyltransterase 


gp:AFl25164 


AF125164 



Description 



Bacteroides tragilis 63&k polysaccharide B (PS B2) JDiosynthesislocus , 
complete sequence; and unknown genes. 



335 



NT 



AA 



ORF Name 



NT ID 



3555062 c3 2yy 



AAID Length Length 
£S5~ 



6256 



7W 



— Score Probability 
4.6e-52 



sir 



Protein name 



Locus Name 



unknown 



|gp:AF12Sl64 



Acc# 



AF125164 



Description 



feacteroides tragiiis 63§£ polysaccnande B (PS B2) biosynthesislocus, 
complete sequence; and unknown genes. 



ORF Name 



NT ID 



NT AA 

— — , Score P robability 
AAID Length Length 



13551300 c3 2hS 



ll.le-37 



Protein name 



Locus Name 



stationary pnase survival protein SurK 



bir:A70372 



Acc# 



A70372 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



5255 



XST" 



^5" 



0.00012 



Protein name 



Locus Name 



unknown 



gp:AF04S745 



Acc# 



AF048749 



Description 



Bacteroides tragiiis capsular polysaccharide Joiosyntnesis operon, complete 
sequence . 





ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


±±1E.15.S..±±..±5± 


1037 


5255 


680 


2043 


1355 


4.3e-145 



Protein name 



Locus Name 



FtsMi> 



|gp:AB0233iO 



Acc# 



AB023310 



Description 

Cyanidioschyzon merolae gene tor FtsH2, complete eels . 



336 



KJss.c In ct U It; 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score Probability 




4304812_c2_24& 


1038 


6260 


140 423 


532 


3 . ve-bi 




Protein name 








Locus 


Name 


Acc# 




WcgG 


gp:AFl«164 


AF125164 




Description 
















Bacteroides tragiiis 638k polysaccharide B {VS B2) mosynthesisiocus, 




complete sequence; and unknown genes. 




























ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score Probability 




480355$_c3_297 


103$ 


626± 


198 597 


996 


z . be-iuu 




Protein name 








Locus 


Name 


ACC# 





putative acetyltransterase 



gp:Afl25l£4 



AF125164 



Description 

Bacteroides tragiiis 638£ polysaccnanae B {PS 32} JDiosyntftesisiocus , 
complete sequence; and unknown genes. 



NT 



AA 



ORF Name 



i4a7Am.„ci...ia2L | itmtt 



NT ID AAID Length Length 

wz&i — 



I5T5" 



Score Probability 
10.006^ 



195 



Protein name 



Locus Name 



|gp:YPiu2KB 



Acc# 
AL031866 



Description 

Yersinia pestis 102 kbases unstable region: rrom l to 119443. 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



TMT 



1T5T 



l.Se-26 



Protein name 



Locus Name 



N-acetylglucosammyl trans t erase 



EpTAMTTI^- 



Acc# 



AB017355 



Description 



Streptococcus agalactiae DNA, cps (capsular polysaccharide ) genes, partial 
and complete cds . 



337 



NT 



AA 



ORF Name 



NT ID 



14857256 ci 18J 



TMT 



AAIP Length Length 



Score Probability 
|7.4e-il7 



Protein name 



Locus Name 



Acc# 



X-His cLipeptidase, :arainoacyinistiame 
dipeptidase raminopeptidase 
D:heta-a lanyl-histidine 



pir : JUOJOO 



Description 



NT 



AA 



ORF Name 



NTID 



AAIP Length Length 



Score Probability 
|6.8e-20 



157 



Protein name 



Locus Name 



hypotnetical protein 



pir :E?23lO 



Acc# 



E72310 



Description 



ORF Name 



NTID 



NT AA 

— — , Score Pro bability 
AAIP Length Length 



5iafiAii».ciJia;£ | |iQ44 | 



[TuW 



H7T" 



|5.4e-ii 



Protein name 



Locus Name 



capsular polys accnande mosyntnesis nomoiog 
yveQ 



pir :F700J6 



ACC# 
F70036 



Description 



ORF Name 



Protein name 



NTID 



5757 



hypothetical protein APE2014 



Description 



NT 



AA 



AAIP Length Length 



Score Probability 
3.0e-26 



237 



Locus Name 



pir :H72504 



Acc# 



H72504 



338 



ORF Name 



NT ID 



±1 4b 



IMS- 



Protein name 



probable membrane -bound lytic murein 
transglycosylase D (dniR) 



Description 



NT 



AA 



AAID Length Length 



Score Probability 









440 


1323 




375 





Locus Name 



bxr:H7iaOI 



i.3e-3f 



Acc# 



H71301 



ORF Name 



6.D3.7.8.U1...G3....2.7.&.... 



Protein name 



NTID 



NT AA 

— — Score Probabi lity 
AAID Length Length 



TTT 



TOT" 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



9.6e-37 



Locus Name 



Acc# 



P43038 



blMSTHYLTRAtf SP3RA3E ) 



NT 



AA 



ORF Name 



NTID 



1045 



AAID Length Length 



1415 



Score Probability 
3.0e-65 



Protein name 



Locus Name 



Ykok 



gp:AB013374 



ACC# 



AB013374 



Description 



Bacillus halodurans C-125 mamX, yjctA, yKoK ana yvtK genes, partiaiana 
complete cds * 



339 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



6853387 cl 138 



JET 



|4.8e-106 



Protein name 



Locus Name 



PC Z A3 61 . b 



gp:AOPCkIA36l 



Acc# 



AJ223998 



Description 



Amycolatopsis orientalis cosmid PCZA36 l . 



ORF Name 



S00Sl2 c2 23£ 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



TTTTT 



TTT 



Tl4tr 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



S.2L^Q5.X...Cl..,ZU6.., 



Protein name 



Locus Name 



putative amino t rans t erase 



gp:AF125164 



ACC# 



AF125164 



Description 



Bacteroides tragilis polysaccnarxde b IPS B2) Piosyntnesisiocus, 

complete sequence; and unknown genes. 



ORF Name 



NTID 



NT AA 

— — , Score Probabi lity 
AAID Length Length 



Tu3T 



TTT 



i.3e-06 



Protein name 



Locus Name 



unknown 



gp:AF068902 



ACC# 



AF068902 



Description 



Streptococcus pneumoniae D-glutamic acid adding enzyme MurD 
(murD) ,undecaprenyl-PP-MurNAc-pentapeptide-UDPGlcNAc GlcNAc 
transferase (murG) , cell division protein DivIB 
(divIB) ,orotidine-5 ' -decarboxylase PyrF (pyrF) , and 

orotatephosphoribosyltransf erase PyrE (pyrE) genes, complete cds ; andunknown 



340 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 

pnn — 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



VTTT 



Length Length 
T7~ 



Score Probability 



TFT 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
1 



Score Probability 
|6.3e-06 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp:E><3IiiOSV2 



Acc# 



AJ130872 



Description 



Porphyromonas gingivaiis WbO receptor antigen (rag) locus encodmga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



NTID 



TTCT 



AAID Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



341 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



Locus Name 



Acc# 



[NO-HIT 



ORF Name 



l£4S.£2S.7...±l...b.. 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 
T5T 



Score Probability 



¥56 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



I24fi47.aia..±i...bi 



Protein name 



NT 



AA 



NTID 



TUSTT 



AAID Length Length 



Score Probability 



tttt 



2 .2e-06 



Locus Name 



aramoyl - pent apep tide carkoxypeptiaase 



pir:T34747 



Acc# 



T34747 



Description 



ORF Name 



NT AA 

— — Score Probabil ity 
NTID AAID Length Length 



2AfifiL£lU„±3L..-3LL 



TUFT 



6 .3e-14 



Protein name 



Locus Name 



slow myosin neavy chain 2 



gp;GGU85(m 



Acc# 



U85023 



Description 

Gallus gallus slow myosin heavy chain 'A (SM2) mRNA, partial cas . 



342 



ORF Name 



4100885 ti 2b 



Protein name 



NTID 



TUZT 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



nypotnetical protein ^iip00b2 



Description 



NT 



AA 



AAID Length Length 



Score Probability 
10.00017 



TTF" 



Locus Name 



pir :F7iy80 



Acc# 



F71980 



ORF Name 



49.Q0.2l5.2..±I...1 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probabil ity 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



343 



NT J^J-i. 

— , — , Score Probability 

AAID Length Length dL 

— 



AA 



ORF Name 



NTID 



107554^7 ti ii 



[TUFT 



I25TT 



|l.Se-24 



Protein name 



Description 



Locus Name 



Acc# 



P42179 



5KD O^ElRON JftAUSCRl^lOKfAL REGULATOR 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1175211 ti lib 



55T 



4.6e-53 



Protein name 



Locus Name 



inner membrane ABC transporter 



gp:AF2l382i 



Acc# 



AF213822 



Description 



Zymomonas mobilis strain ZM4 tosmia clone 42B3, complete sequence. 



ORF Name 



Protein name 



NTID 



TUFT 



NT AA 

— — Score Probability 
AAID Length Length 



6290 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



6291 



TWTT 



W7T 



|4.4e-87 



Locus Name 



gp:YP1021U* 



Acc# 



AL031866 



Description 

Yersinia pestis 102 kJoases unstable region: rrom 1 to 119443. 



344 



ORF Name 



NTID 



NT AA 

— — , Score Probab ility 
AAID Length Length 



15657687 ti iJ 



1070 



373 



|2.6e-34 



Protein name 



Locus Name 



lsp:YBDMJi!00Ll 



Acc# 



P77174 



Description 

HYPOTHETICAL K£> PROTSlti IN OstA-d SbG mtEk^tcJ Rhl^iui^ 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


I5$003l7_il_5 


1071 


62S3 




1014 


3lS 


l.6e-27 


Protein name 








Locus 


Name 


Acc# 



NrpB 



jgp: £1^464 88 



U46488 



Description 



Proteus mirabilis NrpS (nrp^) gene, partial cds, Nrpu mrpU} , JNirpT uirpTj , 
NrpA (nrpA) , NrpB (nrpB) , NrpG (nrpG) and IrpP (irpP) genes, complete cds. 



NT 



AA 



ORF Name 



NTID 



TU7T 



AAID Length Length 
TT7~ 



Witt 



1ST 



— ^ Score Probability 
|4.0e-0& 



TT5~ 



Protein name 



Locus Name 



6 OKDa protein 



gp:AB004bbO 



Acc# 



AB004560 



Description 

£>orphyromonas gmgivalis DNA tor 60kDa protein, complete cas. 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



\126.SJ.0.b2,...a±..^A I [HTTT 



WTT 



Protein name 



Locus Name 



Acc# 



Description 



[NO -HIT 



345 



ORF Name 



2292S450 tl ib 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



— Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



TUTS~ 



'WITT 



1.2e-by 



Protein name 



Description 



Locus Name 



sp:YBDN_fc!COLl 



Acc# 



P77216 



tttf^OfHETiClAL 47.6 KB PR6Tgtti Iti gSfA-DSBCi tNffiRtiEMlC RflGlOM 



ORF Name 



NTID 



— — Score Probability 



AAID Length Length 



0.0043 



Protein name 



Locus Name 



MHC class II alpna cliain 



Acc# 



AF091557 



Description 



Aulonocara hansbaenschi MHC class li alpha cnam MHC-Auna-DAAl 
mRNA ( MHC - Auha - DAA1 * 0 1 allele), complete cds . 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



TTJ7T 



TZT 



Protein name 



Locus Name 



Acc# 



Description 



INC-HTT 



NT 



AA 



ORF Name 



NT ID 



2449207& ti 1 



TUTS" 



AAID Length Length 
523 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



KO-HIT 



NT 



AA 



ORF Name 



NT ID 



HTT7T 



AAID Length Length 
[5TuT — 



Score 



Probability 
1.7e-07 



Protein name 



Locus Name 



pobR regulator 



gp:PSEY18527 



Acc# 



Y18527 



Description 



Pseudomonas sp. pobA, pobR, pcaQ, pcaH and pcaG genes. 



NT 



AA 



ORF Name 



infi&i&is..±a...ia I crass 



NTID AAID Length Length 

— 



Score 



[214" 



[S4T 



Probability 
|4.7e-l2 



Protein name 



Description 



Locus Name 



|gp = 



:LIINLC 



Acc# 



Y07639 



L.ivanovii 255 rRNA, 55 rRN A, tRNA-Asn, tRNA-Thr, QRJ? 1 z, miD, andmic 
genes . 



NT 



ORF Name 



NTID 



AAID 



3.3.3.^S.:L..cl...5.2 1 



iOSi 



Length Length 
233" 



AA 

— , Score 



Probability 
|i.5e-05 



Protein name 

Description 
TOgRMORECjtrLAl'ORY PROTEIN LOR J? ' 



Locus Name 



Acc# 



P28808 



347 



ORF Name 



NT ID 



NT AA 

— — Score Proba bility 
AAID Length Length 



35650462 13 4i 



TTT 



|4.8e-23 



Protein name 



Locus Name 



6 0kDa protein 



gp:AB004b60 



Acc# 



AB004560 



Description 



Porphyromonas gingivalis DNA tor 6UKDa protein, complete cas . 



ORF Name 



.4065160 izl 14 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
1553 



Score Probability 



197 



Locus Name 



Acc# 



Description 



NT 



AA 



ORF Name 



NTID 



I4asisafi...ca...aa 



AAID Length Length 

jzv — 



Score Probability 
0.6026 



^1 



Protein name 



Locus Name 



lipase precursor 



gp:AF053006 



Acc# 



AF053006 



Description 



Staphylococcus epictermiciis lipase precursor tgehl) gene, compietecds . 



ORF Name 



NTID 



AAID 



NT AA 

— — Score P robability 
Length Length 



195AA62.±±..± 



1085 



7.3e-23 



Protein name 



Description 



Locus Name 



sp : TCMJ^Tk^A 



Acc# 



P39887 



(eic 2.1.1.-) 



348 



ORF Name 



NT ID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



5250317 c2 80 



0.042 



Protein name 



Locus Name 



pqqG protein 



bir:B55ba7 



Acc# 



B55527 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



7.uS.5Sl0.2..±1...2B.. 



TOST" 



5305 



7TT" 



0.00S8 



Protein name 



Locus Name 



hypothetical protein MTH1102 



pir:P69013 



Acc# 



F69013 



Description 



ORF Name 



NTID 



NT AA 

— — Score P robability 
AAID Length Length 



3.3.&3.20.2...G3....1U1.. 



TTT 



l.2e-32 



Protein name 



Locus Name 



sensory transduction nistidme Kinase 
S110474 :protein s!10474 :protein sll0474 



pir:£766b0 



Acc# 



S76650 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


117^M17....c2...ia 


10S9 


6311 


484 






121 


2.8e-06 



















Protein name 



unknown 



Description 



Locus Name 



|gp:tJS677l 



Acc# 



U96771 



Prevotella bryantii putative polygalacturonase , B- 1 , 4 - endoglucanase r ana 
mannanase genes, complete cds; and unknowngenes . 



349 



NT 



AA 



ORF Name 



NT ID 



16601B26 cl 3tf 



AAID Length Length 
TF7T) — 



Score Probability 
7.4e-9i 



Protein name 



Locus Name 



receptor antigen (RagA) 



Acc# 



AJ130872 



Description 

E>orphyromonas gingivaiis WSO receptor antigen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


205C)7S37_c2_42 


1091 


6313 


542 


1629 


13$ 


3.6e-07 



Protein name 



Locus Name 



unknown 



gp:W$6771 



Acc# 



U96771 



Description 



Prevotella bryantu putative polygalacturonase, B-l, 4- endoglucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


l±63.B±±lUt3^2.Z 


1042 


6314 


420 1263 


101 


0.024 


Protein name 






Locus Name 


Acc# 


hypothetical protein ytaP 




| pir:B69<*«lJ 


| B69988 


Description 


ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


ll&SlBll^l^ 


1093 


6315 


543 1632 


147 


5.0e-07 



Protein name 



Locus Name 



unknown 



gp:05677i 



Acc# 



U96771 



Description 



Prevotella bryantii putative polygalacturonase, B-l, 4-endoglucanase, ana 
mannanase genes, complete cds; and unknowngenes. 



350 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Liengun 


Score 


Probability 


35351583_±3_34 




5315 


71 


216 






Protein name 








Locus 


Name 


Acc# 


Description 
















NO -HIT 


















ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


iminm..2& 


1095 | 


6317 


443 


1332 


520 


6.9e-50 















Protein name 



Locus Name 



Acc# 



nypotneticai protein PAB13 71 



Description 



[pa.r:C!7b0b4 



C75064 



ORF Name 



NTID 



41445.15.^2.^17. 



Protein name 



[TOST 



NT AA 

— — Score Probability 
AAID Length Length 



FTTH" 



P7TT 



[2TT 



Locus Name 



Acc# 



Description 



MO -HIT 



ORF Name 



7.8.1$.3.2...g1...:^.... 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



TTTSTT 



2.7e~85 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir:JC6U2/ 



Acc# 



JC6027 



Description 



NT 



AA 



ORF Name 



NT ID 



22697711 c3 y 



1W 



AAID Length Length 




JIT 



Score Probability 
9.2e-6S 



PIT 



Protein name 



Locus Name 



neuraminidase precursor 



gp : BNkNAbJAtW 



Acc# 



D28493 



Description 



Bacteroides iragilis nanH gene tor neuraminidase, complete cds . 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



iii$2£50 c2 139 



TUTT 



ITT 



2.2e-£0 



Protein name 



Locus Name 



TruB 



gp:APl69967 



Acc# 



AF169967 



Description 



F lavobacterium j ohnsomae LeuS (ieuSJ gene, partial cds; and F]oi^t]Ol^ , 
FtsX (ftsX), Fjol3 (fjol3), BacA (bacA) , and TruB (truB)genes, complete cds . 



ORF Name 



NTID 



"NTT AA 

— — Score Pr obability 
AAID Length Length 



liaS6.5.0J....cl...l^U., 



1100 



£122 



WIT 



TTT 



,i.3e-06 



Protein name 



Description 



Locus Name 



sp:^PAJ*OkkU 



ACC# 



P50069 



RlfiO^CLfciASE! P PKO'fEllN CJOMPO NflKfT, (PKOTBflfl Ob) (RNASE PJ 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



1101 



73 



Protein name 



Locus Name 



Acc# 



Description 



352 



NT 



AA 



ORF Name 



NTID 



1102 



AAID Length Length 
TT££ 1 



Score Probability 
0. 00018 



P77 



Protein name 



Locus Name 



sensory transduction system regulatory 
protein slrl837 iprotein slrl837 iprotein 
glrt837 



E 



ir:S7VJ4i 



ACC# 



S77341 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


iSl0d37V_cl_HSJ 


|ll03 




254 


765 


iOS 




U . UUli 


Protein name 










Locus Name 




Acc# 












sp:tt£M4_SCiM>0 




P87214 


Description 


















(UROiii^) 1 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2D.1.7.6.a7.a...cA...lb.fo. 


1104 


6326 


280 


843 


106 




U . UUUb j 


Protein name 


Locus Name 




Acc# 


ATPase summit 6 


gp:TCU40^S 




U40265 


Description 


















Trypanosoma cruzi ATPase subumt b mRNA, com 


plete cas. 






ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2D.i.7.7.b....ad„.15.3. 


1105 


6327 


304 


915 


466 






Protein name 










Locus Name 




Acc# 


FtSX 


gp:Atfl6£967 




AF169967 



Description 



Piavobacterium johnsoniae Leu£ (ieuS) gene, partial cas; and F3012 (H|oi2 j , 
FtsX (ftsX) , Fjol3 (fjol3), BacA (bacA) , and TruB (truB)genes, complete cds . 



353 



ORF Name 



2I160iV cl iuV 



Protein name 



NTID 



TTTJF 



NT 



AA 



AAID Length Length 



Score Probability 



fZTT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 
— , — , S core 
Length Length 



IXTTTT 



WTT 



11314 



Probability 
i.4e-110 



Protein name 



Description 



Locus Name 



sp:METK_HAKiW 



ACC# 



P43762 



ADSNOSYLTRANS^RASJE !) (ADOMKT SYNTJlKTAS-bi) 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



TTulT 



TU7T 



Probability 
|2.fie-lOS 



Protein name 



Description 



Locus Name 



sp : SYYj&At&'r 



Acc# 



P00952 



ORF Name 



NTID 



NT AA 
— — , Score 
AAID Length Length 



2144.7.0.il...ci...lU.y... 



HTUT 



PIT 



Protein name 



Locus Name 



Probability 



Acc# 



Description 
(NO-HIT 



354 



ORF Name 



23452160 tl iu 



Protein name 



NT ID 



TTTTT 



NT 



AA 



AAID Length Length 
[153 



Score Probability 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



ZTTT 



NT 



AA 



AAID Length Length 



Score Probability 



3T" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



TTTT 



AAID 



— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protem name 



NTID 



— — Score Probability 



AAID Length Length 



TTTT 



i.5e-09 



Locus Name 



oxidorectuctase, snort chain 
dehydrogenase/reductase family 



pir :A7i>3yb 



Acc# 



A72395 



Description 



ORF Name 



26605287 Cl 114 



Protein name 



Description 



(BC 2.1.1.66) 



NT 



AA 



NT ID 



AAID 



TTTT 



Length Length 



FZFT 



Score Probability 
|3.6e-37 



Locus Name 



Acc# 



sp:BACA_BCOLI 



ORF Name 



125305427 tl 14 



Protein name 



NT ID 



AAID 



1115 



hypothetical protein A635R 



Description 



NT 



AA 



Length Length 
FT - 



Score Probability 
715 



0.033 



Locus Name 



pir:Tl§l37 



Acc# 
T18137 



NT 



AA 



ORF Name 



NT ID 



iiAaaiaia...aa...i5i„ I [tits 



AAID Length Length 
— 



Score Probability 



FF" 



Protein name 
Description 

INO-Mrr 



Locus Name 



Acc# 



ORF Name 



NT ID 



NT AA , , . , . 
_ _ _ — ^, x — Score Probability 
AAID Length Length ^~ 



UTTT 



FFFF" 



FTT 



11584 



l¥T7~ 



3.8e-42 



Protein name 



Locus Name 



cnolme suitatase 



gp:RMU3 994 0 



Acc# 



U39940 



Description 

Smorhizobium meliloti bet operon, complete sequence. 



356 



ORF Name 



54251537 12 70 







NT 




AA 


NT ID 


AAID 


Length 


Length 


|iil8 


6340 


90 




273 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



^5.7.3.1^16....cl..aa5. I 11119 



fzunr 



Probability 
2.4e-35 



Protein name 



Locus Name 



putative secreted. £>eta-gaiactos:i_dase 



gp:SCF8l 



Acc# 



AL133171 



Description 
Streptomyces coelicolor cosmid F81. 



NT 



AA 



ORF Name 



NTID 



AAID 



TT7TT 



Length Length 



Score 



T7T 



Probability 
|2.5e-l3 



Protein name 



Locus Name 



TJoTT 



pp 



Acc# 



AF169967 



Description 



Flavobacterium johnsoniae LeuS (leuS) gene, partial cds; and 
FtsX (f tsX) , Fjol3 (fjo!3), BacA (bacA) , and TruB (truB)genes, 



V^oll (tjoI2) , 
complete cds. 



NT 



AA 



ORF Name 
3.6.&3.7.&23....c3....:L48... 



NTID 



AAID 



TTTT 



Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



357 



ORF Name 



3937750 Cl 110 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



TUT 



AAID Length Length 
5333 — 



Score Probability 



33T 



TuTT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
, — L1 — ^ Score Probability 
Length Length J - 



19A2D.B.2...z2..±±& .....I ITT23 



6345 



35T 



75T 



i.4e-74 



Protein name 



Locus Name 



S-acLenosylmethionine tRNA ribosyltransf erase 



Description 



pir :A72360 



Acc# 



A72360 



NT 



AA 



ORF Name 



NTID 



I41U21Z7....C3....1&.2... 



AAID Length Length 
S31S — 



1WT 



Score Probability 
735 



I.Ie-72 



Protein name 



Description 



Locus Name 



sp:KDUI_ERWCH 



Acc# 



Q05529 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



ldli3.0.D.5....c3....15.8. I 



6347 



T3T" 



i.6e-Ii 



Protein name 



Locus Name 



UI0454 



gpTAFTTTW 



Acc# 



AF174390 



Description 

Haemophilus mtluenzae strain Rd KW2 0 HI 04 54 gene, partial cds. 



358 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



418S43a ci IbV 



IT 



9.5e-15 



Protein name 



Locus Name 



conserved nypotnetical protein 



pir:ti722bl 



Acc# 



G72251 



Description 









NT 


AA 


Score 


ORF Name 


NTID 


AAID 


Length 


Length 


4iaaaa:i...ci...iitt 


1127 


6349 | 


201 


|606 


374 



12.1e-34 



Protein name 



conserved hypothetical protein yvab 



Description 



Locus Name 



[pir:D70U3J 



Acc# 



D70033 



ORF Name 



Protein name 



NTID 



TITS' 



NT 



AA 

— Score Probability 
AAID Length Length 



TFZW 



Locus Name 



Acc# 



Description 



IN6-H11' 



ORF Name 



Protein name 



NTID 



TT7T 



hypothetical protein c0624 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



3297 



JUT 



Locus Name 



pir:S73091 



l.le-3* 



Acc# 



S73091 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



TT3TT 



1773 



i.7e-09 



Locus Name 



Acc# 



response regulator 



gp:££>AJ6398 



AJ006398 



Description 

Streptococcus pneu moniae rrOS and hkQ9 genes; two component systemic. 



359 



ORF Name 



4876blb c2 li4 



Protein name 



NT ID 



AAID 



TTTT 



NT 



AA 



Length Length 
TT3I — 



Score Probability 



T7*T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



TTTT 



— — Score Probability 
Length Length 

TuTS — 



355 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



TTTT" 



NT 



AA 



Length Length 
— 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 




Score Probability 



Locus Name 



Acc# 



NO-Ull 1 



ORF Name 



Protein name 



NT 



NTID 



AAID 



Length Length 



AA 

— Score Probability 



Locus Name 



Acc# 



Description 



ItiO-HlT 



360 



ORF Name 



NT ID 



Protein name 



635b 



hypothetical protein PH0283 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TUT 



|4.7e-06 



Locus Name 



ir:D714b3 



jpir: 



ACC# 



D71453 



ORF Name 



Protein name 



NTID 



TTT7" 



NT 



AA 



AAID Length Length 
TT3I — 



Score Probability 



T7T 



Locus Name 



Acc# 



Description 



IN0-H1T 



ORF Name 



Protein name 



NTID 



TTTS" 



— — Score Probability 
AAID Length Length 



53" 



Locus Name 



Acc# 



Description 



(NO-HIT 



ORF Name 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



11135 



|2.5e-29 



Locus Name 



Acc# 



|sp:UW>K_i>uk<Jl 



083019 



Description 

(tiPPK) (6-#Yt)ft0XYM£frHYL-7, fl -DlHVDROPTEiftlti ft YR0PHI6& PkOKINAS E J IFFFJfi.) 



361 



ORF Name 



NT ID 



7515541 cl lii 



Protein name 



ubiquinone/ menaquinone £>iosyntnesis 
methyl transf erase- related protein 



Description 



NT 



AA 



AAID Length Length 
— 



3TT 



Score Probability 



Locus Name 
toir;F72262 



12 . 7e-07 



Acc# 



F72262 



ORF Name 



10.9.7.S.5.3.0...±3....5.4...... 



Protein name 

Description 
INO-HIT 



NT 



AA 



NT ID 



AAID 



Length Length 
T7 - 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
INO-HIT 



NT ID 



NT AA 
T — ^ T — ^, Score Probability 
AAID Length Length JL 



142L5.7..7.fiL2L..±3....££ I 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— L1 T — ^ Score Probability 
Length Length £ - 



TT£T 



SIT 



0.020 



Locus Name 



WW domain binding protein 5 



|gp:MMU92454 



Acc# 



U92454 



Description 

Mus muscuius WW domain binding protein 5 mRNA, partial cds . 



362 



ORF Name 



NTID 



— — score Probability 



2084768 ti b7 



AAID Length Length 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



ORF Name 



NT ID 



AAID Length Length 



AA 

— Score Probability 



Protein name 



Locus Name 



conserved Hypothetical protein yiD^ 



bir:H6W74 



Acc# 



H69874 



Description 



ORF Name 



NT ID 



— — Score Probability 

AAID Length Length 



6368 



TuTT 



HT7T 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TTTT 



TU7T 



±.2e~ J A J 2± 



Protein name 



Locus Name 



hypothetical protein mexF 



pir :T30&J0 



Acc# 



T30830 



Description 



NT 



AA 



ORF Name 



NTID 



\2±6A1&D.Q..±2..±L I [1143 



AAID Length Length 



[ST70 



TrTT 



Score Probability 
[2.1e-bb 



WT2 



Protein name 



Locus Name 



sp:YAEV_E<Jt>Ll 



Acc# 
Q47679 



Description 

HYPOTHETICAL 2£.V KB PROTEIN I N bMAg-^M HA INTEkCjENIC kEdloN 



363 



ORF Name 



NT ID 



NT AA 

, , ^ — , — Score Probability 
AAID Length Length x 



25975307 tl 27 



TIT 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID 



\2&&&.ti.l&&..±±..±6. I ITT^T 



\ZTTT 



Length Length 
T25" 



Score Probability 



T7W 



Protein name 

Description 
IN^TXTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2.6..7.5.6.5.5.1...Gl...0.Q^ I 11151 



IST7T" 



Length Length 



Score 



Probability 
3.5e-46 



Protein name 



Description 



Locus Name 



sp:A0RE_E00LI 



Acc# 



P24180 



ACPIELAVIN RESISTANCE PROTEIN E PRECURSOR. (EMVC PROTEIN) 



NT 



AA 



ORF Name 



NTID 



aaaftii&2„.c2».iaa i iris? 



AAID Length Length 

zm — 



Score Probability 
— 



2.3e-l6£ 



Protein name 



Locus Name 



transcription-repair coupling factor 



gp:AF023181 



ACC# 
AF023181 



Description 



Listeria monocytogenes transcript ion- repair coupling tactor (mtdL) , low 
temperature requirement B protein (ltrB) , and DivIC homolog (divL) genes, 
complete cds . 



364 



NT 



ORF Name 



NT ID 



AAID Length Length 



AA 

— Score Probability 



51375817 t2 44 



1153 



Protein name 



TIT 



155" 



Locus Name 



0.042 



Acc# 



conserved hypotneticai protein afoiss 



Description 



E 



ir:D6^7i 



D69273 



ORF Name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 



Protein name 



TTTT 



Locus Name 



7.8e-42 



Acc# 



Description 



sp:NAGA_VlliCJk 



032445 



DEACETYLAiW) 



ORF Name 



Protein name 



NTID 



ST7T 



NT — Score Probability 



AAID Length Length 



Locus Name 



I.2e-27 



Acc# 



Jiypotneticai protein 



Description 



G75263 



ORF Name 



Protein name 



NTID 



NT — score Probability 



AAID Length Length 



460 



Locus Name 



4.ie-42 



Acc# 



dihydroorotase (pyre) PABH4y 



Description 



|pir:(J7b027 



C75027 



365 



ORF Name 


NT 

NTID AAID Length 


AA 

— , Score 
Length 


Probability 


34430317_t2_38 


TT57 6375 262 78y 304 


5.4e-27 | 


Protein name 




Locus Name 


Acc# 


protein -tyrosine phospnatase 


gp:AB0^8bJo 


AB028630 


Description 




Clostridium periringens hyp2 7, bacH 7 ptp, cpa 


genes lornypotneticai 




protein, bacterial hemoglobin, protein- tyros inephosphatase, 2' 
nucleotide 2 ' -phosphodiesterase, partial and complete cds . 


, 3 1 -cuclic 




ORF Name 


NT 

NTID AAID Length 


AA 

— Score 
Length 


Probability 


4554?53_t2_45 


115$ 6380 161 48b 211 


| 3.8e-l7 



Protein name 



Locus Name 



sp:Y0^„BAC!sU 



Acc# 



P54486 



Description 

HYPO T HETICAL 17. J KB PROTEIN I N 0C0A -50DA INTkkgkjnuj k^IuN 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




45.7.0.Ml..±1...2b. 


1155 




5381 


253 


752 


531 


4.7e-bl | 



Protein name 



Locus Name 



putative glycosyl transterase. 



|gp:Sf!6D7 



Acc# 



AL133213 



Description 

Streptomyces coelicolor cosmid 6D7 . 



ORF Name 



NTID 



AAID 



NT AA 
— — , Score 
Length Length 



TTFTT 



Probability 
7.4e-4l 



Protein name 



Description 



Locus Name 



sp:NA(^S_BA<JSU 



ACC# 



035000 



PHOSPHATE DEAMINASE) — (GNPDA) (GLCN6P deaminase; 



366 



NT 



AA 



ORF Name 



48760y0 Cl 82 



NTID AAID Length Length 

— 



TT5T 



Score Probability 
TT2 



0.00012 



Protein name 



Description 



Locus Name 



sp:MFD_BACSU 



Acc# 



P37474 



TRANSCRIPT ION- RE PAIR COUPLING FACTOR (TRCF) 



ORF Name 



NTID 



AAID 



NT AA 

— _ — - Score Probability 
Length Length 



4S76300 cl SS 



FT" 



Protein name 

Description 
iNO-filT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



.7.15.7.5.7.&...H3L...6L.7. I 11163 



Length Length 



Score Probability 

— 



l.le-SS 



Protein name 



Locus Name 



conserved Hypothetical protein 



pir :C1229l 



Acc# 



C72391 



Description 



ORF Name 



NTID 



NT AA 

^ ^ — . — ^_ Score Probability 
AAID Length Length i - 



, ftaai7.fi..±i...2a i ittst 



Protein name 



F7IT 



1.9e-18 



Locus Name 



Acc# 



sp:METH_HUMAN 



Description 

(METHIONINE SYNTHASE, VITAMIN-M2 DEPENDENT) (MS) 



367 



ORF Name 



10437558 c3 133 



Protein name 
Description 

NO-HIT 



NTID 



AAID 



NT AA 

— , — „ Score Probability 
Length Length 



ITT 



TOT 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



lQ&4.7.aSS...t2...1.7. 



transcription regulator, crp tamily 



Description 



NT 



AA 



Length Length 
1ST 



T5T 



Score Probability 



5.7e-0S 



Locus Name 



bir:F722S5 



Acc# 



F72285 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



U0.S.£2£...c2...128. I IIT57 



3TT 



1.3e-87 



Protein name 
Description 

tOTATiVfi AMINOTkANSFSMSE B, 



Locus Name 



sp:PATli_BACiJU 



Acc# 



Q08432 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



1MM5.3A.±1...S. I [TTSF 



i.2e-08 



Protein name 



Locus Name 



outer membrane assembly protein (asmA) RP347 



BTrTFTT^r 



Acc# 



E71691 



Description 



368 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length J ~ 



14648577 Cl 93 



TTZT 



TTUT 



T7T 



i.le-10 



Protein name 



Locus Name 



Acc# 



transmembrane sensor 



|gp:AF051691 



AF051691 



Description 



Pseudomonas aeruginosa stress tactor A (pstA) , ECF sigma tactor {£iui; , 
transmembrane sensor (f iuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds . 



ORF Name 



NTID 



AAID 



14$7S£35 c3 155 



1170 



6392 



Protein name 



NT AA 

— — , Score Probability 
Length Length 



5TT 



mr 



Locus Name 



1.4e-58 



Acc# 



conserved hypothetical protein ytqA 



Description 



pir:D69999 



D69999 



ORF Name 



2.0.a&&9.3..7....ia...ll.. 



Protein name 



NTID 



TT7T" 



NT AA 

— , — , Score Probability 
AAID Length Length 



ITT 



Locus Name 



Acc# 



lipoic acid synthase 



Description 



pir :A75480 



A75480 



ORF Name 



NTID 



22.i.7.g.lil...a3....14S. I \TTT2 



Protein name 



AAID 



NT 



AA 



— , — , Score Probability 
Length Length 





Locus Name 



Acc# 



Description 
IMC- HIT 



369 



NT 



AA 



ORF Name 



NTID 



22705153 c3 132 



TT7T" 



AAID Length Length 
— 



JET 



Score Probability 
2 . 2e-28 



TTT 



Protein name 



Locus Name 



GldB 



|gp:AF158372 



Acc# 



AF158372 



Description 



Flavobacterium johnsoniae hypothetical protein gene, partial cds,-GlciB 
(gldB) , GldC (gldC) , and hypothetical protein genes, completecds ; and 
hypothetical protein gene, partial cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



123620910 C2 111 



TTVT 



Length Length 
33 



Score Probability 



733" 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

„^ TT , T — ^, T — Score Probability 
AAID Length Length x - 



i&ii±i6.$..±i..&& I ittt^ 



6397 



904 



2712 



^Te~T3" 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



JC6027 



Description 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID 



Length Length 



ITT 



Score Probability 
6.060§5 



Locus Name 



hypothetical protein yvqF 



plr :G70045 



Description 



Acc# 



G70045 



370 



ORF Name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



24695187 c2 110 



TT7T 



\T7T~ 



Probability 
|1.5e-100 



Protein name 



Description 



Locus Name 



sp:NAGB_BOP>BU 



Acc# 



030564 



PHOSPHATE DEAMINASE) (GNPDA) (GLCN6P DfiAMI&ASEl) 



ORF Name 



NTID 



NT AA 
— — Score 
AAID Length Length 



2314055 C2 99 



11178 



Probability 
5.4e-27 



Protein name 



Locus Name 



enoyl-acyl carrier protein reductase 



bir:H75530 



Acc# 



H75330 



Description 



ORF Name 



NTID 



NT AA 
_ — ^ „ — , Score Probability 
AAID Length Length JL 



TT73T 



6401 



T5T 



S.2e-I0 



Protein name 



Locus Name 



hypothetical protein APE2345 



pir :F72462 



ACC# 



F72462 



Description 



NT 



AA 



ORF Name 



'3.QA2D.26.X...C3....X&3... 



NTID AAID Length Length 

— 



TTSu" 



TTTTT 



Score Probability 




|2.5e-54 



Protein name 



Locus Name 



O-acetylJaomoserine sulrhydrylase 



pir:D7'2324 



Acc# 



D72324 



Description 



ORF Name 



3.25.6.6.4;2....C.2....27.. 



Protein name 

Description 
NO -HIT 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



371 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
T7T 



Score Probability 



Protein name 
Description 

(NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



!3^40.6.5.1.7...±1...S 



TTST 



AAID Length Length 
— 



£T3~ 



Score Probability 
2 . 8e-40 



Protein name 



Locus Name 



Acc# 



igp:3C5745 



Description 

S.cerevisiae chromosome XIII cosmid 974 5, 



NT 



AA 



ORF Name 



NTID 



AAID 



3.9A47.11...£3l...5£. I 11184 



Length Length 




Score Probability 




|4.£e-37 



Protein name 



Locus Name 



probable translation factor yciO 



pir :F64874 



ACC# 



F64874 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



iaA£aa<L±a...aa i ittto 



6407 



TTT" 



7.5e-07 



Protein name 



Locus Name 



maturation protexn pPM32 



Acc# 



AF166485 



Description 

Glycine max maturation protein pPM32 (PM32; mRNA, complete cds. 



372 



ORF Name 



NTID 



NT AA , , . . . 
T — , i , — L1 Score Probability 
AAID Length Length =L 



4007687 ti 10 



TZTW 



|4.3e-200" 



Protein name 



Locus Name 



DPP TV 



gp:AB008194 



Acc# 



AB008194 



Description 



Porphyromonas gingival is gene for DPP IV, complete cds . 



NT 



AA 



ORF Name 



14113037 c3 15$ 



NTID AAID Length Length 

6409 



TX37 



281 



Score Probability 
T33 



$.?e-07 



Protein name 



Locus Name 



two -component response regulator 
lytT- involved 



[pir:B^$6S5 



Acc# 



B69655 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
^TI3 



TTUT 



Score Probability 
TFL 



|2.6e-50 



Protein name 



Locus Name 



hypothetical protein b2 7l0 



(pir:B6505i 



Acc# 



B65051 



Description 



ORF Name 



NTID 



b:jJ.ZB.S3...XX..Al. 



TTFT 



Protein name 



NT AA 

, — _ — _ Score Probability 
AAID Length Length ^ 



5411 



conserved hypothetical protein ykrA 



Description 



T5T 



7T7T 



Locus Name 



pir:C69862 



Acc# 



C69862 



373 



ORF Name 



NTID 



AAID 



NT AA 

— ^, , — Ll Score Probability 
Length Length ^ 



969812 C3 144 



1190 



6412 



TOT 



TFT" 



|3.7e-12 



Protein name 



Locus Name 



RNA polymerase ECF-type sigma £ actor homolog 
yhdM 



pir;C69826 



Acc# 



C69826 



Description 



NT 



AA 



ORF Name 



NTID 



5.7.SiO.S...c5L...lD.O. I [TTST 



AAID Length Length 

— 



TIF" 



Score Probability 
333 



i.0e-30 



Protein name 



Locus Name 



sam- dependent methytransxerase 



pir :C72086 



Acc# 



C72086 



Description 



NT 



AA 



ORF Name 



NTID 



ifiiaiififii...ci...i<ia I firs? 



AAID Length Length 
— 



TTTT 



Score Probability 
l.^e-M 



322 



Protein name 

Description 
PRIMOSOMAL PROTEIN N' (REPLICATION FACTOR YJ 



Locus Name 



Acc# 



sp : PRi A^BACStJ 



NT 



AA 



ORF Name 



NTID 



i£).3.3.M^:z..±3....i3.a I iroi 



AAID Length Length 
^£TE — 



TITTT" 



Score Probability 
TT2 



5 . 8e-16 



Protein name 



Locus Name 



Hypothetical protein MJ074 9 



pir :E64393 



Acc# 



E64393 



Description 



374 



ORF Name 



NTID 



NT AA 

_ _ _ __ _ — _ — Score Probability 

AMD Length Length 



11767812 tl 20 



6416 



i.5e-24 



Protein name 



Locus Name 



two- component response regulator 
lytT- involved 



pir :B63655 



Acc# 



B69655 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

mi — 



Score Probability 
T3T 



3 . 5e-0£ 



Protein name 



Locus Name 



sp:YGEK_ECOLI 



Acc# 



Q46791 



Description 

HYPOTHETICAL TRANSCRIPT TONAL REGULATOR IN KDU1-LYSS INTERGENIC REGION 



NT 



AA 



ORF Name 



NTID AAID Length Length 
S¥T3 



Score Probability 

vm — 



2.4e-"08" 



Protein name 



Locus Name 



hypothetical protexn 



pir:C72325 



Acc# 



C72325 



Description 



ORF Name 



NTID 



AAID 



NT AA 
Length Length 



— , Score Probability 



TTTT 



Protein name 



Description 



Locus Name 



Acc# 



(NO-HIT 



375 



ORF Name 



14252182 tl 45 



Protein name 



resolvase 



Description 



NTID 



TTW 



NT 



AA 



AAID Length Length 



2X2" 



Score Probability 
TT2 



2.6e-lS 



Locus Name 



pir :S38652 



Acc# 



S38652 



ORF Name 



Protein name 

Description 
[NO-HIT 



NTID 



AAID 



NT AA 
_ — _ — _ Score Probability 
Length Length 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
JNO-HIT 



NT 



AA 



NTID 



AAID 



11200 



— , — , Score Probability 
Length Length i - 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



AAID 



Ua2a4£2...d..£aS I I33UT 



Length Length 



Score Probability 



T5T 



Protein name 
Description 

NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 



Protein name 

Description 
INU^HTT 



Locus Name 



Acc# 



376 



ORF Name 



NT ID 



16832885 ci 170 



Protein name 



Hypothetical protein 



Description 



NT 



AA 



AAID Length Length 



IZ9ST 



Score Probability 

nrm — 



Locus Name 



[pir.-JQlOSTT 



2 .36-177 



ACC# 



JQ102 0 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



a.a&i4Q5a...Gi...i-&2 i 11204 



Protein name 

Description 
[NO-HIT 



498 



Locus Name 



Acc# 



ORF Name 



is7.mftfiL±2...a7... 



Protein name 
Description 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length A - 



£3" 



[53 



0 . 045 



Locus Name 



sp : 3E>D2_CAEEL 



Acc# 
Q21767 



ORF Name 



NT 



AA 



NTID 



AAID 



±93.12A11..±1...±26. I 



Length Length 



Score Probability 



Protein name 

Description 
MO-lilt 



Locus Name 



Acc# 



377 



ORF Name 



20328267 Cl 164 



Protein name 

Description 
MO-HIT 



NT AA 

XTmTr , ^ ^ x — _ _ — _ Score Probab ility 
NT ID AAID Length Length JL 



TZUT 



pur 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



conserved Hypothetical protein 



Description 



NT 



AA 



AAID Length Length 
— 



\TWT 



Score Probability 




Locus Name 



pir:E72312 



3.1e-15 



Acc# 



E72312 



NT 



AA 



ORF Name 



NTID 



AAID 



lDAllt>35....Ci2....1±± I JTSTJ^ 



T — , — ^ Score Probability 
Length Length ^ 

i|77yB 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



ORF Name 



NTID 



AAID 



11210 



Length Length 



AA 

— . , Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2.D..7.^.3.L6.a...cZ...ZlSL I II2TT 



Length Length 



TIT 



Score Probability 
ITS 



Protein name 



Locus Name 



conserved hypothetical protein HP0713 



pir :A6460y 



Acc# 



A64609 



Description 



378 



ORF Name 



NTID 



20976426 t3 114 



TUT 



Protein name 



AAID 



asparaginase nomolog yccC 



Description 



NT AA o ^ ^ _ . . . fc 
— , — , Score Probability 
Length Length ^ 



3.5e-07 



Locus Name 



pir:F6S>7S4 



Acc# 



F69754 



NT 



AA 



ORF Name 



NTID 



2.-LbA1325...X-L...l. I 11213 



AAID Length Length 

zzzs — 



TT5TT 



Score Probability 
— 



3.1e-I25 



Protein name 

Description 
ANAEROBIC C4-DICARB0XYLATE TRANSPORTER DCUB 



Locus Name 



!sp:DCUB_HAEIN 



Acc# 



P44855 



NT 



AA 



ORF Name 



NTID AAID Length Length 

£TTS — 



TIT 



Score Probability 
o . 0070 



Protein name 



Locus Name 



putative transmembrane etflux protein. 



|gp:S0E51 



Acc# 



AL132973 



Description 



Streptomyces coelicolor cosmid F91. 



NT 



AA 



ORF Name 



iiafiflLiia...c2...iia I 1121s 



NTID AAID Length Length 

w%m — 



UST 



Score Probability 
S4 



0.031 



Protein name 



Locus Name 



sp : SPRCJCENLA 



Acc# 



P36378 



Description 

(OSTEONECTIN) (BASEMENT MEMBRANE PROTEIN BM-4 0) 



379 



ORF Name 



NT ID 



NT AA 

^ ^ _ — _ — _ Score Probability 
AAID Length Length 



23617157 c3 



4.5e-23 



Protein name 



Locus Name 



lsp:YJV7_YHAS , r 



Acc# 



P40893 



Description 

HYPOTHETICAL 22.0 Kfi PftOtEllN IN HX*11-HX*8 iNTfiftflENlC ftfiSlOUf 



ORF Name 



25651252 c2 23$ 



Protein name 



NTID 



1217 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 

toro-titt? 



ORF Name 



|ii£aiJ££A...cl...l.7.4 



Protein name 



NTID 



AAID 



hypothetical protein MJ1618 



Description 



NT 



AA 



Length Length 



T7T 



Score Probability 
5.0e-0S 



Locus Name 



pir :A64502 



Acc# 



A64502 



ORF Name 



NTID 



TTTT 



Protein name 



probable integrase/recombinase 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



ITT" 



fTTTT 



3.6e-05 



Locus Name 



pir :B71194 



Acc# 



B71194 



380 



NT 



AA 



ORF Name 



NT ID 



AAID 



24005637 2bb 



TTKT 



Length Length 

fm — 



Score Probability 



7G 



67 



0.02b 



Protein name 



Description 



Locus Name 



Acc# 
Q80910 



RfiGuLATCkY PROTEIN ti'Z 



ORF Name 



NT ID 



AAID 



— — Score Probability 
Length Length 



240660b6 ti ibb 



TTZT 



YTW 



5.5e-oy 



Protein name 



Description 



Locus Name 



sp : CEba_baCAM 



Acc# 



P23939 



BAMHI CONTROL ULKMUNT 



ORF Name 



Protein name 



NT ID 



TTZT 



NT 



AA 



AAID Length Length 



Score Probability 



S7T 



Locus Name 



Acc# 



Description 



KO-HIT 



ORF Name 



2±lB:±lxL±L.Al). 



Protein name 



NT ID 



AAID 



TTZT 



NT 



AA 



Length Length 
T5T~ 



Score Probability 



Locus Name 



Acc# 



Description 



(NO- HIT 



381 



ORF Name 



24354017 t3 153 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



TTZF 



ST 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



Protein name 



arylesterase 



Description 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



11225 



T5ZT 



Locus Name 



gp:AF044S83 



Acc# 



AF044683 



Agrobacterium radiobacter putative dihydrolipoamideS-acetyltransterase 
(dla) gene, partial cds; arylesterase (ada)gene, complete cds; and putative 
dihydrolipoamide dehydrogenase (dlh) gene, partial cds. 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
7T" 



Score Probability 



TIT 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



1227 



6449 



NT AA 

— , — , Score Probability 
Length Length 



1194 



Locus Name 



Acc# 



Description 
NO -HIT 



382 



ORF Name 


NT 

JM i ID t\t\±Lt J-iciiy L.J.J. 




AA 
Length 


Score 


Probability 


24882203_tl_2y 


6450 VI " 


lay 










Protein name 






Locus Name 






Acc# 


Description 
















NO-HIT 














i 


ORF Name 


NT 

NTID AAID Length 


AA 
Length 


Score 


Probability 


2Aaa2ai^ct™m 


1229 6451 17b 


537 


348 








Protein name 






Locus Name 






Acc# 


adaptive response regulatory protein 


gp:AF04783y 




AF047839 


Description 


















£seudoaiteromohas sp . S3 putative giucosyl hydrolase piecuisoi diiadadptiv^ 
response regulatory protein (ada) genes, complete cds . 




ORF Name 


NT 

NTID AAID Length 


AA 
Length 


Score 


Pr 


obability 


^Al£A0....a£...2A2 


1230 ^452 2 00 


603 


352 






4.4e-32 


Protein name 






Locus Name 






Acc# 


unknown 


gp:AF00bO34 






AF006034 


Description 


















Clostridium pasteurianum 1 , 3 -propanediol aenyarogenase tctfiaij 
cds . 


gene, compxet-e 




ORF Name 


NT 

NTID AAID Length 


AA 
Length 


Score 


Probability 


2B.5.1&Al£.^t2^.b. 


1231 6453 78 


2.5 1 











Protein name Locus Name 



Description 
[NO-HIT 



383 



ORF Name 



NTID 



NT AA 
_ — _ _ — _ Score Probability 
AAID Length Length J - 



25514555 t3 147 



15454 



86 



0.010 



Protein name 



Locus Name 



probable serine -threonine -protein kinase 



pir :T41341 



Acc# 
T41341 



Description 



ORF Name 



NTID 



AAID 



TTIT 



Protein name 



hypothetical protein MTH84 7 



Description 



NT 



AA 



Length Length 
TTT 



Score Probability 
|3.9e-08 



T2T 



Locus Name 



pir:A59213 



Acc# 



A69213 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
[S4SS — 



TTFT 



Score Probability 

— 



3.0e-24 



Protein name 



Description 



Locus Name 



Acc# 



sp : P&tkjBkCSU 



PRIMOSOMAL PROTEIN M' (REPLICATION FACTOR Y) 



ORF Name 



NTID 



AAID 



NT AA 
Length Length 



Score Probability 



TIT 



Protein name 
Description 

NO -HIT 



Locus Name 



Acc# 



384 



NT 



AA 



ORF Name 



NT ID AAID Length Length 

[£T5I5 — 



TIT 



Score Probability 
5T2 



|2.0e-91 



Protein name 

Description 
aM1d0Hy£>£0La£S 11) (L-A£rtA£E 11) (COLaSpaSe) 



Locus Name 



Acc# 



sp:ASG2_EC0LI | P00805 



NT 



AA 



ORF Name 



NT ID 



26460937 £2 102 



AAID Length Length 
ST^3 — 



1179 



Score Probability 
2.6e-S7 



Protein name 



Locus Name 



mannose-l-phospnate guanylyltranst erase 



Description 



jpir:ri?2363 



Acc# 



H72303 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



6460 



3.0e-56 



Protein name 

Description 
(EC 2.3.1.-) 



Locus Name 



sp:YJVS_YEAST 



Acc# 



P40892 



ORF Name 



NT ID 



NT AA 
_ — T — Score Probability 
AAID Length Length JL 



2L&5aaia2L...tl...2L2L I 11239 



6461 



4S2" 



T3W 



3.3e-Sl 



Protein name 



Locus Name 



oxidoreductase , aldo/iceto reductase tamily 



pir:E72254 



Acc# 



E72284 



Description 



385 



NT 



AA 



ORF Name 



NTID 



AAID 



26601062 c3 259 



Length Length 
"51 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



i29.3.2aso.i„±i...i3.i I imr 



Length Length 



Score Probability 



TUT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



lllllS£.2...a2...2l&. I (1237 



Protein name 
Description 

NO-HIT 



Locus Name 



Probability 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



mas.o.2...c3.„.2.ai i kzu 



IT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



aaa452L55...ci...iaa | rraw 



Length Length 



Score Probability 
TO3 



0.014 



Protein name 



Locus Name 



hypothetical protein 2 



pir :S49113 



Acc# 



S49113 



Description 



386 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



34078300 ±3 152 



Protein name 



Locus Name 



integrase 



gp;BEU75371 



Acc# 



U75371 



Description 



BacteroicLes tragilis transposon Tn4555 TnpA (tnpAJ , integrase (int) , TnpC 
(tnpC) , excisionase (xis) , mobilization protein (mobA),and beta- lactamase 
(cfxA) genes ; complete cds; and unknown genes. 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



34104127 t3 141 



11246 



3.0e-26 



Protein name 



Locus Name 



sp : lft<3A_VlBCfi 



Acc# 



P27772 



Description 

T RON - RE GUL ATED OUTER MEMBRANE VIRULENCE PROTEIN PRECURSOR 



ORF Name 



3.42&0.9.11..±3....15.a., 



Protein name 



NTID 



AAID 



— , — , Score Probability 
Length Length 



TZTT 



\64&9 



Locus Name 



Acc# 



Description 
[NO-HIT 



ORF Name 



isii4aaa„±2...fi5 



Protein name 



NTID 



AAID 



16470 



NT 



AA 



— , — , Score Probability 
Length Length 

7TE~ 



Locus Name 



Acc# 



Description 

NO-HIT 



387 



NT 



AA 



ORF Name 



NTID 



35704786 t2 92 



AAID Length Length 

mn — 



Score Probability 
2.5e-29 



TIE 



Protein name 



Locus Name 



integrase IntNl 



gp:BUU51Si7 



Acc# 



U51917 



Description 



Bacteroides unitormis insertion element NBU1 fragment, mtegraselntNl gene, 
complete cds . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



3§3SSl-7 c2 24^7 



T2W 



H4T 



TTTT 



l.5e-l34 



Protein name 



Locus Name 



aspartate ammonia- lyase 



gp:WSAJ2933 



Acc# 



AJ002933 



Description 

Wolineila succinogenes aspA, dcuA genes and partial ansA gene. 



NT 



AA 



ORF Name 



NTID 



AAID 



£T7T 



Length Length 



Score Probability 
TSu 



|2.9e-il 



Protein name 



Locus Name 



AlgZ 



|gp:PAU52431 



Acc# 



U52431 



Description 



Pseudomonas aeruginosa AlgR- cognate sensor AlgZ (algZ) gene, complete cds. 



NT 



AA 



ORF Name 



£Q.aQ.9.5.3....£2....8.7. i 13352 



NTID AAID Length Length 

[ST74 



Score Probability 
3.1e~0S 



147 



Protein name 



Locus Name 



transcription regulator 



gp;AF008220 



Acc# 



AF008220 



Description 
Bacillus subtilis rrnB-dnaB genomic region. 



388 



NT 



AA 



ORF Name 



NT ID 



AAID 



4072187 CI 172 



FT7T 



Length Length 
TTZZ — 



Score Probability 
1153 



9.3e-95 



Protein name 



Description 



Locus Name 



sp:DX3_BA<^SU 



Acc# 



P54523 



PftOSABLH l-MOmVLULOSH-S-fHOS&SAtE! SYNTHASE (DXf SYNTHASE) 



ORF Name 



423162 cl 207 



Protein name 



NTID 



AAID 



16476 



NT AA 

— , — , Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



l £345.3.3..7....t3...110... 



Protein name 



Description 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length *~ 



6477 



1206 



L4T 



3.Se-0$ 



Locus Name 



gp:ECASPA 



Acc# 



X02307 



~E~! coli aspA gene tor aspartase (L-aspartate ammonia- lyase) (EC4 .3.1.1) 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



2421 



T7T 



i.0e-08 



Protein name 



Locus Name 



R27-2 protein 



pir :T302y6 



Acc# 



T30296 



Description 



389 



NT 



AA 



ORF Name 



NTID 



AAID 



14960312 tl 46 



TZ5T 



Length Length 



T57T 



Score Probability 
TZ1 



7.2e-16 



Protein name 



Locus Name 



putative integrase 



gp:BA124239:J 



Acc# 



AJ242593 



Description 
Bacteriophage A118 compiete genome. 



ORF Name 



575712 tl 42 



Protein name 

Description 
NO-fll* 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 
£P7 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length <£ - 



l.6e-20 



Locus Name 



hypothetical protexn sir2078 



pxr:S77566 



Acc# 



S77566 



Description 



390 



ORF Name 



5800466 ci 152 



Protein name 

Description 
[NO-HIT 



NT ID 



AMD 



NT AA 

— , — , Score Probability 
Length Length -L - 



FT 



Locus Name 



Acc# 



ORF Name 



NT ID 



AAID 



s&iafl2L±2...aa i wi&i 



Protein name 



probable pretoldm subunit APE1440 



Description 



NT 



AA 



Length Length 
55? 



Score Probability 

m 



Locus Name 



pir :G72622 



0.00034 



Acc# 



G72622 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



WFF 



Protein name 



Locus Name 



putative transmembrane protein Wzc 



gp:AF104512 



Acc# 



AF104912 



Description 



Escherichia coli K30 capsule biosynthesis cluster, partialsequence . 



NT 



AA 



ORF Name 



NTID 



1264 



AAID Length Length 

Tin - 



Score Probability 



JUT 



75 



0.034 



Protein name 



Locus Name 



nuclear tactor kappa-B2 



gp:HSTJ2flfllS 



Acc# 



V2Q816 



Description 

Human nuclear tactor kappa -B2 (NF-KB2 J gene, partial ccts . 



391 



NT 



AA 



ORF Name 
133501 ±2 2§ 



NT ID 



AAID 



6487 



Length Length 



T7TT 



Score Probability 
TTOS — 



9.0e-144 



Protein name 

Description 
(GLNRS) 



Locus Name 



Acc# 



sp : SY0_ECOLI 



NT 



AA 



ORF Name 



NTID 



AAID 



1J50033 t'l 11 



Length Length 
TIT 



Score Probability 



Protein name 

Description 
MO-HlT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



T7ZT 



6489 




172 


51$ 


470 





1.4e-44 



Protein name 
Description 

PROBABLE THIOL PEROXIDASE, 



Locus Name 



sp:TPX_MYOTtJ 



Acc# 



P95282 



NT 



AA 



ORF Name 



NTID 



iI8.5.9..7.D.2...±3....5.4.. 



AAID Length Length 
ttJQ 



T73T 



Score Probability 
T3T 



3.8e-ll 



Protein name 



Locus Name 



transposase 



gp:AF038S66 



Acc# 



AF038866 



Description 



Bacteroides fragilxs transposon Tn552 0 transposase (bipHJ andmoJDiiization 
protein BmpH (bmpH) genes, complete cds . 



392 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



13945437 c3 



TZZT 



86 



0.00077 



Protein name 



Locus Name 



sp:DBH_THEMA 



Acc# 



P36206 



Description 
DlIA-fil3!«)Il*G PROTEIN m 



ORF Name 



15738937 tl 24 



Protein name 



NTID 



TT7TT 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TUT 



Locus Name 



Acc# 



Description 



ORF Name 



NTID 



AAID 



lfi4QSfiai.±l„.5i I P37T 



Protein name 



hypothetical protein Rvl624c 



Description 



NT 



AA 



Length Length 
TF? 



Score Probability 
i.9e-0§ 



TT7 



Locus Name 



pir :F70558 



Acc# 



F70558 



ORF Name 



NTID 



AAID 



NT AA 
— , — , Score 
Length Length 



TZTF 



raw 



T7T 



Probability 
l.ie-10 



Protein name 



Locus Name 



conserved hypothetical protein MTH72 



pir :B69196 



Acc# 



B69196 



Description 



393 



ORF Name 



24070786 c2 47 



Protein name 



Description 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TTTT 



7ZT 



i.7e-32 



Locus Name 



sp:YQGH_BACSU 



Acc# 



P46339 



ORF Name 



c3 100 



Protein name 



NT ID 



TTTT" 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



[42T" 



11373" 



Locus Name 



Acc# 



Description 
IWO-Hfr 



ORF Name 



NT ID 



as442iaa..±i...is t 



Protein name 



AAID 



NT 



AA 



Length Length 



Score Probability 



hocus Name 



Acc# 



Description 

NO-HIT 



ORF Name 



NTID 



AAID 



aaSfil3.£6....tl...2Q I |T37? 



£4W 



Protein name 



hypothetical protein 



Description 



NT 



AA 



Length Length 
T5T 



WZT 



Score Probability 
l.3e-07 



TTT 



Locus Name 



pir :T28682 



ACC# 



T28682 



394 



NT 



AA 



ORF Name 



NTID 



34179077 ii 16 



1277 



AAID Length Length 




7WT 



Score Probability 
T7B 



1.2e-10 



Protein name 



Locus Name 



|sp:EPSA_BtmSO 



Acc# 



Q45407 



Description 

3£>S 1 &OLYSACCMAftlf)E EXPORT OtJTER MEMBRANE! P&OtSIN fiPSA £>Rfi«JRS0R 



ORF Name 



NTID 



NT AA 
_ * — , i , — ^ Score Probability 
AAID Length Length JL 



36131937 cl 78 



1278 



REITS" 



TIT 



Protein name 



Locus Name 



phosphate -binding protein PstS 



fpirTHFSuTT 



Acc# 



H69097 



Description 



NT 



AA 



ORF Name 



NTID 



1279 



AAID Length Length 
ttUI — 



TTTT 



Score Probability 
2 . 7e-64 



656 



Protein name 



Locus Name 



GumD protein 



pir :S67820 



Acc# 



S67820 



Description 



NT 



AA 



ORF Name 



NTID 



±lk±0A2...al.A$. I 



AAID Length Length 
— 



Score Probability 

iras — 



I3.6e-21 



Protein name 



Locus Name 



hypothetical protein [ repA 5 ' region) 



pir :S30120 



Acc# 



S30120 



Description 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



PIT 



TOT" 



DedA family protein 



Locus Name 
[pir:B75253 



Description 



|2.6e-27 



Acc# 



B75253 



395 



NT 



AA 



ORF Name 



NTID 



AAID 



TZ$T 



Length Length 
T5T 



Score Probability 




|2.1e-23 



Protein name 



Locus Name 



N-acetylmuramoyl-L-alanine amidase homolog 



pir :G64126 



Acc# 



G64126 



Description 



ORF Name 



NTID 



AAID 



7.mifi2L...tl...4 



1283 



Protein name 



phosphate -binding protein PstS 



Description 



NT AA , , . n ■ 
— _ „ — Ll Score Probability 
Length Length ^ 



7^" 



|3.1e-38 



Locus Name 



pir :H69097 



Acc# 



H69097 



ORF Name 



Protein name 

Description 
INO-MiT 



NTID 



NT AA 

,, TT ^ T — _ — Score Probability 
AAID Length Length 



Il284 



X47 



444 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



1&1U3X7.6....C.3....114.. I 



AAID Length Length 

ran — 



Score Probability 




12 . 5e-34 



Protein name 



Locus Name 



gp;PGU6 02 08 



Acc# 



U60208 



Description 

Porphyromonas gingival is ortl, ort2 and ort3 genes, complete cds . 



396 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



179752 c2 94 



1286 



TZUW 



9 . 2e-62 



Protein name 



Description 



Locus Name 



Acc# 



sp:VBD6_EC0LI 



iffPOtSfitlCAL 46.6 KC PROTEIN Itf PH2£-NFNB ItfxElRGENlC REGION 



NT 



AA 



ORF Name 



22053367 c2 92 



NTID AAID Length Length 



1863 



Score Probability 
S7B 



9.5e-£7 



Protein name 



Locus Name 



alpha- 1, 3/4-lucosictase precursor 



gp:£Stf3M54 



Acc# 



U39394 



Description 



Streptomyces sp. alpha- 1, 3/4- tucosidase precursor gene, compieteccts . 



NT 



AA 



ORF Name 



2!ftJiS5...cl...m I 



NTID AAID Length Length 
[^TS 



Score Probability 
0.0027 



Protein name 



Locus Name 



Acc# 



sp:YEHT_ECOLI 



Description 

ttY^OT'&E'rlCAL 27.9 KD £>R0tEiN IN MOLR-BGLX iaTCEkSStflC MiGiOtf 



NT 



AA 



ORF Name 



NTID 



AAID 



ff2F9" 



Length Length 
$1 



Score Probability 



Protein name 
Description 

pc^irrr 



Locus Name 



Acc# 



397 



NT 



AA 



ORF Name 



NT ID 



24406550 c2 99 



TZTT 



AAID Length Length 
^ — 



TUT 



%3U~ 



Score Probability 
TUZ 



1.0e-05 



Protein name 



Description 



Locus Name 



gp:GGU25741 



Acc# 



U25741 



Group G streptococcus strain g6 emmL gene # partial cds . 



ORF Name 



25428436 C2 105 



Protein name 



NTID 



AAID 



T75T 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 
T — T — _ Score Probability 
AAID Length Length dL 



TTTT 



FST4" 



probable extracellular nuclease 



Description 



TUTT 



TUT 



Locus Name 



0.035 



Acc# 



D75625 



ORF Name 



NTID 



NT AA 

— _ — Score Probability 
AAID Length Length • £ - 



2£iiaaia...aa...iii I 11233 



^T5" 



Protein name 



silent surface layer protein 



Description 



WT 



TTZT 



TUT 



'0.0058 



Locus Name 



(gpT^FUT^FT 



Acc# 



AF079365 



Lactobacillus crispatus silent surtace layer protein (cbsBJ gene, partial 
cds . 



398 



ORF Name 



NTID 



AAID 



NT AA 
T ~ v T — . , Score Probability 
Length Length 



265878 13 62 



T2W 



6516 



Protein name 



Locus Name 



0.043 



Acc# 



MAR binding tilament-UKe protein l:MFPl 
protein 



pir:T07lll 



Description 



T07111 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2520 



TTT 



0.00012 



Protein name 

Description 
FERRIC ENTEROS ACT IN RECEPTOR PRECURSOR 



Locus Name 



sp : PFEA_PSEAE 



Acc# 



Q05098 



NT 



AA 



ORF Name 



NTID 



AAID 



3.D.19.2.0.a£L.cl....7.&., 



11296 



Length Length 



Score 



1701 



Probability 
|7.7e-83 



Protein name 
Description 

BETA- GALACTOS IDASE , ( LACTASE ) 



Locus Name 



Acc# 



|sp:BGAL_THEMA 



NT 



AA 



ORF Name 



NTID 



iiia£ia.i...ci...ni I 



AAID Length Length 



Score Probability 
73B 



T7^e r 7T' 



Protein name 



Locus Name 



Acc# 



DNA-directed DNA polymerase, III chain 
dnaX:DNA polymerase III (gamma and tau 
subunits) tinaX 



pir :S13786 



Description 



399 



ORF Name 



34570437 t2 47 



Protein name 

Description 
(PfiMlDASfl D) 



NT 



AA 



NTID 



AAID 



"Z5TT 



Length Length 
TWT 



TFTT 



Score Probability 
|fi.5e-iifi 



1142 



Locus Name 



|sp:PEPE>_EC!C>LI 



Acc# 



P15288 



ORF Name 



35990807 Cl 79 



Protein name 



NTID 



AAID 



1299 



transaldolase- related protein 



Description 



NT AA 

— , — , Score Probability 
Length Length 



TTT 



wnr 



1.6e-59 



Locus Name 



pir:G723$4 



Acc# 



G72394 



NT 



AA 



ORF Name 



NTID 



AAID 



~g%7T 



Length Length 
TIT" 



Score Probability 
|2.0e-05 



144 



Protein name 



Description 



Locus Name 



|gp:APU72238 



Acc# 



U72238 



Anabaena PCC7120 0E>F£i, , > 

sequences . 



ORFR4 , and ORFR5 genes , complete 



ORF Name 



NTID 



AAID 



NT AA 

— „ — , Score Probability 
Length Length 



A4iaia...ca...a.flLfi I mar 



^UT 



WTT 



2.6e-S>6 



Protein name 
Description 

BETA- (3ALACT0S IDAS E , ( LACTASE ) 



Locus Name 



|sp:BGAL_BACME 



Acc# 



052847 



400 



ORF Name 



4951651 12 35 



Protein name 



ITT 



333" 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



NTID 



li5iiaaaa„±i..,i<A I u^rn 



Protein name 



AAID 



probable proteinase PAB196 0 



Description 



NT AA , , . . . 
— , ^ — A , Score Probability 
Length Length A - 



TUT 



'5.ie-15 



Locus Name 



[pir:A75179 



Acc# 



A75179 



ORF Name 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



TJUF 



T — ^, — ^, Score Probability 
AAID Length Length JL 



£7T 



Locus Name 



Acc# 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 
^ST7 — 



274 



1825 



Score Probability 




0.012 



Locus Name 



gp:ATAC012563 



Acc# 



AC012563 



Arabidopsis tnaiiana cnromosome I BAC T23K23 genomic sequence, complete 
sequence . 



401 



NT 



AA 



ORF Name 



NT ID 



AAID 



6528 



Length Length 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID 



6529 



— , — , Score Probability 
Length Length 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



3.3.45.fc9.&2...£1...2 



113 A3 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



1N0-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



|iD.aSA3..7...±l...l.. 



TJUT 



TT1T 



0.00062 



Protein name 
Description 

HYPOTHETICAL PROTEIN MJ0066 



Locus Name 



sp:Y066_METJA 



Acc# 



Q60377 



402 



NT 



AA 



ORF Name 



NTID 



AAID 



6S37782 rl 4 



rmr 



Length Length 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



aftiau2i„±i.„a I irnr 



Length Length 
TT5" 



Score Probability 



1W 



Protein name 

Description 
JNO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



iaa4i5tfe3L...ci...a.2La i 11312 



T5T 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TJTT 



2¥T 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



ii5flaaafiL...at...iia 1 pn 



Length Length 



Score Probability 



Protein name 
Description 

m-uif 



Locus Name 



Acc# 



403 



qrf Name v*±j.±j 


AAID 


NT AA 
Length Length 


Score 


Probability 




138606S:i_c:M.y4 1315 


6537 


488 


|1467 


1040 








Protein name 






Locus Name 




ACC# 

AJ249201 




cell dxvision protein 






'"] gp:PAL_49_Ul 






Description 
















"f>revotella albensis itsQ (partial) , rtsA ana 


L ttsz genes ana 








ORF-fts (partial) . 
















ORF Name NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 

1 T7> 7TT 1 ■*> A 




142Sim2_cl_l.il 1316 | 




489 


1470 


1275 








Protein name 






Locus Name 




ACC# 
Q51831 










sp:Mttft<i_PORCi± 






Description 














I 


ACETYLMJRANOYL-h -ALANINE synthutas*;; 












| 


ORF Name NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 

-I VT 7TZ T=\ 


1 


l„5.Mlb.„...a2...1b.4 1317 


6539 


254 


765 


341 






I 


Protein name 






Locus Name 




ACC# 




| FtsQ 






I gp:A£004555 




AB004555 




Description 
















£>orphyromonas gingivalis genes ror Ftsy, tts*, Ftsz, 


complete 


CC 


IT. 




ORF Name NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 




±6A6£.±...c-<L..±te. 1318 


6546 


669 


2010 


5534 




u . u 




Protein name 






Locus Name 




ACC# 

1 7\Dm 





DMA gyrase ts subunit 



Description 

Bacteroides tragiixs gyrb gene t or UUA gyrase a sucunit, compxeuecas . 



404 



NT 



AA 



ORF Name 



NTID 



AAID 



16593937 Cl 127 



TTTT 



Length Length 
T35" 



Score Probability 

m% — 



6.7e-36 



Protein name 



Description 



Locus Name 



sp:VLAO_BAC^U 



Acc# 



007639 



HYPOTHETICAL 43.7 KD PkOTSltf itf KffRfi-PVCA llrftfiftSfi^lC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



16808437 c3 192 



Length Length 



WW 



Score Probability 

\m — 



Protein name 



Locus Name 



UDP-N-acetylmuramoylalanine-D-glutamate 
ligase 



ETrnrTMTT 



De script ion 



€.9e-lfi 



Acc# 



H70477 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



|I7.7.iB.l...cl...I3.7. I 



rex- 



0.047 



Protein name 



Locus Name 



OrtslOc 



IgpT^m^TT 



Acc# 



U42227 



Description 



Saccnaromyces cerevisiae replicative mitocnondrial DNA polymerasecatalytic 
subunit (MIP1) gene, nuclear gene encoding mitochondrialprotein, partial 
cds, and putative 10-f ormyl-tetrahydrof olatebinding protein (FTB1) gene, 
complete cds . 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



liaaaaaii-ai-iasL I \cm 



6544 



TFT" 



2.1e-25 



Protein name 



Locus Name 



hypothetical protein 1 



pir :S70830 



Acc# 



S70830 



Description 



405 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 
„ — , — ^, Score Probability 
Length Length * L 



Locus Name 



Acc# 



Description 
MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



auaflia...c3L„.ia3L i \tth 



Length Length 



Score 



Probability 
7.6e-60 



Protein name 



Locus Name 



unknown 



gp:EPU94707 



Acc# 



U94707 



Description 



Enterococcus taecalis strain 
yllB, yllC, yllD, pbpC, mraY, 
complete cds. 



A24836 ceii wall/ceil division genecluster, 
murD, murG, divlB, ftsA andftsZ genes, 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length JL - 



fb4T 



T55" 



|2.5e-l3 



Protein name 



Locus Name 



Acc# 



sp:YGY4_HAL5Q 



P21562 



Description 

HYPOTHETICAL £0.2 KT> PROTEIN IN THE 5 ' REGION OP GYRA AMD GYRB (ORE 4) 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



m&14fifl...c2...I&6. I [1325 



TTZT 



Probability 
1.4e-13i 



Protein name 



Locus Name 



cell division protein 



gp:PAL245201 



Acc# 



AJ249201 



Description 



Prevotella albensis rtsQ (partial) , ttsA and ftsZ genes and 
ORF-fts (partial) . 



406 



NT 



AA 



ORF Name 



NTID 



234S8S37 ±3 100 



1327 



AAID Length Length 
6549 



Score Probability 



T7T 



Protein name 

Description 
GTCPETT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2.3.6A6.9A2...C.1...12.6. I 11328 



6550 



Length Length 
TIT 



WT 



Score Probability 




8.9e-34 



Protein name 

Description 
ADDING 



Locus Name 



Acc# 



|sp:MCT>D_BAC5U | 



NT 



AA 



ORF Name 



NTID 



AAID 



\2A.0.11±ll...al..±lZ I 



Length Length 
— 



Score Probability 




|3.9e-72 



Protein name 



Locus Name 



hypothetical protein 



bir:S7£S27 



Acc# 



S76527 



Description 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


2£4.13.8.7.5....C.2...16.:7. 


1 li3t> 




61 


185 



Score Probability 



Protein name 

Description 
MO-HM 



Locus Name 



Acc# 



407 



ORF Name 



24414077 ti 53 



Protein name 

Description 
WO -HIT 



NT 



AA 



NTID 



AAID 



TJJT 



^TT 



— , — L , Score Probability 
Length Length ^ 

TUT 



Locus Name 



Acc# 



ORF Name 



2LA5£4a...£3L.7.5 



Protein name 



NTID 



AAID 



T5TF 



conserved Hypothetical protein 



Description 



NT AA 

— „ — , Score Probability 
Length Length sU 



WIT 



155- 



Locus Name 



pir:H754S0 



|4.$e-09 



Acc# 



H75460 



NT 



AA 



ORF Name 



NTID 



AAID 



££fiL5a5.7..7....£2L...&a... 



Length Length 
TIT 



Score Probability 



TZT 



Protein name 
Description 

NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2Z&1216±.±2..Al I tlJTZ 



Length Length 



Score Probability 



Protein name 

Description 
KO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



TJJT 



15557 



Length Length 
TIT 



Score Probability 



Protein name 

Description 
INO-HTT 



Locus Name 



Acc# 



408 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



3TT 



7.5e-lii4 



Protein name 



Locus Name 



Hemolysin A 



gp:PMU27b8 7 



Acc# 



U27587 



Description 

Prevotella melaninogenica Hemolysin a 



(piiyA) gene, complete cas. 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



'2$3lSia c3 lay 



TUT 



W5T 



2.6e-8y 



Protein name 



Locus Name 



UDP-MurNac-tripeptiae syntnetase 



|pir:fi704b0 



Acc# 



E70450 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



3.0.16.6.417....ci....ly.l.. 



1328 



Length Length 




Score Probability 
i.5e-06 



7F 



Protein name 



Locus Name 



phospho-n-acetylmuramoyl-pentapeptiae- 
transferase (mraYl) RP595 



P ir:lil7i6b4 



Acc# 



E71664 



Description 



ORF Name 



3.I5.3.Mb.Z..±l...iU 



Protein name 



NTID 



— — score Probability 



AAID Length Length 



1339 



5561 



Locus Name 



conserved hypothetical protein aq_«b4 



pir :B70^74 



Description 



Acc# 



B70374 



409 



ORF Name 



3166057 c2 163 



Protein name 



Description 



(EC 2.4.1.-) 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length dL - 



1170 



l.Oe-62 



Locus Name 



Acc# 



|sp:MUSG_BACSU 



ORF Name 



,33398557 ±3 102 



Protein name 



NTID 



1341 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
iKfO-MT 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



3.3..7.a.7.22..7....X.1...3.2 



T34T 



W5T 



Protein name 
Description 

(DUTPASE) (DUTP PYkO PHOSPHATASE) 



Locus Name 



sp:bOT_A0UA£ 



Acc# 



066592 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



3.3.2&6.D.3.8....11...3l5... 



TJZT 



TZT 



TTTT 



[TFT" 



9.8e-I5 



Protein name 



Locus Name 



putative TonB- dependent outer membrane 
receptor 



gp:AF04S749 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



410 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



34260912 Cl 122 



T5T 



IT 



Protein name 



Locus Name 



Acc# 



Hypothetical protein 2 



pir:14075y 



Description 



ORF Name 



NTID 



AAID 



11545 



Protein name 



probable RNA polymerase Sigma tactor 



Description 



NT AA 
T — , T — ^ Score Probability 
Length Length ^ 









19§ 



|4.8e-i9 



Locus Name 



pir :T4201b 



Acc# 



T42015 



NT 



AA 



ORF Name 



NTID 



aa£iaii.±i...ii rrns 



AAID Length Length 

6568 



Score Probability 




2.7e-14 



Protein name 

Description 
Mus musculus P4(21)n mRNA, partial cds. 



Locus Name 



gp:AB02S§5§ 



Acc# 



AB028868 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
— 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



411 



NT 



AA 



ORF Name 



NTID 



I4079668 ±2 66 



AAID Length Length 

ff^s — 



Score Probability 
TT53 



Protein name 



Locus Name 



RING tinge r protein 



gp:AF036255 



Acc# 



AF036255 



Description 



Rattus norvegicus RING finger protein mRNA, complete cds. 



NT 



AA 



ORF Name 



4174013 tl 1 



NTID AAID Length Length 
SFTl 



T5T 



Score Probability 
<S.7e-03 



TUT 



Protein name 



Locus Name 



RecO 



Acc# 



U17037 



Description 



Haemophilus intluenzae opacity associated proteins OapA and OapB toapA and 
oapB) genes, complete cds, and DNA recombination andrepair protein (recO) 
gene, partial cds. 



ORF Name 



NTID 



NT AA ^ _ , , . _ . . 
— — Score Probability 
AAID Length Length i 



ia.7.5.812L...a3....ia6. I 



TTT 



0.0015 



Protein name 



Locus Name 



Acc# 



DNA-bmding protein HB : DNA- binding protein 
HU: DNA -binding protein II 



pir :S00015 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



4asmm..fis I itjst 



357" 



TU5T 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



412 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



511726$ ci 121 



T33T 



S37T" 



TS75 — \vm 



TTT 



Probability 
1.5e-i3 



Protein name 



Locus Name 



sp:YABB_ECOLT 



Acc# 



P22186 



Description 

HYPOTHETICAL 17.4 KD PkOTEltf lirf fRt»-MSL itirgfeSeNiC RESIOltf (OrPCJ 



NT 



AA 



ORF Name 



NTID 



AAID 



5S>$4067 ci 123 



T333" 



F375 - 



Length Length 
7U3" 



[2TTT 



Score Probability 
a.le-34 



337 



Protein name 

Description 
BINDING PROTEIN) 



Locus Name 



Acc# 



sp : SP5D_BACSU "[ Q03524 



NT 



AA 



ORF Name 



NTID 



£fl7ifiai..ci...ia5 i [i33¥ 



AAID Length Length 



6576 



TT3T" 



Score Probability 
313 



■8.1e-51 



Protein name 
Description 

( tJbP - MU&sfAC - KrtTAPfi £>T1 Dfi PHOS £H<M?RAKfS i^ERaSe ) 



Locus Name 



sp : M£AY_B0RBU 



Acc# 



Q44776 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



&fi.4aaaa...a2„.t&a 1 11333- 





6577 




85 


^70 




153 





5.4e-Il 



Protein name 



Locus Name 



probable ribosomal protein S20 rpsT 



pir:G70684 



Acc# 



G70684 



Description 



413 



NT 



AA 



ORF Name 



NTID 



AAID 



675S437 ti 36 



Length Length 



BITT 



Score Probability 
TBS 



Protein name 



Locus Name 



probable sultoiipid biosynthesis protein SqdA 



Description 



pir:A42380 



7.0e-13 



Acc# 



A42380 



NT 



AA 



ORF Name 



NTID 



AAID 



XD.3.£&1&Q....£;L...3. I 11357 



Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
Length Length 



— , Score Probability 



luM27.S3..±3....46. I JTJ^ 



TUT 



Protein name 

Description 
IHO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



lQ.7Afn.22....Cl...SL5L 



AAID Length Length 
ttZI — 



846 



Score Probability 
153 



§.ie-0§ 



Protein name 



Locus Name 



potassium channel alpha subunit Kv2 . 2 



gp:XLU20342 



Acc# 



U20342 



Description 



Xenopus laevis potassium channel alpha subunit Kv2.2 (XShabl2) mRNA, 
complete cds . 



414 



NT 



AA 



ORF Name 



NTID 



12518961 C2 100 



AAID Length Length 
— 



Score Probability 
TZI 



Protein name 



Locus Name 



probable protoporpnyrmogen oxidase (hemK) 
RP847 



pir:G71646 



Acc# 



G71646 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
T5T 



Score Probability 
MS 



6.1e-42 



Protein name 



Locus Name 



conserved hypothetical protein MTH700 



pir:ES9193 



Acc# 



E69193 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— . — , Score Probability 
Length Length ^ 



14S.D.S.SaO...±3.....1£ I 11357 



6584 



^5" 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



15.a3.5.<U.6....c3....l5.£ I 



Length Length 



5TT 



Score Probability 

tz§ — 



6.1e-lS 



Protein name 



Locus Name 



hypothetical protein yitL 



pir :E69840 



Acc# 



E69840 



Description 



ORF Name 



NTID 



16M2517....C.3....L3A 1 [OF¥ 



Protein name 

Description 
NO-HIT 



AAID 



NT 



AA 



Length Length 
TO 



Score Probability 



2TT 



Locus Name 



Acc# 



415 



ORF Name 



NT ID 



AAID 



NT AA 
Length Length 



19953510 c2 102 



W5T 



TT7T" 



Probability 
|4.ie-fii 



Protein name 



Locus Name 



argminosuccinate lyase 



pzr :D70419 



Acc# 



D70419 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



2L3.6..73A5.5....C1...B.D. , 



6588 



Length Length 



Score Probability 
TZZ 



3.9e-0S 



Protein name 
Description 

REGULATORY PROTEIN RECX 



Locus Name 



sp ;£ECX_PSEAE 



Acc# 
P37860 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2A[A2.&.S.6.2....£3...3.3. 



7.2e-55 



Protein name 



Description 



Locus Name 



Acc# 



sp;ASSY_METJA "J Q60174 



L1GASE) 



NT 



AA 



ORF Name 



NTID 



AAID 



in>ir 



Length Length 



Score Probability 



TOT 



Protein name 
Description 

NO-HIT 



Locus Name 



Acc# 



416 



NT 



ORF Name 



NTID 



AAID Length Length 



AA 

— , Score 



24353376 tl b 



T3ZT 



5TT 



TTTT 



Probability 
|2.6e-130 



Protein name 



Description 



Locus Name 



gp:AB024y46 



Acc# 



AB024946 



Escherichia coli plasmid pBl7l una, complete sequence. 



— — Score Probability 
Length Length 





AA 



ORF Name 



NTID 



AAID 



25663952 tl 2 



1370 



T5TT 



ll.de-24 



Protein name 



Description 



Locus Name 



|sp:MTGA_ACjlOA 



Acc# 



024849 



(EC 2.4.2.-) ( MoNuPTJN CT loNAL TGAi^) 



NT 



AA 



ORF Name 



NTID 



AAID 



TT7T" 



Length Length 




Score Probability 



TIT 



Protein name 



Locus Name 



Acc# 



Description 



(MO-HIT 



ORF Name 



2£3.&M&:A..±2...44.. 



Protein name 



NTID 



AAID 



TTTT 



ribollavm-speciric deaminase 



Description 



— — Score Probability 
Length Length 



1.7e-b3 



Locus Name 



bir;GV22UV 



Acc# 



G72207 



417 



ORF Name 



NT ID 



— — Score Probability 



TTTT 



AAID Length Length 
757 



5575 



752 



TT3 



0.0016 



Protein name 



Description 



Locus Name 



sp:HEXA>LAL>l 



Acc# 



Q17127 



ftEl^tAMfeiRUsT PRECURSOR 



ORF Name 



5556717b ci JLJ1 



Protein name 



NT ID 



TT74- 



AAID 



55S5" 



NT 



AA 



Length Length 



— Score Probability 



T5T 



Locus Name 



Acc# 



Description 



[NO-Hrr 



ORF Name 



Protein name 



NT 



NT ID 



AAID Length Length 



AA 

— Score Probability 



[TT75" 



F57T 



545" 



Locus Name 



N- acetyl - gamma - glut amy i-pnospnate reauctase, I ipir :F6ybU« 



Description 



Acc# 



F69508 



ORF Name 



Protein name 



NT 



AA 



NT ID 



AAID 



TT75" 



55W 



Length Length 
527 



Score Probability 



TIT 



Locus Name 



Acc# 



Description 



418 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



|40950b0 ti bl 



1377 



73T 



Score Probability 
i.6e-:14 



T7S 



Protein name 



Locus Name 



pyrroixne-5-carJDOxyiate reductase 



lgp:^AJl073y 



Acc# 



AJ010739 



Description 



Clostridium sticklandu proc gene ana b' tlanKing region. 



NT 



AA 



ORF Name 



NTID 



AAID 



4577005 C^ 130 



TTTW 



— — , Score 
Length Length 

^3 



Probability 
4.1e-52 



Protein name 



Description 



Locus Name 



sp:P^Bl_BAdsU 



ACC# 



P25972 



OROTATE yHO^PiriOklBOSYLTkAN^^ 'liJkA^ii!, (uPkT) (u^kTA^E) 



— — Score Probability 



AA 



ORF Name 



NTID 



TTTT 



AAID Length Length 
TTTT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



[4S.0.i0.b.i..±^...3.i.. 



Length Length 




Score Probability 



1UT 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



419 



NT 



AA 



ORF Name 



NT ID 



4804813 ±2 3b 



AAID Length Length 

mi 



XTTT 



— ^ Score Probability 
S.Se-66 



670 



Protein name 



Locus Name 



|sp:AkGD_bAL l ^U 



Acc# 



P36839 



Description 
ACElfVLOkNX^HlNh! AMINOTRANSFER ASE , (AOUA'I) 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


5ll07l2_c2_114 


1382 


6604 


413 


1242 


338 


1.3e-37 



Protein name 



Locus Name 



sensory transduction iiistiame Kinase 
slr2104 rprotein slr2104 :protein slr2104 



bir:S7S13fe 



Acc# 



S75136 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


siaia2Ji...ci...ai 


1383 


6605 


659 


1980 


108 


0.033 



Protein name 



Locus Name 



hypothetical protein 1'lOMlo . iu 



pir :T047 72 



Acc# 



T04772 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



5.2.7Mb.u..±Z...3.3. 



TT3¥" 



POTT 



KIT 



TOT 



|i.ie-21 



Protein name 



Locus Name 



arginme repressor 



gp:BtiAJiU9b4 



Acc# 



AJ010954 



Description 

Bacillus stearothermophilus argK gene and partial recJM gene. 



420 



ORF Name 



NT ID 



— — Score Probability 



AAID Length Length 



5270302 ri ii 



6601 



Protein name 



Locus Name 



5.0e-127 



Acc# 



acetyl-CoA synthetase relatea protein 


pir:Fbyiy3 


F69193 


Description 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 


$.9A&ti.l.±^..b.± 


1386 


660B 


334 100b 273 


1 . Oe-23 


Protein name 




Locus Name 


Acc# 


probable malate aenyarogenase, ^-Ketoacid 


pir :S7£'/3b 


S75735 


dehydrogenase : protein sl±08 91 

^^W^rnrrpna.QP TirnhPl D sll0891 


: 2-Ketoacia 






Description 












ORF Name 


NTID 


AAID 


NT 
Length 


— , Score 
Length 


Probability 


±03Ab.±21...t2..±0A 


1387 


6609 


760 2283 410 


- 7.8e-40 


Protein name 


Locus Name 


ACCff 


115K outer membrane protein 


precursor 


susc 


pir:JUe>027 


JC6027 


protein 












Description 












ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


10.5^.0.0.l 5 .^..±l...l^i. 


1388 


6616 


114 


345 
















Protein name 








Locus Name 


Acc# 


Description 













NO-HIT 



421 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



10662877 ci 202 



55TT" 



TIT 



i.6e-lfc 



Protein name 



Locus Name 



putative transposase 



gp:AJ? , 0(j742y 



Acc# 



AF007429 



Description 

Haemophilus paragallmarum IS-like putative transposase gene, complete cas. 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l072594:>_c3_342 


l3$0 


6612 


60 


183 






Protein name 








Locus 


Name 


Acc# 


Description 
















ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


±0&X$.h3±...a±..:zVA 




6613 


138 


417 


170 


5.5e-l3 


Protein name 








Locus 


Name 


Acc# 










sp : MTGA 




P44890 


Description 














(El 4 ^.4.2l-) {MYOFUNCTIONAL TUA^) | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


llOlZMA^al^b. 


. 1342 


6614 


333 


1002 


1634 


6.2e-168 


Protein name 








Locus 


Name 


Acc# 



mobilization protein a 



gp:AF11^42 



Description 

feacteroides tragiiis mobiliz ation protein B imoJDBj gene, compietecas , 



422 



ORF Name 



11S32332 tl U 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
555 



Score Probability 



TIT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


iiaaiiii...ci.„i4iA 


13^4 


6616 


2SS 


|86 7 


113 





Protein name 



Locus Name 



transmembrane sensor 



gp:AF060iyJ 



Acc# 



AF060193 



Description 



Pseudomonas aeruginosa pigACDK operon, complete sequence ; Hypothec i 
(pigB) gene, complete cds . 



cal Pigb 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



— Score Probability 



T4T 



S.le-li 



Protein name 



Locus Name 



coliagen-UKe protein 



gp:BTU67y2l 



Acc# 



U67921 



Description 

Bacillus thurxngxensis plasmid p'l'X14-l, MOB, kkl>, ana collagen- HKeprocem 
genes , complete sequence . 



NT 



ORF Name 



NTID AAID Length Length 

T5T 



AA 

— Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



INO-H1T 



423 









NT 


AA 


Score 


Probability 


ORF Name 


NT ID 


AAID 


Length 


Length 








13071^4^_ci_2i^ 


1397 


S519 


466 


1401 


357 




7 .5e-37 



Protein name 



conserved Hypothetical protein 



Description 



Locus Name 



pTrTF7^nr 



Acc# 



H72331 



ORF Name 



Protein name 



NTID 



— — Score Probability 



AAID Length Length 
[2T7 



1W 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 


NTID AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 


ll&M&ll^tl^&b. 


1595 6621 


401 1206 lby 


" i.0e-05 | 


Protein name 




Locus Name 


Acc# 


transposase 




AF038866 


Description 

i ^„4-~^ ■ •f-^^mi-io frangnnooTi Tn 5 n fransposase (bipH) andmoJoilizat ion 





protein BmpH (bmpH) genes, complete cds . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


14£6.3.^.1...cl...m 




1400 


6622 


193 




5S1> 


19^ 


7.2e-16 



Protein name 



Locus Name 



RNA polymerase sigma tactor sigZ-UKe protein [gpTAFTTT^T 



Acc# 



AF137263 



Description 

Bacberoicles thebaiotaomicr on 30^ ribosomai protein SI6-liKeprotem, rucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



424 



ORF Name 



14589067 tJ Ibu 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
232 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



795" 



0.00021 



Locus Name 



sp : LCk^YBJRPK 



Acc# 



P28808 



Description 



ORF Name 



NTID 



AAID 



Protein name 



hypotnetical protein aq_2087 



Description 



— — Score Probability 
Length Length 



T2T 



TTT 



3T 



Locus Name 



pir:H7047d 



10.029 



Acc# 



H70478 



ORF Name 



Protein name 



NTID 



AAID 



6626 



— — S core Probability 
Length Length 



TUT 



Locus Name 



Acc# 



Description 



425 



NT 



AA 



ORF Name 



NTID 



156597bd ti bl 



T1UF" 



AAID Length Length 
02 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


±Z63±&3A^alJ±9A 1406 


6628 


428 1287 


ji9i 


3.9e-12 


Protein name 






Locus Name 


Acc# 


transposase 


gp:At'0:itt866 


AF038866 


Description 




Bacteroides tragiixs transposon Tnbb2U 
protein BmpH (bmpH) genes, complete cds 


transposase (bipH) anamoniiizacion 




ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


lSll&CLSl.±l..ia 1407 


6629 


523 1572 


2044 | 


2.2e-2ll 


Protein name 






Locus Name 


Acc# 








Sp:TRA2_BAUFK 


Q45119 


Description 












"TEEKTSTOSASE FOR INtiEkl'loN SEQUENCE ELEMENT 1S21-L1KW 






ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


±5.8.19M±...a±...lti.l). 140^ ™ 


6620 


478 1437 


469 


2 .2e-44 



Protein name 
Description 

f>feOfOt>OftPilVRftlO(SliM 6tft)ASK f (PPO) 



Locus Name 



|sp:W>0XJ4mA 



Acc# 



P56601 



426 



ORF Name 



16491593 c2 27y 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



TFT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



6632 



NT 



AA 



AAID Length Length 

pzi — 



Score Probability 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protem name 



NTID 



AAID 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



6634 



hypothetical 26. 8K protein 



Description 



NT 



AA 



Length Length 

— 



Score Probability 
0.00024 



Locus Name 



bir:Jc^J22 



Acc# 



JC2322 



ORF Name 



|21g.&12a0.^a^j.b.O... 



Protein name 



NTID 



AAID 



— — Score Probability 
Length Length 

T5M 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 


]M I ID 


7\ 7\ m 


NT 

T Dnrrt" Vi 
■Uciiy Oil 


AA 
Length 




Score 


Probability 


2245^7__c3_i47 


1414 


6636 


187 


564 








Protein name 








Locus 


Name 


Acc# 


Description 
















MO-HIT | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


226.$±h.b^...z±..:lL2 


1415 


6637 


434 


1305 




140 


2.7e-06 



Protein name 



Locus Name 



immunoreactive b3 KU antigen vuizj 



|gp:AP144641 



Acc# 



AF144641 



Description 



Porphyromonas gm givaiis strain W5Q immunoreactive 53 KB antigenPGl23 gene, 
complete cds . 



ORF Name 


NTID AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


226&2b£lL.alJl±± 


::. 1416 


524 


$75 445 


6.1e-42 



Protein name 



Description 



Locus Name 



lsp:HTM_^Tk(i<J 



Acc# 



030795 



PUTATIVE HEAT ^HocJR 


PROTEIN 


HTPX 










ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


22&lLb&±Jt2J$£ 


1417 


6639 


133 402 


97 


0.0001B 



Protein name 



Locus Name 



MbpB 



IgpiWUaSVlfo 



Acc# 



U25716 



Description 



Bacteroides rragilis mobiliz ation protein MbpA (mJjpAj , MbpB (miDpBjanct MJDpc 
(mbpC) genes, complete cds. 



428 



NT 



AA 



ORF Name 



NT ID 



AAID 



22933438 c3 397 



6640 



Length Length 
TIT 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID 



122aaa4afiL.t1.-ii I rrar^ 



6641 



Length Length 



Score Probability 



TIT 



Protein name 

Description 
IWO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



z3A3.5..7.aa...t2...i3.a i rrzzu 



KMT 



Length Length 



Score Probability 



HUT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



23.baXZ6.5L...£.3....1ASl I 11421 



Length Length 



Score Probability 



Protein name 

Description 
iNO-HlT 



Locus Name 



Acc# 



429 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2.1e-06 



Protein name 



Locus Name 



immunoreactive 53 KD antigen yulAi 



gp:Al< , i44b41 



Acc# 



AF144641 



Description 

Porpnyromonas gingivalis strain W5U 
complete cds . 



immunoreactive 53 KD antigenPG123 gene, 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


23£7«10_ 


_c3_345 1423 


6645 


197 


554 




345 


2.4e-3l 


Protein 


name 






Locus 


Name 


Acc# 



putative acetyltransterase 



gprSCFl 



AL117322 



Description 

Streptomyces coeiicoior cosmid Fi. 



ORF Name 



NTID 



AAID 



l±&l bbhlJ±J± l l !| 

Protein name 



5546 



NT AA 

— — Score Probability 
Length Length 





TO 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



24.2S^6.i:A...ci....i.3-b.. 



Protein name 



unknown 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



|3.ie-37 



Locus Name 



Acc# 



gp:AP07S3i7 



AF079317 



Description 

Sphingomonas aromaticivorans piasmia pNLl, complete piasmidsequence . 



430 



ORF Name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 



Protein name 



6648 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



NT — score Probability 



AAID Length Length 



TSTT 



WTT 



5.ue-4ti 



Locus Name 



immunogenic 'Z3 JcDa lipoprotein pgj 



bp:APi41>7yy 



Acc# 



AF145799 



Description 



frorphyromonas gmgivalis strai n WbO immunogenic 23 KDa lipoproteins gene, 
complete cds . 



ORF Name 



Protein name 



— — Score Probability 
NTID AAID Length Length 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



vrlE protein 



Description 



NTID 



— — Score Probability 



AAID Length Length 



TAT 



9 . 5e-08 



Locus Name 



pir :Tl/ib4 



Acc# 



T17384 



431 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



24642137 tl 62 



T7T" 



2.Se-ii 



Protein name 



Locus Name 



Acc# 



putative outer membrane ponn 



Description 

Vibrio cholerae glutamyl tKNA synthetase igitxj gene, partial cas /putative 
outer membrane porin (ompA) , unknown protein, vibriobactinreceptor precursor 
(viuA) , and ViuB protein (viuB) genes, completecds ; and VibF (vibF) gene, 
partial cds. 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


246422l2J:2_llO 


1431 


6653 


301 


906 


631 


1.2e-6l 



Protein name 



Description 



Locus Name 



|sp:YbyHJ*AtJiJU 



Acc# 



031448 



HYPOTHETICAL 33 


8 KD PROTEIN IN CiLl"! 1 - 


-PURT INTERS 1(J 


REGION 




ORF Name 




NTID 


AAID 




NT 
Length 


AA 
Length 


Score 


Probability 


1±£A1B.16..±1J1& 


1432 


5654 


77 


234 






Protein name 












Locus Name 


Acc# 


Description 


















NO- HIT | 


ORF Name 




NTID 


AAID 




NT 
Length 


AA 
Length 


Score 


Probability 






1433 


6655 




186 


561 


593 


1.3e-57 



Protein name 



Locus Name 



Acc# 



mobilization protein A 



|gp:AF11^41 



AF118241 



Description 

Bacteroides tragil is mobilization protein a (mooAj gene, compietecas. 



432 



ORF Name 



NT ID 



AAID 



KPT AA 

— — Score Probability 
Length Length 



24726bW ci Jbb 



55"5F" 



T5T 



|i.0e-30 



Protein name 



Description 



Locus Name 



|sp:MTGAJ±!(JoLl 



Acc# 



P46022 



(2C 2.4.2.-) — (MONOPUNCTlO^AL tGA^fcl) 



ORF Name 



NT ID AAID 



2480*4:46 cl 2L0 



HATS 1 |5557 



Protein name 



hypothetical protein MTHB4 7 



Description 



NT 



AA 



Length Length 
\ZTS 



I2TJT" 



Score Probability 
|1.9e-06 



Locus Name 



E 



TrTK^TT 



Acc# 



A69213 



ORF Name 



Protein name 



NT 



AA 



NTID AAID Length Length 




Score Probability 



153" 



vrrr 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



AAID 



TZTT 



55ST 



hypotneticai protein yaaT 



Description 



— — Score Probability 
Length Length 



TUT 



ITT 



TIT 



1.7e-07 



Locus Name 



pir:C69770 



Acc# 



C69770 



433 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



255110W cl 2S2 



6660 



|2.8e-i7 



Protein name 



Locus Name 



LemA 



[gp:LMU66i^r 



Acc# 



U66186 



Description 



Listeria monocyto genes LemA (lemA) gene, complete cas, ana Lem£(iemB) gene, 
partial cds. 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 

on 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



1440 



AAID 



— — Score Probability 
Length Length 



IT 



TFT 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



TS4T 



hypothetical protein HU2Fuy.J 



Description 



NT 



AA 



Length Length 
^4 



Score Probability 



X2F 



Locus Name 



Acc# 



T33369 



434 



NT 



AA 



ORF Name 



NT ID 



AAID 



12742942 t2 70 



Length Length 
TT7 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 
NO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID 



23.7.1MSa„±2...S.3. I 11337 



Length Length 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



\ n 



NT 



AA 



ORF Name 



NT ID 



AAID 



3.xy.Z3.^3.s....c;z...zai 



1444 



6666 



Length Length 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 
JNO-HIT 



ORF Name 


NTID AAID 


NT 
Length 




AA 

— , Score 
Length 


Probability 




. 1445 6667 


171 


1516 237 


£>.Se-20 | 


Protein name 










Locus Name 


Acc# 


putative ECF sigma 


tactor RpoEl 








gp:AF049107 


AF049107 



Description 



Myxococcus xantnus response regulator FrzZ (trzzj gene, partialccts; alanine 
dehydrogenase (aldA) , putative ECF sigma factor RpoEl (rpoEl) , and response 
regulator homolog (frzS) genes , complete cds;and unknown genes. 



435 



ORF Name 



Protein name 



NTID 



11446 



NT 



AAID Length Length 



AA 

— Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NT 



NTID 



AAID Length Length 



AA 

— Score Probability 



T5T 



8.2e-7^ 



Locus Name 



sp:HEMN_AOUMi! 



Acc# 



067886 



Description 
OXYGEN- INflEiPiWDiiW'l eOPkO^UKPH yftlMOCjKN 11 



ORF Name 



Protein name 



NTID 



1448 



— — Score Probability 
AAID Length Length 



6670 



IT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



1449 





425 


1281 




154 





6.$e-06 



Locus Name 



probable carooxy- terminal proteinase, Dl 



pir :T0by7b 



Acc# 



T05975 



Description 



NT 



AA 



ORF Name 



NTID 



25302 li b2 



AAID Length Length 

vn — 



Score Probability 



^77 



TUT 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



^7T 



Length Length 
— 



Score Probability 

o.oobb 



Protein name 



Locus Name 



integrase 



gp:HlVUby223 



Acc# 



U69223 



Description 

flIV-1 strain CMR'2 1$ from Cameroon integrase (poij gene, partialcas, 



ORF Name 


NT 

NTID AAID Length 


AA 

— , Score 
Length 


Probability 




... 1452 ££?4 


£7£ 111 


6.4e-06 


Protein name 




Locus Name 


Acc# 



137 protein 



pir :£>C4110 



PC4110 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



\A±±±05h...a±..J.±l 



Length Length 
T2¥5 



Score Probability 
S.Oe-24 



2^ 



Protein name 



Locus Name 



nypotnetical protein 



gp:AF , 14^^bl 



Acc# 



AF149851 



Description 

Pseudomonas sp. KC hypothetical proteins, metnaiiotnionein-liKeprotein, 
MoeB-like protein, putative proteins, hypotheticalprotein, putative 
oxidoreductase, and putative AMP ligase (entE) genes, complete cds; and 
putative receptor gene, partial cds. 



437 



NT 



AA 



ORF Name 



NT ID 



14720187 t'l yy 



MET 



AAID Length Length 
B52 



Score Probability 
|7.ie-96 



Protein name 



Locus Name 



spiISTliJiAcMi 



Acc# 
Q45120 



Description 

ItiSfifttlCW SfiQUElNCjiil t£2i-hliCK frtlTAl'lViil Af ^-BINDING PKUTEUN 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


462275l_t2_101 


1455 


| 


554 


17S5 


570 


1.8e-48 



Protein name 



Locus Name 



oxaloacetate decarboxylase , sutounit aipna 
(oadA) homolog 



pir 



Acc# 



C69406 



Description 



NT 



ORF Name 



NTID 



AAID Length Length 



AA 

— Score Probability 



6678 



Protein name 



Description 



Locus Name 



Acc# 



N0-H11 1 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



— Score Probability 



\±81te.±b..±L..b± 



TTT 



14 . 6e-14 



Protein name 



Locus Name 



collagen 



[gp:AB00^9^3 



Acc# 



AB008933 



Description 

Hydra vulgaris HT2 mKJMA lor collagen, partial cas. 



438 



ORF Name 



517715V t'A 8ti 



Protein name 



NT AA 

— — Score Probability 
NT ID AAID Length Length 



TI"53~ 



T2T 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID AAID Length Length 
fcZZI 1 153 I 



Score Probability 



Locus Name 



Acc# 



Description 



[WO -HIT 



ORF Name 



Protein name 



NTID 



1460 



AAID 



6£82 



— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



NTID 



AAID 



1461 



hypothetical protein 



Description 



— — Score Probability 
Length Length 



Locus Name 



|pir:B72i08 



4.8e-ly 



Acc# 



B72308 



ORF Name 



Protein name 



NTID 



1462 



AAID 



6684 



NT 



Length Length 



AA 

— Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



439 



ORF Name 



5025010 c3 333 



Protein name 



NTID 



16685 



NT 



AA 



— Score Pr obability 
AAID Length Length 



7T 



TIT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



6686 



NT 



AA 



Length Length 
T5TI — 



Score Probability 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



Length Length 
T7S 



AA 

— Score Probability 



TIT 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



\l&5.112.±L..bA 



Protein name 



NT 



AA 



NTID AAID Length Length 



Score Probability 



11455 



I55S8 



|2.7e-20 



Locus Name 



|sp:l l k!OAJil<JoLl 



Acc# 
P13036 



Description 

IRON (111) DIClTkATJa TRAUriJb>ok T PkOTHIN PECA kk^iJUk^uk 



440 



ORF Name 



NTID 



AAID 



15805290 cl 33 



Protein name 



glycine -ricn protein tclone wiu-1) 



Description 



— — Score Probability 
Length Length 



Locus Name 



foir:Si4<rtJ2 



!2.8e-0b 



Acc# 



S14982 



ORF Name 



13.6.5.<ib.<n...al...3.Z.. 



Protein name 



NTID 



AAID 



6690 



NT 



AA 



Length Length 
TTS2 



Score Probability 



TST" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT 



AA 



NTID 



afiatmi-ci^jLu 



AAID Length Length 



879 



Score Probability 
|5.5e-0b 



130 



Protein name 



Locus Name 



membrane glycoprotein 



gp:DBb7JJ 



Acc# 



D88733 



Description 

fiquine herpesvirus 1 tMK for membrane glycoprotein, complete cas. 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



\2&A2b:±lb....z2..M. I [I*7Tr 



Protein name 



Description 



Locus Name 



Acc# 



IMC- Hi T 



441 



NT 



AA 



ORF Name 



NT ID 



|34£<d630^ c3 43 



[XT7T 



AAID Length Length 
— 



HT5B" 



Score Probability 
I3T3 



|1.3e-^0 



Protein name 

immunoreactive 53 KD antigen vuias 



Locus Name 



lgp:AF144641 



Acc# 



AF144641 



Description 

frorphyromonas gmgivalis s train WbO immunoreactive S3 JcD antigenPGi^i gene, 
complete cds . 



NT 



AA 



ORF Name 



807033 ci 29 



NTID AAID Length Length 

"3T3 



Score Probability 



T47? 



^4" 



urnr 



Protein name 



Description 



Locus Name 



Acc# 



[NO -HIT 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


Hfllfili9.1...c2L..AU 


1473 




66$$ 


344 1035 


452 


l . le-42 



Protein name 



Description 



Locus Name 



sp: YF23_hak±jn 



Acc# 



P44243 



HYPOTHETICAL i>kuTEI N HI1!^3 



NT 



ORF Name 



NTID 



AAID 



±43A83.bA.±2...±l 



Length Length 
EST 



AA 

— Score Probability 



27 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



442 



ORF Name 



NT ID 



— — Score Probability 



14650012 iJ> 'lb 



XT75" 



AAID Length Length 



XTT 



"73 



Protein name 



Locus Name 



glucosidase 11 JDeta-sununit 



[gp:AV06feUTT" 



Acc# 



AF066061 



Description 



Mus musculus gluc osidase tl beta-subunrt gene, alternatrveiysplrcea 
products, partial cds . 



ORF Name 



15535^61 cl 28 



Protein name 



NTID AAID Length Length 

33T 



XT7S 



— — Score Probability 



xxr 



Locus Name 



Acc# 



Description 



MO -HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



T47T 



Length Length 



Score Probability 
|6.8e-4i 



[¥54 



Locus Name 



Isp : CliHjJLol>E 



Acc# 
P54965 



NT 



AA 



ORF Name 



NTID 



[X1T75~ 



AAID Length Length 
1X53 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



443 



ORF Name 



Protein name 



NTID 



TT7T 



NT 



AA 



AAID Length Length 
F7UI — 



Score Probability 



141 



WIT 



Locus Name 



Acc# 



Description 



K0-H1T 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



T3W 



6702 



T7T 



TUT 



4.0e-3& 



Locus Name 



sp:S0J_BA<J^U 



Acc# 



P37522 



SOJ SfcOTUiN 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



FTuT" 



Locus Name 



hypothetical protein F20D1O.23U 



pir :T0b6^b 



ACC# 



T05638 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



FTuT" 



522 



T5W 



TT5~ 



2.8e-10 



Protein name 



Locus Name 



endo-xylanase nomoiog PCZAibi.14 



pir :T17480 



Acc# 



T17480 



Description 



ORF Name 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



\5M3±&L±1^1. I 



F7T 



Locus Name 



Acc# 



Description 



NO-HIT 



444 



NT 



AA 



ORF Name 



NTID 



6838437 ±1 i 



[1484 



AAID Length Length 
— 



Score Probability 
0.0024 



Protein name 



Locus Name 



outer membrane protein 



gp:BNRoMWi 



Acc# 



L77614 



Description 



Bacteroides thetaiotaomicron outer membrane protein ^susD) gene, complete 
cds . 



ORF Name 



10547256 r2 b 



Protein name 



NTID AAID Length Length 
— 



T4^ 



— — score Probability 



246 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



1486 



nypotnetical protein aq_ioi8 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



1ST 



Wo" 



Locus Name 



pir:H7038V 



7.5e-05 



Acc# 



H70387 



ORF Name 



2I1.7.B.41..±l...l 



Protein name 



NTID 



AAID 



— — Score Probability 
Length Length 



ITT 



Locus Name 



Acc# 



Description 



NO-HIT 



445 



ORF Name 



NTID 



— — Score P robability 
AAID Length Length 



6710 



Protein name 



surlace exclusion protein sepl precursor 



Description 



HIT 



Locus Name 



10.023 



Acc# 



S72375 



ORF Name 



ai4im:L.ci.„iu.. 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



73T 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



Iructanase 



Description 



NTID 



^7T2~ 



NT 



AA 



AAID Length Length 



Score Probability 



TT7F" 



Locus Name 



jpir:A3byiS 



fl.3e-14i 



ACC# 



A36915 



ORF Name 



|8.5LaiZ...cL...ll.. 



Protein name 



Description 



NT 



NTID 



AAID Length Length 



AA 

— Score Probability 



7^T 



2T7T 



S.5e-23l 



Locus Name 



|gp:BNRSCjRL 



ACC# 



M83774 



Bacteroides tragilis levanase [scrL) gene, complete cas . 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



11452 



16714 



flTT 



EST" 



Protein name 



Locus Name 



Acc# 



Description 



MO-Hl'l' 



446 



ORF Name 



NTID 



NT AA ^ , n . , . 
T — T — _ Score Probab ility 
AAID Length Length 



15735882 13 5 



T1~9T 



TZUT 



Protein name 



Locus Name 



renin -binding protein-related protein :proteln 
slr!975 :protein slr!975 



pir:S7B649 



Acc# 



S75649 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



26.B.125.6.6....t3...A 



Length Length 
|73 



Score Probability 



TTT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



\1±!B±..±1..£. ! 11*35 



Length Length 



Score Probability 
2Su 



S.7e-23 



Protein name 



hexuronate transporter nomolog yjmG 



Locus Name 
jpir:A69353 



Acc# 



A69853 



Description 



ORF Name 



Protein name 



NT 



AA 



NTID AAID Length Length 

— 



143" 



Score Probability 
TZZ 



Locus Name 



N-acetylneurammate lyase 



gp : CPMANA 



Description 



i.4e-0? 



Acc# 



Y12876 



C.pertringens gene encoding N-acetylneurammate lyase and twopartial open 
reading frames. 



447 



ORF Name 



NT ID 



AAID 



NT AA 

— . — ^, Score Probability 
Length Length A - 



5117337 t2 2 



F7T3" 



|4.4e-iS 



Protein name 

Description 
HYPOTHETICAL PrOTeJM H10227 



Locus Name 



ppT 



:!fflCH HAE1N 



Acc# 



P44583 



NT 



AA 



ORF Name 



NTID 



781932 13 7 



1498 



AAID Length Length 
[F7ZI3 — 



Score Probability 
F¥3 



3 .ie-53 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



Description 



pir : JC6027 



Acc# 



JC602 7 



ORF Name 



NTID 



LlZD.a^2L...a3....3.L | 11499 



Protein name 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



7T 



TIT 



Locus Name 



Acc# 



Description 
feto-HlT 



ORF Name 



IMD.23.MZL..C.3....3.2.. 



Protein name 



NTID 



AAID 



6722 



metaJoolite transporter nomolog ytnA 



Description 



NT AA 

— , — , Score Probability 
Length Length <L - 



i.5e-6i 



Locus Name 



pir :D69814 



Acc# 



D69814 



448 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
J51 1 



6723 



Score Probability 
5.ie-i4 



Protein name 



Locus Name 



alpha-N-acetylgiucosaminiaase 



gp:NTAltJ2Uy 



Acc# 



Y18209 



Description 

Nicotiana tabacum mRlsfA tor aipna-N-acetylglucosaminidase . 



ORF Name 


NTID AAID 




NT 
Length 


AA 
Length 


Score 


Probability 


2«8l53:S_c2_26 


1S02 1 6«4 




202 609 


327 


3.5e-^9 


Protein name 


Locus Name 


Acc# 


probable cationic ammo acia transporter 


pir:T34694 


T34694 


Description 














ORF Name 


NTID AAID 




NT 
Length 


AA 
Length 


Score 


Probability 





1503 672b 




432 1299 


195 


"J 1.9e-12 


Protein name 


Locus Name 


Acc# 


j lmmunoreactive 52KD 


antigen PG41 






gp:At'i7bVlb 


AF175716 


Description 




Porphyromonas gingivalis strain Wbo 
complete cds. 


immunoreactive b2KL> anuigen^x ywie, 




ORF Name 


NTID AAID 




NT 
Length 


AA 
Length 


Score 


Probability 


lO.S.lBA&.l^X.-X.^L 


1504 6126 




446 


1341 


616 





Protein name 



Description 



Locus Name 



sp:ANAti_HUMAU 



Acc# 



P54802 



GLUCO^AMlMbA^) (NAcj) 



449 



NT 



AA 



ORF Name 



NTID 



10580062 c2 yi 



AAID Length Length 
5C3 



6727 



Score Probability 
8 . Oe-24 



279 



Protein name 



Locus Name 



6 0KDa protein 



gp:AB004b60 



Acc# 



AB004560 



Description 



Porphyromonas gingivalis DNA tor 60JcDa protein, complete cas . 



NT 



AA 



ORF Name 



NTID 



13175950 c2 yi 



AAID Length Length 
£T5 



Score Probability 



7T 



Protein name 



Description 



Locus Name 



Acc# 



NT 



ORF Name 



NTID 



AAID Length Length 



AA 

— Score Probability 



|13.S.^$.0.0.0....c2...:/.B. 



1507 



i.6e-i09 



Protein name 



Locus Name 



I15R outer membrane protein precursor : buse! 
protein 



bir:JC!6U27 



Acc# 



JC6027 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

mi — 



Score Probability 



6730 



£1T 



Protein name 



Description 



Locus Name 



Acc# 



450 



ORF Name 



NT ID 



— — Score Probability 



AAID Length Length 



1464780* ±2 20 



6731 



VST 



4 . 3e-Ub 



Protein name 



Locus Name 



Acc# 



|sp:YDlJ^_iil(JuLl 



P77402 



Description 

gVPOTflE^lClAL I'kANSCjRlPflOriAb KEfltttA'l'OK. IN AROD-PPS INTlilkGlKisllO KfcXJlud 



ORF Name 



114660852 cl 74 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



W7TT 



[2UT 



WIT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


1$&1$M:Z.±±...±& 1511 


6733 


96 291 /b 


0.064^ | 


Protein name 




Locus Name 


Acc# 


hypothetical protein cU4UUb 


pir:SVb372 


S75372 


Description 










ORF Name NTID 


AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 




20.a0.7.m...c3....1i).2 1512 


6734 


^El 1404 


I.7e-12 





Protein name 



Locus Name 



transposase 



|gp:AP03B8bb 



Acc# 



AF038866 



Description 



Bacteroides tragilis trans poson Tnb520 transposase iDipH) anamobilization 
protein BmpH (bmpH) genes, complete cds . 



451 



ORF Name 



22683287 rJ 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
TSFT — 



Score Probability 



Locus Name 



Acc# 



Description 



IN0-H1T 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
— 



Score Probability 



Locus Name 



Acc# 



Description 



IN0-H1T 



ORF Name 



2.i8.3.5.I6^..±l...li... 



Protein name 



NT 



AA 



NTID 



AAID 



6737 



Length Length 
TFS 



Score Probability 
0.029 



47 



Locus Name 



hypotnetical protein tub/. 2 



pir :T2482b 



Acc# 



T24826 



Description 



ORF Name 



NTID 



2k.7.y.iMl...c3....M 



Protein name 



hypothetical protein C33G8.2 



Description 



— — score Probability 



AAID Length Length 



73T 



5 . 7e-3y 



Locus Name 



pir:T^41^7 



Acc# 



T34137 



ORF Name 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



|3.3.1.?.28.11..±3...A.7. I IT5T7 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 


IN J- J-U 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


345104i&_ci_63 


1518 


5740 


259 780 


424 




|_l . ue- ,5 y 


Protein name 








Locus Name 




Acc# 


hypothetical protein F36HI2.3 






pir:T3J4bV 




T33457 


Description 
















ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


lSli&Ril±..clJl& 


1515 


5741 


512 1539 


202 






Protein name 


Locus Name 




Acc# 


unknown 


gp:U96771 




U96771 


Description 


















Prevotelia sryantu putative 
mannanase genes, complete cds; 


polygalacturonase, tf-l, 4- 
and unknowngenes . 


enaogiucanase , emu. 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


lS19&^±..a2Jlb. 


. J1B20 


5742 


278 837 


450 






Protein name 








Locus Name 




Acc# 


hypothetical protein CiiGb.^: 


pir:T:U13V 




T34137 


Description 
















ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


165:i±ti.^a2^A 


. 1521 


5745 


450 


1255 


545 







Protein name Locus Name 



sp:YBDN_U<JoLl 1 P77216 

Description 

HYPOT^'llCAL 47.8 KB PRO'lKlN IN CdTA-US B<5 INTHlKCjENIC RKG1UJN 



453 



ORF Name 



33717b c'A 77 



Protein name 



NT ID 



T5ZT 



— — Score Probability 
AAID Length Length 



T75~ 



Locus Name 



Acc# 



Description 



(NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAID Length Length 



Score Probability 



7^" 



2358 



i.3e-0a 



Locus Name 



sp:PVDA_ytlky*l 



Acc# 



P46360 



PESTlcilN kfiCfclkTOR pkECUksok ( Iftfrc!) (iPkbb) 



ORF Name 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



I [1^4 



IT3BT 



HIT 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



saaam^ci™kbL 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



16747 



[T7T 



153?" 



Locus Name 



|sp:YBDMJM)Ll 



Acc# 
P77174 



Description 

HYPOTHETICAL 2 J . b> KT> PROTEIN IN (JriTA-D SMcj INTEkGENic Ktxiiud 



454 



ORF Name 



NTID 



AAID 



Probability 



64850i>i> c2 yy 



Protein name 



1526 



6748 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



ifti2a.7...±i...aa | 



1527 



i.5e-4b 



Protein name 



Locus Name 



immunoreactive blKD antigen PUb2 



IgpiAPlVbViy 



Acc# 



AF175719 



Description 



igenPG52 gene, 



Porphyromonas gingivaiis strain WbU immunoreactive biKJJ ant 
complete cds . 



ORF Name 



H5..-?.6.b.b.2....ci....lB.y... 



Protein name 



NT 



NTID AAID Length Length 



AA 

— Score Probability 



6750 



FT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



l2.3.Mlia..±1...2 



— — Score Probability 
Length Length 





3 . le-13 



Protein name 



Description 



Locus Name 



sp:BGALjm±iWT 



Acc# 



P77989 



BETA- GALACT05 IDA^ , ( LAtJTAriE ) 



455 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



|13704bb2 ci l2y 



ji.5>e-143 



Protein name 



Locus Name 



Acc# 
083351 



Description 

6-M0SPhO(JLucJONAT4 D£liiYt)kOC^NAS S, DE(JARBUXYLAT1JNU , 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l375S53o__c3_190 


1551 


5753 


135 


411 






Protein name 








Locus 


Name 


Acc# 


Description 














NO-HlT 1 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


iaamai^„.ai...iafiL 


1532 


5754 


74 


225 


77 


" 0.0055 



Protein name 



Locus Name 



putative signal transduction protein uarA 



gp:AFl73844 



Acc# 



AF173844 



Description 

Mycobacterium smegmatis garA- containing gene cluster, paruiai sequence - 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



TZJT 



7.2e-32 



Protein name 



Locus Name 



cytochrome a oxidase suJDunit n 



gp:AF00150i 



Acc# 



AF001503 



Description 

Salmonella typhi murium cytochrome cl oxidase susumt I icydAj an 
d oxidase subunit II (cydB) genes, complete cds . 



dcytocnrome 



456 



ORF Name 



NT ID 



NT AA 

— — Score P robability 
AAID Length Length 



1444627 cl 132 



55" 



0.035 



Protein name 



Locus Name 



riJDOsomal protein Sb 



gp-.UtW'Ub 



Acc# 



U87145 



Description 



toxoplasma gondii cnioroplast, complete genome. 



ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l644l305J:3_103 


1535 6757 


236 


711 


244 


1.2e-20 



Protein name 



Locus Name 



hypothetical protein £2381 



pir :k£30l2 



Acc# 



B65012 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


16.5.1.7.i^b...±1...10.1 


..... 1536 


6758 


450 


13b3 


717 


9.2e-7i 



Protein name 



Locus Name 



nypotnetical protein 



|pir:iJ76y4b 



Acc# 



S76946 



Description 



ORF Name 



Protein name 



p20-CG<4M> 



NTID 



— — Score Probability 
AAID Length Length 



liMm...al...ilO. I 



10,0078 



Locus Name 



gp : HSCUCiBf 



Acc# 
AJ000258 



Description 

Homo sapiens trinucleotide re peat S-d(C(M)n-3ds binding proteinp^-^F. 



457 



ORF Name 



NTID 



NT AA 
_ — ^, _ — _ Score Probability 
AAID Length Length L 



19687836 f3 87 



1290 



6 .4e-95 



Protein name 

Description 
HY^OTiffiriCAL SftOTfilM HI1590 



Locus Name 



:VCAJ HAEIN 



Acc# 



P45262 



NT 



AA 



ORF Name 



NTID 



AAID 



2068766 cl 143 



Length Length 



157$ 



Score Probability 
3.9e-lll 



1098 



Protein name 



Description 



Locus Name 



sp : flVDA_A20Vl 



Acc# 
Q09049 



CYTOCHROME D UBI0UINOL OXIDASE SUBUNIT I , 



NT 



AA 



ORF Name 



NTID 



AAID 



2Lft7.a5azft...c2...iaa i [x^ro 



6762 



Length Length 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



\220A111..±2..JA, 



11341 



Protein name 



Locus Name 



RumB (R3 91) 



gp:XXU136^3 



Acc# 



U13633 



Description 

incJ plasmid R3 91 rumA(R3 91) and rumB(R3 91) genes, complete ccLs. 



458 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



22273212 ci lil 



1ST 



TTTe^ZT 



Protein name 



Locus Name 



urea transport protein 



[gpi&E'ibVbTT 



Acc# 



AF167577 



Description 

Actinobaciilus pleuropn eumoniae transcriptional regulator iapuR)gene, 
partial cds; and putative periplasmic binding protein (cbiK) , putative 
cytoplasmic membrane protein (cbiL) , cobalt membranetransport protein 
homolog (cbiM) , cobalt membrane transport proteinhomolog (cbiQ) , cobalt 
transport ATP-bindina protein homoloa (cbiO) , and urea transport protein 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


225^2^2^1^ 1543 


6765 


354 


iOSO 


184 


2.2e-12 


Protein name 








Locus Name 


Acc# 


molyPdate metabolism 


regulator 




"] pir:B64yVy 


B64979 


Description 














ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




22A4X3X)Jb™±2...J&2 


1544 


6766 


258 


777 


651 


9.le-64 




Protein name 








Locus Name 


Acc# 


1 ABC transporter, ATP-binding protein 




1 pir:H72i8b 


H72385 


Description 














ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 


2Z$&$M.Q.J.1^2& 


1545 


6767 


105 


jjis 







Protein name 



Locus Name 



Acc# 



Description 



[NO-iilT 



459 



NT 



AA 



ORF Name 



NTID 



AMD Length Length 



Score Probability 



24100i>6b cJ 1«8 



TOT 



1.5e-m 



Protein name 



Locus Name 



|sp:G6Pli_At!TAU 



Acc# 



P77809 



Description 

<3LUC0SK-fo-PHQa^HAl'B l-Dlilk*DkOaBtiASh!, (GbPD) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
2Tu 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



\lA6Al±M...a±^± I 



F77TT 



JU7T 



0.0017 



Protein name 



hypothetical protein MTHib/ 



Locus Name 
pir :A6yi4b 



Acc# 



A69146 



Description 



ORF Name 



Protein name 



NTID 



2.5.S.1.7.0.13...±l...l 



putative secreted joeta-gaiactosidase 



Description 

Streptomyces coelicolor cosmicL F8I. 



NT 



AA 



AAID Length Length 



Score Probability 



WW 



T5T 



Locus Name 



gprSCFSl 



3 . Be-3b 



Acc# 



AL133171 



460 



ORF Name 



25667675 C3 205 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 

rm — 



Score Probability 



ITT 



Locus Name 



Acc# 



Description 
[NO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID 



Z^.7,aai6....t3....1QQ.. 



TE3T" 



TT7T 



Length Length 
WIT 



11256 



Score Probability 
2.5e-23 



[326 



Protein name 



Locus Name 



probable membrane protein b08 78 



pir :F64826 



Acc# 



F64826 



Description 



ORF Name 



NTID 



NT AA 
_ — _ _ — Score Probability 
AAID Length Length 



3.17.6.25.3.7....11...2;!.. 



irrr 



2.0e-17 



Protein name 



Description 



Locus Name 



Acc# 



sp:YEfflJ_E<»LI 



ORF Name 



NTID 



NT AA 

— , — t Score Probability 
AAID Length Length JL 



I3.iaaa3.aa...c2...i5.st i 



8.6e-36 



Protein name 



Locus Name 



probable glucose- 6 -pnospnate 1-d.enycLrogenase 



pir:<?7l3l$ 



Acc# 



C71319 



Description 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TZZT 



Locus Name 



Acc# 



Description 
NO-HIT 



461 



ORF Name 



343826«7 cJ lyj 



Protein name 



NTID 



F777 



NT 



AA 



AAID Length Length 
— 



Score Probability 



KIT 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
¥77 



Score Probability 



T5B" 



Locus Name 



Acc# 



Description 



[NO -HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



iI0.!S.M...cl...m 



1557 



9.4e-08 



Locus Name 



conserved hypotnetical protein AF0444 



|pir:D6yiub 



Acc# 



D69305 



Description 



ORF Name 



NTID 



AAID 



— — S core Probability 
Length Length 



15bl 



4 . 3e-56 



Protein name 



Locus Name 



probable glutamate/ aspartate transporter 



tair:<5Vi:i09 



Acc# 



G71309 



Description 



ORF Name 



Protein name 



ftumA(&3dl) 



NTID 



— — Score Probability 
AAID Length Length 



H555" 



[TT5" 



[4^TT 



|Tu4~ 



|£.4e-2? 



Locus Name 



IgprXXUl J 6 33 



Acc# 
U13633 



Description 

IncJ plasmid runA(R3frL) and rumBlR3^ij genes, complete cas . 



462 



ORF Name 



Protein name 



Description 



NTID 



AAID 



— — Score P robability 
Length Length 



I3T5" 



9b0 



Locus Name 



|1.7e-SJ 



Acc# 
Q59516 



fcfiDuC'l'ASE) (liPR-A) 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



72677S7 cl 133 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



miox±:L.a±...x&b. ..J IT557 



6784 



Locus Name 



|5.1e-0$ 



Acc# 



Description 



|gp:A£sOl6260 



Agrobactenum tume taciens piasmrd pTi-SAKUKA, complete sequence. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



3.9.220.b.7....a3....ZUb.., 



6785 



TZVT 



[5.0e-26 



Protein name 



Locus Name 



Acc# 



coproporpnynnogen oxidase, III , 
oxygen- independent hemN 



pir:Bbyb4U 



B69640 



Description 



463 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



13869003 t3 21 



1564 



535" 



1608 



or 



0.0010 



Protein name 



Locus Name 



glycoprotein Vp2 6 0-l±Jce protein A18L 



pir :T17508 



Acc# 



T17508 



Description 



NT 



AA 



ORF Name 



NTID 



23.&6.;U.8.1...a2..3.8..... 



T5S5" 



AAID Length Length 

r7F7 — 



T225" 



Score Probability 
7T7 



6.ie-74 



Protein name 



Locus Name 



metabolite transport protein homolog ywtG 



Description 



pir :E70070 



Acc# 



E70070 



NT 



AA 



ORF Name 



NTID 



AAID 



2.S.63.23.43....C1...2.3. I [X55F 



Length Length 
TS4~ 



Score Probability 



555" 



Protein name 
Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



&6.16aOaO..±±..£. I 11557 



6789 



Length Length 



Score Probability 



Protein name 

Description 
IMO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA , ^ . n . 
— , — , Score Probability 
Length Length *~ 



3.3.10.D.2.6.D....X.3....Z3.. 



T5SF" 



5TJJ" 



1512 



124" 



0.0002$ 



Protein name 



Locus Name 



STARP antigen 



gp : PFSTARP 



Acc# 



Z26314 



Description 
P * falciparum gene tor STARP antigen. 



464 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length ^ 



751405 c3 44 



T557" 



7TT 



7TT 



rrr 



0.026 



Protein name 



Description 



Locus Name 



|sp:ATPS_ACACA 



Acc# 



Q37385 



At£> Synthase a CtiAitf, {teOKitf 5) 



ORF Name 



9862501 C3 41 



Protein name 



NTID 



157TT 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TUT 



TFT 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



NTID 



NT AA 
T — _ T — Score Probability 
AAID Length Length 



\t&6£A0.$±.±l..J. I ITS7T 



S43~ 



Protein name 



Description 



Locus Name 



gp : pGpSaaGen 1 



Acc# 



X95938 



P.gingivalis rnnB & pgaA genes & oris 15 0, 197, 2 02 & 199. 



NT 



AA 



ORF Name 



3.4£Q.6.5.12L.£1...4; I 



NTID AAID Length Length 

6794 



[3TT" 



Score Probability 



|4.5e-55 



Protein name 



Locus Name 



2 , 3 -.bisphospnoglycerate- independent 



gp:AF120090 



Acc# 



AF120090 



Description 



Bacillus megaterium 2, 3-JDisphosphoglycerate-independentphosphoglycerate 
mutase (pgm) gene, complete cds . 



NT 



AA 



ORF Name 



NT ID 



36155311 cl 9 



T57T 



AAID Length Length 
S7^S — 



TIT" 



Score Probability 




V.8e-42 



Protein name 



Locus Name 



pro.ba.ble transport protein 



pir:A75272 



Acc# 



A75272 



Description 



NT 



AA 



ORF Name 



NT ID 



l3.6.d3.D.1.7.5...±1...5i I 



TE7¥" 



AAID Length Length 
— 



Score Probability 



T3T 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
1ST 



Score Probability 




|4.7e-3S 



Protein name 



Locus Name 



putative large secreted protein 



Igp : SCF12 



Acc# 



AL117669 



Description 

Streptomyces coeiicolor cosmid F12 . 



NT 



AA 



ORF Name 



NTID 



1£11.7.1S2l.£2l.5.. 



u37F" 



AAID Length Length 




FT 



Score Probability 
75 



0.042 



Protein name 



Locus Name 



Acc# 



IgpTWMM^W 



Description 

Plasmodium talciparum MAL3P7 , complete sequence. 



466 



ORF Name 



24343756 cl 7 



Protein name 

Description 
MO-HIT 





1577 




67M 




62 


185 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



4a&Qfi5Q„.tl...3L I 



ct6 02 hypothetical protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 
75 



Locus Name 
jpir:F72036 



Acc# 



F72036 



ORF Name 



|iaS.5.fl.7.6.3....c3....3.3.S.. 



Protein name 

Description 
IHO-HTT 



NTID 



AAID 



NT AA 
„ — , — , Score Probability 
Length Length 



TTFT" 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



NT AA 
_ — _ — _ Score Probability 
AAID Length Length ' L 



6802 



75T 



T5TT 



3.0e-14 



Locus Name 



putative TonB - dependent outer membrane 
receptor 



gp:AF048749 



Acc# 



AF048749 



Description 



Bacteroides rragilis capsular polysaccnande biosynthesis operon, complete 
sequence . 



467 



ORF Name 


NTID 


7\ 7\ in 
AA1JJ 


NT 

j-ieny uii 


AA 


Score 


Probability 


12ii707fe_ci_i!ll 




6S03 




204 






Protein name 








Locus 


Name 


Acc# 


Description 














NO-HIT | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


125Miu2..±i...l5i 


1582 


6804 


954 


286S 


249 


4.1e-17 















Protein name 



Locus Name 



putative mstidine protein Kinase 



|gp:H!Ua;ib64 



Acc# 



U82564 



Description 



-like protein 
(hoxJ) gene, 



hydrogenase-like protein small subumt moxB) gene, nyctrogenase 
large subunit (hoxC) gene, and putative histidine protein kinase 
complete cds,and nickel permease (hoxN) gene, partial cds . 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


LltebA&^al^Al 


lS§3 


6§0S 


343 1032 




361 


4.$e-33 



Protein name 



Locus Name 



capsular polysaccnande oiosyntnesis iiomolog 
yveT 



bir:A70O37 



Acc# 



A70037 



Description 



ORF Name 



NTID 



AAID 



Protein name 



NT 



Length Length 
B3T 



AA 

— Score Probability 



75" 



Locus Name 



Acc# 



Description 



NO-HIT 



468 



ORF Name 



1290933 tJ lib 



NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


1585 


6807 


143 


432 


165 




2 . ye-12 



Protein name 



Locus Name 



Acc# 



hypothetical protein siribbl 


pir :S77uyv 




S77097 


Description 










ORF Name 


NT 

NTID AAID Length 


AA 

— , Score 
Length 


Probability 


lmiM^m... 


T586 6808 bib ibbl 207 




1 . be-iJ5 


Protein name 


Locus Name 




ACC# 


putative tlippase 


|gp:AP125164 


AF125164 


Description 












Bacteroides fragiiis 638k polysaccharide B (PS B2) siosyntnesisiocus , 
complete sequence; and unknown genes. 




ORF Name 


NT 

NTID AAID Length 


AA 

— Score 
Length 


Probability 


±l010±M...alJia.2 


1587 6809 ISO 453 i/s 






Protein name 




Locus Name 




ACC# 


hypothetical protein l 


pir :S2867B 




S28678 


Description 










ORF Name 


NT 

NTID AAID Length 


AA 

— , Score 
Length 


Probability 


±ll£AAl...cx±Jl&i 


1588 6810 354 1U65 byy 






Protein name 




Locus Name 




Acc# 


mannose - i -pnospnat e 


guanyiyltranst erase 


pir:H723ui 




H72303 



Description 



469 



NT 



AA 



ORF Name 



1456713b cl 2Ul 



NTID AAID Length Length 

— 



[753 1 FITTu" 



Score Probability 
10.00040 



TIE 



Protein name 



Locus Name 



immunoreactxve 43KU antigen PG3Z 



bp:Afi7b714 



Acc# 



AF175714 



Description 

Porphyromonas gmg ivaiis strain WbO immunoreactive 43KD antigenPG32 gene, 
complete cds 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




J1590 


6S12 | 


5S3 


\11S2 


13$ 


2.5e-06 



Protexn name 



Locus Name 



hypothetical protei n ^AcJi'/cib ■ isc 



|pir:T378bl 



Acc# 



T37851 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



|7.6e-ia 



Protein name 



Locus Name 



hypothetical protein siribbb 



|pir:£J77uyJ 



ACC# 



S77093 



Description 



ORF Name 

i5&m&tt..±l...a.. 



Protein name 



Description 



NTID 



NT 



AA 



AAID Length Length 



— Score Probability 



Locus Name 



sp:CAPDj^TAAU 



Acc# 



P39853 



CAPt) PROTEIN 



470 



ORF Name 




"NTTTTl 
IN JL J-U 


AAID 


NT 
Length 


AA 

— Score Probability 
Length 


15797007_c2_274 








383 


1152 322 b.be-zy 


Protein name 












Locus Name Acc# 


CpslK 


|gp:AF15b&04 AF155804 


Description 




Streptococcus 
(cps2F) , CpslG 
genes, complete 


suis strain 6bb5 CpslE (cpslEJ 
(cpslG) , CpslH (cpslH) , CpslI 
cds; and CpslK (cpslK) gene, 


gene, partial cas ; ups2f 
(cpsll) , andCpslJ (cpslJ) 
partialcds . 




ORF Name 




NTID 


AAID 


NT 

Length 


AA 

— Score Probability 
Length 


l5S22Su7_tl_2 




1594 


£§l£ 


549 


16bU 


Protein name 












Locus Name Acc# 


Description 














NO -HIT 














ORF Name 




NTID 


AAID 


NT 
Length 


AA 

— Score Probability 
Length 


iiiaai..±i...n 


1595 


6$l7 


67 


T04 49 U.UJ/ 



Protein name 



Locus Name 



probable fttfA- directed DNA polymerase, : reverse 
transcriptase 



pir :S2UUlb 



Acc# 
S20016 



Description 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
T471 



Score Probability 



Locus Name 



Acc# 



Description 



471 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1597 



155" 



1360 



5.2e-E4 



Protein name 



Locus Name 



iolylpolyglutamate synthase/ dinyarotoiat e 
synthase 



|pxr:liVi4411 



Acc# 



D72411 



Description 



NT 



ORF Name 



NTID 



nismziciim: 



AAID Length Length 
£F7 



AA 

— Score Probability 



6820 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



l±6.5±b.^L.al..;±l& 



Length Length 
TuT5 — 



Score Probability 
|2.2e-0b 



128 



Protein name 



Locus Name 



hypothetical protein RPi^Jtf 



pir:DVlbyu 



Acc# 



D71690 



Description 



NT 



AA 



ORF Name 



NTID 



|23A5.5.0L:/.:/....ci...j.4.y... 



AAID Length Length 



Score Probability 



421 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



472 



ORF Name 



NT ID 



NT AA 

— — Score P robability 
AAID Length Length 



7ST 



|2.$e-2« 



Protein name 



Locus Name 



putative tIDP-Jsr-acetyl-D-mannosamine 
transferase 



gp:^J>U0y23y 



Acc# 



U09239 



Description 

Streptococcus pneumoniae type 19F capsular polysacchandeJDiosyntnesis 
operon, (cpsl9f ABCDEFGHI JKLMNO) genes, complete cds,and aliA gene, partial 
cds . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


23632S02_i3_l46 


1602 


6S24 | 


270 813 


412 


l.Se-38 


Protein name 








Locus Name 


Acc# 










gp:AB00&550 


AB008550 


Description 














Pseudomonas aeruginosa pnage pni uta, 


complete genome 


sequence 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


21TJA^.±l..±b.l 


1603 


6825 


64 19b 


219 


I.3e-17 


Protein name 








Locus Name 


Acc# 


putative aminotransferase 


gp:A£ , 12bib4 


j AF125164 


Description 




Sacte'roides tragiiis polysaccharide B (PS B'2) oiosyntnesisiocus , 
complete sequence; and unknown genes. 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2^ftiasLii..±i-ii4 


1604 


6S26 


251 756 


375 


1.6e-34 



Protein name 



Locus Name 



sp:YACO_BACJSU 



ACC# 



Q06753 



Description 

HYPO T HETICAL TkUA/l^IA MLi ' l ' ky LTRMJaFiiikASB VA^U, 



473 



ORF Name 



24412547 ti 20 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 
IS3T 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NT ID 



AAID 



— — Score Probability 
Length Length 



F7S~ 



10. 04^ 



Protein name 



Description 



Locus Name 



sp:Y2iSJ4ETJA 



Acc# 
Q57687 



NT 



ORF Name 



NTID 



AAID Length Length 



— Score Probability 



T5T 



10.026 



Protein name 



Description 



Locus Name 



|sp:fLM_HAOfcW 



Acc# 



P39740 



F LAGULLAk PkOTUlM 1?'L1T 



ORF Name 



NTID 



NT — Score Probability 



AAID Length Length 



sir 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



474 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



TSUT 



AAID Length Length 
TUT 



Score Probability 



Locus Name 



Acc# 



[NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID AAID Length Length 

7£5 



Score Probability 



TZTU 



TIT- 



LOCUS Name 



Acc# 



MO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID AAID Length Length 

TIT 



Score Probability 



TUTT 



TTTT 



Locus Name 



Acc# 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID AAID Length Length 





Score Probability 



15834 



Locus Name 



Acc# 



INO-HIT 



475 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



I24694ittv ci lyy 



Protein name 



Locus Name 



lacunin 



gp:AP0VSlbl 



Acc# 



AF078161 



Description 

Manduca sexta lacunin mKNA, complete cas . 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


24S5400^_t2_l31 


1614 




403 1212 


§42 


5.2e-&4 



Protein name 



Locus Name 



pantothenate metabolism tiavoprotem arp 
homolog ylol : probable aspartate 
1 -dficarb nyvl ase activase 



pir:£>6y878 



ACC# 



D69878 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
\T3Z% 



Score Probability 



455 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



\is.B.mi^...±i...ii). I rcrre 



NTID AAID Length Length 

— 



HUT 



Score Probability 
mi 



|4.0e-86 



Protein name 



Locus Name 



methylmalonyl-UoA decarboxylase, beta-suDunit |gp : pmaj zUTE 



Acc# 
AJ002015 



Description 

Propionigenium mod estum mmdD, mmd£, mmclB genes and partial mmOAgene. 



476 



ORF Name 



NTID 



— — Score Probability 



256254^ ci 190 



TSTT 



AAID Length Length 
TTTJ 



IW 



0.0069 



Protein name 



Locus Name 



Acc# 



transmembrane protein 



gp:Y3m i M 



L11895 



Description 

Saccharomyces cerevisiae putative transmembrane protein tPTMl) gene, 
complete cds . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


P r obab i 1 i ty 
l. — T^TTfT 


262l0302JtlJLO 


l6l8 


6840 393 1182 






Protein name 








Locus Name 


Acc# 


sensory transduction system reguiauory 
protein slrl983 :protein slrl983 :protein 




pir:S?5664 


S75664 
























Description 














ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2.5.16.6.112....C2...2B.:/ 


1619 


6841 


159 


480 






Protein name 








Locus Name 


Acc# 


Description 














NO-HIT 1 


ORF Name 


NTID 


AAID 


NT 
Length 


— Score 
Length 


Probability 


26A6MD.U^12^8.i 


1620 


6842 


134 


4175 




0.0067 


Protein name 








Locus Name 


Acc# 



positive regulator tor virulence ractors 



[gp : CLOOkVl 



D14877 



Description 

Clostridium perrr ingens virft gene tor positive regulator rorviruience 
factors, complete cds. 



477 



NT 



AA 



ORF Name 



NT ID 



— Score Probability 
AAID Length Length 



TZTT 



TZ1T 



Protein name 



Locus Name 



Acc# 



nypotneticai protein 


AF0417 






pir:A6y3U2 


A69302 


Description 


ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


26A&mi..±3...1M 


1622 


6844 


193 buz 






Protein name 








Locus Name 


Acc# 


Description 














NO-HIT | 


ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


16&±b&V±...a±..±&± 


1623 


6845 


190 573 


213 


|2.4e-17 



Protein name 



Locus Name 



unknown 



gp:AF04874y 



Acc# 



AF048749 



Description 



feacteroides iragilis capsula r polysaccharide biosyntnesis oper on, complete 
sequence . 



ORF Name 



NT ID 



— — score Probability 



AAID Length Length 



\11L11±\L±'2...&* 



1624 



TF5TT 



|3.3e-39 



Protein name 



Locus Name 



2 1 , 3 1 -cuclic nucleotide 2 ' -p nospnodiesterase j |gp : ABU2863U 
Description 



Acc# 



AB028630 



Clostridium periri ngens hyp27, bacH, ptp, cpd genes rornypotneticai 
protein, bacterial hemoglobin, protein- tyros inephosphatase, 2', 3 ' -cuclic 
nucleotide 2 1 -phosphodiesterase, partial and complete cds . 



478 



ORF Name 



NTID 



N'T AA 

— — Score Probability 
AAID Length Length 



12548255 ±2 124 



[75" 



li.le-Ob 



Protein name 



Locus Name 



GlyA 



gp:AF13649S 



Acc# 



AF136495 



Description 

Campylobacter iarx CriyA (glyA) gene, partial cas 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


2«l557J:l_4 


1626 


6848 


258 


111 


217 


8 .Se-l8 



Protein name 



probable DNA pol III epsiion cnain 



Description 



Locus Name 



tpir:B71b36 



Acc# 



B71536 





ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


« l$5£/i9±L...c.±..JAh>.. 1527 


5845 


368 1107 




1.4e-53 


?!= Protein name 






Locus Name 


Acc# 




galactosyl transferase 


gp:aWJ2iyuu4 


AJ239004 




Description 




Streptococcus pneumoniae type 8 capsular gene 


cluster . 




ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


p 1628 


6850 


378 1137 






Protein name 






Locus Name 


Acc# 


Description 













MO-HIT 



479 



ORF Name 



Protein name 



NT ID 



1629 



— — Score Probability 
AAIP Length Length 



DNA repair protexn 



Description 



r 2W 



Locus Name 



pir:A7SJyi 



|2.7e-39 



Acc# 



A75391 



NT 



AA 



ORF Name 



NTID 



AAIP Length Length 



752 



Score Probability 
\1.6e-U 



Protein name 



Locus Name 



Acc# 



lsp:kECNJi(JoLl 



Description 

DNA Rl^AlR PkOTJbiiN RkJCN (^C OMBINATION MOTRIN M) 



ORF Name 



NTID 



AAID 



NT AA 

_ — Score 

Length Length 





6853 




151 


456 




128 





Probability 
!2.4e-08 



Protein name 



Locus Name 



Acc# 



P52620 



Description 
DNA POLYMERASE ill, kUTA cJkAI N, (kkAt>MJT) 



ORF Name 



NTID 



NT AA 

— Score 
AAID Length Length 



a3l2ilS2L.„al...iaa I 



Protein name 



Locus Name 



Probability 



Acc# 



Description 



[NO-HIT 



480 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



33788S&2 cl 2i2 



HIT 



Protein name 



conserved ttypotftetical protein aq_2 74 



Description 



Locus Name 



pir:OV0^2b 



i.2e-:J6 



Acc# 



C70325 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


3.ia.7.0.ill^al^ly.l 


1634 


6S56 


347 1044 


132 


7.0e-06 


Protein name 








Locus Name 


Acc# 


transmembrane prote 


m 






gp:^AJ698b 


AJ006986 


Description 


Streptococcus pneumoniae type 


33F DNA, 


capsular gene 


cluster. 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


ISJ&AlL^tl^Ll 


1635 


6857 


316 951 




1.3e-57 | 


Protein name 




Locus Name 


Acc# 


TOP-N- acetyl enoipyruvoylgiucosamme reductase 


gp:BPE^J08 


AJ238308 



Description 

Bordetella pertu ssis partial gene tor putative tnioesterase, tKJMA-uiy , 
dapB, omlA genes and partial fur gene. 



murB, 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



[NO-HIT 



481 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



3$43§02 ±2 66 



402 



4.0e-3i 



Protein name 



Locus Name 



T^gF- 



|gp:AP0S?bb78 



Acc# 



AF095578 



Description 



Salmonella typhimun um (yjgF) gene, complete cas; ana unKnowngene. 



— — Score Probability 
NTID AAID Length Length 

mzv — 



AA 



ORF Name 



|3S>446§7 ±1 143 



EST 



1Z2T 



13 . 3e-lB 



Protein name 



Locus Name 



hypothetical protein AFU4I7 



|pir:A69302 



Acc# 
A69302 



Description 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



\10&±0M.±±J1& I 



Protein name 



probable integral membrane protein 



Description 



IBT 



10.0014 



Locus Name 



|pir:TJ70b0 



Acc# 
T37050 



ORF Name 



Protein name 



NTID 



1640 



NT 



AA 

— Score Probability 
AAID Length Length 



117$ 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



serine acetyitransterase 



Description 



NT 



AA 



— — , Score 
Length Length 

H5T 



Probability 
|7.2e-16 



Locus Name 



tair:<SV2i4y 



Acc# 
G72349 



482 



ORF Name 



NTID 



4145057 c± 441 



Protein name 



NT 



AA 



AAID Length Length 



Score Probability 



3.7e-ly 



Locus Name 



Acc# 



serine acetyitransterase 



Description 



bir:a72J4y 



G72349 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



— Score Probability 



Protein name 



TZTT 



Locus Name 



4.6e-0S 



Acc# 



Description 



spTTTST^YNYT 



P74442 



HYPOTHET I CAL WD-kE PEAT PkOTEIN ^LkUl4i 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



Protein name 



Locus Name 



Acc# 



Description 



KfO-HlT 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



|f!7.10^O.2..±l...i:Z 



Protein name 



1645 



6567 



WTT 



12442 



Locus Name 



5 . 2e-74 



Acc# 



S p:BACA_BACiLl 



068006 



Description 



483 



ORF Name 



4773261 cl lyb 



Protein name 



Description 



NT 



AA 



NT ID 



AAID Length Length 



Score Probability 



2TT 



r7iT" 



0.017 



Locus Name 



sprYJBUJiJt'oLl 



Acc# 



P32689 



PRECURSOR 



ORF Name 



4&0216S c3 406 



Protein name 



NTID 



TFT7" 



AAID 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



4&a£iaa...ci...:LjauL. 



Protein name 



NTID 



proJoaoie iipopolysaccnariae 
N-acetylglucosaminyltransf erase, rfbU 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



pir :F64bUU 



1 .4e-09 



Acc# 



F64500 



ORF Name 



Protein name 



NTID 



— — Score Probability 

AAID Length Length 



phnP protein tpnnP) nomolog 



Description 



Locus Name 



pir :D7UI6b 



|2.6e-4J 



Acc# 



D70166 



484 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



4954380__l2_9O 


1650 


6872 


620 


1853 


357 















Protein name 



Locus Name 



oxaloacetate decarboxylase, subunit alpha 
(oadA) homolog 



Description 



pir:C69406 



4 .4e-47 



ACC# 



C69406 



NT 



AA 



ORF Name 



NTID 



H55T 



AAID Length Length 
— 



Score Probability 



1355" 



798 



lAe-26 



Protein name 



Description 



Locus Name 



sp:DWB_P£^!PU 



Acc# 



P13455 



DNA POLYMERASE 1 11, BETA CHAIN, 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



1ST 



Score Probability 
8.3e-2i 



Protein name 



Locus Name 



putative nistidine protein Kinase 



|gp:REU8^b64 



Acc# 



U82564 



Description 



-iiKe protein 
(hoxJ) gene, 



hydrogenase-like protein small sufcunit inoxB) gene, nydrogenase 
large subunit (hoxC) gene, and putative histidine protein kinase 
complete cds,and nickel permease (hoxN) gene, partial cds. 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


52L&aa5i...d...m 


1652 


687b 


440 


1323 


108 


2.8e-0!> 



Protein name 



Description 



Locus Name 



sp : FER_METBA 



Acc# 



P00202 



485 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



c2 27b 



1654 



l.Oe-41 



Protein name 



Locus Name 



ss-1, 4 -galactosyl transterase 



|gp:^PC^^141! 



Acc# 



X85787 



Description 

S.pneumonxae cpsl4 locus, 



ORF Name 



604§452 rl J 2l 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



75" 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



NADH dehydrogenase t ubiquinone) , , 3 9 Kua 
subunit homo log 



Description 



Il2l8 



Locus Name 



bir:H69478 



;1 . 5e-07 



Acc# 



H69478 



ORF Name 



Protein name 



NTID 



KTT AA 

— — Score Probability 
AAID Length Length 



hypothetical protein S110744 



Description 



JUT 



924 



Locus Name 



pir:S77079 



Acc# 



S77079 



486 



ORF Name 



5742762 ri ^ 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



6881 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



16883 



T7TT 



TTTF" 



i.0e-30 



Locus Name 



capsular polysaccharide blosyntnesis nomolog 
yveT 



pir:A700J7 



Acc# 



A70037 



Description 



487 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



465 



1398 



T7T 



Protein name 



Locus Name 



phosphate starvation inducible protein 
homolog ylaK 



Description 



pTrTK£W7T 



|i.7e-76 



Acc# 



A69873 



ORF Name 



Protein name 



NTID 



azmai^t^ m | 



— — Score Pro bability 
AAID Length Length 



16885 



P7TT 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



— Score Probability 



TFF" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



16887 



nbonuclease H, T" 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



i.8e-ii 



Locus Name 



biriJObVttV 



Acc# 



JC5787 



488 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



107203:17 tl 7b 



or 



|4.0e-22 



Protein name 



Locus Name 



lsp:YC0a_yJiA4!T 



Acc# 



P37261 



Description 

tiYPOfHETlCAL 21.1 KD PkOTEltf IN FtJSl-AG£>l INTERGENIC RECjIUM 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



TTZT 



Score Probability 
|2.2e-07 



TIT" 



Protein name 



Locus Name 



hypothetical protein BBI16 



bir:<37024l 



Acc# 



G70241 



Description 



ORF Name 



NTID 



NT AA 

— — , Score Probabil ity 
AAID Length Length 



Protein name 



DNA topoi some rase III tops 



Description 



TUT 



Locus Name 



pir:H697i>4 



Acc# 



H69724 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 




Score Probability 



T5T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



1670 



AAID 



NT 



AA 



— — , Score Probab ility 
Length Length 





rnr 



Locus Name 



Acc# 



Description 



NO-HIT 



489 



ORF Name 



NT AA 

— — Score Probability 
NT ID AAID Length Length 



T5TT 



|2.7e-32 



Protein name 



Description 



Locus Name 



|gp:AB0il>yS7 



Acc# 



AB012957 



Vibrio choierae genes tor o-antigen synthesis , strain 022, completecas . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1F72- 



S.Se-lfi 



Protein name 



Locus Name 



putative giycosyl transferase 



gp:AF048/4y 



Acc# 



AF048749 



Description 



Bacteroides iragilis capsular polysaccharide siosyntnesis ope ron, complete 
sequence . 



NT 



AA 



ORF Name 



i2£ulD.B.:/...±^...llb... 



NTID AAID Length Length 





Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



KO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



proJaaJDie swt/snt nelicase 



Description 



NT 



AA 



Length Length 




Score Probability 
|i.3e-8i 



Locus Name 



|pir:Jil7148l 



Acc# 



E71481 



490 



ORF Name 



Protein name 



NT 



AA 



NTID AAID Length Length 

TTCT — 



Score Probability 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



NT 



AA 



NTID AAID Length Length 
F£T5 1 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



MocB (Tn4i99J 



Description 



NTID 



1677 



NT 



AA 



AAID Length Length 



11284 



Score Probability 
17 .6e-20 



Locus Name 



pir :B484B7 



ACC# 



B48487 



ORF Name 



Protein name 



NTID 



AAID 



enhanced entry protein EnnC 



Description 



— — , Score Probability 
Length Length 



3TT 



1ST 



2.2e-22 



Locus Name 



|gp:AP0b77o4 



Acc# 



AF057704 



Legionella pneumophila ElnhA (enhA) , ElnhB (enftB) , and ennancea entryprotem 
EnhC (enhC) genes, complete cds . 



491 



Protein name 



NT 



AA 



ORF Name 


NTID 


AAID 


Length 


Length 


i6135886J:3_i&:J 


i£74 


6901 




103 


312 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA „ ^ , , . n . 
— — , Score Pro bability 
Length Length 



1680 




6902 280 843 




105 





0.00081 



Protein name 



Description 



Locus Name 



Acc# 



sp:Yy21_iiOk^U 



HYPOTHETICAL £>&0tein bbd21 



ORF Name 



NTID 



NT AA „ _ , , . _ . . 
— — Score Probability 
AAID Length Length 



T£W±~ 



TUTT 



l.Se-llfi 



Protein name 



Locus Name 



Acc# 



gp:AB0l£260 



Description 

Agrobacterium tumetaciens plasmid p'ri-SAKURA, complete sequence. 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



T7TT 



|2.6e-176 



Protein name 



Locus Name 



hypotnetical protein 



pir:JQ1020 



Acc# 



JQ1020 



Description 



492 



NT 



AA 



ORF Name 



NTID 



AAID 



1^62G60 t2 134 



Length Length 



Score Probability 
TT3 



0.0074 



Protein name 



Locus Name 



ES/130 



|gp:&F00675r 



Acc# 



AF006751 



Description 



Homo sapiens ES/13 0 mRNA, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



K34" 



Length Length 



Score Probability 



TM5" 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
— 



Score Probability 



T7T 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



iaaaa^i...t2L...aa i 



Length Length 
427 



1284 



Score Probability 
6 . le-106 



1049 



Protein name 



Locus Name 



transposase 



Acc# 



AF038866 



Description 



Bacteroid.es Iragilis transposon Tn552 0 transposase (bxpH) andmcbilization 
protein BmpH (bmpH) genes, complete cds. 



493 



ORF Name 
20213132 c2 406 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



1687 






61 


186 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



KO-HIT 



ORF Name 
\10.B.1B.B.1..±2..±±1.. 



Protein name 



NTID 



AAID 



NT AA 
— , — , Score 
Length Length 



TTT" 



Locus Name 



Probability 



Acc# 



Description 



[NO-HIT 



ORF Name 
2L145t2L&...c3....5tl7... 



Protein name 



NTID 



AAID 



NT AA 
— , — , Score 
Length Length 



TUT 



Locus Name 



Probability 



Acc# 



Description 



NO-HIT 



494 



NT 



AA 



ORF Name 



NT ID 



ti 6 



AAID Length Length 
T5^3 — 



Score Probability 
5.4e-C7 



Protein name 



Description 



Locus Name 



Acc# 



P16947 



M PfcOtfSIN, StiROfYM 45 pfufiCuftSOft 



ORF Name 



NTID 



NT AA 

— , — ■ Score Pr obability 
AAID Length Length JL 



6914 



5W 



Protein name 



Locus Name 



tetracycline resistance element regulator 
RteA 



Description 



pir :A41860 



ACC# 



A41860 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
IT7 



Score Probability 



Locus Name 



Acc# 



Description 



paw 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



INO-HM 



495 



NT 



AA 



ORF Name 



NT ID 



21679626 tl 45 



T5"S5~ 



AAID Length Length 

mn — 



Score Probability 



T7T 



Protein name 

Description 
MO -HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



lZAS.$All...al..A&6. I ITS^ 



AAID Length Length 




Score Probability 
TTC 



1.5e-08 



Protein name 



Description 



Locus Name 



gp:APU72238 



Acc# 



U72238 



Anabaena PCC7120 ORFR1, 0RFR2 , 0RFR3, ORFR4 , and ORFR5 genes , complete 
sequences . 



NT 



AA 



ORF Name 



2Li&a.7.&is...ca...fiia I rrzrr 



NTID AAID Length Length 




TJUT 



Score Probability 
55 



4.3e-i>5 



Protein name 



Locus Name 



phage abortive intection protexn 



pir:T30326 



Acc# 



T30326 



Description 



ORF Name 



NTID 



AAID 



Protein name 



UDP-galactopyranose mutase 



NT AA 

— , — , Score Probability 
Length Length 



TTFT" 



Locus Name 



gp:SPAJ6 986 



Description 

Streptococcus pneumoniae type 33F DNA, capsular gene cluster. 



I4.7e-122 



Acc# 



AJ006986 



496 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length =£ - 



227740S7 t2 126 



T5W 



33" 



0.0024 



Protein name 



Locus Name 



non- structural 5a protein 



gp:HCU56570 



Acc# 



U56570 



Description 



Hepatitis C virus isolate 925821 non- structural 5a (NS5a) gene, partial cds. 



NT 



AA 



ORF Name 



NTID 



TTuTT 



AAID Length Length 
T^l 



Score Probability 

on 



S4 



Protein name 



Locus Name 



sp:SE>RG_x:fi]SfLA 



Acc# 



P36378 



Description 

(OSTEONECTIN) (ON) (BASEMENT MEMBRME PROTEIN BM-40) 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



TTDT" 



T5T 



tutt 



i.4e-216 



Protein name 



Description 



Locus Name 



Acc# 



Igp : BNRRTEAB 



Bacteroides thetaiotaomicron rteA and rtaB genes involved mproduction ot 
plasmid-like forms, complete cds, and tetQ gene, 3'end. 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



\11±±115A.±1..±±& I P77TJ2 



TTUT 



2.4e-0S 



Protein name 



Locus Name 



actin binding protein MAYVEN 



gp:AF059569 



Acc# 



AF059569 



Description 

Homo sapiens actin binding protein MAYVEN mRNA, complete cds . 



497 



NT 



AA 



ORF Name 



±2 ±62 



T7UI 



vfpth 7\ "a ~r t\ t — ^ i- -r — ^, Score Probability 
NTID AAID Length Length JL 

— 



Protein name 

Description 
INO-HTT 



Locus Name 



Acc# 



ORF Name 



NT AA 

™„ x — 4_u x — Score Prob ability 
NTID AAID Length Length JL 



±±L$£.$±±..±1..±1$. I IT7uT 



IT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



\216A±5.5.2..±±...1± 



NTID AAID Length Length 

mn — 



Score Probability 
HT7 



S.3e-85 



Protein name 



Description 



Locus Name 



gp:BFU63096 



Acc# 



U63096 



Bacteroiaes tragilis LJoctAJ gene, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



TTulT 



6 §2$ 



Length Length 

146 



Score Probability 

m 



6.62$ 



Protein name 



Locus Name 



Acc# 



hypothetical protein 



gp:AF036485 



Description 

Piasmid pNZ4 000, complete sequence. 



498 



ORF Name 



NTID 



NT AA 

AAID Le^th Le^th Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
x — o-i- -r — Score 
Length Length 



2±2h.3A±i..±iJi&L. I nrrm 



TTTT 



Probability 
S.4e-34 



Protein name 



Description 



Locus Name 



sp:GSPA_BACSU 



Acc# 



P25148 



GEtffi&AL S'l'RKS^ PRO! 1 *!™ A 



NT 



AA 



ORF Name 



NTID 



ii^iA.?.aaft„±L„a4 i [tttts 



, , TrA _ — _ — _ Scor e Probability 
AAID Length Length i - 

15531 — 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



z^ias.i2...±i...S.a.. 



ITTu" 



NT AA 
Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



499 



ORF Name 



24415883 ±3 171 



Protein name 



NTID 



AAID 



T7TT 



NT AA 
Length Length Probability 



12*3" 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NTID 



NT AA 

7V7 . T _ _ — — _ Score Probability 
AAID Length Length 



2.ii^l2S.S...ci„^0.b. I [F7T2 



Protein name 



rxJDonuciease ill (rnc) homolog 



Description 



MIT 



©.Oe-25 



Locus Name 



pir :H70187 



Acc# 



H70187 



ORF Name 



±A.bMlk2....a±...51& 



Protein name 



NTID 



vrll protein 



Description 



T7TT 



NT AA 

7V7VT _ _ — — ^, Score Probability 
AAID Length Length z - 



TUT 



Locus Name 



pir :T17388 



0.0018 



ACC# 



T17388 









NT 


AA 






ORF Name 


NTID 


AAID 


Length Length 


Score 


Probability 




1714 


6$16 


637 


1514 


1505 


2.^e-154 j 



Protein name 



; arginme decarboxylase , 2 rprotem 
slr0662 rprotein slr0662 



Locus Name 
jpir:S7677l 



Acc# 



S76771 



Description 



ORF Name 



NT AA 

NTID AAID Length Length Probability 



2Lib.4U8.:/b...±1...24Si.. 



T7TF" 



TZZT 



0 . 00018 



Protein name 



Locus Name 



Acc# 



complement C9 precursor 



pir:C9H(J 



Description 



500 



ORF Name 



NT ID 



NT AA 
^ ^ — L1 — _ Score 
AAID Length Length 



653§ 



Probability 
|5>.5e-69 



Protein name 



Description 



Locus Name 



sp:PDXB_ETOLI 



Acc# 



P05459 



SfttfEHftONATEi - 4 - t>HOSfe>HAja DEHYDROGENASE f 



NT 



AA 



ORF Name 



NTID 



24645461 tl 1$ 



T7TT 



AAID Length Length 
— 



TUT 



Score Probability 




5.5e-85 



Protein name 



Locus Name 



tetracycline resistance element mobilization 
regulatory protein rteC 



pir :A36927 



Acc# 



A36927 



Description 



ORF Name 



NTID 



EF7TF" 



Protein name 



acetylglutamate Kinase 



Description 



NT 



AA 



AAID Length Length 
— 



Score Probability 

rrTT — 



^.9e-35 



Locus Name 



|pir:P69111 



Acc# 



F69111 



ORF Name 



NTID 



NT AA 

_ _ _ „^ — _ — _ Score Probability 
AAID Length Length 21 



:M6A7.M2...a:L..28.i.. 



T7TT 



Protein name 



auxin-responsive GH3-like protein 



Description 



6.1e-22 



Locus Name 



gp:ATAC005356 



Acc# 



AC005396 



AraJoidopsis thaliana chromosome II BAC T26I20 genomic sequence, complete 
sequence . 



501 



NT 



AA 



ORF Name 



NT ID 



AAID 



I246S0067 t3 257 



Length Length 
TTT 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 
|MO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

t 4-t- t — ^ Score 
Length Length 



1721 



411 



Protein name 



Locus Name 



Probability 



Acc# 



Description 
INO-HIT 



ORF Name 



NTID 



AAID 



NT AA 
Length Length Probability 



Z±l^til6.„±±..±9& I TT7TI 



5544 



Protein name 



Locus Name 



sp:kli!P BUCAP 



Description 
ATy-BJjPE NbiiM 1 DNA W*!LI(JAiSiji REP, 



Acc# 



051889 



ORF Name 



NTID 



NT AA 

AAID Length Length Probability 



Zba^A£.l.±l„.£& 



T7ZT 



1470 



Protein name 



Locus Name 



unknown 



gp:AF14487 9 



Acc# 



AF144879 



Description 

i_.eptospxra interrogans rtJD locus, complete sequence. 



502 



ORF Name 



2bb78201 tl 70 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



TTZT 



Length Length 



Score Probability 
FT7 



i.le-51 



Locus Name 



sp:LWJA_BAGN0 



Acc# 



P39907 



ORF Name 



2b9^7 t2 ill 



Protein name 



NTID 



AAID 



NT AA 

T — ^ T — Score 
Length Length 



TT7T 



Probability 
I RTjB 



Locus Name 



Acc# 



tetracycline resistance protein tety : tetA (Q) 2 



pir : 140188 



Description 



NT 



AA 



ORF Name 



NTID 



26.119A2...C.2..AS.0... 



TTZZ" 



AAID Length Length 
^ — 



TUT 



fZTTT 



Score Probability 
T0"S 



Protein name 



Locus Name 



mobilization protein C 



gp:APii8243 



Acc# 



AF118243 



Description 



Bacteroides tragi! is mobilization protein C (moJoC) gene, completecds . 



NT 



AA 



ORF Name 



NTID AAID Length Length 

1727 | |6$4S> 1 124$ 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



503 



NT 



AA 



ORF Name 



NT ID 



26571300 £3 173 



T1TT 



AAID Length Length 
— 



Score Probability 



TTT 



TTT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



aasa44iL±i.„ai I irm 



AAID Length Length 
5551 — 



Score Probability 

mz — 



3.4e-25 



Protein name 

Description 
INCOMPATIBILITY MOD-2) 



Locus Name 



sp:HSdO_PODAN 



Acc# 



043109 



NT 



AA 



ORF Name 



NT ID 



2&fifl£iaai..±i.„iii i irrnr 



— — , ^ _ — L1 Score Probability 
AAID Length Length JL 

W5T2 



TUT 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID 



26.1$&6.1&.±1..2&$. | fITTT 



555T 



Length Length 
"STT 



TZJT 



Score Probability 
TT1 



4 . 6e-9l 



Protein name 



Locus Name 



ABC transporter (ATP -binding protein) homo log 
ygaD 



pir:G69S15 



Acc# 



G69815 



Description 



504 



ORF Name 



26ttiS752 c2 422 



Protein name 

Description 
NO-HIT 



NT 



AA 



NT ID 



TTTT 



_. — _ — Score Probability 
AAID Length Length L - 

^551 — 



135" 



Locus Name 



Acc# 



ORF Name 



2L9.2L3.125L...t3....25A.. 



Protein name 
Description 

NO-HIT 



NT 



AA 



NTID 



TTJT 



AAID Length Length 
— 



Score Probability 



TTT 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



2$1£&21&..jc±J±13. 



T7JT 



M protein precursor 



Description 



NT 



AA 



AAID Length Length 




Score 



Probability 
10.032 



Locus Name 



pxr :S60858 



Acc# 



S60858 



NT 



AA 



ORF Name 



NTID 



2M7.1il..±2...M I FITS 



Protein name 

Description 
NO-HIT 



AAID Length Length 




Score Probability 



TTT 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



T7T£~ 



F£33~ 



Length Length 



TTJU" 



Score Probability 
252 



2.4e-iS 



Locus Name 



Hypothetical protein Rv0 597c 



pir :H70908 



Acc# 



H70908 



Description 



505 



NT 



AA 



ORF Name 



|ii986S676 tl 44 



T7T7 



_ _ -p-n T — _ — ^, Score Proba bility 
NTID AAID Length Length JL 

— 



|2.7e-09 



Protein name 



Description 



Locus Name 



sp:PRIM_LISMO 



Acc# 



P47762 



DNA PRIMASE, 



NT 



AA 



ORF Name 



NTID 



AAID 



31736291 ±2 93 



11738 



6960 



Length Length 
TTT 



Score 



Protein name 

Description 
KfO-fllf 



Locus Name 



Probability 



Acc# 



NT 



AA 



ORF Name 



NTID AAID Length Length 

nrm — 



Score Probability 



6351 1 1 BUT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



™„ _ _ _ _ — _ r — _ Score Probability 
NTID AAID Length Length JL 





1740 



53" 



"TUT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



506 



NT 



AA 



ORF Name 



NT ID 



31892650 Cl 351 



\T7TT 



AAID Length Length 
5553 



Score Probability 



fSUT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



±±l&b2b£...cil.„l , m I 



NT ID AAID Length Length 

[5551 — 



Score Probability 



7T 



TIT 



Protein name 

Description 
NO -HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



iimft5ft.„ti.„iia i trrzs 



NTID AAID Length Length 

[5555 — 



55T 



T55T 



Score Probability 
TT2 



i-5e-i0 



Protein name 



Description 



Locus Name 



sprVOLD BPP2 



Acc# 



P13520 



OVERCOMING L YSOGEN I 2 AT I ON DEFECT PROTEIN 



NT 



AA 



ORF Name 



33J.D£.25.±..±1...±63. I 11744 



NTID AAID Length Length 

[5553 — 



52T 



Score Probability 
TSS 



2 .2e-14 



Protein name 



Locus Name 



putative RNA polymerase sxgma tactor (ECF 



gp:SCE46 



Acc# 



AL133252 



Description 
Streptomyces coelicolor cosmicl E46 . 



507 



ORF Name 
34037503 t3 170 



Protein name 

Description 
MO-HIT 



NT 



AA 



NT ID 



AAID Length Length 



Score Probability 





1745 




6957 




99 


300 



Locus Name 



Acc# 



ORF Name 
3.40.£3.441..±2...i3.a.. 



Protein name 

Description 
NO-HIT 



NTID 



AAID 



NT AA 
Length Length 



T5W 



Locus Name 



Probability 



Acc# 



ORF Name 



Protein name 

Description 
MO-HIT 



NTID 



NT AA 

_ _ T — T — Score Probability 
AAID Length Length 



TTZT 



T7T 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 
T — ^ _ — -i Score Probability 
AAID Length Length JL 



2A±Z.2A0.2.±±J±& I nTTTS 



S570 



TTF" 



Protein name 

Description 
(SfO-HlT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



Miaaiaa..±i...2i2 1 uttz? 



AAID Length Length 




Score Probability 
— 



5.5e-124 



Protein name 



Locus Name 



maturase- related protein (mtL intron) 



pir :S77648 



Acc# 



S77648 



Description 



508 



NT 



AA 



ORF Name 



NTID 



1^4268878 £1 27 



T7STT 



AAID Length Length 
— 



Score Probability 
151 



Protein name 



Locus Name 



glucosidase II beta-subumt 



gp:AF06606i 



Acc# 



AF066061 



Description 



Mus musculus glucosidase II beta-suJounit gene, alternativelysplxced 
products, partial cds . 



NT 



AA 



ORF Name 



NTID 



^l7lM7 ci 350 



TT5T 



_ _ _. _ — — ^ Score Probability 
AAID Length Length J - 

m*n — 



TIT 



1320 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



lS152ftH...c£...42a I IT752 



Length Length 
T73~ 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TST 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



509 



NT 



AA 



ORF Name 



NTID 



AAID 



3933375 c3 520 



Length Length 
POTT 



Score Probability 
Wl 



0.017 



Protein name 



Locus Name 



hypothetical protein 



pir :B72242 



ACC# 



B72242 



Description 



ORF Name 



NTID 



AAID 



NT AA , , . , . 
— , — ^ Score Probability 
Length Length £ - 



T$T 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

— 



TIT 



Score Probability 
TT5 



i.5e-45 



Protein name 



Description 



Locus Name 



sp:YLCA_E00LI 



Acc# 



P77380 



PROBABLE 'rkAtfSCRIMlONAL REGULATORY PkOTElN YLCA 



ORF Name 



NTID 



AAID 



NT AA 
— , — , Score 
Length Length 



1757 



15575 



TTuT" 



Probability 
7.5e-45 



Protein name 



Locus Name 



conserved hypothetical protein aq_1224 



pir :G70405 



Acc# 



G70405 



Description 



NT 



AA 



ORF Name 



NTID 



11758 



AAID Length Length 
F9Su — 



573 



Score Probability 




|3.1e-22 



Protein name 



Locus Name 



sniJcimate Kinase 



pir : A70487 



ACC# 



A70487 



Description 



510 



ORF Name 



Protein name 



synthase, 



Description 



NT ID 



AAID 



NT AA 
— — Score 
Length Length 



TT5T 



Probability 
t7.4e-117 



Locus Name 



pir :G69842 



Acc# 



G69842 



ORF Name 



Protein name 

Description 
INO-HTT 



NT 



AA 



NT ID 



AAID Length Length 

mwi — 



Score Probability 



P7TT 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



±1D.11S.0...±±JH 



NTID AAID Length Length 

15553 — 



T75T 



Protein name 



hypothetical protein PH0424 



Description 



Score Probability 




Locus Name 



pir:A71153 



|2.9e-37 



Acc# 



A71153 



ORF Name 



NTID 



NT AA 

^^-r^ T — — — Score Probability 
AAID Length Length J - 



±16.0A1..±1..X&$. I IT752 



Protein name 

Description 
NO-HIT 



5M4" 



5T 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



T75T 



Length Length 



Score Probability 



Locus Name 



Acc# 



511 



NT 



AA 



ORF Name 



NT ID 



4984628 t3 232 



TT&T 



AAID Length Length 




TTTT 



Score Probability 




|4.2e-30 



Protein name 



Locus Name 



Acc# 



copper resistance sensor kinase pcoS: copper 
sensor 



pir:S52258 



Description 



ORF Name 



NT ID 



AAID 



NT AA 
— — Score 
Length Length 



amai7...±3L.m ....i [tt^ 



TTZT 



Probability 
9.8e-241 



Protein name 



Description 



Locus Name 



sp:ATMA_ECOLI 



Acc# 



P39168 



MG(2+) TRANSPORT ATPASE , P-TYPE 1, 



NT 



AA 



ORF Name 



NTID 



AAID 



bA2.3.2.Q,D...±2..±±l I 



Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



6.5.aa...ci...^:Ab. 



nrrrr 



AAID Length Length 

— 



Score Probability 
Tl 



0.031 



Protein name 



Locus Name 



sp:VflE<3_fiC0Ll 



Acc# 



Q46787 



Description 

HYPOTHETICAL 15 . 1 KD PROTEIN IN KDUI-LYSS INTERGENIC REGION 



512 



ORF Name 



NT ID 



NT AA 
T — T — ^, Score Probability 
AAID Length Length JL 



TJTT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



aS.lA0.1.7...±1...3.0.., 



T7F5" 



AAID Length Length 
15551 — 



T7F" 



TOT" 



Score Probability 
715 



0.033 



Protein name 



Locus Name 



41kd antigen 



gp:A13461 



Acc# 



A13461 



Description 

P. falciparum gene tor 41kd antigen, clone 41-7. 



NT 



AA 



ORF Name 



NTID 



AAID 



£aaaii2Lx:2...afi& i umu 



Length Length 



Score Probability 




BTTe^TT 



Protein name 



Locus Name 



acyl carrier protein (ACP) 



gp : ABACPF 



Acc# 



X82399 



Description 
A. Jorasilense acpF gene. 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



.7. 8. 1 2. ZtL 21 '. ".. 2, 5. 2L . . 



17717 



^55T 



33T" 



TUT 



b.le-70 



Protein name 

Description 
(MOSMOHEXOKINAflfi) 



Locus Name 



sp:KSPF_SYNY3 



Acc# 



P72830 



513 



ORF Name 



783410 t3 172 



Protein name 



NTID 



TTJT 



AAID 



NT 



AA 



Length Length 



Score Probability 



TT7F" 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



AAID 



NT AA , , , n , 
— . — , Score Probability 
Length Length " L 



&5.a27...±2...i5.s i irm 



TUT 



TTT 



kL3e-34 



Protein name 



Locus Name 



phosphor ibosylglycmamide tormyl transferase 



gp:ATPUR3 



Acc# 



X74767 



Description 



Arabidopsis thaiiana mRNA tor phosphoribosyiglycinamidetormyltransrerase 
encoded by PUR3 gene. 



ORF Name 



Protein name 



NTID 



TTTT 



AAID 



NT 



AA 



Length Length 



Score Probability 



24T 



Locus Name 



Acc# 



Description 
IMO-HIT 



ORF Name 



3.3.23.13.0....g3....5l17„. 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



TTTT 



hypothetical protein SCE68.26C 



Description 



TTT" 



Locus Name 



|pir:T36276 



Acc# 



T36276 



514 



NT 



AA 



ORF Name 



NTID 



\TT7T 



AAID Length Length 
mm 



sir 



Score Probability 
TT3 



Protein name 



Locus Name 



nypothetical protein Rv3 069 



bir:F70650 



Acc# 



F70650 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length J - 



TTTT 



TTT 



Protein name 



Locus Name 



hypothetical protein 



pir : JQ1020 



Acc# 



JQ1020 



Description 



NT 



AA 



ORF Name 



NTID 



i2£A7.2lu1...c2l...2u I frm 



AAID Length Length 
TuTIu — 



TTTT 



Score Probability 
T¥u 



2.ue-27 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6 02 7 



ACC# 



JC6027 



Description 



ORF Name 



NTID 



AAID 



ia£!2L&17.„.cl...21 1 U7TT5 



Protein name 

Description 
InO-HIT 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



515 



NT 



AA 



ORF Name 



NTID 



124218752 ci 18 



AAID Length Length 
— 



TTT 



414 



Score Probability 
| |0 . 015 



Protein name 



Locus Name 



ribosomal protein L5 



|gp:U17009 



Acc# 



U17009 



Description 



Phytophthora mtestans mitochondrion, complete genome. 



NT 



AA 



ORF Name 



NTID 



3163438 C2 19 



TVST 



AAID Length Length 
7M3 — 



TTZT 



Score Probability 
T¥8 



1.4e-07 



Protein name 



Locus Name 



transmembrane sensor 



gp:AI?05l6$l 



Acc# 



AF051691 



Description 



Pseuclomonas aeruginosa stress tactor A (pstA) , ECF sigma tactor (tiulj , 
transmembrane sensor (f iuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



18.0.6.3.3.Z...C.3....2.1.. 



nrmr 



Length Length 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



6AAB:±i).6...±l..& 



TTZT 



AAID Length Length 




TTT 



Score Probability 




1.7e-14 



Protein name 



Locus Name 



RNA polymerase sigma tactor SigZ-liice protein 



gp:AF137263 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 3 OS ribosomal protein sie-iikeprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



516 



NT 



AA 



ORF Name 


NT ID 


AAID 


Length 


Length 


i3065655_t3_2 


1784 




7005 


67 


201 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



±&Bm2L.±2..A 1 



AAID Length Length 
7TO7 — 



TIT 



Score Probability 




0.044 



Protein name 



Locus Name 



transcription regulator, PJdsX tamily 



pir:H75270 



Acc# 



H75270 



Description 



NT 



AA 



ORF Name 



NTID 



U6.5.2I6.1..±1.„17. 



[F73r 



AAID Length Length 

Turn — 



77" 



Score Probability 
T5I 



2.6e-14 



Protein name 



Locus Name 



hypothetical protein 



pir : JQ1020 



Acc# 



JQ1020 



Description 



ORF Name 



NTID 



NT AA 

_ _ _ _ — _ — Score Probability 
AAID Length Length ^ 



16.S3.2LS.a5....tZ...ll I 11787 



ITS" 



3 . 9e-40 



Protein name 



Locus Name 



hypothetical protein 



pir : JQ1020 



Acc# 



JQ1020 



Description 



ORF Name 



NTID 



mktti2L&..±a...i& i ™ 



Protein name 



NT 



AA 



AAID Length Length 
7TJTT3 — 



Score Probability 
R 



Locus Name 



sp:SPRC_XE^LA 



Description 

(OSTEONECTIN) (ON) — (BASEMENT MEMBRANE PROTEIN BM-40) 



0.031 



Acc# 



P36378 



517 



NT 



AA 



ORF Name 



NT ID 



AAID 



2461562$ c3 24 



Length Length 



Score Probability 
£733 — 



Protein name 



Locus Name 



alpha -glucos Idas e 



gp:BTO66897 



Acc# 



U66897 



Description 



Bacteroides thetaiotaomicron neopulluianase (susA) and.alpha-glucosid.ase 
(susB) genes, complete cds . 



ORF Name 



NT ID 



Protein name 



hypothetical protein 



Description 



NT 



AA 



AAID Length Length 
7UT2 — 



TTT 



Score Probability 
T7T5 



5.1e-l2 



Locus Name 



pir : JQ1020 



Acc# 



JQ1020 



NT 



AA 



ORF Name 



NTID 



±0£±±6.5.2^1JX0£ I 



AAID Length Length 

tuts — 



Score Probability 
— 



6.Se-107 



Protein name 



Locus Name 



Acc# 



sp:YFBS_ECOLI 



Description 

HYPOTHETICAL 65.9 KD PROTEIN IN LRHA-ACKA INTERGENIC REGION 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



±7$2 


7014 


555 


106S 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



518 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



10727040 ci 204 





1753 


7015 


345 


1038 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length *- 



7TT 



1.2e-52 



Protein name 



Locus Name 



ACC# 



P74211 



Description 

&YAlD02tAMlBffi 5'-£>H0S£>HAT2 OXIDASE, (E>B»/PMf> OXIDASE!) 



NT 



ORF Name 
lll5.0A.7..7....cl...3.1.1., 



NTID 



AAID 



Length Length 



AA 

— , Score Probability 



TTZT 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

mm, — 



Score Probability 




2 .4e-33 



Protein name 



Locus Name 



Acc# 



O-acetyl transferase 



gp:SAU7730S 



U77308 



Description 

Staphylococcus aureus O-acetyl transferase (capSHj gene, completecds . 



519 



NT 



AA 



ORF Name 



NT ID 



AAID 



12985802 t2 99 



TTTT 



Length Length 
TFT 



7JT 



Score Probability 
53 



0.0053 



Protein name 



Description 



Locus Name 



gp:D84670 



Acc# 



D84670 



Pyrococcus turiosus gene tor DNA polymerase II subunit 1, DNApolymerase II 
subunit 2, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



T7W 



ToTO" 



Length Length 
JET" 



Score Probability 



Protein name 

Description 
INO-HTT 



Locus Name 



Acc# 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


imisa&„.Gi.„i2S 


1799 




162l 


182 


549 







Protein name 

Description 
INO-HTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



TTWZT 



Length Length 
ST5 



Score Probability 



Protein name 

Description 
INO-HTT 



Locus Name 



Acc# 



520 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
TUTS — 



TFT 



Score Probability 
I.2e-22 



1^1 



Protein name 



Locus Name 



orlX 



gp:AB014440 



Acc# 



AB014440 



Description 



Staphylococcus aureus genes tor ortl, orrX, orr.2, ort3, partial andcompiete 
cds . 



NT 



AA 



ORF Name 



NT ID 



AAID 



16112SS2 t3 1S2 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NT ID 



X5"UT 



AAID Length Length 




T55" 



^7" 



Score Probability 
HT7 



l.3e-14 



Protein name 



Locus Name 



RNA polymerase ECF-type sigma lactor homolog 
ylaC 



Acc# 



A69872 



Description 



ORF Name 



NT ID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



19.5.7.18.7....c3.„.3.15. I ITEM 



Protein name 



Description 



Locus Name 



Acc# 



WfO-fliT 



521 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



15725575 ci 207 



\TTT 



|1.8e-id 



Protein name 



Locus Name 



unknown 



gp:AF048744 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



ORF Name 



20505007 c3 305 



Protein name 



NTID 



AAID 



NT AA 
T - , Score Probability 
Length Length 



IT5T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



|2ia.7.5.£0...±2....14.3. I ITM 



Protein name 



NT 



AA 



AAID Length Length 
JJ3 



Score Probability 



TTT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



2L2A6.9A5l2L...£3....2.9.8. 



T5W 



7<JJV 



TTT 



513 



1.4e-l0 



Protein name 



Locus Name 



unknown 



gp:AF04 874 9 



ACC# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



522 



ORF Name 



NTID 



NT AA , , . n . 
— , — , Score Probability 
AAID Length Length ^ 



4 . 8e-177 



Protein name 



Locus Name 



putative UDP-galactose-6 dehydrogenase 



|gp:AF048749 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



21611611 ±3 163 



lfilO 


7012 142 1025 




$02 





2 . 3e-90 



Protein name 



D- 2 -hydroxy- acid dehydrogenase, 



Description 



Locus Name 



pir;£767S2 



Acc# 



S76782 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


2L3L9.^5tia...tl..Aa 


1811 




7033 


104 


315 







Protein name 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



240.22.2.12....C1...2.3.6... 



Protein name 



NTID 



AAID 



conserved hypothetical protein 



Description 



NT 



AA 



Length Length 



Score Probability 
|4.3e-57 



Locus Name 



lpir:A72335 



Acc# 



A72335 



523 



ORF Name 



24415552 ci 251 



Protein name 



ykvJ protein 



Description 



NT ID 



AAID 



T3TT 



NT 



AA 



Length Length 



Score Probability 
B.ie-70 



Locus Name 



pir :A63868 



Acc# 



A69868 



ORF Name 



NT ID 



NT AA 

W , T „ — — ^, _ — _ Score Probability 
AAID Length Length jL 



1814 



7025 



1151 



TTT 



Protein name 



Locus Name 



tap:AF07<>967 



Acc# 



AF079967 



Description 



Phytomonas serpens 12 S large suPunit riPosomai RNA and 3S small suPunit 
ribosomal RNA, partial sequence; NADH dehydrogenase subunit8 (ND8) 
cryptogene, NADH dehydrogenase subunit 9 (ND9) cryptogene , NADH dehydrogenase 
subunit 7 (ND7) cryptogene, ATPase subunit 6 (A6) cryptogene, G3 cryptogene, 
complete sequence; and MURF1 (MURF1) and MURF1 (MURF1) genes, complete cds . 



ORF Name 



NT ID 



AAID 



NT AA 
T T Score Probability 
Length Length 



&a£a&&&±..g$„.$&1 J fTFTF 



TUTT 



T5T 



5.9e-50 



Protein name 



Locus Name 



N-acetylneuraminic acid condensing enzyme 



|gp:LPN73il 



Acc# 



AJ007311 



Description 



Legionella pneumophila serogroup 1 lipopolysaccharide JDiosynthesisgene 
cluster . 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length i - 



7035 



11157 



3 . 3e-05 



Protein name 



Locus Name 



provable lipopolysaccharide 
N-acetylglucosaminyltransf erase, rfbU 



pir :F64500 



Acc# 



F64500 



Description 



524 



ORF Name 



NT ID 



AAID 



NT AA 

T — Score Probability 
Length Length 



TtJTT 



TTIT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2UA2&ll...a±J2±l I 



TuTTF 



Length Length 
[2TRT 



Score Probability 




|8.2e-i5 



Protein name 



Locus Name 



thiol :disul tide interchange protein 



pir:F75549 



Acc# 



F75549 



Description 



NT 



AA 



ORF Name 



NTID 



\2l6Al±26..±t...±9. I fTOT^ 



AAID Length Length 
7MI — 



Score Probability 
73 



0.045 



Protein name 
Description 

3 OS R1B0S0MAL PROTEIN SlOP 



Locus Name 



sp:RS10_METTH 



Acc# 



027133 



NT 



AA 



ORF Name 



NTID 



1820 



AAID Length Length 




TUTT 



Score Probability 
TTZ 



|i.0e-33 



Protein name 



Locus Name 



unknown 



gp:AF144879 



ACC# 



AF144879 



Description 

Leptospira interrogans rtb locus, complete sequence. 



525 



NT 



AA 



ORF Name 



NTID 



144806575 c3 300 



AAID Length Length 

r7U¥3 — 



Score Probability 
T55 



2.6e-0S> 



Protein name 



Locus Name 



galactosyl transferase 



gp:AF030373 



Acc# 



AF030373 



Description 



Streptococcus pneumoniae strain SP-264 alpha, 1-6-glucosidase (dexB) gene, 
complete cds; capsular polysaccharide biosyntheticlocus , complete sequence; 
and oligopeptide binding protein (aliA)gene, complete cds. 



ORF Name 



NTID 



125431551 cl 5i2 



"TWTT 



Protein name 



acetyltransterase, vatB 



Description 



NT 



AA 



AAID Length Length 
7u44 — 



717 



Score Probability 
233 1 |2.le-25 



Locus Name 



pir :T10903 



ACC# 



T10903 



ORF Name 



Protein name 
Description 



NT 



AA 



NTID 



11823 



AAID Length Length 
7TJ^5 



Score Probability 



11544 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 




T5T 



Score Probability 
551 



2.1e-57 



Locus Name 



conserved hypothetical protein ykvM 



pir :D69868 



Acc# 



D69868 



Description 



526 



NT 



AA 



ORF Name 



NT ID 



AAID 



126847025 t2 124 



7MT 



Length Length 



Score Probability 




|4.ie-84 



Protein name 



Description 



Locus Name 



sp : FLP YEAST 



Acc# 



P03870 



kECOMBltfASS FLP PRttffiltf (PROfElltT ABLE) 



ORF Name 



NTID 



NT AA 
_ — _ _ — _ Score Probability 
AAID Length Length £ - 



7MF" 



TFT 



FU4~ 



MI 



4.1e-84 



Protein name 



Description 



Locus Name 



spTFT^TEAST- 



Acc# 



P03870 



RECOMBINASE FLP PROTEIN (PROTEIN ABLE) 



NT 



AA 



ORF Name 



NTID 



AAID 



2MRM5.2...cl...21& I [T3T7 



Length Length 
TIT 



TJUT 



Score Probability 

— 



|i.2e-93 



Protein name 



Locus Name 



Cps7G 



gp:AFi64515 



Acc# 



AF164515 



Description 



Streptococcus suis putative glycosyltransf erase Cps7E (cps7E) gene, partial 
cds; putative glycosyltransf erase Cps7F (cps7F) and Cps7G(cps7G) genes, 
complete cds; and putative glycosyltransf erase Cps7H(cps7H) gene, partial 
cds . 



ORF Name 



NTID 



NT AA 
_ — _ _ — _ Score Probability 
AAID Length Length jL 



1446 



ST5™ 



2 . Oe-21 



Protein name 



Locus Name 



Cps2 J 



gp:AF02^47l 



Acc# 



AF026471 



Description 



Streptococcus pneumoniae DexB (dexB) gene, partial cds; putatrvetransposase 
gene, complete cds; type 2 capsular polysaccharidebiosynthesis operon, 
complete sequence; and AliA (aliA) gene, partial cds. 



527 



NT 



AA 



ORF Name 



120505275 cl 198 



NT ID AAID Length Length 
7uTI 



ITT 



Score Probability 
TT7 



0.00037 



Protein name 



Locus Name 



STARP antigen 



gp:AP205525 



Acc# 



AF209925 



Description 

Plasmodium falciparum STARP antigen (STARP) gene, complete cds. 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length J - 





7ll 




347 





|1.5e-3l 



Protein name 



Locus Name 



CMP-N-acetiyneuraminic acid synthetase 



|gp:LPN73ll 



Acc# 



AJ007311 



Description 



Legionella pneumophila serogroup 1 lipopoiysaccharide biosynthesisgene 
cluster. 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
W2 



Score Probability 



TOT" 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



\1&&±9.8±1...z1J1&1 1 \T&TZ 



TTT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



528 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
— 



T72~ 



Score Probability 




6.3e-17 



Protein name 



Locus Name 



putative ECF sigma tactor RpoEl 



gp:AFo4S107 



Acc# 



AF049107 



Description 



Myxococcus xanthus response regulator FrzZ (rrzZ) gene, partialcds; alanine 
dehydrogenase (aldA) , putative ECF sigma factor RpoEl (rpoEl) , and response 
regulator homolog (frzS) genes, complete cds/and unknown genes. 



NT 



AA 



ORF Name 



NT ID 



AAID 



34552551 c3 236 



Length Length 



Score Probability 



Protein name 
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Protein name 

Description 
INO-HIT 



Locus Name 
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Score Probability 
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■1.3e-2S 



Protein name 



Locus Name 



putative acetyl transferase 



|gp:LPM731i 



Acc# 



AJ007311 



Description 



Legionella pneumophila serogroup 1 lipopoiy saccharide biosynthesisgene 
cluster . 
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Probability 
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Protein name 



Locus Name 



water-stress inducible protein 



|gp:AF010584 



Acc# 



AF010584 



Description 



Oryza sativa water-stress inducible protein (WSI) mRNA, completecds . 
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Protein name 
Description 

CHLOROPLAST 50£ RIBO^OMAL PROTEIN L23 



Locus Name 



sp:ftK23JBUGeft 



Acc# 



P19167 



NT 



AA 



ORF Name 



NT ID 



3.9.ia0.1..±2...1D.l I 



AAID Length Length 
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Score Probability 
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Protein name 



Locus Name 



ubiquinone/ menaquinone .biosynthesis 
methyl transf erase- related protein 
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Description 



ORF Name 
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Protein name 



NT ID 



hypothetical protein S111773 



Description 
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AAID Length Length 
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Locus Name 



pir:S77110 



Acc# 



S77110 
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Protein name 



Locus Name 



0RF8S 



|gp:AB028134 



Acc# 



AB028134 



Description 



Shigella sonnei O-antigen gene cluster tor ORF6S, ORF7S , 0RF8S , 0RF9S , 
ORF10S, partial and complete cds . 
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Protein name 



Locus Name 



alanine dehydrogenase 



|gp:AI?0?07TF" 



Acc# 



AF070716 



Description 



Vibrio proteolyticus alanine dehydrogenase (aid) gene, completecds . 
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Protein name 



Locus Name 



Toul 



gp:AF058689 



Acc# 



AF058689 



Description 



Neisseria meningitidis strain Z2491, genomic sequence. 
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Protein name 



Locus Name 



hypothetical protein 



pir :S75991 



Acc# 



S75991 



Description 
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NTID 
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AAID Length Length 




Score Probability 
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Protein name 



Locus Name 



putative glycosyi transterase 



|gp:AF048749 



Acc# 



AF048749 



Description 



Bacteroides fragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 
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Protein name 



Locus Name 



conserved hypothetical protein 



prr:672224 



Acc# 



B72224 



Description 
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NTID AAID Length Length 
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Score Probability 




2.6e-59 



Protein name 



Locus Name 



quinolmate phosphoribosyi transterase 



pir:B70375 



Acc# 



B70375 



Description 
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Protein name 



Locus Name 



conserved Hypothetical protein 
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Acc# 



A72334 



Description 
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Protein name 
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sp:LGUL_NEIME 



Acc# 
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(S-D-LAOTOylGLuTathIOne methYLGlyOxal lyaSe!) 
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Protein name 

Description 
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Protein name 
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INO-HIT 
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Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 
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Score Probability 
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Protein name 



Description 



Locus Name 



sp:REPI_YEA^T 



Acc# 



P03871 



tRaNS-aCtING FACTOR S (REP1) (PROTEIN BAKER) 
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NTID 
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Probability 
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Protein name 



Locus Name 



putative epimerase/ dehydratase Wbil 



gp:AP0^4070 



Acc# 



AF064070 



Description 



Burkholdena pseudomallei putative dinydroorotase [pyrCl gene, partial cds; 
putative l-acyl-sn-glycerol-3-phosphateacyltransf erase (plsC) , putative 
diadenosine tetraphosphatase (apaH) , complete cds; type II O-antigen 
biosynthesis gene cluster , complete sequence; putative undecaprenyl 
phosphateN-acetylglucosaminyltransf erase, and putative 
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Protein name 



Description 



Locus Name 
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Protein name 



Locus Name 



aminotransterase homolog 



gp:AF00l4$7 



Acc# 



AF001497 



Description 



Campylobacter jejuni polysaccharide biosynthesis protein homologgene, 
partial cds, galactosyl transferase homolog, UDP-galactosephosphate 
transferase homolog, acetyl transferase homolog andaminotransf erase homolog 
genes, complete cds, and polysaccharidebiosynthesis enzyme homolog gene, 
partial cds. 
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Protein name 

Description 
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ribosomal protein L6 
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Description 
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Description 

RIBO^OMAL PROTEIN L17 
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Protein name 



Description 
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ALPiiA CHAltf) (RNA POLYMERASE ALPHA SUBMIT) 
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Protein name 



Locus Name 



Acc# 



gp:AB0l7308 



AB017508 



Description 

Bacillus naiodurans C-125 genomic DNA, 32 JcJd tragment, completecds . 
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5 OS RIBOSOMAL PROTEttf LS (SL6) 
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Protein name 



Locus Name 



Acc# 



ritiosomal protein S14 



pir :R3EC14 



Description 
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Protein name 



Locus Name 



initiation tactor IF1 



gp:AF1152M 



Acc# 



AF115283 



Description 



Leptospira interrogans Si 0~spc- alpha locus, complete sequence. 
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nbosomal protein L15 
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Protein name 
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ribosomal protein S13 



gp:AF115283 



ACC# 



AF115283 



Description 

Leptospira interrogans SlO-spc-alpha locus, complete sequence. 
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Description 

Bacillus nalodurans C-125 genomic DNA, 32 kia fragment, completecds . 
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Description 



|gp:AB0175iy5~ 



AB017508 



bacillus halodurans c-125 genomic DNA, 32 Kb tragment, compietecas . 
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preprotem translocase SecY 



gpiAFll^W 



Acc# 



AF115283 



Description 

Leptospira interrogans SlO-spc-alpha locus, complete sequence. 
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Protein name 



Locus Name 



immunoreactive 8 9KD antigen PG87 



|gp:AF17b72^ 



Acc# 



AF175722 



Description 



Porpnyromonas gingival is strain W50 immunoreactive 8 9KD antigenPG87 gene, 
complete cds . 
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hypothetical protein Rv0584 
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Acc# 



G70934 



Description 
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Protein name 



Locus Name 



beta-galactosidase 



gp:AF0Sb4^ 



Acc# 



AF055482 



Description 



Thermotoga neapolitana gaiactose utilization operon, completes equence . 
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hypotnetical protein 
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Protein name 



Locus Name 



115K outer membrane protein precursor :Susc 
protein 



Description 
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hypothetical protein RP407 



Description 
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Locus Name 



sp:VHI4_kH00A 
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Description 

HYPOTH E T I CAL PROTEIN IN HIM A 3 ' REGION (FRAGMENT) 
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r7TTT 



1ST" 



2.6e-18 
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conserved nypotnetical protein yl£>H 



Description 
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3-dehydroqumate syntnase PAB0298 
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Protein name 



Locus Name 



pobR regulator 



|gp:PyiiJViyb2V 



Acc# 



Y18527 



Description 

Pseudomonas sp. po£A, pobR, pcaQ, pcan and pcaG genes. 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



2$4l280£ ti 12 



T5W 



TTTT 



27J4" 



|2.1e-l6 



Protein name 



Locus Name 



RNA polymerase sigma lactor SigZ-liJce protein | igp : AF1372b3 



Acc# 



AF137263 



Description 

Bacteroides thet aiotaomicron 30^ ribosomal protein sib-iueprotein, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 
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Protein name 



Locus Name 



immunoreactive 89kD antigen pgbv 



gp:AF175722 



] 



Acc# 



AF175722 



Description 

Porphyromonas gingivalis strain W50 immunoreactive byjcD antigenPGB/ gene, 
complete cds . 
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Protein name 



Locus Name 
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3 -pnospnosniKimate 
1-carboxyvinyl trans f erase, : 5-enolpyruvylshikim 
ah ft - 3 -phosphate synthase . 



pir : JN0758 
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Protein name 



Locus Name 



Acc# 



alanyi-tRNA syntnetase 



gp:AP0ii7b00 



AF027500 



Description 

Aquitex pyrophilus alanyl- tM syntnetase (alaS) gene, compieteccis ; ana 
ATP-dependent Clp protease regulatory subunit (clpA) gene, partial cds. 
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lacto-N-Jaiosidase precursor 



gp:SSU404Bd 



Acc# 
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Description 



Strepbomyces sp. lacto-N-biosidase precursor gene, complete cds . 
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Protein name 



Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



|4£3.22u&...c2....i:/.i. 



AAID Length Length 

Twm — 



7T^ 



WW 



Score Probability 
2.6e-4$ 



2S¥ 



Protein name 



Locus Name 



alpJaa-l, 3/4- tucosidase precursor 



gp:SStf35394 



Acc# 



U39394 



Description 



Strepbomyces sp. alpha- 1 , 3/4-lucosidase precursor gene, completecds . 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
3.8e-bfc 



Protein name 



Locus Name 



putative membrane transport protein. 



bprSCCVSA 



Acc# 



AL133220 



Description 

Streptomyces coelxcoior cosmid C75A. 



546 



ORF Name 



5162628 ±2 60 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID Length Length 
7TT7 — 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



T3UB" 



NT AA 

— , — , Score Probability 
Length Length — — • 



H3F" 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NT 



AA 



NTID 



AAID 



5t&6a2L5i...G2....:L7.& 



TTIT 



Length Length 



Score Probability 
3.3e-li 



Protein name 



Locus Name 



PobR 



gp:RLU4038B 



Acc# 



U40388 



Description 



Rhizo&ium leguminosarum positive regulator or po&A (pobR) gene, complete 
cds, and 4-hydroxybenzoate hydroxylase (pobA) gene, partial cds . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length J ~ 



6.2.5.Q3.2L...C2....1SA.. 



ttttt 



1.7e-lS 



Protein name 



Locus Name 



unknown 



E 



p:AP00738i 



Acc# 



AF007381 



Description 



Flavooacterium ]ohnsoniae gliding motility protein (glcLAJ gene, complete 
cds ; and unknown genes . 



547 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



7TJT 



Score Probability 
5.0e-I0 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp:PSIii0a71i 



Acc# 



AJ130872 



Description 



Porphyromonas gingival is W5 0 receptor antigen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



NTID 



53475$ cl 140 



1910 



AAID Length Length 
73 



Score Probability 



TFTZ 



Protein name 



Description 



Locus Name 



Acc# 



NO -HIT 



ORF Name 



NTID 



NT AA 

— — „ Score Probability 
AAID Length Length 



fii&ifis&..x&...ias i 



1TTT 



T5TI 



5T5" 



5.6e-52 



Protein name 



Locus Name 



sp:YWNl!_BAc^U 



Acc# 



P71040 



Description 

HYPOTH E T I CAL 55.5 KD PROTEIN IN SPO IIQ-MTA INTERGENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



7.ftfiLa5CL±2L.5Ltt.. 



1512 



17134 



1025 



TS5" 



i.0e-05 



Protein name 



Locus Name 



transmembrane sensor 



|gp:AF05i<^i 



Acc# 



AF051691 



Description 



Pseudomonas aeruginosa stress ractor A (psrA) , ECF sigma tactor { t ml) / 
transmembrane sensor (fiuR) , and hydroxamate- typef errisiderophore receptor 
(fiuA) genes, complete cds . 



548 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



Protein name 



7TJT 



\7W 



] 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



Protein name 



1514 



TUTS" 



|§.3e-S8 



Locus Name 



Acc# 



DNA processing cnam A 



Description 



IpTrTrmW 



C72399 



ORF Name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



7TT7" 



3.2e~06 



Protein name 



Description 



Locus Name 



sp:EXSA_PSKAJ±; 



Acc# 



P26993 



EXOfiN£VMEi 5 5Vrt?HKSiS RfigULATu RY PROTEIN i^XSA 



ORF Name 



Protein name 



NTID 



±1±5.1L1L.±1...±Z I 



NT AA 

— — , Score Probab ility 
AAID Length Length 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



T5TT 



hypotnetical protein RP870 



Description 



NT 



AA 



AAID Length Length 



Score Probability 
0.0016 



TU7 



Locus Name 



pir :F7164y 



Acc# 



F71649 



549 



NT 



AA 



ORF Name 



NTID 



13678887 ti 67 



AAID Length Length 
\TT5 



Score Probability 



IT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



"7TZT 



Length Length 
— 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



1N0-M1T 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



7TIT" 



|S.0e-I2 



Protein name 



Locus Name 



RNA polymerase sigma-E tactor 



pir:H7bbbO 



Acc# 



H75550 



Description 



NT 



AA 



ORF Name 



NTID 



1921 



AAID Length Length 




Score Probability 
5 . Oe-ll 



192 



Protein name 



Locus Name 



tonB-lrnKect receptor Tlr 



gp:AFlSb^ 



Acc# 



AF155223 



Description 

Porphyromonas gingivalis tonB-lxnkect receptor Tlr (tlr) gene, complete cas. 



550 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



22460876 12 42 



TTT 



Protein name 



Locus Name 



IT 



gp:AF030027 



Acc# 



AF030027 



Description 

Elquine herpesvirus 4 strain NS8U567, complete genome. 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Pr obability 
Length Length 



23634713 tl 22 



TWTT 



[T7T 



11134 



133" 



|3.Se-ll 



Protein name 



Description 



Locus Name 



gp : PSKOPKC 



Acc# 
D28119 



Pseudomonas aeruginosa oprc gene tor outer membrane protein c, complete cds . 







ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




l!&15£16.^c±^M 


1524 


7146 


56 


251 


75 


0.035 



Protein name 

Description 
HYPOTHETICAL 44.1 KD PROTEIN Itf ft»B5- 



Locus Name 



ACC# 



P33313 



CDC28 Ml'flRGENlO KECilOJSl 



ORF Name 



NTID AAID 



NT A A 

— — Score Probability 
Length Length 



2±110:±±±.±±..±± 


1525 


7147 


142 429 


145 















li.4e-10 



Protein name 



Locus Name 



Acc# 



sp:VE54__AQTjAg I 067466 



Description 

HYPOTHETICAL 15.3 KD PROTEIN AQ_1454 



551 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




Score Probability 
i.3e-73 



744 



Protein name 



Locus Name 



putative reductase iron-suitur protein 



gprSCMld 



Acc# 



AL133469 



Description 



Streptomyces coelicoior cosmid M10. 



NT 



AA 



ORF Name 



NTID 



AAID 



124486507 ti 69 



T5TT 



7149 



Length Length 




STT" 



Score Probability 
3.1e-14 



Protein name 



Description 



Locus Name 



sp:Y374JYWTdA 



Acc# 
Q57819 



HYPOTHETICAL PkOTKIN MJ0374 



NT 



AA 



ORF Name 



NTID 



AAID 



2S.19:ibh0..±2...11 



1928 



7150 



Length Length 
ITT" 



Score 



Probability 
8.2e-I9 



Protein name 



Locus Name 



probable UDP-glucose 4-epimerase 



pir :A7ii&3 



Acc# 



A71183 



Description 



ORF Name 



NTID 



NT AA _ _ 
— — Score Pro bability 
AAID Length Length 



2L&3A5LQ2L5t...ca„.a5L 



Protein name 



nimD protein 



Description 



T£4~ 



S.7e-l2 



Locus Name 



Acc# 



pir:I40iaV 



552 



NT 



AA 



ORF Name 



NTID 



31445427 ci Hb 



AAID Length Length 
353 



7T52 



Score Probability 
|8.ie-56 



57F 



Protein name 



Locus Name 



.sp:YMPJiAC£JU 



Acc# 



P37567 



Description 

HY&<WtffiTICAL 37.1 KB teOTElftf In rt)LK-Lrti S 1^T2r^n1C fefeGlOrt 



NT 



AA 



ORF Name 



NTID 



33357252 c3 140 



AAID Length Length 
7153 — 



652 


1555 




2135 





Score Probability 
. 2e-221 



Protein name 



Locus Name 



putative reductase tiavoprotem summit 



|gp:SCMlO 



Acc# 



AL133469 



Description 



Streptomyces coelicolor cosmid M10. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



13.mM0.u...c3....m I |T932 



7T5T - 



375" 



Score Probability 
7.2e-07 



Protein name 



Locus Name 



sp:EXSA_P3EAE 



Acc# 



P26993 



Description 

aXOEtiZYMa g flYNTkiiiSlS REICjULA'1'6rY frftO'imU I&SA 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



\1±±$3±&L±2J1£ I 



7T53" 



Score Probability 
3.9e-47 



Protein name 



Locus Name 



cation ettiux system nomoiog yd£M 



pir :C697$1 



Acc# 



C69781 



Description 



553 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
1225 



T3~ 



Score Probability 




7¥ 



Protein name 



Locus Name 



Acc# 



citrate syntnase 



jgp:BBU2S076 



U28076 



Description 

Bartonella baciilitormis citrate synthase igitA) gene, partial cas. 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




1$35 


7157 


317 954 


$S0 


1.2e-d8 


Protein name 








Locus Name 


Acc# 


O-acetyiserme (thiol) -lyase-A 


related protein 


gp:AFl74l3a 


AF174138 


Description 




Methanosarcina bariteri o-acetylserine (tnioi ) -lyase-A 
gene, complete cds. 


reiateaprocem {cys^j 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


lOlZllZ^tl^Z 


1336 


715S 


425 1278 


1537 


1.2e-157 


Protein name 








Locus Name 


Acc# 


collagenase 


gp:AB006y73 


AB006973 


Description 














Porphyromonas gingival is DNA 


tor collagenase, 


complete cds . 


















ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


41l!20.2..±2...Al 


1$37 


7lS$ 


321 966 


§1 


0.043 



Protein name 



Locus Name 



|sp:RLl2_JtlAL\yo 



Acc# 
P41197 



Description 

505 RlBo^OMAL PkoTEIN L12 ( 'A' TVPkl) 



554 



ORF Name 



NT ID 



14789162 ci yl 



7TFTT 



Protein name 



conserved nypotneticai protein yvaJ 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



7T3~ 



TT3TT 



Locus Name 



pir:^70027 



Acc# 



G70027 



ORF Name 



Protein name 



NT ID 



AAID 



TTUT 



NT AA 

— — Score Probability 
Length Length 



TUT 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



probable lipase 



Description 



NT ID 



AAID 



NT 



AA 



Length Length 

m — 



Score Probability 
2.2e-2A 



mi 



Locus Name 



jpir:C71>472 



Acc# 



C75472 



ORF Name 



Protein name 



NT ID 



ld4l 



AAID 



— — Score Probability 
Length Length 



5T" 



195 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



7164 



NT 



AA 



Length Length 

vn — 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



NT 



AA 



ORF Name 



NTID 



78170:4 t'l 36 



flMT 



AAID Length Length 
[7T^ — 



TZTT 



Score Probability 
0.0070 



155 



Protein name 



Locus Name 



hypothetical protein 



Acc# 



AJ243397 



Description 

£>seudomonas aureotaciens partial Jdoia gene ana orpl una. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



867012 C2 102 



IW 



TUT 



Score Probability 
|7.2e-16 



Protein name 



Locus Name 



putative cytochrome B su&unit 



[gprSCMlO" 



ACC# 



AL133469 



Description 



Streptomyces coeiicolor cosmicl Mio. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



7TFT 



Probability 
5.5e-i4 



Protein name 



Locus Name 



Mecl protein 



gprSSKSMUcJA^ 



Acc# 



Y13095 



Description 



S.sciuri mecA2 gene, strain K3 (MM2 ) . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



7T5F" 



TTT 



Probability 
10.0010 



Protein name 



Locus Name 



rmmunoglobulin-Fc-JDinamg protein 



|gp:S&FCftAi 



Acc# 



X73159 



Description 

S. pyogenes ±crA2 gene lor ig-Fc-mnamg protein. 



556 



ORF Name 



NTID 



24548410 c2 41 



Protein name 



nypotneticai protein C0624 



Description 



NT 



AA 



AAID Length Length 
TUTT 



Score Probability 
4.2e-37 



Locus Name 



pir:S7S04*l 



Acc# 



S73091 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



11945 



7T7TT 



TJT" 



TXT" 



.0.0037 



Protein name 



Description 



Locus Name 



spiBLARjJTAAU 



Acc# 



P18357 



REGULATOkY PR0TH1N BLAkl 



NT i-i-H. 

— — Score Probability 

NTID AAID Length Length 

17171 



AA 



ORF Name 



\15±Z$£:i2.±lJ±'± I 



153" 



10.022 



Protein name 



Locus Name 



extracellular protein Exp4 precursor 



|gp:LLU9S83<> 



Acc# 
U95836 



Description 



Lactococcus lactis extracellular protein Exp4 precursor, gene, partial cas. 



ORF Name 



NTID 



NT — Score Probability 



AAID Length Length 



|±mi&2...c:L..Aa 



JT7T 



317 



^4" 



2.$e-74 



Protein name 



Description 



Locus Name 



sp:MDU_<JkLAU 



Acc# 



P80040 



MALATE DEHYDROGENASE, 



557 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



4725152 ±4 26 



rTTTT 



TUTJT 



Protein name 



Locus Name 



nypotneticai protein Fi4*vy.b 



bir:T3^774 



Acc# 



T33774 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TZZZ 



1ZT 



Score Probability 



673 



Protein name 



Locus Name 



115K outer membrane protexn precursor : SusC 
protein 



Description 



toxr:J(!6<W7 



Acc# 



JC6 027 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



7T7T 



T5T 



Locus Name 



Acc# 



INO-HTT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



TTTT 



Length Length 




Score Probability 



52 



Locus Name 



Acc# 



Description 



NO-HIT 



558 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



5591061 ci ii 



7T7T 



Protein name 



Locus Name 



Phosphoenolpyruvate carboxyJcinase 



gp:AB0166oO 



Acc# 



AB016600 



Description 

Selenomonas ruminantium gene tor Pnosphoenolpyruvate carboxyKinase , complete ; 
cds . 



ORF Name 



10009595 cl h5 



Protein name 



NT ID 



NT AA „ 

— — Score Proba bility 
AAID Length Length 



7T7TT 



71 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



iimiSfiLi...t3L„.M. 


1957 


7179 


575 


112$ 


24§ 













5.5e-2i 



probable nistidinol phosphatase 



Description 



Locus Name 



Acc# 



F75515 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



■2.4e-il 



Locus Name 



sp:Rl££_BA<JAM 



Acc# 



Q44681 



Description 

(LUMA^lNk! SYNTHASE) (RIBOFLAVIN SYNTHASE Btfl'A 0HA1N) 



559 



NT 



AA 



ORF Name 



NTID 



AAID 



17032 cl 50 



7I3T 



Length Length 



Score Probability 
5.2e-45 



¥7T 



Protein name 



Locus Name 



Acc# 



AB013492 



Description 

kaciiius naiodurans u-12 5 genomic dna, 



$A/35' fragment, cioneALBACOOl . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l$54562_tl_b 


|1960 


7l§2 




1101 


|lS05 


|2.Se-lS4 | 



Protein name 



Description 



Locus Name 



sp:G3f>_BACjFR 



Acc# 



Q59199 



(FRAGMENT) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



T5FT" 



7T2T 



FIT 



T1TT" 



2.6e-I44 



Protein name 



Locus Name 



sp:GUAA_ii!COLl | 



Acc# 



P04079 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



7TT 



Score Probability 
0.00037 



Protein name 



Description 



Locus Name 



sp:YA57_AC!TA(j 



Acc# 



052728 



HYPOTHETICAL PROTEIN lObV 



560 



ORF Name 



240762 ci 4^> 



Protein name 



NTID 



NT AA _ _ , , . _ . . 

— — Score Proba bility 
AAID Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



\1&.12SM±&...±2..±& I [T^Fl 



[7TSF" 



I7T5" 



12151 



Probability 
9.0e-128 



Protein name 



Description 



Locus Name 



| sp:OyDA_E(JoLi 



Acc# 



P27298 



0LtG6PfiPTI£>A££ A, 



ORF Name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



l4.4.11.&2h...±1...16. I 



7TST 



1W 



Probability 
0.0054 



Protein name 



Description 



Locus Name 



gp:M<3ttt4967 



Acc# 



U34967 



Mycoplasma genitalium repetitive sequence element mgp-r4. 



NT 



AA 



ORF Name 



NTID 



1565 



AAID Length Length 



Score 



TTRT 



Probability 
2.2e-05 



Protein name 



Locus Name 



hypothetical protein 



pir:D72^ 



Acc# 



D72328 



Description 



561 



ORF Name 



NTID 



AAID 



"NTT AA 

— — Score Proba bility 
Length Length 



24645176 ti 41 



7X^5" 



1 . 6e-'/i 



Protein name 



Locus Name 



prot>a£>le oxidoreductase 



|gp : SdFll 



Acc# 
AL132662 



Description 

Streptomyces coeiicolor cosmid Fll. 



ORF Name 



Protein name 



NTID 



AAID 



7190 



KTT AA 

— — Score Probability 
Length Length 



T7T 



FIT 



Locus Name 



Acc# 



Description 



ORF Name 



2££3£&i6...±1...6... 



Protein name 



NTID 



AAID 



7TST 



dCMP deaminase nomolog 



Description 



NT 



AA 



Length Length 
4^4 



T4T 



Score Probability 
1.0e-25 



357 



Locus Name 



pir :G6S470 



Acc# 



C69470 



ORF Name 



Protein name 



NTID 



|3.0.0.^6.i2..±2L...16. I ff57U 



— — Score Probability 
AAID Length Length 



17192 



12028 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NT 



AA 



NTID 



TT7T 



AAID Length Length 




7T51 



Score Probability 
7.4e-30 



Locus Name 



conserved nypotftetical protein aq_i/ii 



pir :C7044y 



Acc# 



C70449 



Description 



562 



ORF Name 



NTID 



34407937 ti ib 



TT7T 



Protein name 



carooxyl- terminal proteinase 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



pir:F7036y 



Acc# 



F70369 



ORF Name 



Protein name 



NTID 



NT AA 

— — , Score Pro bability 
AAID Length Length 



11830 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



— — Score Probabi lity 
AAID Length Length 



3.5.3.7.D.lI2„.c2...7.b. I [TT7^ 



ll.ie-37 



Protein name 

Description 
LARGlS- CONDUCTANCE Mgga?^O^EN5 1TtVB! ChAnNKL 



Locus Name 



spiMSCLJsitJoLl 



Acc# 
P23867 



ORF Name 



NTID 



■NTT AA 

— — Score Pr obability 
AAID Length Length 



|££3.a^...cl...4y... 



1975 



7TST 



Protein name 



Locus Name 



Acc# 



Description 



563 



ORF Name 



NT ID AAID 



Probability 



1782642 c2 77 



Protein name 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



NTID 



AAID 



NT AA „ n , , . -, . . 
— — , Score Pro bability 
Length Length 



Protein name 



TT7T 



7TW 



WT 



£5T 



183 



Locus Name 



i.3e-U 



Acc# 



Hypothetical protein 



Description 



pir : JQI02U 



JQ1020 



ORF Name 



2L.7A16.a:/....cl...^b.. 



Protein name 



NTID 



NT AA „ _ , , . n . , 

— — Score Pr obability 
AAID Length Length 



TTHT 



Locus Name 



1.4e-23 



Acc# 



conserved hypothetical protein MTH83 



Description 



|pir:F69210 



F69210 



ORF Name 



Protein name 



NTID 



TTPT 



— — , Score Probability 
AAID Length Length 



7201 



1950 



Locus Name 



S.$e-l76 



Acc# 



threonyl-tRNA syntnetase 



Description 



bir:B743l7 



B75317 



ORF Name 



I47.115.6.2...12...1&., 



Protein name 



NTID 



AAID 



7202 



NT 



AA 



Length Length 




Score Probability 



TIT 



Locus Name 



Acc# 



Description 



[NO-HIT 



564 



NT 



AA 



ORF Name 



NTID 



7032800 cl iU 



AAID Length Length 
TTS 



ITS" 



Score Probability 
i.5e-22 



Protein name 



Description 



Locus Name 



sp:IF3_HAElN 



Acc# 



P43814 



ORF Name 



NTID 



AAID 



"NTT AA 

— — Score Probability 
Length Length 



10741260 c2 il 



74 



FIT 



[ITu~ 



1.9e-06 



Protein name 



Description 



Locus Name 



sp:&L29_BAOsl l 



ACC# 



P04457 



50S kltsoyOMAL JJkO'l'EIM L29 



NT 



AA 



ORF Name 



NTID 



AAID 



H5HT 



17205 



Length Length 



Score Probability 
W5I 



|I.2e-4i 



Protein name 



Description 



Locus Name 



sp:RLU_>YWYi 



Acc# 



P73313 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



M3.5.5.20.2...cl...lb.. 



7206 



735" 



i.7e-62 



Protein name 



Locus Name 



Acc# 



|gp:AfiOl7b08 



AB0175O8 



Description 

Bacillus haiodura ns genomic dna, 32 Kb rragment, completecas. 



565 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



25600512 ci 14 



TS3T 



7207 




210 


633 




486 





2.8e-46 



Protein name 



Description 



Locus Name 



sp:ftL4JaAUiT 



Acc# 



P28601 



505 R160S0MAL PfeOT'EllW L4 



ORF Name 



NTID 



AAID 



31525257 c3 22 



Protein name 



riJDosomal protein siy 



Description 



NT AA 

— — Score P robability 
Length Length 



90 



TTT 



TZZ~ 



2.5e-25 



Locus Name 



Acc# 



pir:H7224y 



ORF Name 



— — Score Probability 
NTID AAID Length Length 



TWTT 



7205 



TTT 



3.6e-85 



Protein name 



Locus Name 



Acc# 



bp:AB01VbOB 



AB017508 



Description 

Bacillus halodurans C-125 genomic dna, 32 JcJd tragment, completecas . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1988 



97 



7^4" 



TTT 



4 .le-13 



Protein name 



Locus Name 



lsp:RL23__yYNY3 



Acc# 



P73318 



Description 



503 RIB030MAL PROTEIN L23 



ORF Name 



NT ID 



NT AA „ „ -, i_ • -i ■ j_ 
— — Score Probability 
AAID Length Length 



597600V t3 12 



TTTT 



Protein name 



Description 



Locus Name 



Acc# 



(NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



1990 



TTTT 



Length Length 



E7TT 



Score Probability 
1275 



Q.ie-24 



Protein name 



Locus Name 



Acc# 



ribosomal protein Si'/ 



bir:0V^249 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



TZTT 



Length Length 



432 



Score Probability 
2 . 9e-l9 



531 



Protein name 



Description 



Locus Name 



sp:RL22_E^OLi 



Acc# 



P02423 



505 ftlSOaOMAL J^ko'i'S lti L'l'i 



NT 



ORF Name 



NTID 



AAID 



Length Length 



AA 

— Score Probability 



735 



Protein name 



Description 



Locus Name 



Acc# 



567 



ORF Name 



NT ID 



NT AA 

— Score 

AAID Length Length 



122&440b ti 1 



[¥ulT 



YFTT 



Probability 
i7.7e-23 



Protein name 



Locus Name 



Acc# 



P26400 



Description 
£UfrATlVS O-ANTlGmi JftANgPOkT-bik 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



1251537 c2 67 



1^4" 



IBS" 



Score Probability 
TT7TJT3 



ST 



Protein name 



Locus Name 



gp:CEY3$Cl2A 



Acc# 



AL132859 



Description 

Caenorhabditis elegans cosmia 0 9C12A, complete sequence. 



ORF Name 



NTID 



NT AA 
— — , Score 
AAID Length Length 



TUT 



1ST 



Probability 
9.7e-09 



Protein name 



Locus Name 



nypotnetical protein iv.y 



pir:S226l^ 



Acc# 



S22619 



Description 



ORF Name NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


miO.S.O^..±1...3.2. lSSS 


721$ 


407 1224 142 


l.le-06 


Protein name 






Locus Name 


Acc# 


putative membrane protein 


gp:SPNmy84 


AJ131984 



Description 

Streptococcus pneumoniae cap3 7 locus. 



568 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



14882818 c2 7b 



|2.9e-i!i 



Protein name 



Description 



Locus Name 



Acc# 



sp:Dmij4At^T 



DISf A- BIDDING PftOTljlN IX (HB) (HUj 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



15822135 ri 31 



TZ2W 



741" 



Protein name 



Description 



Locus Name 



Acc# 



PUTA T IVE GLYC05YL TRANSFER ASE HlOttfett, 



ORF Name 



NTID 



NT 

AAID Length 



±6±5&1..±±..3.Q. 



AA 
Length 





Score Probability 
3.ie-IS 



Protein name 



Locus Name 



Acc# 



WbcD 



gp:YEU46aby 



Description 



Yersinia entero colitica lipopolysaccharxae u-szLde cnainJjiosyntnesis genes. 



ORF Name 



NT 

NTID AAID Length 



AA 



Length 



Score Probability 



TFZT 



i.6e-i^ 



Protein name 



Locus Name 



putative UE>P-(*±cNAc :uncLecaprenylpnospnate 



gp:AF64 874y 



Acc# 



AF048749 



Description 



biosyntnesis operon, complete 



Bacteroides tragxlis capsular polysaccharide 
sequence . 



569 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



1989077 t3 36 



TTTT 



Score Probability 
|2.4e-147 



3W 



Protein name 



Locus Name 



TJDP-glucose-4-epimerase/cLTDP-glucose-4, b 



gp:AF04874i> 



Acc# 



AF048749 



Description 

Bacteroides tragiiis capsular polysaccnaride biosynthesis operon, complete 
sequence . 



ORF Name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



26556457 r2 lb 



TTTT 



3.6e-33 



Protein name 



Locus Name 



unknown 



Acc# 



AF078135 



Description 



Leptospira borgpetersenn iipopoiysacchanae o-antigen biosyntheticiocus , 
complete sequence. 



ORF Name 


NTID AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


ltB.19±9±..±l..&ti 


2603 7225 


235 


726 129 


| 2.Se-06 



Protein name 



Locus Name 



iipopeptide antibiotic lturm A raosyntnesis 
protein: protein slr0495 :protein slr0495 



pir :S74408 



ACC# 



S74408 



Description 



ORF Name 



|2l5l1M0.u:A..±^...Z9.., 



Protein name 



NT 



AA 



NTID 



AAID 



7226 



Length Length 
— 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



570 



ORF Name 



NTID 



23945251 11 11 



^T5TT5" 



Protein name 



NT AA 

— — , Score Probability 
AAID Length Length 



TZZT 



hemolysin- related protein 



Description 



3~5T 



|3.2e-52 



Locus Name 



[pir:F7232^ 



Acc# 



F72326 



ORF Name 



2441tt7.5Ll...c2...B.b... 



Protein name 



NTID 



7228 



NT 



AA 



AAID Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



[MO-HIT 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


M426.5M..±1...B. 


2007 


7224 


370 


1113 


594 


l.Oe-57 



Protein name 



Locus Name 



A/G-specilic adenine glycosyiase nomolog ytng | Jpir :A69BU2 



Description 



Acc# 



A69802 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



fZUUW 



72Tu~ 



WTT 



7.4e-30 



Locus Name 



single stranded DNA-omding protein 



gp:SHtf£4098 



Acc# 



U64098 



Description 

Shewanella hanedai single stranded DNA-JDindmg protein [ssb) gene, complete 
cds . 



571 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2537767 c2 88 



12005 



7251 




105 


315 




75 





0.0077 



Protein name 



Locus Name 



host snut oil virion protein 



gpiCHDNAcMO 



Acc# 



X89471 



Description 

Canine herpesvirus DNA lor capsict and host siiut ott virion proteingenes . 



ORF Name 



25303426 c3 102 



Protein name 



NTID 



2rmr 



NT AA „ , ... 
— , — , Score Probability 
AAID Length Length J ~ 



TZTT 



TT 



234 



Locus Name 



Acc# 



Description 



WO-HIT 



ORF Name 



Protein name 



NTID 



2011 



NT AA 

— — , Score Probability 
AAID Length Length J ~ 



TZTT 



7WT 



Locus Name 



Acc# 



Description 



1NO-HIT 



ORF Name 



3.2£3..1..±3....3.S 



Protein name 



NT 



AA 



NTID 



AAID 



[2uT2" 



727T 



Length Length 
252" 



789 



Score Probability 
|3.6e-2l 



2T5~ 



Locus Name 



rhamnosyl trans terase 



gp:AF097bl9 



Acc# 



AF097519 



Description 



Klebsiella pneumoniae cLTDP-D -glucose 4,6 denydratase 
(rmlB) , glucose -1-phosphate thymidylyl transferase 

(rmlA) , dTDP-4-keto-L-rhamnose reductase (rmlD) , dTDP-4-keto-6-deoxy-D-glucose 
3, 5-epimerase (rmlC) , and rhamnosyl transf erase (wbbL) genes, complete cds . 



572 



NT 



AA 



ORF Name 



NT ID 



34101512 c'A fe8 



purr 



AAID Length Length 
333 



7235 



tut 



Score Probability 
5.2e-13 



TT7 



Protein name 



Locus Name 



Acc# 



conservea nypotnetical protein 


|pir:G7b347 


G75347 


Description 


ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


&{&$111Jl±JI 2 014 


7236 


318 


957 


75 


|0.023 


Protein name 


Locus Name 


Acc# 


ribosomal protein S10 


|gp:Ui7003 


U17009 


Description 


Phytophtnora mtestans mitochondrion, 


complete genome 




1 


ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


i7.13.n:/.^cl^6.5. 20 lb ' 


7257 


61 


18b 






Protein name 








Locus Name 


Acc# 


Description 














MO-HIT 1 


ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


mftm...ti...i& 2 0ib " 


7238 


254 




765 


348 


1.2e-31 



Protein name 



Locus Name 



putative glycosyl transterase 



gp:AF04874y 



Acc# 



AF048749 



Description 



Bacteroides trag ilis capsular polysaccnaricte JDiosyntJiesis operon, complete 
sequence . 



573 



NT 



AA 



ORF Name 



NT ID 



4875677 ±1 y 



purr 



AAID Length Length 
TOTI — 



7239 



Score Probability 
l.2e-45 



£37 



Protein name 



Description 



Locus Name 



IspiST^JiUMAN 



Acc# 



P08842 



StfLFM'K StJLFOHYDKUL ASfi) (AkYljSULFATASE C) 



NT 



AA 



ORF Name 



NTID 



5273377 ti 33 



AAID Length Length 




Score Probability 



7240 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 

wiz — 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



[NO -HIT 



NT 



AA 



ORF Name 



NTID 



aai£2&.»c:i„.&2L 



2020 



AAID Length Length 
TT2 



TIT 



Score Probability 
|2.Se-07 



Protein name 



Locus Name 



Yjctl-liKe protein 



gp:LLLNI&kl 



Acc# 



Y13384 



Description 
Lactococcus lactis msz gene ana 3 ORF 1 s . 



574 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



207 



Score Probability 
0.0011 



Protein name 



Locus Name 



cryptogene protein G4 



tpir:3519I0 



Acc# 



S51910 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



12±0.&lB....zL.£l 



.5.5e-V6 



Protein name 



Description 



Locus Name 



sp:CAFA_HAli!lN 



Acc# 



P45175 



CY T OPLASMIC AXIAL FILAMENT PROTEIN 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



$.9.5.&652..±1J±± I 



1UT 



Protein name 



Description 



Locus Name 



Acc# 



sp:WBBJ_ECOLl 



(EC 2.3.1.-) 



NT 



AA 



ORF Name 



NTID 



iflS5&2LSi„±;£...a 



12024 



AAID Length Length 



Score Probability 
0.0017 



Protein name 



Locus Name 



transposase 



gp : AF038866 



ACC# 



AF038866 



Description 



Bacteroides tragilis transposon Tn5520 transposase (JDipHj anamobilization 
protein BmpH (bmpH) genes, complete cds . 



575 



ORF Name 



NT ID 



NT AA 

— — Score Probab ility 
AAID Length Length 



12351:176 t2 V 



1.4e-138 



Protein name 



Locus Name 



unknown 



Acc# 



AF125164 



Description 



Bacteroides frag ilis 6^&R polysaccharide B (PS B2j jdio sync he sis locus , 
complete sequence; and unknown genes. 



ORF Name 



1610^032 t2 6 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 
fZUl 



Score Probability 



7248 



^3" 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



NTID 



1UTT 



AAID 



TZ4T 



NT 



AA 



Length Length 



— Score Probability 



TuT" 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



— — Score Prob ability 
AAID Length Length 

[7^5 — 



AA 



NTID 



\ixiia6.!b...±2...A I mrs 



1ST 



16.613 



Protein name 



Locus Name 



bp:AB^1078 



ACC# 
AB021078 



Description 
plasmid ColIb-P9 una, complete sequence. 



576 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probab ility 
Length Length 



7089527 ±3 li 



Protein name 



7T 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



TZ5T 



NT 



AA 



— - — Score Probability 
Length Length 



Locus Name 



Acc# 



Descr iptf on 



(NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probab ility 
Length Length 



2.flai3..7...±l...i 



Protein name 



2031 



7253 



11089 



Locus Name 



Acc# 



Description 



1N0-HIT 



ORF Name 



Protein name 



NTID 



AAID 



7T5T" 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



K0-H1T 



ORF Name 



NTID 



AAID 



NT AA 

— — Score P robability 
Length Length 



Protein name 



?UJT 



TuT5" 



Locus Name 



Acc# 



Description 



INO-HlT 



577 



ORF Name 



12464906^ tl 2 



Protein name 



NT ID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



7257 



NT AA 

— — Score Probability 
Length Length 



\27T 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



\±5.i±i&b.±.±2...B. I pmz 



EI 



|4.5e-18 



Locus Name 



receptor antigen (RagA) 



|gp:PG1130872 



Acc# 
AJ130872 



Description 



Porphyromonas gingivalis W50 receptor antigen iragj locus encodinga major 
immunodominant 55kDa antigen. 



ORF Name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



purr 



7259 



741 



1ST 



4 .8e-16 



Protein name 



Locus Name 



liSK outer membrane protein precursor : SusC 
protein 



pir : 



Acc# 



JC6027 



Description 



578 



NT 



AA 



ORF Name 



NTID 



1055218^ c3 304 



2038 



AAID Length Length 




TUT 



Score Probability 
5.5>e-i4 



Protein name 



Locus Name 



glucosamine- -rructose- 6 -phospnate 
aminotransferase PAB2201 



bir:P7Wili 



Acc# 



F75212 



Description 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



t03:i^±±.±L.± i l 



7261 



4.4e-i0 



Protein name 



Description 



Locus Name 



sp-.ASNBJjlCOLl 



Acc# 



P22106 



NT 



AA 



ORF Name 



NTID 



AAID 



10.7A40.13....al...i.y.^.. 



2TST0" 



Length Length 




Score Probability 
H |3.0e-0£ 



HIT" 



Protein name 



Locus Name 



hypothetical protein Rvi624c 



pir :P70bb^ 



Acc# 



F70558 



Description 



NT 



AA 



ORF Name 



NTID 



±h$.$±±&&.±±...16. 



12041 



AAID Length Length 
SSI 



Score Probability 
I.3e-25 



mi 



Protein name 



Locus Name 



sp:Tm>FJMUMA 



Acc# 



Q56320 



Description 

(5 1 -PHOSfrHORlBOSyL) ANTHRANi LATE l^QMERASEt, (PRAl) 



579 



ORF Name 



NT ID 



— — Score P robability 
AAID Length Length 



11875400 c2 208 



77T" 



|6.Se-i7 



Protein name 



Description 



Locus Name 



sp:CIRA_ECOLI 



ACC# 



P17315 



COLIC IN 1 RECSPT0& PRECURSOR 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


12l2593l_r3_l44 


2043 


726S 


214 


645 


220 


4 .3e-lfl 



Protein name 



Locus Name 



2,3,4, 5-tetranyaropyricLine-2-carboxy±ate 
N-succinyl transferase- related protein 



pir:H(7224b 



Acc# 



H72245 



Description 



NT 



AA 



ORF Name 
12213.IB.6...±A...14tf... 



NTID 



AAID Length Length 



Score Probability 



2.2e-30 



Protein name 



Locus Name 



tlavodoxin 



tpir:H718bO 



Acc# 



H71850 



Description 



ORF Name 



|1228..7.B.:/...±i....lAfo.. 



Protein name 



NTID 



AAID 



7267 



anthramiate syntnase, component: i 



Description 



N'T AA 

— — Score Probability 
Length Length 



7TF" 



Locus Name 



pir:D72414 



|7.2e-VI 



Acc# 



580 



ORF Name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



11298876^ ti 14b 



10.026 



Protein name 



Locus Name 



sp:MU5M_PETMA 



Acc# 
Q35543 



Description 

tiADa~ttelQuXN0N2 6xifrOREtiUCTA^ CliAlti b, 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


I4336463_c2_209 


2047 


7265 


340 1023 332 


6.1e-45 | 


Protein name 








Locus Name 


Acc# 


nypotneticai protein 


pir:t)72llb 


D72115 


Description 












ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


li3.5.46A2..±1...8. 


2048 


7270 


449 libO 
















Protein name 








Locus Name 


Acc# 


Description 


















ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


146.3.28.a&...£3....1£2 


2045 


7271 


813 


2442 
















Protein name 








Locus Name 


Acc# 


Description 















[NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



TUT 



Length Length 
TOT" 



Score Probability 
2 . 7e-ll 



Protein name 



Locus Name 



nypotneticai protein 



pir:S76639 



Acc# 



S76639 



Description 



581 



ORF Name 



NT ID 



NT AA 

— — Score Pro bability 
AAID Length Length 



15656537 cJ iuy 



1TTT 



Protein name 



Locus Name 



hypothetical protein aq_1059 



pir:C70:J<*i 



Acc# 



C70391 



Description 



ORF Name 



NTID 



ifiLaa2a&5...c3....zajL.. 



7274 



Protein name 



hypothetical protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 
12 . 3e-ivv 



11723 



Locus Name 



pir : jgiu^u 



Acc# 



JQ102 0 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




Score Probability 
|2.4e-l04 



Protein name 



Locus Name 



carbamoyl pnospnate synthetase in 



gp:E i ft24Cill 



Acc# 



Z93780 



Description 



Pugu rubripes genes encoding carbamoyl pnospnate syntnetase ill, myosin 
light chain, MAP2 . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



|i.6e-4:i 



Protein name 



Description 



Locus Name 



spiTRPtiJl'HKMA 



Acc# 



Q08654 



582 



ORF Name 



NTID 



Protein name 



NT AA 

— — , Score Probability 
AAID Length Length 



17277 



nypothetxcal protein HelE 



Description 



I5T 



Locus Name 



^] [pir:T0fl60iT 



10.035 



Acc# 
T08605 



ORF Name 



Protein name 



NTID 



KfT AA 

— — Score Pro bability 
AAID Length Length 



7278 



raw 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2!±116.1..±i....a:l 


|2057 


7274 




1089 




8.9e-73 

















Protein name 



Locus Name 



Acc# 



Description 



sp:ASGi_ECOLl 



P18840 



(L-A&tfA^ 1) 



ORF Name 



Protein name 



NTID 



"NTT AA 

— — Score Probability 
AAID Length Length 



TIFT" 



Locus Name 



3.4e-^2 



Acc# 



TpsW 



Description 



IgprAflSb^Ob 



AF155805 



Streptococcus suis strain 521& Cps9D (cps^D) gene, partial ccts ; cpsy* 
(cps9E) , Cps9F (cps9F) , and Cps9G (cps9G) genes, completecds; and Cps9H 
(cps9H) gene, partial cds . 



583 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



22033141 cl lb2 



Protein name 



Locus Name 



Acc# 



Description 
IN0-H1T 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



12060 



TIWT 



i.8e-ii 



Locus Name 



receptor antigen (RagA) 



|gp:PGIi30£7T 



Acc# 



AJ130872 



Description 



Porphyromonas gmgivaiis WbU receptor 
immunodominant 55kDa antigen. 



antigen (rag) locus encodinga major 



ORF Name 



NTID 



AAID 



Protein name 



— — Score Probability 
Length Length 



TTnr 



7.5e-l24 



Locus Name 



Acc# 



Description 



spiTk^BJl'lltlMA 



P50909 



T RYPTOPHAN SYNTHASE SETA CHAIN, 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



2062 



Protein name 



TTT 



Locus Name 



Acc# 



Description 



NO-HIT 



584 



NT 



AA 



ORF Name 



NTID 



22850128 cl iyt) 



AAID Length Length 



252 



Score Probability 

on 



5? 



Protein name 



Description 



Locus Name 



IspiSPRCJUWLA 



Acc# 



P36378 



(OSTEONECTIN) — UW) — (kASEMfiNT MElMfeRANEl PROTEIN 6M-4U) 



NT 



AA 



ORF Name 



NTID 



23600812 cl 169 



AAID Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



(SfO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TT45 



Score Probability 
i.ie-S7 



57T 



Protein name 



Description 



Locus Name 



sp:YQH£>_h!0OLI 



Acc# 



Q46856 



HYPO T HETICAL OXlbuk E DUCTA^k IN METC-riUF T INTljktiKHIC JmciluN 



ORF Name 



NTID 



— — Score Pr obability 
AAID Length Length 



\21&.5.±s:i2...z±jioa I mzz 



11084 I 



Protein name 



Description 



Locus Name 



sp:PYRi_DICDl 



Acc# 
P20054 



585 



ORF Name 



24022162 ti 147 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



TZZT 



§.2e-118 



Locus Name 



sp:PU51_VEAyT 



Acc# 



P54113 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



lM&26.m..±i...n I puss 



im — i imi 



il.0e-% 



Locus Name 



aspartate ammotranst erase related protein I tpir :E6yibtf 



Acc# 
E69168 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TIWT 



Ii.0e-l74 



Protein name 



Description 



Locus Name 



gp:AB02b342 



Acc# 



AB025342 



Moritella marina genes, complete cds, similar to eicosapentaenoicacia 
synthesis gene cluster. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



\1±2L1BA±..±2....B± I 



1232 



855 



1.2e-20 



Protein name 



Locus Name 



exopoiyphosphatase 



binEVOJVb 



Acc# 



E70376 



Description 



586 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TIT 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



NTID 



NT AA 
— — , Score 
AAID Length Length 



12072 



Probability 
|4.ie-29 



Protein name 



Description 



Locus Name 



sp:THI0 BORBU 



Acc# 



051Q88 



NT 



AA 



ORF Name 



NTID 



\IU7T 



AAID Length Length 
1ST 



Score Probability 
:5.Se-4i 



Protein name 



Description 



Locus Name 



sp:T&PC!__PSEPU 



Acc# 



P20578 



I NDOL E - ^ -GLYCEROL PHO^PHATK SYNTHASE, (le^ri) 



NT 



ORF Name 



NTID 



M5.<i27....cl...m 



AAID Length Length 
[2TS~ 



AA 

— Score Probability 



ITT 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



587 



NT 



AA 



ORF Name 



NT ID 



AAID 



24641962 ti 140 



TZTT 



Length Length 
7^5 



Score Probability 
|6.5e-:itt 



KU7 



Protein name 
Description 

tftVPTO&HA^ SYlffiHfliifcl ALPHA CHAIN, 



Locus Name 



sp:TRPA__METV0 



Acc# 



P14637 



ORF Name 



24545650 c2 26b 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 
TTT7 — 



Score Probability 



3F5" 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



1299 



Length Length 
222 



Score Probability 



77 



Locus Name 



Acc# 



IN0-H1T 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



2uTT 



73W 



1255" 



T4F 



§.3e-05 



Locus Name 



probable glycerophosphoryl diester 
phosphodiesterase 



tpir:G7bb0b 



Acc# 



G75506 



Description 



588 



OPT? "NT^mp* 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




2914202_c3_30S 


2079 


7301 


627 


IS84 


151 


1.2e-23 




Protein name 








Locus 


Name 


Acc# 












_HAEIN 


P43854 


Description 














&HOSt>g&ftlfiOSVLt > YRO&KtOSPHA , rJii AMINOTRANSFERASE) 




(GPATASEy | 
















ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2926562_l2_99 


20S0 


7302 


352 


1055 


221 


5.2e-lS 



Protein name 



Locus Name 



subunit ot the terminal oxidase witn uriKnown 1 igp : AADOXP2 4H 



Acc# 
Y08730 



Description 

A.ambivalens doxA gene locus witfr aoxD ana ctoxA genes. 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



2M7A&.7....ci..2.S.l 1 EOTT 



HUT 



TTT 



T5T 



S.de-Ii 



Protein name 



Locus Name 



Acc# 



probable tnioredoxm 



pir :T0b27l 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


1±63£0A:L±1...±±Q. 


20£2 


7304 


692 


2079 


133$ 


1.4e-l36 



Protein name 



Locus Name 



polyphosphate kinase 



I gpiAl^^B 



Acc# 



AF083928 



Description 



Vibrio choierae polyphosphate Kinase tppk) ancL exopoiyphospnatase <ppxj 
genes, complete cds . 



589 



ORF Name 



NTID 



31750887 c3 2bb 



7TuT 



Protein name 



hypothetical protein APE2554 



Description 



NT 



AA 



AAID Length Length 
1 ITS - 



TIT 



Score Probability 
|i.7e-05 



TTT 



Locus Name 



pir :C7248y 



Acc# 



C72489 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




Score Probability 



SAT 



IT 



0.0024 



Protein name 



Description 



Locus Name 



|gp:CELB04S4 



Acc# 



AF025452 



Caenorhabditis elegans cosmid. B0454. 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



TZT 



TUZT 



|4.ie-52 



Protein name 



Locus Name 



sp : TRPDJ4ETJ A 



Acc# 



Q57686 



Description 
ANTHRAX I LAT E ^M^PHOklBu^VLT RAWSPLlkA^if!, 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



3.3.3.aS.4.1.Z...c2....Z6.6... 



UUT 



TTT 



AST 



TUT 



4 . le-05 



Protein name 



Locus Name 



immunoreactive 36 KDa antxgen PG14 



[gp:AE'14b793~ 



Acc# 



AF145798 



Description 



Porphyromonas gingival is strain W50 immunoreactive 36 JcDa antigenPGl4 gene, 
complete cds . 



590 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
153 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 
[MO-HIT 



NT 



AA 



ORF Name 



NTID 



3.40.I.V.1^2..±i....±2.b... 



AAID Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



P74667 



frlAMINOfrl]ytELAl i S f^ MERASfi, (tiAft EiPlMfiRA^hl) 



"NTT mm 

— — Score Pr obability 

AAID Length Length 

I73TI — 



AA 



ORF Name 



NTID 



\iiii$.iOA.±i...i.22. I puss 



I5TT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



ORF Name 



NTID 



AAID Length Length 



AA 

— Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



591 



ORF Name 



NT ID 



NT AA 
— , — , Score 
AAID Length Length 



JTTT 



Probability 
7.2e-22 



Protein name 



Locus Name 



aspartate aminotransterase related protein 



pirTE^TFF" 



Acc# 



E69168 



Description 



ORF Name 



NT ID 



AAID 



3£U£26.2...c2...2.16. I BfTOJ 



7m" 



Protein name 



provable response regulator 



Description 



NT AA 

— , — , Score Probability 
Length Length 



Locus Name 



pxr:Ti467b 



5.5e-12 



Acc# 



T34675 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Probability 
Length Length 



Aflftil&i...ci„.ifiSL I prra? 



TUT 



5.2e-45 



Protein name 



Locus Name 



sp:3V[AtfG_£>MDBl 



Acc# 



Q51658 



Description 

METHYLAMINE U T ILIZATION PR OTEIN MAUG PRECURSOR 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



±2.19A11..±±...1& 



\ZUT 



5.7e-15 



Protein name 



Locus Name 



JdZIP histictme Kinase 



gp:PE>UY1824^ 



Acc# 



Y18245 



Description 



£>seudomonas putida todX, tod?, todfil, todC2 , todB, toctA, todD,tocLE, tocLG, 
todl, todH, todS, todT genes. 



592 



ORF Name 



5057^1^ rl 4i 



Protein name 



NTID 



NT AA 

— — , Score Proba bility 
AAID Length Length 



73TT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Pro bability- 
Length Length 



a0.6.£12..±l...b, 



12096 



TIT 



T5T 



i.2e-i3 



Protein name 



Locus Name 



PobR protein 



|gp:^U^bi7b>2 



Acc# 



AJ251792 



Description 



protein ana pobA gene torPoJDA 



Pseudomonas putida pobR gene tor pojdr 
protein. 



NT 



AA 



ORF Name 



NTID 



AAID 



nai6..7.a.6....c3....ab.., 



PUTT 



Length Length 
73 



Score Probability 



237 



2.8e-07 



Protein name 



Locus Name 



Acc# 



gp:D9070i 



Description 

Escnencnia coli genomic una. (13 .6 - 14 . 0 mm) 



ORF Name 



NTID AAID 



NT AA 

— — Score Pr obability 
Length Length 



1ET 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



593 



ORF Name 



20203^bl cl bb 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 
7321 



Score Probability 



50" 



273 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 







NT AA 


NT ID 


AAID 


Length Length 


2100 




7322 


1045 3138 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT ID 



TTUT 



AAID 



TTTT 



NT AA 

— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 
|243.15..7.ai..±2...2b.., 



Protein name 



Description 



NT ID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



12102 



77 



0.0054 



Locus Name 



|sp:NOLP_kklU> 



ACC# 



P23717 



NODULATION PROTEIN NOLP 



594 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



124727127 c3 102 



TTUT 



|2.ie-20 



Protein name 



Locus Name 



putative oxygen- independent 
coproporphyrinogen 



gp:AF157642 



Acc# 



AF157642 



Description 



Desultitobacterium denaiogenans putative ' 
oxygen-independentcoproporphyrinogen III oxidase (hemN) gene, partial cds; 
Hrd22-1 (hrd22-l) gene, complete cds; and two-component sensor 
histidinekinase homolog (hkhB) gene, partial cds. 



ORF Name 



Protein name 



NTID 



12104 



AAID 



NT AA 

— — Score Probab ility 
Length Length 



255 



Locus Name 



Acc# 



Description 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2105 



TTZT 



1611 



27£4 



1. le-287 



Protein name 



Locus Name 



mobilization protein C 



gp:AFil&243 



Acc# 



AF118243 



Description 



Bacteroides fragilis mobilization protein c tmobCj gene, compietecas. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



26£A2±:ih...a±.Al 



T7T* 



:6.7e-i:i 



Protein name 



Locus Name 



RNA polymerase sigma tactor sigz-UKe protein | |g P : AF137263 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron iOS nbosomal protein S16 -liKeprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



595 



ORF Name 



3370^427 cl 4i 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 

wm — 



Score Probability 



ITT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT ID 



AAID 



7TTu~ 



NT AA 

— — Score Probability 
Length Length 



102 



TUT 



Locus Name 



Acc# 



Description 



QsTO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 
TZT2 



TUT 



Score Probability 
4.0e-55 



Locus Name 



sp: YBDN_ECOLl 



Acc# 



P77216 



HYPOTHETICAL 47. a KB EkO ' l ' tilM IN CaTA-bS BG IMTEkcJBNIC kEciloN 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



kaaiaaa..±i.-.i.. I prrs 



Protein name 



Locus Name 



Acc# 



Description 



INO-HTT 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



4103377 i2 2ti 



7333 



JET 



Protein name 



Description 



Locus Name 



Acc# 



IN0-H1T 



NT 



AA 



ORF Name 



NTID 



AAID 



|41&0.1&l...c2...b.8.... 



fZTTT 



Length Length 
T2T 



Score Probability 
|7.3e-^3 



Protein name 



Locus Name 



unknown 



gp:AFll&244 



Acc# 



AF118244 



Description 



Bacteroides rragrlis unknown gene. 



NT 



AA 



ORF Name 



NTID 



AAID 



|440.&$.M...c:l..A6.. 



fZTTT 



Length Length 
£TT5 



5? 



Score Probability 
7.2e-l6 



Protein name 



Description 



Locus Name 



sp:Yk£)Mjil<JoLi 



Acc# 



P77174 



HYPO T HETICAL 21.9 KB PROTEIN IN C5TA -DSBG INTERGENIO mKiloM 



NT 



AA 



ORF Name 



NTID 



AAID 



M^.O^tl^Z ] 

Protein name 

Description 



Length Length 




Score Probability 



Locus Name 



Acc# 



NO-HIT 



597 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



6287687 ri io 



7TTT 



Protein name 



Locus Name 



M protein 



|gp:SSGMT 



Acc# 



X60098 



Description 



Streptococcus sp . { group G) emm gene tor M protein. 



NT 



AA 



ORF Name 



NT ID 



16522533 c2 81 



2116 



AAID Length Length 
532 



Score Probability 
|4.3e-05 



T2T7 



Protein name 



Locus Name 



probaoie transposase tor isibbb 



pir :F7U6/y 



Acc# 



F70678 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 




Score Probability 
0.0087 



Protein name 



Locus Name 



coproporphynnogen III oxidase 



|gp:B^HkcJA 



Acc# 



Y09446 



Description 



B.stearothermopttilus nemN gene (partial) ana nrcA gene. 



— — Score Probability 
AAID Length Length 

[52IT 



AA 



ORF Name 



NT ID 



1^6.33.U.b....C3....^ 



1211$ 



TITS' 



Il.6e-l23 



Protein name 



Locus Name 



5 0 JcD antigen pcii 



|gp:AF14407b 



Acc# 



AF144076 



Description 

Porphyromonas gingivalis strain WbO bu KD antigen PGi gene , complete cas . 



598 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



16603425 ci Id 



T6T" 



i.7e-6* 



Protein name 



Locus Name 



NqrB 



|gp:AFi6^80 



Acc# 



AF165980 



Description 



Vibrio harveyi ^a+- translocating NADH-qumone oxidoreductasecompiex operon, 
complete sequence. 



ORF Name 



Protein name 



NT 



AA 



NT ID AAID Length Length 
TT2 1 



— Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



106$ 



tttt 



l . ie-69 



Locus Name 



sp:BLMH_kAT 



Acc# 



P70645 



BLEOMYCIN HVbUoLA^li!, (BLM MVb KOLA^k) im) 



NT 



AA 



ORF Name 



NTID 



|118.4S.D.b.2....al...4y. I 



AAID Length Length 
— 



Score Probability 




|3.ie-ti!i 



Protein name 



Locus Name 



115K outer membrane protein precursor : Susc 
protein 

Description 



bin 30>&2 l i 



Acc# 
JC6027 



599 



ORF Name 



1953386 t!i 23 



Protein name 



NT ID 



TTZT 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
[MO-MIT 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



13.5£3.5.2&...cl...3.2.. 



TUT" 



|2.8e-05 



Protein name 



Description 



Locus Name 



sp:YWO_HAEIN' 



Acc# 



P44031 



HYPOTHETICAL PROTfitN 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TIF" 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
S3 - 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



600 



ORF Name 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



\116A19A±.±±...l 



Protein name 



NTID 



NT AA 

— , — „ Score Probabi lity 
AAID Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



TTZT 



AAID Length Length 




7^T 



Score Probability 
l.le-44 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Description 



Acc# 



JC6027 



ORF Name 



NTID 



AAID 



NT AA 
— , — , Score 
Length Length 



24&£2.D.a:'L.c:L..3.& , 



Probability 
|2.Se-10 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp:PGI130872 



Acc# 



AJ130872 



Description 



Porphyromonas gingival is W50 receptor antigen (rag) locus encoclinga maj 
immunodominant 55kDa antigen. 



or 



601 



NT 



AA 



ORF Name 



NTID 



AAID 



29691536 ±2 12 



TTTT 



7T5T 



Length Length 
7T9 



Score Probability 



P7TT 



Protein name 

Description 
[MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Ii4i4m...c2...4i I \rm 



Length Length 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Z2.±53±2....&3...AZ 



TTTT 



Length Length 
TUT" 



JUE~ 



Score Probability 
TUT 



1.7e-05 



Protein name 



Locus Name 



putative transcriptional regulator 



gp:YPPCPl 



Acc# 



AL109969 



Description 



Yersinia pestis plasmid pPCPl. 



ORF Name 



NTID 



AAID 



NT AA 
T , T Score Probability 
Length Length 



TTT 



TTTT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



602 



NT 



AA 



ORF Name 



NTID 



13854652 t3 32 



AAID Length Length 
— 



Score Probability 




|2.2e-21 



Protein name 



Description 



Locus Name 



sp:TRA2_£ACFR 



Acc# 



Q45119 



TkANSPOSASE FOR INSERTION SeOUENCE ELEMENT IS21-L1KE 



NT 



AA 



ORF Name 



NTID 



14555510 t3 30 



AAID Length Length 
775$ 



Score Probability 
CT55 



9.£e-37 



Protein name 



Locus Name 



YadS 



|gp:AF198617 



Acc# 



AF198617 



Description 



Aeromonas caviae polar tlagelia locus , complete sequence. 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length *~ 



\TTJT 



TUT 



Protein name 

Description 
KO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



2LUai5Lai2L...CZ...5b. 



2138 



7350 



1341 



3.5e-47 



Protein name 



Locus Name 



lipopolysaccharicte biosynthesis protein bplD 
homolog 



pir :G64487 



Acc# 



G64487 



Description 



603 



NT 



AA 



ORF Name 



NT ID 



25422552 ti 12 



AAID Length Length 
7351 — 



Score Probability 



Protein name 

Description 
KO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



26±1±0.11..±1....$. I 



AAID Length Length 
7TS"2 — 



Score Probability 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



2.6.S8.X6.2.5...JL3....3.5... 



AAID Length Length 
73F3 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
73^ — 



73T" 



Score Probability 

wn — 



10.00035 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



ACC# 



JC6027 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



iO.D.S.5.5.1..±1...10. I PT¥3 



73^5" 



Length Length 
TTu" 



Score Probability 



TST 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



604 



ORF Name 



4087502 cl 45 



Protein name 

Description 
INO-HIT 



NT ID 



NT AA 

, „^ T — ■ — ■ Score Probability 
AAID Length Length ^ 





2144 




736S 




175 


540 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID Length Length 

njn — 



Score Probability 



^5" 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 
73EB — 



TUT" 



Score Probability 
0.005S 



Locus Name 



sp:ZN90_HUMAN 



Acc# 



Q03938 



Description 

ZINC PINTER PROTEIN 30 (ZINC FINOER PROTEIN HTF9) (FRAGMENT) 



ORF Name 
16.5.25.3.&3....C.2...3..7... 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TWIT 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



605 



NT 



AA 



ORF Name 



NT ID 



32665833 cl 32 



AAID Length Length 
7T7T5 — 



5TT 



T77T~ 



Score Probability 




|3.4e-07 



Protein name 



Locus Name 



immunoreactive 5 3 KD antigen PG123 



|gp:AF14464i 



Acc# 



AF144641 



Description 



Porphyromonas gingival is strain W50 immunoreactive 53 kD antigenPG123 gene, 
complete cds . 



NT 



AA 



ORF Name 



cl 31 



NT ID AAID Length Length 
7T7I — 



Score Probability 



TUT 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



ORF Name 



NT ID 



AAID 



TTTT 



Length Length 



AA 

— -u Score Probability 



PITT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID 



3.6.5.Z5i3.1.7....c;1...2.a.. 



TT5T 



7T7T 



Length Length 
^5 



Score Probability 



Protein name 

Description 
MO -HIT 



Locus Name 



Acc# 



606 



ORF Name 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



TTTT 



ST 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



2153 



AAID 



r7T73" 



NT 



AA 



Length Length 
73— 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



UTTT 



NT 



AA 



Length Length 
WD 



Score Probability 



ITS" 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



12155 



AAID 



7377 



NT 



AA 



Length Length 
TZJZ 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probab ility 
AAID Length Length 



1TTT 



7ZT 



S.0e-59 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



JC6027 



Description 



607 



NT 



AA 



ORF Name 



NTID 



2451162b ri b 



AAID Length Length 
7T3 - 



[2TT 



— ^ Score Probability 
u7TT£2 



Protein name 



Locus Name 



Acc# 



neat snock transcription t actor HSF21 



pir :S5ybi37 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




2158 


7380 


357 


1194 


225 


1.5e-l6 



Protein name 



Locus Name 



dipeptidase nomolog 



gp:AF06U8bb 



ACC# 



AF060858 



Description 



Salmonella dublin regulatory protein CopR icopR) , nisticline Kinase (cops; , 
SPI-4 pathogenicity island containing dipeptidase homolog (pipD) , SopB 
(sopB), PipC (pipC), PipB (pipB) , and PipA (pipA)genes, complete cds; and 
tRNA-Ser gene, complete sequence; andunknown genes. 





ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


llllllZl^tlJl 


2155 


7381 


585 1761 


1153 


5.0e-I18 



Protein name 



Description 



Locus Name 



sp:«K£j3AC^U 



Acc# 



P18159 



PROBABLK PHOSPHOMANNOMUTA^ E, (EMM) 



ORF Name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



|3.ill3.7.12..±3....^ I 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



608 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
FT 



Score Probability 



Locus Name 



Acc# 



Description 



MO -HIT 



ORF Name 



NTID 



NT AA 

— — Score Probab ility 
AAID Length Length 



12162 



7TST" 



ITT" 



2.4e-06 



Protein name 



Locus Name 



transmembrane sensor 



gp:At'0bl6yi 



Acc# 



AF051691 



Description 



£>seudomonas aeruginosa stress tactor A (psrA) , ECF sigma tactor { tiul) , 
transmembrane sensor (fiuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 
7.2e-l6 



Protein name 



Locus Name 



RtfA polymerase sigma tactor SigZ-liKe protein I |gp : AFi3726i 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S rxJDOSomai protein S16 -iiiceprotein, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



12164 



Length Length 
TZW 



Score Probability 
|7.8e-161 



1567 



Protein name 



Description 



Locus Name 



sp : PME_BOk^U 



Acc# 
Q59189 



T 0P0IS0MERA5E I V ^UBUNIT B, 



609 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 





7387 




160 


483 




305 





4.2e~27 



Protein name 



Locus Name 



Acc# 



probable KDQ transterase 



E 



ir:T3b6b2 



T35652 



Description 



NT 



AA 



ORF Name 



NTID 



3.3..7.D.7.7i.l...ci...J.2.. 



AAID Length Length 

Rrai — 



7388 



Score Probability 

1.2e-08 



TTI" 



Protein name 



Locus Name 



Hypothetical protein PH04 74 



pir:E711by 



Acc# 



E71159 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



\4.0!&$.11..±±..X 



TST" 



T14T 



ITT 



|7.4e-30 



Protein name 



Locus Name 



hypothetical protein Jd2 981 



pir :C650§4 



Acc# 



C65084 



Description 



NT 



AA 



ORF Name 



NTID 



Aia£25fl..±i...ii I prro 



AAID Length Length 



7IW 



372 



Score Probability 
3.2e-6i 



Protein name 



Locus Name 



conserved hypothetical protein aq_066 



bir:E7030^ 



Acc# 



E70306 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TFL 



Score Probability 
|4.0e-38 



Protein name 



Locus Name 



Acc# 



car boxy- terminal processing proteinase 
ctpA, : tail-specif ic endopeptidase Pre 



pir :B6y610 



Description 



610 



ORF Name NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


14664017_c3_5 2170 


7392 


209 |627 376 


|1 . 9e-33 


Protein name 


Locus Name 


ACC# 


receptor antigen (RagA; 


gp:PSIii0a72 


AJ130872 


Description 




Porphyromonas gmgivalis WbU 
immunodominant 55kDa antigen. 


receptor 


antigen 


(rag; locus enc 


oamga major 




ORF Name NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


2'42S7800_t2_a |2l7l 


7353 | 


398 1197 343 


■"| 4.0e-3l 



Protein name 



Locus Name 



transposase 



gp:AP03£$66 



Acc# 



AF038866 



Description 

Bacteroides rragilis tra nsposon TnSb^O transposase tmpHj anctmocniization 
protein BmpH (bmpH) genes, complete cds . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


±1L&1L&±±1..±L 


|2172 


7354 


155 


471 


147 


2.5e-10 



Protein name 



Locus Name 



sp : CUTFJilcJoLl 



Acc# 



P40710 



Description 

COHEIR Homeostasis Eko'i'UiN dOTP frRHciUft SOR (Lipoprotein jni^) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



14&£.10.5..7„.±1...B.. 



TT7T 



7JT 



TUT 



3.le-a6 



Protein name 



Locus Name 



Na+/H+- exchanging protein siri595 :Ma+/a+ 
antiporter :Na+/H+ antiporter 



pir :S7495l 



Acc# 



S74951 



Description 



611 



NT 



AA 



ORF Name 



NT ID 



AAID 



143547^ tJ lb 



7JW 



Length Length 
551 



TT7~ 



Score Probability 

o.oo^o 



517 



Protein name 



Description 



Locus Name 



gp:CEYlilB2<J 



Acc# 



AL132906 



Caenorhabditis elegans cosmid. YIHB2C, complete sequence. 



ORF Name 



NTID 



16S32SS5 ±2 14 



fZTTT 



7397 



Protein name 



nypotnetical protein 



Description 



— — Score Probability 



AAID Length Length 



1ST 



531T 



TUT 



Locus Name 



pir :dyiU2U 



Acc# 



JQ102 0 



NT 



AA 



ORF Name 



NTID 



AAID 



2176 



7398 



Length Length 
£52 



57 



Score Probability 

on 



Protein name 



Description 



Locus Name 



sp : £Pk<J_X±i!NLA 



Acc# 



P36378 



(OSTEON E C T IN) (ON) (BASEMEN T MEMBkANE PROTEIN BM-4UJ 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



12177 



751 



TTT7T" 



T5T 



1. le-07 



Protein name 



Locus Name 



KIAAOSbO protein 



gp:AB020bbV 



Acc# 



AB020657 



Description 

Homo sapiens mRNA tor KIAAO&bO protein, complete cas . 



612 



NT 



AA 



ORF Name 



NTID 



AAID 



34181531 cl 33 



7175" 



Length Length 



Score Probability 
0.0058 



Protein name 



Locus Name 



Acc# 



sp:Y414_HAE!N 



Description 

gy&wmii'ieAL p&o'ik in ai64i4 



ORF Name 



5172277 &J> 40 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



2179 



7401 



Length Length 
WZZ I IT4Tn 



Score Probability 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



IN0-H1T 



ORF Name 



Protein name 



NTID 



AAID 



hypothetical protein 



Description 



NT AA 

— — Score Probability 
Length Length 



419 



T5T 



3.3e-75 



Locus Name 



Acc# 



G72244 



613 



NT 



AA 



ORF Name 



NT ID 



221b0262 C2 47 



AAID Length Length 

FS^ 



3TT 



Score Probability 
3.5e-bb 



Protein name 



Locus Name 



Acc# 



Description 



ORF Name 




NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


24744l2b_ 


_cl_44 


21S5 740S 


164 


495 


216 


5.0e-24 


Protein 


name 






Locus Name 


Acc# 



macropnage intectivity potentiator 



U92222 



Description 

Legionella spiritensis macrophage intectivity potentiator tmipjgene, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



2184 



AAID Length Length 
— 



ATT 



Score Probability 
|4.9e-i04 



TUTT 



Protein name 



Locus Name 



Na+-translocatmg nadh- ubiquinone 
oxidoreductase, beta chain 



pir :D640b2 



Acc# 



D64052 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



T5T 



1.5e-iu 



Protein name 



Description 
(R.HPA11J 



Locus Name 



sp:T2H2_HAI±iJL>A 



Acc# 



P36433 



614 



NT 



AA 



ORF Name 



ci 42 



NTID AAID Length Length 

nrm — 



12186 



11416 



Score Probability 
II . 5e-5l 



Protein name 



Locus Name 



sp:DEAD_3A(jyU 



Acc# 
P42305 



Description 
£>RO£ABL£ AT^-DfiMNDfiN'T RISfA #2 LIPASE DEAD 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



30105385 c3 61 



T3T 



3.3e-lS 



Protein name 



Description 



Locus Name 



|gp:AB0l4075 



Acc# 



AB014075 



Clostridium histoiyticum genes xor " 
hypoxanthine-guaninephosphoribosyl-transferase (HGPRTase) , GTPase and 12 

ORFs, completeand partial cds . 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



3.41.7.40.6.1...cd...b.b... 



12188 



7410 



|4.1e-61 



Protein name 



Description 



Locus Name 



sp:MO^_HAlfliJM 



Acc# 



P71342 



(MA-ti^R COMPLY dUBU^Il'i' b) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



T02 



4 . 8e-ub 



Protein name 



Locus Name 



Ma+-translocating nadh- ubiquinone 
oxidoreductase, gamma chain 



|pir:SS$b28 



Acc# 



S65528 



Description 



615 



ORF Name 



14142151 cl 43 



Protein name 
Description 



NTID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


2190 


7412 




109S 




839 


l.ie-tfj* 








Locus 


Name 




Acc# 



sp:SERC_BA<JiJU 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



74TT 



TIT 



Score Probability 
Ifi.le-lcJb 



Protein name 



Locus Name 



D-3-phosphoglycerate dehydrogenase 



|gp:AP07^8Bl 



ACC# 



AF079881 



Description 

Entodmium caudatum T)-3 -ph osphoglycerate dehydrogenase itikna, partial cas . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TJl 



Score Probability 
1.6e-4i 



Protein name 



Locus Name 



hypotnetical protein 



pir : J01O20 



Acc# 



JQ1020 



Description 



ORF Name 



NTID 



— — Score Probability 



AAID Length Length 



55" 



5.$e-05 



Protein name 



Locus Name 



hypothetical protein MJ160B 



pir:G64S0U 



Acc# 



G64500 



Description 



616 



ORF Name 



12704377 t2 10 



Protein name 

Description 
MO-HIT 



NT 



AA 



NTID 



AAID Length Length 
TUZ — 



Score Probability 



Locus Name 



Acc# 



ORF Name 



lifia3Liflas..±i...fi.. 



Protein name 



NTID 



AAID 



17417 



hypothetical protein 



Description 



NT AA 
T — . , T — , , Score Probability 
Lengtn Lengtn 



TUTT 



Locus Name 



pir : JQ1020 



b.6e-103 



Acc# 
JQ1020 



ORF Name 



lSSt23.15.0....c3....6.1 



Protein name 
Description 

NO-HIT 



NT 



AA 



NTID 



AAID Length Length 
— 



Score Probability 



T7F" 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



|20.5.D.0.43.0...±1..3.1 1 



AAID Length Length 
7¥T3 — 



Score Probability 
F72 



0.020 



Protein name 



Locus Name 



P&01314 



gp:AP118084 



Acc# 



AF118084 



Description 
Homo sapiens PR01914 mRNA, complete cds . 



617 



ORF Name 



NT ID 



AAID 



NT AA 
T — T Score Probability 
Length Length 



22860128 12 21 



WT 



0.031 



Protein name 



Description 



Locus Name 



sp : yPkC_XENLA 



Acc# 



P36378 



(OSTfiOtffiMltf) (ON) (feASfiMfiNf MEMSRANfi t'ROT'Slrt BM-40J 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
ITT" 



Score Probability 



Protein name 

Description 
MO-MT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



Tim' 



AAID Length Length 

im — 



Score Probability 



TST" 



Protein name 

Description 
IHO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



a4misi..ci„.3i2„.„ i mvr 



AAID Length Length 
7323 — 



Score Probability 




5 .8e-46 



Protein name 



Locus Name 



conserved hypothetical protein CAB06 2 96 



pir :T17189 



Acc# 



T17189 



Description 



618 



NT 



AA 



ORF Name 



NTID 



34407151 c3 6B 



TZUT 



AAID Length Length 



TU7W 



Score Probability 



7.1e-64 



Protein name 



Locus Name 



low specixrcity L-tnreonine aldolase 



|gp:AE0ai5';7 



Acc# 



AB001577 



Description 

Pseudomonas sp. DNA for low specxticity L-threonine aldolase, complete cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



2211 



Score Probability 

o.ocis 



Protein name 



Locus Name 



Jiypotnetical protein T13D8.29 



|pTr7 



Acc# 



T02292 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



4.3.zaa3a...c2L...5.i i 12204 



\rnr~ 



7.0e-i§ 



Protein name 



Locus Name 



gp:AP1905S0 



Acc# 



AF190580 



Description 



Pseudomonas syrmgae pv. syrmgae AlgT (algTj gene, complete cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



isiaaaa3„.±i.„4 1 mss 



JTTT 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



IHO-HIT 



619 



ORF Name 



NT ID 



582552 ±3 29 



Protein name 



NT AA 

— , — , Score Probability 
AAID Length Length 1 ~ 



pyruvate deiiycLrogenase 



Description 



1549 



Locus Name 



pir :T34668 



6.3e-159 



Acc# 



T34668 



ORF Name 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 
7¥23 — 



Score Probability 



T/TT 



Locus Name 



Acc# 



Description 
[NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



7.16..7.0.5.Z...C.2...A5. 



Length Length 



fZTT 



Score Probability 
0.(50059 



Locus Name 



[sp:YHVS__YEA3T 



Acc# 



P38853 



Description 

HYPOTHETICAL i3l.l Kb t>ft<MfilKf M RECl04-£OL3 HftfiftflSlKlC REGION 



ORF Name 



NTID 



NT AA 

— — _ Score Probability 
AAID Length Length ^~ 



12209 



7TT 



TuT" 



3 . 3e-06 



Protein name 



Locus Name 



hypothetical protein APE0580 



bir:i)72543 



Acc# 



D72643 



Description 



ORF Name 



NTID 



NT AA 
T — — ^_ Score Probability 
AAID Length Length ^ 



7432 



T5T 



TTZT 



'2.3e-i77 



Protein name 



Locus Name 



Acc# 



hypothetical protein 



pir : JQ102 0 



JQ1020 



Description 



620 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



21519061 12 J* 



7ZTT 



T5T 



T7T" 



3.8e-ii 



Protein name 



Description 



Locus Name 



sp : y611>_METJA 



Acc# 



Q58029 



ORF Name 



NT ID 



226S50W ri lb 



12212 



7434 



Protein name 



amxnotranst erase ^AspC tamiiy; 



Description 



NT 



AA 



AAID Length Length 



— Score Probability 



illdl 



5.5e-76 



Locus Name 



bir:B7032b 



Acc# 



B70325 



ORF Name 



NT ID 



AAID 



2213 



— — Score Probability 
Length Length 

o.o^i 



FT 



Protein name 



Locus Name 



sp:5PRC_XliWLA 



Acc# 



P36378 



Description 

(OSTEONECTIN) (ON) (BAijlilMEN T MEMBkANE PROTEIN bm~4u; 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2162L$.hD....a2..±Z-JL. 


2214 


1416 


306 | 


$21 


551 


3.2e-36 



Protein name 



Locus Name 



sptSPLJicJuiil 



Acc# 



P06995 



Description 

ACID ALDOLASE) ( N - AC ET ¥ LN LI UkAMIN ATE PVR UVATL1 LtfASE) waiass) 



621 



ORF Name 



NT ID 



AAID 



NT AA 

— T — Score Probability 
Length Length ■ £ - 



124225577 c3 127 



7ITT 



Protein name 

Description 
IkJO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID 



\2&129.b.&l...al..±0A 1 



Length Length 
TTU7 



Score Probability 
TTL 



■9.0e-7I 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp:PGI130§72 



Acc# 



AJ130872 



Description 



Porphyromonas gingival is W50 receptor antigen Tragi locus encodinga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



NTID 



AAID 



TZTT 



Length Length 



2052 



Score Probability 
7TZ 



<S.2e-77 



Protein name 

Description 
DNA PRIMA3E, 



Locus Name 



|sp : E>RIM_CL0A5 



Acc# 



P33655 



ORF Name 



NTID 



AAID 



NT AA o ^ , , , _ , ^ 
— , — , Score Probability 
Length Length 



Em" 



1.3e-38 



Protein name 



Locus Name 



sp : STS__RAT 



Acc# 



P15589 



Description 

SULFATE StJLFOH^DROLASfij (j&YLSULtfAfAStf C) (ASO) 



622 



ORF Name 



NTID 



AAID 



NT AA 
— , — , Score 
Length Length 



26595402 t2 38 



TIFT 



7MT 



1068 



Protein name 



Locus Name 



pttospno-2-cieJiyciro-3-cieoxyheptonate 
aldolase/chorismate mutase 



pir .-A75449 



Description 



Probability 
|4.7e-35 



Acc# 



A75449 



ORF Name 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



Locus Name 



3.0e-33 



Acc# 



Description 



sp : PHEA_ERWHE 



Q02286 



ORF Name 



3.S2.3.4aa0....cl...8.8... 



Protein name 



NTID 



NT AA 
T — _ — Score Probability 
AAID Length Length ' L 



TZTT 



744T 



B5T 



Locus Name 



1.3e-38 



Acc# 



Description 



sp : STS _M0USE 



P50427 



SULFATE SULFOHYDROLASE) (ARYLSULFATASE C) (A3C) 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



TZTT 



7444 



Length Length 
W5~ 



Score Probability 



Locus Name 



Acc# 



Description 
IMO-HIT 



623 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



3930163 c3 l2y 



TTZT 



7445 



TTT" 



167 



| 4.3e-10 



Protein name 



Locus Name 



transmembrane sensor 



gp:AF0Sl^yl 



Acc# 



AF051691 



Description 



frseudomonas aeruginosa stress lactor A (pslA) , &w sigma ractor itiui j , 
transmembrane sensor (fiuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds . 



ORF Name 



NTID 



AAID 



NT 

Length Length 



AA 

— , Score 



3$37$27 t3 83 



2224 



7446 



ST" 



Tuir 



TuT" 



Probability 
1.3e-05 



Protein name 



Locus Name 



unknown 



lgp:St>Uby23b 



Acc# 



U59236 



Description 



gynechococcus KJC7^42 nbosomal protein ril ot 3us risosome irpsi j , u^ /i, 
ORF231, ORF341, carboxyltransf erase alpha subunit (accA) ,ORF245, ORF227, and 
GTP cyclohydrolase I (folE) genes, completecds, and ORF2 05 gene, partial 
cds . 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



aaattiAi^c^m | 



T3TT 



PTT 



4 . 7e-12 



Protein name 



Locus Name 



RNA polymerase sigma ractor sigz-liJce protein I igp : AFi3 rzbi 



Acc# 
AF137263 



Description 



Bacteroides thetaiotaomicron 30^ nboso mal protein aib-iiKeprotem f rucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



624 



ORF Name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



4117167 ci liU 



TT2TT 



TTT 



4.6e-71 



Protein name 



Locus Name 



receptor antigen (RagA) 



bp:PSIii0ti72 



Acc# 



AJ1308 72 



Description 

£orphyromonas gm givalis W50 receptor antigen [rag) locus encodinga major 
immunodominant 55kDa antigen. 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


4675<50l_c2_l26 


2227 


744$ 


453 


1462 


112 


9.1e-l4 



Protein name 



Locus Name 



unknown 



gp: 1)96771 



Acc# 



U96771 



Description 



Prevotella bryan tii putative polygalacturonase, B-l, 4-enaoglucanase, ana 
mannanase genes, complete cds ; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



|8.3e~b4 



Protein name 



Description 



Locus Name 



sp:GCHl_>YNYi 



Acc# 



Q55759 



GTE> CTCLOHVDfeOLA^ii! 1, (<jT*p-<Jh- 1) 



NT 



AA 



ORF Name 



NTID 



AAID 



r74^r 



Length Length 



Score Probability 
p^S — 



|1.2e-ll 



Protein name 



Description 



Locus Name 



S p:YCBGJiACjfciU 



Acc# 



P42239 



625 



ORF Name 



NTID 



NT AA 

AAID Length Length Probability 



5553151 ci §7 



TZ7T 



T7T 



4 . 3e-13 



Protein name 



Locus Name 



unknown 



|gp:U5677i 



Acc# 



U96771 



Description 



Prevotella bryantii putative polygalacturonase, B-l, 4-endoglucanase, and 
mannanase genes, complete cds; and unknowngenes . 



ORF Name 



NTID 



AAID 



11553382 ±2 22 



Protein name 



hypothetical protein F19H6.4 



Description 



NT 



AA 



Length Length 
TTT 



Score Probability 
3B 



Locus Name 



pir:T2ll23 



0.00012 



Acc# 



T21123 



NT 



AA 



ORF Name 



NTID 



i&h.o:i±i:L±±..±2 1 Yim 



AAID Length Length 
7¥51 



Score Probability 



Protein name 
Description 

pr^nrr 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



16.116.5.5.5....C2...7.6. I 



Length Length 



Score Probability 

Tn — 



5.2e-25 



Protein name 

Description 
DIHYDROFOLATE REDUCTASE, 



Locus Name 



|sp:DYftJtiei<30 



Acc# 



P04174 



626 



NT 



AA 



ORF Name 



NT ID 



168314885 tl 18 



AAID Length Length 
TZZZ — 



ffU5~ 



TITS' 



Score Probability 

— 



|8.2e-166 



Protein name 



Locus Name 



hypothetical protein 



pir : JQ1020 



Acc# 
JQ1020 



Description 



ORF Name 



±LB£21L2...c±...Z.Z. ...J 



NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2235 


7457 


291 




875 


112 


0.001B 



Protein name 



Locus Name 



Hypothetical protein 



pir :T29116 



Acc# 



T29116 



Description 



NT 



AA 



ORF Name 



lfift&££lA..±:L..l 



„ mTT ^ , , ™ — _ — _ Score Probability 
NTID AAID Length Length 

P74^5 



TO" 



T2T 



5.6e-07 



Protein name 



Locus Name 



probable oxidoreductase 



pir : F70970 



Acc# 



F70970 



Description 



ORF Name 



NTID 



TFTT 



Protein name 



AAID 



7T5T 



hypothetical protein PH1073 



Description 



NT AA 

t — ^ T — 4-^ Score 
Length Length 



Probability 
|2.6e-07 



Locus Name - 



pir:P71i01 



Acc# 



F71101 



NT 



AA 



ORF Name 



NTID 



2Q9.ai53.1...C3....9.9...... 



12238 



AAID Length Length 
HTZQ 



wnr 



Score Probability 
TUJG — 



|6.3e-104 



Protein name 



Locus Name 



tnymiaylate syntnase 



gp:NGU86637 



Acc# 



U86637 



Description 



Neisseria gonorrhoeae thymictylate synthase (tnyA) gene, completecds . 



627 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



22165923 ci 51 





223S 




7451 190 


57:4 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



2.2.8.6lQ12.8....£.2....3.4.. 



TIFT 



AAID Length Length 

rr^i — 



3T" 



\Z5T 



Score Probability 
SI 



0.031 



Protein name 



Locus Name 



sp : SPRC_XENLA 



Acc# 



P36378 



Description 

(OSTfiOtfECTlM) (Otf) (BASfirtfitW ME!m6rAn£! P&OtEttf feM-40) 



ORF Name 



NTID 



AAID 



NT AA 

S^o^e Probability 
Length Length 



^3..7.0A40...±3....3..7„ 



£24lT 



74ST 



TS3T 



Protein name 



Locus Name 



probable oxictorecLuctase 



pir :F70970 



Acc# 



F70970 



Description 



ORF Name 



Protein name 



NTID 



12242 



NT 



AA 



AAID Length Length 




Score Probability 

— 



Locus Name 



sp:YA22_MET i TH 



Description 

PUTATIVE BIOPOLYMER TRANSPORT PROTEIN EXBB H0M0L0G 



&.0e-i7 



Acc# 



027101 



628 



ORF Name 



24509675 13 44 



Protein name 

Description 
NO-HIT 



NT 



AA 



NT ID 



AAID Length Length 



Score Probability 





2243 




7465 




155 


458 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA „ ^ , , . _ . ^ 
— , — , Score Probability 
AAID Length Length J ~ 



\U:i&lb£l.±±..£. I 



7^" 



2.2e-07 



Protein name 



Locus Name 



sideropnore -mediated iron transport protein 



pir :F7l82y 



Acc# 



F71829 



Description 



NT 



AA 



ORF Name 



NTID 



121U011.±1...1& 



AAID Length Length 
— 



Score Probability 
S.9e-35 



377 



Protein name 



Locus Name 



UBE-la 



|gp:AB030BTJT 



Acc# 



AB030503 



Description 
Mus mus cuius mRNA tor UBE-la, complete cds . 



NT 



AA 



ORF Name 
3.Z^2ZQl...tZ.„3.D... 



NTID 



AAID 



Length Length 



Score Probability 



Protein name 

Description 
KO-fllT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



TXFT 



AAID Length Length 
7^3 



Score Probability 




0.00056 



Protein name 



Locus Name 



hypothetical protein PH1889 



pir:t)7l202 



Acc# 



D71202 



Description 



629 



ORF Name 



34504015 ti 8 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 
7T7I5 — 



Score Probability 



713" 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 
|i.4e-49 



5TT 



Locus Name 



bitunctionai snort chain isoprenyl 
diphosphate synthase (idsA) homolog 



bir:F^S3b 



Acc# 



F69535 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



l$.15£S£2.±3...3.b. 1 



8.8e-S>8 



Protein name 



Locus Name 



citrate synthase 



Acc# 



AF088222 



Description 



Lactococcus lactis subsp. iactis citrate synthase, acomtateJtiydratase , ana 
truncated isocitrate dehydrogenase genes, completecds . 



NT 



AA 



ORF Name 



NTID 



AAID 



7T7T 



Length Length 



Score Probability 
I3TI 



Protein name 



Description 



Locus Name 



sp:AStf<iJlArild 



Acc# 
P44337 



REGULATORY PROTEIN ASUC 



630 



ORF Name 



ci 67 



Protein name 



LytB protein 



Description 



NTID 



AAID 



NT 



AA 



Length Length 
TTT 



553" 



Score Probability 
3.9e-40 



Locus Name 



pir:G70449 



Acc# 



G70449 



ORF Name 



AftftISifl™c3L...ICta.. 



Protein name 



Description 



NTID 



AAID 



NT AA 
— , — , Score 
Length Length 



2253 



464 



Probability 
|6.0e-44 



Locus Name 



sp:KCY_BACSU 



Acc# 



P38493 



(CMP KINASE) 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



|S114QfiL2L„.a3L...IIl I 



555" 



3TT 



B.7e-83 



Protein name 



Description 



Locus Name 



sp:K6Pi_THUTH 



Acc# 



P21777 



NT 



AA 



ORF Name 



NTID 



5.8.&^5.3..7...±1...3. I 



AAID Length Length 



780 



Score Probability 
|2.5e-3& 



411 



Protein name 



Locus Name 



sp: YABD_BACSU 



Acc# 



P37545 



Description 

HYPOTHET I CAL 25.2 KB PROTE I N IN ME T 5-KSGA I N TERGENIO REGION 



631 



ORF Name 



829417 ti ii 



Protein name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



5T" 



T35" 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



TTTT 



|4.5e-u6 



Locus Name 



hypothetical protein 



|pir:T2Ml6 



Acc# 



T29116 



Description 



NT 



AA 



ORF Name 



NTID 



|itt55SLitt2..±i...tiibL.. 



2258 



AAID Length Length 
TTT51 



Score Probability 
8.4e-22 



Protein name 



Locus Name 



conserved hypothetical protein AF2231 



Description 



Acc# 



G69528 



ORF Name 



Protein name 



NTID 



AAID 



74S1 



NT AA 

— — , Score Probability 
Length Length 



152 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
TF5 



Score Probability 



53 



Locus Name 



Acc# 



Description 



[NO-HIT 



632 



NT 



AA 



ORF Name 



NTID 



12147561 ci 120 



12251 



AAID Length Length 




Score Probability 
155 



S.ie-iO 



Protein name 



Locus Name 



immunoreactive 53 JcD antxgen PG123 



|gp:AF144641 



Acc# 



AF144641 



Description 



Porphyromonas gingivaiis strain W50 immunoreactive 53 kD antigenPG123 gene, 
complete cds . 



ORF Name 



l2657l§0 c5 207 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



74^4- 



T75 1 [TTT7 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



X6.±^.1S.1..±±...11 



Protein name 



NTID 



AAID 



2255 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



7^" 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



633 



NT 



AA 



ORF Name 



NT ID 



1354412 11 29 



AAID Length Length 
T7T 



Score Probability 
TBI 



5.8e-i4 



Protein name 



Locus Name 



RNA polymerase ECF-type sigma tactor sigw 



pir:H69706 



Acc# 



H69706 



Description 



ORF Name 



NTID 



NT AA „ ^ 
— , — , Score Probability 
AAID Length Length 



Protein name 



Locus Name 



hypothetical protein aq_l2 2 0 



pir :C704Ub 



Acc# 



C70405 



Description 



ORF Name 



22A&6.$.1&.±2..&$... 



Protein name 



NTID 



12267 



AAID 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



\2iioa6.iz.±i..±±& I mzz 



Protein name 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



551 



TS^~ 



Locus Name 



6.$e-224 



Acc# 



chaperonm groEL 



Description 



pir:547«0 



S47530 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



2±2A2112..xx...& I 



2269 



7491 



1017 



9. 5e-90 



Protein name 



Locus Name 



Acc# 



cysteine synthase 



gp:MLCB22 



Z98741 



Description 



Mycobacterium leprae cosmid B22 . 



634 



ORF Name 



124414553 cl 14^ 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



TFTT 



Length Length 
153 



Score Probability 



T5T 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 





2271 7493 




181 


545 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



2272 



AAID 



7494 



NT 



AA 



Length Length 

ttt 



Score Probability 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



2.iS.0.5.3.0.2..±1...16.. 



Protein name 



NTID 



TUT 



hypotnetical protein yugP 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TJT 



WTT 



Locus Name 



pir:P70011 



1.5e-40 



Acc# 



F70011 



635 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



24806517 t2 49 



i.6e-26 



Protein name 



Locus Name 



dipeptidyi peptidase in 



gp:D89340 



Acc# 



D89340 



Description 



Rattus norvegicus mRNA tor dipeptidyi peptidase III, complete cds . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



25423S37 c2 154 



7497 



Protein name 



hypothetical protein 



Description 



6 . le-122 



Locus Name 



pir : JQ1020 



Acc# 



JQ1020 



ORF Name 



NTID 



ZS£.B.5.1B.l...al...22A I YTTK 



Protein name 



AAID 



NT AA 

— , — 1 Score Probability 
Length Length 



5.5e-i2 



Locus Name 



Acc# 



Description 
HYPOTHETICAL PROTEIN MJ0374 



sp : Y3 74_METJA 



Q57819 



ORF Name 



NTID 



TTTT 



Protein name 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



636 



NT 



AA 



ORF Name 



NT ID 



TZ1W 



AAID Length Length 



Score Probability 
i.le-17 



SIT 



Protein name 



Locus Name 



Acc# 



sp:FUR_CAMJK 



Description 

FERRIC tfPTAKfi kEGtlLAxlOtf £>kOTElJSf (PfiRRlC UPtAKB REGULATOR) 



ORF Name 



NT ID 



AAID 



NT AA 

— — , Score Proba bility- 
Length Length 



cl 142 



ET7T 



1827 



2.2e-l33 



Protein name 

Description 
ATP-Dtl^tlMDEMT DNA HELICAL kK(J(j, 



Locus Name 



spiRECOJiAi^lN 



Acc# 



P71359 



ORF Name 



NT ID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



T3¥" 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



|23.23.S17..±1...1:A 



T5UT 



Length Length 
TUTS 



Score Probability 



1^" 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



637 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



129455156 ci 129 



2282 



7504 



Protein name 



acylamino-acid- releasing enzyme, 
(acyl-peptide hydrolase) (aph) 
( acylaminoacyl -peptidase ) PAB13 0 0 



Description 



T375" 



Locus Name 



bir:H75007 



|2.ie-40 



Acc# 



H75007 



ORF Name 



Protein name 



NTID 



NT AA 

— , — , Score Probab ility 
AAID Length Length 



Locus Name 



Acc# 



Description 



ORF Name 



3.14^.7..7.Q2....C1„.1Z8... 



Protein name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



273 



WIT 



5T5" 



Probability 
|2.9e-5l 



Locus Name 



immunoreactive 89KD antigen PG87 



gp:AF175722 



ACC# 



AF175722 



Description 



Porphyromonas gmgivalis strain W50 immunoreactive 89KD antigenPG87 gene, 
complete cds . 



ORF Name 







NT 


AA 


NTID 


AAID 


Length 


Length 


2285 




7507 


345 


1038 



Protein name 



Locus Name 



Acc# 



Description 



NO -HIT 



638 



ORF Name 



33238187 c2 iby 



Protein name 



NTID 



AAID 



7F0TT 



NT 



AA 



Length Length 



Score Probability 



TT7IT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



2287 



AAID 



17509 



NT AA 
— — Score 
Length Length 



1521 



Locus Name 



Probability 



Acc# 



Description 



[NO-HIT 



ii : i: 



ORF Name 



3.Mfl.20.iil...ci>...lb.0.., 



Protein name 



NTID 



2288 



AAID 



75Tu~ 



NT AA 

— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



3.5L7.8.15..7.£...±I...AD.. 



Protein name 



NTID 



12289 



7511 



conserved hypothetical protein 



Description 



NT 



AA 



AAID Length Length 



ITT 



Score Probability 
1.9e-56 



5B3 



Locus Name 



|pxr:D7bbb7 



Acc# 



D75557 



NT 



AA 



ORF Name 



NTID 



AAID 



|19.D.63.&0...±2....b.i... 



Length Length 

pro — 



Score Probability 
0.033 



777 



Protein name 



Locus Name 



hypothetical protexn Y68A4B.3 



pir :T27307 



Acc# 



T27307 



Description 



639 



NT 



AA 



ORF Name 



NT ID 



14022312 13 113 



AAID Length Length 

iwn — 



ST 



Score Probability 




|2.0e-36 



Protein name 



Description 



Locus Name 



sp:CH10_KHKSI 



Acc# 



P42376 



10 KB CHftPEftOtilti (PROTEIN CfrtilO) (gftOTfiM flftOfclii) 



ORF Name 



4695302 c2 17S 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



WIT 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



NT AA 

— , — , Score Probabi lity 
AAID Length Length 



i7.lSu7.&0...±3....8.8. I 



conserved hypotnetical protein MTH83 



Description 



or 



|4.le-l3 



Locus Name 



pir: F69210 



Acc# 



F69210 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



12294 



1161 



■1.3e-25 



Protein name 



Locus Name 



Acc# 



conserved hypotnetical protein 



(pir:C72340 



C72340 



Description 



640 



NT 



AA 



ORF Name 



NT ID 



14883437 ±1 15 



AAID Length Length 
TZBI — 



Score Probability 
|3,0e-97 



Protein name 



Locus Name 



sp : PURA_YEAST 



ACC# 



P80210 



Description 

ADENYLOSUCCINATE Sttfflffi'tASJii, (lMt>- -ASP ARTATE LttiAiiifl) 



ORF Name 



NT ID 



526£37 ti 96 



Protein name 



probable glycosyl nydrolase 



Description 



NT 



AA 



AAID Length Length 
4TT 



TTTT 



Score Probability 
3.ue-46 



Locus Name 



bir:«6467 



Acc# 



T36467 



ORF Name 



Protein name 



Description 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



31T 



?ZUT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



|£43.2Ll8....Gl..m I [2793 



5.9e-42 



Protein name 
Description 

HYPOTHETICAL PROTEIN HI1246 



Locus Name 



|sp:YC46_MAbllM 



Acc# 



P44135 



641 



NT 



AA 



ORF Name 



NTID 



6462751 t'2 bU 



AAID Length Length 
TIT 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



.7.222125....cl...2IS. I OT 



Length Length 
— 



Score Probability 
i.6e-52 



Protein name 



Locus Name 



putative secreted, protein 



gp:5CMii 



Acc# 



AL133278 



Description 

Streptomyces coelicolor cosmid. Mil. 



ORF Name 



NTID 



AAID 



NT AA 
— — , Score 
Length Length 



aimi...ciL..iaa i pr 



7TT 



TTT 



Probability 
|5.4e-l0 



Protein name 



Locus Name 



Heme receptor 



gprVlMUTA 



Acc# 



L27149 



Description 
Vibrio cnolerae neme receptor (hut A) 



gene, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



7ZZT 



Length Length 
— 



VTT 



Score Probability 
1.5e-6b 



Protein name 



Description 



Locus Name 



sp : SYHJIUMAW 



Acc# 



P12081 



(UISRS) 



642 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



Protein name 



7526 



IFF* 



3.2e-0S 



Locus Name 



Acc# 



response regulator 



Description 



gp:SPAJ6a<*6 



AJ006396 



Streptococcus pneumoniae rr07 ana hk:07 genes; two component system07, 



ORF Name 



NTID 



NT AA „ _ , , . _ . . 
— — , Score Probability 
AAID Length Length 



iiaa&5.£ia...ci...i I [s^s 



7527 



1115 



Protein name 



Locus Name 



|6.2e-ll3 



Acc# 



£>eta-glucosidase 



gp:AF006658 



Description 



AF006658 



Bacteroides tragiiis Joeta-giucosiaase gene, complete cds. 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



643 



ORF Name 



1276700 ti i 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 




Score Probability 



TUT" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NT ID 



AAID Length Length 
— 



7530 



TUT 



Score Probability 
7.0e-I76 



T7W 



Locus Name 



115K outer membrane protein precursor : Susc 
protein 



Description 



pir : JC6027 



Acc# 



JC6027 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



±6£A01ll...a±..±16. 



TZTT 



1.4e-39 



Protein name 



Locus Name 



hybrid, histidme kinase 



bp:AF029704 



Acc# 



AF029704 



Description 



Dictyostelium discoideum hybrid histidme Kinase (dhkD) mRNA, complete cds . 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



±$5.6A0.11...zl..±ll I 



t^tt 



Twer 



WZT 



|2.1e-20 



Protein name 



Locus Name 



probable chromate transport protein 



pir :G71379 



Acc# 



G71379 



Description 



644 



ORF Name 



NT ID 



NT AA 
— — Score 
AAID Length Length 



20830137 c2 13b 



2311 




7533 




459 


1380 


267 



Probability 
7.9e-25 



Protein name 



sucrose transporter l 



[gp 



Locus Name 



Acc# 



AF191024 



Description 

Asarina barclaiana sucrose transporter 1 CSUTl) mRNA, complete cds. 



ORF Name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



22534^33 c3 175 



TTTT 



7534 



1294 



1005 



Probability 
|2.Se-207 



Protein name 



Locus Name 



sp : £Uk4_DkOME 



Acc# 



P35421 



Description 

(AD E NOSINE- 2) ( FOAMS) ( FOkM V LGL YC I NAM ID S RIBOTlbE i^M'HIilTA;^) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



STIFF" 



Score Probability 
13 .6e-62 



Protein name 



Locus Name 



cation etnux system tAcrB/AcrD/AcrF tamiiyj I Ipir :F70342 



Acc# 



F70342 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



TSTT 



Length Length 
4TT 



Score Probability 



TTT4" 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



645 



ORF Name 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



75TT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 
— — , Score 
Length Length 



Probability 
li.ie-21 



Protein name 



Locus Name 



cation err lux system (czcB-liicej 



pir :C7041b 



Acc# 



C70415 



Description 



NT 



AA 



ORF Name 



NTID 



12317 



AAID Length Length 
TTT 



Score Probability 
!5.2e-17i 



Protein name 



Locus Name 



sp:YVDKJSACaU 



Acc# 



006993 



Description 

HYPOTHETICAL W . 3 KB £>fe0TB3CJsf flsT CLp£-CrH ^TeIr^ SNIC RMcjiOJ^ 



NT 



AA 



ORF Name 



NTID 



AAID 



2318 



Length Length 
T7 - 



— Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



646 



ORF Name 



I2&892568 fi y± 



Protein name 



NTID 



NT AA 

— — , Score Probabil ity 
AAID Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



transcription regulator, Lad family 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



|1.4e-42 



Acc# 



F72282 



ORF Name 



NTID 



NT AA 

— — Score Proba bility 
AAID Length Length 



l$AAb.6.2..±±..M. 



12521 



Protein name 



Locus Name 



YngK 



Acc# 



AF184956 



Description 



Bacillus subtil is mycosubtilin operon, complete sequence. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



|42l2l.7.7.6....c3....1B.1.. 



3TT 



Score Probability 
|4.7e-234 



Protein name 



Locus Name 



excmuclease ABC cnaln AruvrA protein 



pir :H65l57 



Acc# 



H69157 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



2323 



7^" 



77T 



T5T 



7.9e-26 



Protein name 



Locus Name 



Acc# 



H. intluenzae predicted coding region HI112 7 



Description 



Haemophilus influenzae fed section 107 ot 163 ot the completegenome . 



ORF Name 



NTID 



NT AA ^ _ , , . . . . 
— — Score Pr obability 
AAID Length Length 



cl 126 



Protein name 



pro&aJole chromate transport protein 



Description 



l.le-2l 



Locus Name 



bxr:G70068 



ACC# 



C70068 



ORF Name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



Sliu.7.12...cl...l^.. 



2325 



75TT 



5.2e-ii 



Protein name 



Locus Name 



immunoreactive 52KD antigen PG41 



|gp:APi7B7l6 



Acc# 



AF175716 



Description 



Porphyromonas gmgivalis strain WbO immunoreactive b2KD antigenPG4i gene, 
complete cds. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
S3 - 



Score Probability 



270 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



648 



NT 



AA 



ORF Name 



NTID 



828142 tl 4 



\77TT 



AAID Length Length 

rres — 



Score 



T7T 



Probability 
|2.ie-34 



Protein name 



Locus Name 



YngK 



gp:AF1849b6 



Acc# 



AF184956 



Description 



Bacillus subtilis mycosufctilin operon, complete sequence. 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



17048551 c3 £0 



TTZT 



Probability 
|l.2e-32 



Protein name 



Description 



Locus Name 



sp:AMM_S'lRLl 



Acc# 



Q11010 



(ALANINE AMIMOPEM'lbAiJl*!) 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



2l6A&S2b....a±...5A I 



52T 



TB3" 



Probability 
2.0e-07 



Protein name 



Locus Name 



isp:AAPl_YEAST 



Acc# 



P37898 



Description 

ALANINE/ AR^tNlNS AMlNOPSPl'ibAs^!, 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 
|2.1e-S5 



Protein name 



Locus Name 



hypothetical protein SC4H2.17 SC4H2.iv 



pir :T351l6 



Acc# 



T35116 



Description 



649 



NT 



AA 



ORF Name 



NTID 



25355961 cl 47 



T5TT 



AAID Length Length 
FT7TT 



Score Probability 
2.Ie-07 



XT? 



Protein name 



Locus Name 



receptor antigen B (RagB) 



gp:Paiiioa72 



Acc# 



AJ130872 



Description 



£>orphyromonas gingivalis W50 receptor antigen (rag) locus encodmga ma^or 
immunodominant 55kDa antigen. 



ORF Name 



NTID 



25350375 c3 by 



7^4 



Protein name 



nypotneticai protein TP0851 



Description 



NT 



AA 



AAID Length Length 
T4U1 — 



Score Probability 
.l.de-iO 



Locus Name 



bir:C7l274 



Acc# 



C71274 



ORF Name 



|lii^6.0.b.:/....cl..Ay. 



Protein name 



Description 



NT 



AA 



NTID 



2333 



AAID Length Length 




Score Probability 



7555 



TIT 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



7556 



|4.6e-&6 



Locus Name 



115K outer membrane protein precursor : Susc 
protein 



pir: JC60^7 



Acc# 



JC6027 



Description 



650 



ORF Name 



NTID 



NT AA 
— — Score 
AAID Length Length 



14738761 c2 bb 



Probability 
|I.2e-166 



Protein name 



Locus Name 



Acc# 



tumarate nyclratase, tumB, 
iron- dependent : fumarase B 



foir:M4bll 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



T7W 



FIT 



ffTTT 



3.ie-38 



Protein name 



Locus Name 



Acc# 



hypothetical protein F36H12.3 



pir:T334b7 



T33457 



Description 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length J - 



Protein name 



\TTT 



6.5e-38 



Locus Name 



Acc# 



hypothetical protein F36H12.3 



Description 



|pir:T334b7 



T33457 



ORF Name 



NTID 



NT AA 

— — , Score Probab ility 
AAID Length Length 



Protein name 



2338 



7560 



TSTT 



567 



Locus Name 



Acc# 



hypothetical protein F3 6H12.3 



Description 



|pir:M:i4b7 



T33457 



ORF Name 



Protein name 



NT 



NTID 



AAID 



2339 



Length Length 



AA 

— , Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



651 



ORF Name 



NTID 



10742905 r2 ^0 



Protein name 



conserved Hypothetical protein 



Description 



NT 



AA 



AAID Length Length 
1351 



Score Probability 
4.2e-05 



Locus Name 



pxr :A7222l) 



Acc# 



A72220 



ORF Name 



Protein name 



CysQ protein 



Description 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



1ST 



|3.Se-63 



Locus Name 



foir:A7(mi> 



Acc# 



A70330 



ORF Name 



Protein name 



NTID 



iimfi&i&...ci...a4ii I 12^42 



NT 



AA 



— , Score Probability 
AAID Length Length 



T5T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Prob ability 
AAID Length Length 



1ST" 



Locus Name 



16 .Oe-67 



Acc# 



Description 

HYPOTHETICAL 66.7 Kb PRO T E I N ^LL0640 



S p:Y64 0_£Y]tfY3 



P72958 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



I1M8.5.6.1..C1...15.1 1 



7^" 



7TF" 



|i.5e-70 



Protein name 



Locus Name 



Acc# 



hypothetical protein 



pir : JQ±020 



JQ102 0 



Description 



ORF Name 



12857817 ±2 54 



Protein name 



otnG protein 



Description 



NTID 



NT AA 

— ■ — , Score Probability 
AAID Length Length 



TTT 



FIT 



T01 



0.0055 



Locus Name 



pir :S/uyb4 



Acc# 



S70954 



ORF Name 



Protein name 



galactoJcmase 



Description 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



1176 



i.4e-67 



Locus Name 



ir:C722W 



[pir: 



Acc# 



C72283 



ORF Name 



146A&3.12...ca...l2SL 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



7569 



3.0e-33 



Locus Name 



immunoreactive 42kD antigen PG33 



gp:AF , l7^^1b 



ACC# 



AF175715 



Description 

Porphyromonas gingivalis strain WbO immunoreactive 42KD antigenpc^3 gene/ 
complete cds . 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



1S.8.0.1S.0.:L±1..3.2.. 



7570 



MI 



'9.9e-ld 



Protein name 



Locus Name 



glutamme ABC transporter, periplasmic 
glutamine -binding protein (glnH) homolog 



Acc# 



G69278 



Description 



653 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



7571 



T7T 



Score Probability 



JT2 



Protein name 



Locus Name 



hypotneticai protein 



Description 



pir : jgiu^u 



Acc# 
JQ1020 



ORF Name 



ll±&l...al..:21±.. 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 
TS5 



Score Probability 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



757T 



— — Score Probability 
Length Length 

— 



1ST" 



Locus Name 



Acc# 



[MO -HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



2352 



17574 



Length Length 
— 



Score Probability 



TIT 



Locus Name 



Acc# 



NO- HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



2353 



17575 



Length Length 



— Score Probability 



£7 



Locus Name 



Acc# 



INO-HIT 



654 



UKr INdltic 


NT ID 


NT 

AAID Length 


AA 
Length 


Score 


Probability 


22470301_t2_73 




7576 131 396 


71 


0 .026 


Protein name 








Locus Name 


Acc# 


unknown protein 


gp : B AC AT PA 




Description 




B.megatenum ATP syntnase i,a 
genes, complete cds, and ORF . 


,c,b, delta, alpha, gamma, beta anaepsiion suoumt 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


^roDdDiiiuy 


22<S£0002_r3_l2£ 


2355 


7577 


103 312 






Protein name 








Locus Name 


ACC# 


Description 














NO-HIT 












1 
1 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


Zl&6.0.Xl&..±l...Xilh 


2355 


757§ 


S3 252 


54 


0.631 


Protein name 








Locus Name 


Acc# 










spiSPRCJUWLA 


P36378 


Description 














(OSTEONECTIN) (ON) 


(BASEMENT 


MEMBRANE PROTEIN 


BM-40) 




i 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


llZ.BAL&Utl.^ 


2357 


7579 


305 








Protein name 








Locus Name 


Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Pro bability 
Length Length 



3.4e-30 



Protein name 



Description 



Locus Name 



sp:GLUP_likUAii 



Acc# 
Q44623 



GLUCOS£/GaLaCtO££ HuufStOR'i'KR 



ORF Name 



23907$S6 rl 22 



Protein name 



NTID 



AAID 



75ST 



NT 



AA 



Length Length 
B52 



Score Probability 



FT 



Locus Name 



Acc# 



Description 



WO-HlT 



ORF Name 



Protein name 



NTID 



AAID 



hypothetical protein sirinv 



Description 



NT AA 

— — Score P robability 
Length Length 



4 .4e-48 



Locus Name 



pir :S744tfU 



Acc# 



S74480 



ORF Name 



Protein name 



NTID 



AAID 



12361 



— — Score P robability 
Length Length 



iii5i 



|6.1e-74 



Locus Name 



Isp : GALM__A(Ji(JA 



Acc# 
P05149 



Description 
ALDOS E l- E PIMERA&E PRECURSOR, (MUTAROTASE) 



656 



NT 



AA 



ORF Name 



NT ID 



124242327 ±3 143 



AAID Length Length 
17584 



[T7T 



Score Probability 
|T^5 



|i.3e-0<> 



Protein name 



Locus Name 



sp:RFAY_XAkl(Jl> 



Acc# 
P46358 



Description 
frkOSABLK ktfA P0LYMijkAS3 Sl^MA FACTOR RFAY 



ORF Name 



NTID 



NT AA 

— — Score Probabil ity 
AAID Length Length 



24255316 c3 239 



Protein name 



Description 



Locus Name 



Acc# 



KfO-HlT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



TTT 



1.4e-l6 



Protein name 



Locus Name 



vitellogenin 



gp:CHKVlTH 



Acc# 



K02113 



Description 



Gallus galius vitellogenin gene coamg tor pnosvitm, exons 23 ana^4 . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
OTT 



Score Probability 



7587 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



657 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



2366 



TUT 



8.5e-13 



Protein name 



Locus Name 



hypotnetical protein TMiVba 



pir:G72214 



Acc# 



G72214 



Description 



ORF Name 



NTID 



2367 



7583 



Protein name 



sugar-phosphate isomerase 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



4T5TT 



TFT 



l.le-33 



Locus Name 



BTrTTT77^r 



Acc# 



H72296 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



\l&A±5&l:L.al...lll I 



[TOT 



153" 



O.OOlS 



Protein name 



Description 



Locus Name 



Acc# 



sp:Y8$6_HASlN 



HYPOTHETICAL PkOTEIN HI0896 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2369 



7591 



5~21T 



1587 



FT" 



0.0075 



Protein name 



Locus Name 



Acc# 



outer membrane protein 



|gp:HEAOMW>lH 



Description 

Haemophilus influenzae outer membrane protein (OMPPi) gene , complete <jds . 



658 



NT 



AA 



ORF Name 



NTID 



\TTTU~ 



AAID Length Length 
T75~ 



7^ 



TITS" 



Score Probability 
9.5>e-35 



377 



Protein name 



Locus Name 



immunoreactive 42kD antigen 



[gp:AF1757lb 



Acc# 



AF175715 



Description 



£>orphyromonas gingivalis strain W50 immunoreactive 42KD antigenPG3 3 gene, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



— — , Score 



AAID Length Length 



c;i 271 



2371 



7593 



^58 



T7T~ 



Probability 
l.Oe-5* 



Protein name 



Locus Name 



PksB 



gp:AF0l9986 



Acc# 



AF019986 



Description 



Dictyostelium discoideum pksb tpksBj mRNA, complete cds. 



ORF Name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



2372 



7594 



2.2e-$4 



Protein name 



Locus Name 



Acc# 



sulfate aaenyiyitransterase, small 
chain: ATP -sulfurylase : sulfurylase 



pir :D6bUbb 



Description 



ORF Name 



Protein name 



NTID 



2373 



7^ 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



2560026^ ci 278 



7596 



F7I~ 



|S.6e-13 



Protein name 



Locus Name 



ATP suiturylase small suJDumt 



gp:AF03b60a 



Acc# 



AF035608 



Description 



£>seudomonas aeruginosa ATP suiturylase small subunit (cysD; ana 
ATPsulfurylase GTP-binding subunit/APS kinase (cysN) genes, completecds. 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


257$2l27_i3_l24 


2375 


75S7 


§4 


255 








Protein name 








Locus 


Name 


Acc# 


Description 
















NO -HIT | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


16.1^.16.^1^11^ 


2376 




7598 


4di 


1476 




1334 


3.$e-l36 



Protein name 



Locus Name 



ATP suiturylase GTP-bmaing surmnit/APS 
kinase 



gp:AF(I>3^0^ 



Acc# 



AF035608 



Description 



Pseudomonas aeruginosa ATP suiturylase small suSunit ^cysD; ana 
ATPsulfurylase GTP-binding subunit/APS kinase (cysN) genes, completecds, 



ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2tt5.2J.b.a...zX...±b.'<L 


2377 7599 


82 


245 


61 


0.04& 


Protein name 






Locus 


Name 


Acc# 








sp:C0X3 


JPYLL1 


Q37600 


Description 













660 



ORF Name 



NTID 



2929aii0b C2 261 



Protein name 



7600 



NT 



AA 



AAID Length Length 



Score Probability 



|¥3F" 



Locus Name 



3.5e-16 



Acc# 



nypotneticai protein 



Description 



pir:S76868 



S76868 



ORF Name 



Protein name 



NTID 



\ZTTT 



NT AA 

— — Score Probability 



AAID Length Length 



TuTT 



2.1e-144 



Locus Name 



Acc# 



nypotneticai protein 



Description 



bir:J01^0 



JQ102 0 



ORF Name 



Protein name 



NTID 



AAID 



— — Score Pr obability 
Length Length 



558 



Locus Name 



9.6e-37 



Acc# 



conserved nypotneticai protein yKnA 



Description 



pir :F698b7 



F69857 



ORF Name 



Protein name 



NTID 



12381 



AAID 



— — Score Probability 
Length Length 

— 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



3.2l42l25.5.1..±I...A1.. 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



0.044 



Locus Name 



Acc# 



Description 



sp:YC8fe_METJA 



P81319 



HYPOTHETICAL WtOTUlN MJ12«2.2 



661 



NT 



AA 



ORF Name 



NTID 



TITT 



AAID Length Length 



TTTTT 



Score Probability 
|4.ie-i00 



552 



Protein name 



Locus Name 



hypothetical protein TM1759 



E 



ir:H7^14 



Acc# 



H72214 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
559 



Score Probability 



7606 



TTT 



Protein name 



Description 



Locus Name 



Acc# 



|N0 -HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



12385 



TTT 



WZT 



5T 



0.039 



Protein name 



Locus Name 



PfEMPl fragment £>FBl045w 



pir:F71600 



Acc# 



F71600 



Description 



ORF Name 



NTID 



— — S core Probability 
AAID Length Length 



|3A4D..15.3.I..±3....1^ I 



TTT 



mr 



|2.5e-22 



Protein name 



Locus Name 



Acc# 



|sp:YI^S_BA(JSU 



Description 

HYPOTHETICAL JV.b L>kOTEl>J IN DLl^A-MPRb INTEk GEislicJ kkGiuN 



662 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



7609 



T7T 



ITT 



0.0016 



Protein name 



Locus Name 



complement cv 



|gp:AF16^74 



Acc# 



AF162274 



Description 

Sus scrota complement C7 mRNA, complete eels 



ORF Name 



136226562 cl lyo 



Protein name 



NTID 



NT AA 

— — Score Proba bility 
AAID Length Length 



Locus Name 



Acc# 



Description 



NO-Hltf 



ORF Name 



Protein name 



— — Score Probability 
NTID AAID Length Length 



7ZTT 



6.7e-l00 



Locus Name 



sp:RriLEl_hlC 4 OLl 



Acc# 



P25888 



Description 

PUTATIVE A T P-DEJ^NDEWT mk M ELil'AriK kHLK 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


3.$.m3.D....c3....26.& 


2390 


7612 


322 


999 


458 


i.7e-46 


Protein name 








Locus Name 


Acc# 



|pir:Uii>y4B 



H69848 



Description 



663 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



7ZTT 



558 



Protein name 



Locus Name 



2-oxoacid- -terredoxm oxidoreductase, beta 
chain: 2-oxoisovalerate oxidoreductase alpha 
chain (mi si. dent if ication) 



Description 



pir :B6yiy4 



1 .4e-25 



Acc# 



B69194 



ORF Name 



40888 cl 200 



Protein name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



7ST4" 



T55" 



Locus Name 



l.le-26 



Acc# 



Description 



sp:£>EF_THh!MA 



P96113 



DEFORMYLA^E) 



ORF Name 



Protein name 



NTID 



2393 



— — , Score P robability 
AAID Length Length 



7615 



Locus Name 



i.3e-i0ii 



Acc# 



putative transketolase 



Description 



gp:BOUlbl74 



U15179 



Sacteroides ovatus arabinosicLase (asctll) gene, complete cds anaputative 
transketolase, partial cds. 



ORF Name 



4145.3.3.1..±1...^... 



Protein name 



NTID 



AAID 



NT AA 

— — Score Prob ability 
Length Length 



TT3T" 



Locus Name 



|3.9e-63 



Acc# 



Description 



sp:TKT_BAc!^U 



P45694 



TPlAN^KUTOLA^, 



664 



NT 



AA 



ORF Name 



NTID 



14485917 ta b6 



2395 



AAID Length Length 



7ST7 



Score Probability 



S3 



Protein name 



Locus Name 



XylS/AraC! tamily transcriptionai regulatory | lgp : AF039^07 



Acc# 



AF039207 



Description 

Listeria monocytogenes putative transcriptional attenuator leaaerpeptiae 
(attM) , LapA (lapA) , XylS/AraC family transcriptionalregulatory protein 
homolog (lapB) , and NADH- dependent dehydrogenasehomolog (lapC) genes, 
complete cds . 



ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


47l90ll_c3_277 


2396 7618 


205 


618 


476 


3.2e-45 


Protein name 






Locus Name 


Acc# 








sp:NOE>Q_AZOBk 


P28604 


Description 












-SULPtmYLASE) (MODULATION PROTEIN Q) | 


ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


mm£x..±jL...ui 


2397 7619 


89S 


2697 




| 4.0e-S5 


Protein name 






Locus Name 


Acc# 








sp : BGALJTHLWT 


P77989 


Description 












BETA- GALA0T0& 1DA^ , 


(LACTASE) 








i 


ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


&.&63±6±..al.„X9.8. 


239S 7620 


418 


12£7 



















Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



665 



NT 



AA 



ORF Name 



NTID 



5253437 c3 



AAID Length Length 



7WF 



Score Probability 
0.00070 



TUG 



Protein name 



Locus Name 



otnG protein 



pir :S70954 



Acc# 



S70954 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


5mm..±;i...m 


2400 


7622 


73 


222 


SI 


0.0035 



Protein name 



Locus Name 



gas vesicle protein Gvpp 



gp:AF0537£>S 



Acc# 



AF053765 



Description 



Bacillus megater ium AraC (araC) gene, partial cas; gas vesicieprotems Gvpu 
(gvpU) , GvpT (gvpT) , GvpJ (gvpJ) , GvpK (gvpK) , GvpS (gvpS) , GvpL (gvpL) f GvpG 
(gvpG) , GvpF (gvpF) , GvpN (gvpN) , GvpR(gvpR), GvpB (gvpB) , GvpQ (gvpQ) , GvpP 
(gvpP) , and GvpA (gvpA) genes, complete cds; and unknown gene. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



WTT 



Score Probability 
1.2e-06 



Protein name 



Locus Name 



hypothetical protein (eggsfteix protein gene 
region) 



pir :D4480b 



Acc# 



D44805 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


5imo.<L.ci...m 


2402 


7624 


390 


1173 


393 


|2.0e-36 



Protein name 

Description 
HVfrOftifiTieAL 46.1 frfeMEllt CY339.43 



Locus Name 



|sp:YU43_MY<J'l 1 U 



Acc# 
Q50695 



666 



ORF Name 



6381^ 12 b3 



Protein name 



NTID 



ins 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


&&4my.«.ti^3L& 


2404 


7626 


64 








Protein name 








Locus 


Name 


Acc# 


Description 














MO-HIT | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


6A5.15.6.1...L2..&1 


2405 


7627 


257 


774 




7.3e-^ 















Protein name 



Locus Name 



2 - oxoacia- - 1 erreaoxm oxiaoreductase, Joeta 
chain: 2 -oxoisovalerate oxidoreductase alpha 
chain (misidentif ication) 



pir :B6yiy4 



Description 



Acc# 



B69194 



ORF Name 



£&7....g2...£3a.. 



Protein name 



integrase 



Description 



NTID 



— — Score Pr obability 
AAID Length Length 



2406 



7^" 



410 



TTST 



5.3e-36 



Locus Name 



|gp:BFU7b37i 



Acc# 



U75371 



Bacteroides tragi lis transposon Tn455b 1'npA (tnpAj , integrase (int) , Tnpc 
(tnpC) , excisionase (xis) , mobilization protein (mobA) , and beta- lactamase 
(cfxA) genes, complete cds; and unknown genes. 



667 



ORF Name 



5762302 ti 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
3T~ 



Score Probability 



T5T 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
552 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



7631 



[567 



Locus Name 



sensory transduction system regulatory 
protein slrl982 :protein slrl982 rprotein 

fl1rl982 



Description 



bir:S75663 



Probability 
3.0e-17 



Acc# 



S75663 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



SIT 



BUT" 



Locus Name 



Acc# 



Description 



HO-HIT 



668 



ORF Name 



NTID 



7ZTT 



Protein name 



hypotnetical protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



ITT 



TTIT 



Locus Name 



bir:C72^5i 



1 . ie-06 



Acc# 



C72351 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



Protein name 



Description 



TTTT 



Locus Name 



Acc# 



[MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



2413 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 



TZZT 



Score Probability 
3.1e-70 



717 



Locus Name 



spTBTOFmCSIT 



Acc# 



P22806 



LlSAfiE) 



669 



ORF Name 



NT AA 

— — , Score Pro bability 
NTID AAID Length Length 



13S325S7 tl 44 



7STT 



B.3e-ii 



Protein name 



Locus Name 



hypothetical protein 



bp:AP15fli714 



Acc# 



AF158372 



Description 

Flavobacterium johnsoniae hypothetical protein gene, partial cds; GldB 
(gldB) , GldC (gldC), and hypothetical protein genes, completecds; and 
hypothetical protein gene, partial cds. 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


1445S6iV_tl_49 




7£3S 


eos 


lSlS 207 


4 .3e-14 


Protein name 








Locus Name 


Acc# 



hypotnetical protein 



AF158372 



Description 

Flavobactenum johnsoniae hypotnetical protein gene, partial cds; Ulcus 
(gldB), GldC (gldC), and hypothetical protein genes, completecds; and 
hypothetical protein gene, partial cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TJ2 



Score Probability 



TIT" 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


145.6.40.0.^t^.I 


2413 


7640 


25>0 


873 


140 


4.2e-07 



Protein name 



MigA 



Locus Name 



|gp:PAU7072y 



Acc# 



U70729 



Description 

Pseudomonas aeruginosa MigA (migA; gene, complete cas. 



670 



ORF Name 



NT ID 



NT AA 

— — , Score Pr obability 
AAID Length Length 



14531406 ti 14 



TIT 



2.6e-2S 



Protein name 



Locus Name 



glycosyl trans t erase 



|gp:AF1465J2 



Acc# 



AF146532 



Description 

Klebsiella pneumonxae waa gene cluster. 





ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


l4g42l75_£l_3& 


2420 


7642 


103S 3lOS 


56 


4.2e-07 



Protein name 



DNA nelicase nomolog 



Description 



Locus Name 



pir :G6$4$4 



Acc# 



G69494 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


±UA6.9.2b...±l...±&± 


2421 


7543 


141 | 


426 175 


2.5e-13 



Protein name 



Locus Name 



hypotnetical protein 



Acc# 



AF158372 



Description 

Plavobacterium johnsoniae nypotnetical protein gene, partial ccts;GiaB 
(gldB) , GldC (gldC) , and hypothetical protein genes, completecds; and 
hypothetical protein gene, partial cds . 



NT 



AA 



ORF Name 



NTID 



2422 



AAID Length Length 
T2TT 



Score Probability 



353 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



671 



ORF Name 



1550 t2 &6 



Protein name 



NTID 



7645 



NT 



AA 



AAID Length Length 



Score Probability 



7W 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



7T 



TUT 



I.2e-i6 



Protein name 



Locus Name 



mtegrase 



gp:Db04:itt 



Acc# 



D50438 



Description 



Serratia marcescens DNA tor mtegrase, 
metallo-bata-lactamase, aminoglycoside acetyltransf erase, complete cds. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



£423" 



0.00011 



Protein name 



Locus Name 



sp:YBEU_ECOLi 



Acc# 



P77427 



Description 

HYPOTHE T ICAL 2 7 .0 KB PRO ' l ' L! IN IN LL!U^-<5LTL lJsl ' i ' Llk gENIC kUciioN 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



\±6£o:±ri:L±±...$± I 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



672 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 
U5~ 



Score Probability 



Locus Name 



Acc# 



[NO-HIT 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 

— — , Score P robability 
Length Length 



7650 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



7^T 



Score Probability 
3.0e-26 



297 



Locus Name 



sp:Y40K_klU^N 



Acc# 



P55632 



PU T A T IVE IHTBaRA^ljyRliiCOMmJ^A SB V4(jK 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 
0.0001$ — 



TTu" 



Protein name 



Locus Name 



TanTC 



|gp:AF0^02^b 



Acc# 



AF080235 



Description 



Streptomyces cyanogenus landomycm Joiosynthetic gene cluster , complete 
sequence . 



673 



ORF Name 



2091^7 t3 Ibl 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



T08 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



hypotnetical protein 



Description 



— — Score P robability 
Length Length 



fTTT 



Locus Name 



pir :P7S4y4 



9.7e-07 



Acc# 
F75494 



ORF Name 



Protein name 



ClpB 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



7^" 



WIT 



4.ie-14b 



Locus Name 



gp:AB0123yo 



Acc# 



AB012390 



■Jhermus thermophi lus genes for £>naK, GrpE, DnaJ , DatA, cipB, complete cas . 



ORF Name 



NTID 



— — Score Pro bability 
AAID Length Length 



73" 



Protein name 



Locus Name 



Acc# 



Description 



[NO-HIT 



674 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAIP Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAIP Length Length 



Score Probability 
0.048 



T2 



Locus Name 



ubiquinone taosyntnesis protein coqv (coqv) 
RP190 



bir:A717it) 



Acc# 



A71730 



Description 



ORF Name 



NTID 



Protein name 



outer memJarane protein 



Description 



NT 



AA 



AAIP Length Length 



Score Probability 



Locus Name 



pir:C70412 



0.0028 



Acc# 



C70412 



ORF Name 



NTID 



1±0£5.$2±.±1..M...... I 



Protein name 



AAID 



— — S core Probability 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



24257700 ci :1b J 



|2.3e-170 



Protein name 



Locus Name 



sp:YJJK_ECOLl 



Acc# 



P37797 



Description 
A6C T'kAtfS PORTER A'rP- BINDING P&OTSIN VJJK 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TUB" 



Score Probability 
S.le-32 



T7T 



Protein name 



Locus Name 



rhamnosyl transterase related protein PAB0795 I bir :F750$9 



Acc# 



F75099 



Description 



NT 



AA 



ORF Name 



NTID 



2&19&IA2.±±..A5. 1 I^T 



AAID Length Length 



Score Probability 
0.00014 



35 



Protein name 



Locus Name 



hypothetical protein 



|gp:AFIb^372 



Acc# 



AF158372 



Description 



Flavobacterium jonnsomae Hypothetical protein gene, partial cds,-GldB 
(gldB) , GldC (gldC) , and hypothetical protein genes, completecds; and 
hypothetical protein gene, partial cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



2M0..7.5.6.3...±l...lb.b.., 



Length Length 
TT7T5 



Score Probability 



3W 



Protein name 



Description 



Locus Name 



Acc# 



IN0-H1T 



676 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



246408S1 ti 24 



Protem name 



hyaluronan syntnase related PAB1314 



Description 



TTZT 



TUT 



1.2e-i4 



Locus Name 



pir:<37l->00b 



Acc# 



G75005 



ORF Name 



Protein name 



NTID 



NT AA 

- — — Score Probabil ity 
AAID Length Length 



T5T 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



12445 



AAID Length Length 



7^7 



Score Probability 

ttutz 



77 



Locus Name 



|gp:AB0J0B2b 



Acc# 



AB030825 



£>seudomonas aeruginosa genomic DNA, partial sequence, strain :PA01. 



NT 



AA 



ORF Name 



NTID 



12445 



AAID Length Length 
TTT 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



677 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



TTHT 



3.3e-ii 



Locus Name 



Acc# 



Description 



gp : 



L39794 



£>lasmid pWQ199 klSfAlI and RJSfAl genes, complete sequence; RJMAlmoduiator 
protein (Rom), mobilization proteins (mbeC, mbeA, mbeB, and mbeD) , 
N-acetylmannosamine transferase (wbbE) , wbbF, andUDP-N-acetylglucosamine 
2-epimerase (wecB) genes, complete cds . 





ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 




2448 


1610 


$5 282 


66 


| 0.035 



Protein name 



Locus Name 



Acc# 



plasma membrane protein 



Description 



pir :T03bbU 



T03680 



ORF Name 



Protein name 



NTID 



AAID 



7T7T 



NT 



AA 



Length Length 
3U3 



Score Probability 



TOT" 



Locus Name 



Acc# 



Description 



ORF Name 



NTID 



AAID 



— — Score P robability 
Length Length 



TTTTT 



\TZU4- 



10.0010 



Protein name 



Description 



Locus Name 



|sp:AFC_JiALMU 



ACC# 
Q00474 



0- ANTIGEN POLYMERASE 



678 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



FITT 



1ST 



Locus Name 



outer membrane protein mom'/ 2, 
72K: hypothetical protein slll667 hypothetical 
protein slll667 



Description 



pir :S'/46bb 



1.2e-15 



Acc# 



S74665 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
— 



Score Probability 

m 



0.028 



Protein name 



Locus Name 



MHC class ii protein 



bp:AF03oaV2 



Acc# 



AF030872 



Description 



Poeciliopsis occidentals occidentalis MHC class IX protein gene, partial 
exon II, and partial cds . 



ORF Name 



NT AA 

— — Score Pro bability 
NTID AAID Length Length 



uttz 1 rare* 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



679 



NT 



AA 



ORF Name 



NTID 



34154777 ±1 2b 



AAID Length Length 
TOT" 



Score Probability 
|2.4e-36 



390 



Protein name 



Locus Name 



sensory transduction histidine Kinase 
slr2104 :protein slr2104 :protein slr2104 



Description 



pir:375135 



Acc# 



S75136 



ORF Name 



Protein name 



Description 



NTID 



NT AA 

— — Score Prob ability 
AA1D * Length Length 



2455 



Locus Name 



Acc# 



[NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 

muE — 



Score Probability 
■1.5e-i0 



Locus Name 



115K outer membrane protein precursor : Suse 
protein 



pir:JC602V 



Acc# 



JC6027 



Description 



ORF Name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



Protein name 



84 



255 



Locus Name 



Acc# 



Description 



NO-HIT 



680 



Protein name 



Description 



NT 



AA 



ORF Name 


NTID 


AAID 


Length 


Length 


35I88568_jtl_l^ 




7581 


12 


215 



Locus Name 



Probability 



Acc# 



MO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



Length Length 



1WT 



Score Probability 
l.Oe-25 



TT2 



Locus Name 



outer membrane protein mom72, 
72K: hypothetical protein slll667 : hypothetical 
protein s!11667 „ 



toir:S7466b 



Acc# 



S74665 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Prob ability 
Length Length 



(I3T 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



681 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



T7T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



|ilfmib,.±:L..5.6. 



12464 



NT 



AA 



— — , Score Probability 
AAID Length Length 

nzzz — 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



ill0.2.1I..±1...21.. 



Protein name 



NTID 



AAID 



NT 



Length Length 
TUT 



AA 

— Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



T51T 



¥7T 



0.0046 



Locus Name 



ABC transporter protein 



gp:CJAJ0&S6 



Acc# 



AJ000856 



Description 

Campylobacter jejuni KpsM, KpsT genes. 



682 



ORF Name 



Protein name 



NTID 



TOTS 



NT 



AA 



AAID Length Length 



— Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



hypothetical protein MTH1606 



Description 



NT 



AA 



AAID Length Length 




5T" 



Score Probability 
|7.5e-07 



TTT" 



Locus Name 



pir :hJbyU8I 



Acc# 



E69081 



ORF Name 



Protein name 



NTID 



AAID 



75ST" 



NT 



AA 



Length Length 
553 



Score Probability 



1ST 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



NTID 



maiiLcLit:/. 



Protein name 



AAID 



NT AA 

— — Score Probability 
Length Length 



^5" 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



12471 



AAID 



NT AA „ 

— — Score Probability 
Length Length 



ITT" 



Locus Name 



Acc# 



Description 



NO-HIT 



683 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



1644125 c2 iOy 



r7^r 



PUT" 



Score Probability 



157 



Protein name 



Description 



Locus Name 



sp : INVA_BA£BA 



Acc# 



P35640 



ttfVASXOlNf ft&OtfiJW A 



ORF Name 



NTID 



— — ^ Score Probability 



AAID Length Length 



P47T 



1248 



Protein name 



Locus Name 



biosyntnesis or texcnuronic acid tuab 



Description 



Acc# 



D69727 



ORF Name 



Protein name 



NTID 



12474 



NT 



AA 



AAID 



] [ 



Length Length 
775 1 



Score Probability 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



AAID 



7697 



— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



INO-Hlf 



684 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



75W 



TIT 



JUT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Pr obability 
Length Length 



2477 



2.4e-i5 



Locus Name 



hypothetical protein 



lgp:AF1^7^ 



Acc# 



AF158372 



Description 



Plavobacterium johnsoniae hypothetical protein gene, partial cds/GldB 
(gldB) , GldC (gldC) , and hypothetical protein genes, completecds; and 
hypothetical protein gene, partial cds . 



ORF Name 



aisi&i..±aL...i4a 



Protein name 



NTID 



2478 



TTuU 



NT 



AA 



AAID Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



aaai!3...±i..£6i.. 



Protein name 



NTID 



£4T3~ 



AAID 



wrmr 



NT 



AA 



Length Length 
^2 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



685 



ORF Name 



NT ID 



"NTT AA 

— — Score Pr obability 
AAID Length Length 



9959463 t3 14b 



2480 



7702 



ITT" 



9.7e-07 



Protein name 



Locus Name 



serme/tnreonine-specitic protein Kinase, 
PFB0150C 



bir:H7i62l 



Acc# 



H71621 



Description 



ORF Name 



NTID 



AAID 



|23.5D.3.u6...±3....12... 



7703 



"NTT AA 

— — Score Probability 
Length Length 





TTKT 



Protein name 



Locus Name 



nypotnetical protein vuozr/ 



pir: (371^44 



Acc# 



G71244 



Description 



ORF Name 



NTID 



NT — Score Probability 



AAID Length Length 



12482 



17704 



T5T 



TUT 



5.le-<56 



Protein name 



Locus Name 



nypotnetical protein Phiu^iy 



| [pir:A7ll>4r 



Acc# 



A71245 



Description 



ORF Name 



Protein name 



NTID 



24S3 



AAID 



7705 



NT 



AA 



Length Length 
^2 



Score Probability 



Locus Name 



Acc# 



Description 



MO-Hlf 



686 



NT 



AA 



ORF Name 



NTID 



|il756iiB0 t'2 12 



AAID Length Length 

rrm — 



1 



Score Probability 
— 



|i.3e-ib« 



Protein name 



Description 



Locus Name 



Igp: 01^1114 A 



Acc# 
AL132952 



Caenorhabditis elegans cosmid YblH4A, complete sequence. 



ORF Name 



NTID 



— — Score P robability 
AAID Length Length 



il$$52& cl by 



TTUT 



81 



Protein name 



Description 



Locus Name 



Acc# 



K0-H1T 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



7708 



TUT 



2.4e-« 



Protein name 



Locus Name 



sp:SCE4_ME i rEX 



Acc# 



Q49135 



Description 

PUTATIVE SZkim C^cJL E EWZmU (okP4) 



NT 



AA 



ORF Name 



NTID 



AAID 



±26.T116:±.±±...A j 



|24§7 



Length Length 
553 



TIT 



Score Probability 
|7.9e-49 



Protein name 



Locus Name 



histiame ammonia- lyase 



pxr :F/bbIU 



Acc# 



F75610 



Description 



687 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



77Tu~ 



TUT 



930 



F55" 



2.2e-S5 



Protein name 



Locus Name 



hypothetical protein TMUb4i 



[pxr:D72^26 



Acc# 



D72326 



Description 



ORF Name 



NTID 



77TT 



Protein name 



hypothetical protein aq_862 



Description 



NT 



AA 



AAID Length Length 

— 



TUT 



Score Probability 
UTUTZ 



77 



Locus Name 



pir:P70574 



Acc# 



F70374 



ORF Name 



NTID 



I213.3.216.1...13...A5..... I 



Protein name 



AAID 



77TT 



NT AA 

— — Score Prob ability 
Length Length 



75" 



TTW 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



AAID 



21$.b.lb£....a2..£.$. I 



77TT 



hypothetical protein PH021V 



Description 



NT AA n „ , . . -. . . 

— — Score Pr obability 
Length Length 



TUT 



111 



Locus Name 



pir :G7l244 



l.Se-06 



ACC# 



G71244 



ORF Name 



Protein name 



NTID 



AAID 



77TT" 



NT 



AA 



Length Length 

wn — 



Score Probability 



TVT 



Locus Name 



Acc# 



Description 



WO-ftlT 



688 



NT 



AA 



ORF Name 



NT ID 



24804b61 tl b 



AAID Length Length 




7715 



Score Probability 
|4.7e-06 



TI2 



Protein name 



Locus Name 



proJDaJDXe transcription regulator 



pir:T2yub^ 



Acc# 



T29062 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TCSS 



7716 



TUZT 



Score Probability 
|i.3e-126 



Protein name 



Locus Name 



Acc# 



acritlavme resistance protein (acrB) 



lgp:AE001i^b 



Description 

Borreiia burgdorteri (section 11 ot 7u; ot tne complete genome. 



NT 



AA 



ORF Name 



NTID AAID Length Length 

rj5 — 



Score Probability 



TTTT 



FT" 



Protein name 



Description 



Locus Name 



Acc# 





ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


m4&b.6.&...c3....10.1 


2496 


771$ 


75 22b 


61 


0.044 



Protein name 



Locus Name 



MADS Jaox-liKe protein 



gp:AS003^J 



Acc# 



AB003323 



Description 

Oryza sativa mRMA tor box-like protein, complete cas, clone rh^uyby . 



689 



ORF Name 



344096^ ri 2 



Protein name 



Description 



NTID 



AAID 



— — Score Probability 
Length Length 



77T3- 



i.5e-84 



Locus Name 



sprHUTIJW^U 



Acc# 



P42084 



HYDROLASE) 



ORF Name 



3465l3&6 t2 24 



Protein name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



TT7u~ 



Locus Name 



Acc# 



Description 



IN0-H1T 



ORF Name 



Protein name 



NTID 



AAID 



TTTT 



NT 



AA 



Length Length 
W% 



Score Probability 



TTT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



TTTT 



hypotnetical protein S110141 



Description 



NT 



AA 



AAID Length Length 



Score Probability 
S.7e-l8 



Locus Name 



tair:SV64:i4 



Acc# 



S76434 



690 



ORF Name 



NTID 



NT AA rt _ -i -1 > t ■ « 
— — Score Probability 
AAID Length Length 



14141887 J8 



T72T 



8.5e-84 



Protein name 



Locus Name 



immunoreactive 52kD antigen PG41 



gptAFlVSVlb 



Acc# 



AF175716 



Description 



Porphyromonas gingivalis strain W50 immunoreactive 52KD antigenPG4i gene, 
complete cds . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


4422S00_t^_37 


2602 


7724 


159 


480 


308 


1.4e-26 



Protein name 



Locus Name 



Acc# 



P42357 



Description 

HiaTlDltJE AMMONIA- LYASE, (Hl^TlbAdUj 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


5.3.3.1£Ab...±l...y. 


2503 


7725 


525 


1578 


745 


l.le-77 



Protein name 



Locus Name 



Algl 



gp:AF027499 



Acc# 



AF027499 



Description 



Azotobacter vmelandu mannuronan c-5-epimerase (a±gG) gene, partial cds; 
and AlgX, alginate lyase (algL) , Algl, and AlgV genes , complete cds. 



ORF Name NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


- Probability 


10.$.12&1.±'1..±§. 2504 


1126 




1S2 88 




Protein name 






Locus Name 


Acc# 








gp:D85752 


D85752 


Description 











bacH and bad genes, complete cds. 



691 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



14664052 ci 2b 



TTTT 



8.9e-i& 



Protein name 



Locus Name 



RNA polymerase sigma tactor Sigz-iike protein | (gp : AP1371>6^ 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S ri£>osomal protein si6-liJteprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds. 



ORF Name 



NTID 



NT AA 

— — , Score Pr obability 
AAID Length Length 



23455556 cl 24 



Protein name 



TT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score P robability 
AAID Length Length 



24flLiaiflLl...Cl...2L7... 



12507 



7729 



511 



ilSfc outer membrane protein precursor : SusC 
protein 



Description 



Locus Name 



pir:J«01>7 



5.2e-46 



Acc# 



JC6027 



ORF Name 



NTID 



AAID 



ma6ai*L.ci...a& | 



7730 



Protein name 



— — Score Probability 
Length Length 



TTT" 



0.000S7 



Locus Name 



Acc# 



Description 



sp : tfECftJfcioLl 



P23485 



FECR PROTEIN 



692 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



35328314 c2 31 



TTTT 



1WT 



Protein name 



Locus Name 



receptor antigen (RagAj 



|gp:PGIi30tf7T 



Acc# 



AJ130872 



Description 



Porphyromonas gingivalis W50 receptor antigen (rag) locus encodmga ma^or 
immunodominant 55kDa antigen. 



ORF Name 



390S250 ±2 14 



Protein name 



NTID 



2510 



"NTT AA 

— — Score Pr obability 
AAID Length Length 



TTTT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



T7TT 



NT 



AA 



AAID Length Length 



— Score Probability 



95 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



7734 



NT 



Length Length 



AA 

— Score Probability 



109 



Locus Name 



Acc# 



Description 



NO-HIT 



693 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



!3.0e-57 



Protein name 



Locus Name 



|sp:PUR2 RAW IN 



Acc# 



P43845 



Description 





ORF Name NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


l<S93?55l_c2jil(> 2514 




246 


741 94 




Protein name 








Locus Name 


Acc# 


DNA alkylation repair enzyme 


gp:BAJlOlM 


AJ010128 



Description 

Bacillus cereus bc2y/a, aiKD genes and partial glys gene. 



ORF Name 



NTID 



NT AA 

— — , Score P robability 
AAID Length Length 



77TT 



S3" 



PIT 



0.04^ 



Protein name 



Locus Name 



alpna 1,2 tucosyltransterase 



lgp:AF042743 



Acc# 



AF042743 



Description 

kattus norvegicus alpna 1,2 tucosyltransterase mRNA, partial ccts . 



NT 



AA 



ORF Name 



NTID 



119A6.11...alJl±A I 



AAID Length Length 
ETF7 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



694 



NT 



AA 



ORF Name 



NT ID 



AAID 



TUT 



Length Length 



Score Probability 
|8.ie-lll 



Protein name 



Description 



Locus Name 



sp:UVkC_i3A(JfcW 



Acc# 



P14951 



ORF Name 



NT ID 



NT AA 

— — , Score Pr obability 
AAID Length Length 



l2542£57 t3 ill 



T74u~ 



|2.Se-5l 



Protein name 



Locus Name 



ABC-type transport protein sir2U44 :protem 
slr2044 :protein slr2044 



bir:S7Sl97 



Acc# 



S75197 



Description 



ORF Name 



NT ID 



AAID 



NT AA „ 

— — , Score Probability 
Length Length 



i2.^..7.M6.6....cl...lb.B... 



T7TT 



FT0~ 



T7B" 



l.Oe-U 



Protein name 



Locus Name 



Cps2J 



|gp:AF02^>4Vi 



Acc# 



AF026471 



Description 



Streptococcus pneumoniae DexB idexBj gene, partial cas; putativetransposase 
gene, complete cds; type 2 capsular polysaccharidebiosynthesis operon, 
complete sequence; and AliA (aliA) gene, partial cds. 





ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


lku4:/.b.U^..X:L.A 


2520 


7742 


451 1356 


434 


3.0e-4i 



Protein name 



Locus Name 



spore maturation protein B : hypothetical 
protein slll6 77 : hypothetical protein slll677 



pir:S74647 



Acc# 



S74647 



Description 



695 



ORF Name 



NTID 



15228385 ±3 ±06 



Protein name 



alkaline pnosphatase nomoiog yicox 



Description 



NT 



AA 



AAID Length Length 

— 



T5T 



Score Probability 
|4.7e-12 



Locus Name 



pir :B69861 



Acc# 



B69861 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



T7M 



TTZT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



10.11iL&A...alL..±A6. 



7ST 



|2.3e-i<i 



Protein name 



Locus Name 



Acc# 



sp:YEHT_Ii!C!OLl 



Description 

HYPOTHETICAL 27.3 KB PR0T3IN W MOLR-B GlX INTERGEfcfie Rtidloti 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Proba bility 
Length Length 



1033£2.1&..±±J11 I 



I2524 



7HT 



Protein name 



Locus Name 



or£98 



|gp:A?l60^64 



Acc# 



AF160864 



Description 

Tetranymena pyritormis mitocnonctriai una, complete genome. 



696 



NT 



AA 



ORF Name 



NTID 



AAID 



20882827 ti U 



TTTT 



Length Length 
ra^T5 — 



Score Probability 



TTTT" 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



VZTIT 



Length Length 
5TB 



Score Probability 
|2.2e-26 



175 



Protein name 



Locus Name 



conserved nypothetical protein 



|pir:(J7bJ68 



Acc# 



C75368 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



7749 



TIT 



i.5e-4V 



Protein name 



Locus Name 



adhesion protein 



pir:C69i80 



Acc# 



C69180 



Description 



NT 



AA 



ORF Name 



NTID 



TTZT 



AAID Length Length 
TZT 



Score Probability 



TTT 



Protein name 



Description 



Locus Name 



Acc# 



NO -HIT 



NT 



AA 



ORF Name 



NTID 



2:3.^.S.ni...ci.2bl.. 



TTTT 



AAID Length Length 
TTT 



7751 



WTT 



Score Probability 
1 . 7e-30 



357 



Protein name 



Locus Name 



hypothetical protein sllllSl 



|pir:S748^2 



Acc# 



S74882 



Description 



697 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



T33" 



TTT 



|i.5e-13 



Protein name 



Locus Name 



Acc# 



conserved hypotnetxcal protein TP06 5U 



pir:A71i00 



A71300 



Description 



ORF Name 



Protein name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



242LE9.6..7..7...±a...iai 


2521 


7753 


120 


353 











Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



Protein name 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



7755 



NT AA 

— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



698 



NT 



AA 



ORF Name 



NT ID 



263594*6 ±1 17 



AAID Length Length 

wn — 



TTu" 



Score Probability 
JTTUTE 



5T 



Protein name 



Description 



Locus Name 



Acc# 



gp:PFMAL3P8 



Plasmodium taiciparum MAL3P8, complete sequence. 



ORF Name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



26598453 t3 96 



77TT 



Probability 
|2.0e-52 



Protein name 



Description 



Locus Name 



S p:DEOC_CAEElL 



Acc# 



Q19264 



(PHOSPHODEOXYRiijOALDOLA^) (D B OXVUlbOA LDQLAaii!) 



ORF Name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



[2"5T5~ 



77^" 



Probability 
1.8e-51 



Protein name 



Locus Name 



X-Pro dipept idyl -peptidase, 



pir: J05142 



Acc# 



JC5142 



Description 



ORF Name 



NTID 



Protein name 



AAID 



NT 



AA 



Length Length 

m$ — 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



699 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



|304797i)0 t'2 bb 



[TFT 



[T9T~ 



|5.ie-ib 



Protein name 



Locus Name 



sprYPJDJWJ^U 



Acc# 
P42979 



Description 

HYPOTHETICAL 13 . 0 KD P&OTfllN l3Sf 0gRC-t3APB INfflRflfllfllC RfifilON 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



TTT 



\TTJT 



1 . 8e-05 



Protein name 



Locus Name 



Acc# 



glucose-binding protein 



U74323 



Description 

Pseudomonas putida glucose -.binding protein (gitBj gene, compietecas . 





ORF Name 


NTID AAID 


NT AA 
Length Length 


Score 


Probability 


3.1MiD.12...ci...l6.1 


2540 7762 


304 915 


51 


0.017 



Protein name 



Locus Name 



GlylORFi 



gp:AF00394i 



Acc# 



AF003941 



Description 

Neisseria gonorrhoeae GlylORFl and GlylORF2 genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
— 



Score Probability 
0.0005S 



TZ1 



Protein name 



Locus Name 



hypothetical protein F56H9.1 



pir :T22808 



Acc# 



T22808 



Description 



700 



ORF Name 



3367SiS£ c2 202 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 
T5T 



Score Probability 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



7765 



hypothetical protein 



Description 



NT 



AA 



Length Length 
T77 



TEW 



Score Probability 
|6.ie-35 



Locus Name 



pir:S3a&74 



Acc# 



S39974 



ORF Name 



Protein name 



Description 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



7766 



576 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



\±1±±6.11.±±...22.. 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



7757 



"JUT 



TT2T 



9.5e-69 



Locus Name 



sp:^H_ayNi>7 



Acc# 



Q59967 



Description 

SS&im ACfi*YLTRANSE i fiRASfi, PlA&Uib t (SAT) 



701 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



4450538 t3 53 



2546 



7768 



!3.5e-i45 



Protein name 
Description 

GLUCOSS ItiHlBlHlD DIVISION Pk OTElN A 



Locus Name 



Acc# 



P25756 



ORF Name 



NTID 



NT AA „ , 1*. * n ■ i_ 
— — Score Probability 
AAID Length Length 



14575011 ±3 100 



^4T 



1.8e-65 



Protein name 

Description 
HYPOTHETICAL 43.0 KB PROTEIN 5LR0064 



Locus Name 



sp:Y064_SYNY3 



Acc# 



Q55156 



ORF Name 



NTID 



NT AA ^ _ , , . _ . . 
— — , Score Probability 
AAID Length Length J ~ 



4£&7..7.&7....cl...242.. 



T77TT 



T7uT" 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



\<il±9£.2.&..±±...16. 



2545 



AAID Length Length 



77TT 



WTT 



Score Probability 
14 . Ie-52 



Protein name 



Locus Name 



sp:YJES_E00LI 



Acc# 



P39288 



Description 

HYfrOTftfiJlCAL 43.1 Kf) PROTEIN In £>Sd-aMI S tNTfiftflEUlC kSGlOti U'379) 



702 



NT 



AA 



ORF Name 



NT ID 



14766461 t2 b2 



AAID Length Length 
7772 — 



UTT 



Score Probability 

mz — 



|9.6e-37 



Protein name 



Description 



Locus Name 



|sp:APTl_WHEAT 



Acc# 
Q43199 



Afifilrfllrffi f>ttOSf>HOfeIfiOSYLtRANJSFfiRASEl 1, lAE>RT) 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



4804652 ci 14E> 



777T 



F7T 



3.6e-i5 



Protein name 



Description 



Locus Name 



Acc# 



gp:D90868 



E.coli genomic DNA, Koiiara clone #414(53.8-54.2 mm.). 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
"2TT7 - 



777T 



Score Probability 
2.6e-18 



Protein name 



Description 



Locus Name 



sp : YFU2_BACST 



Acc# 



Q04729 



NT 



AA 



ORF Name 



NTID 



AAID 



6.2M4S..7....C.1...M&.. 



777B" 



Length Length 
TITS — 



T7T" 



Score Probability 
li.4e-93 



Protein name 

Description 
HOLLIDAY JUNCTION DNA HELICASE RUVB 



Locus Name 



sp:kWfe_P£E)Aii! 



Acc# 
Q51426 



703 



ORF Name 



NT ID 



NT AA „ . . 
— , — , Score Probability 
AAID Length Length 



14525 ci ibi 



T77F" 



T5T" 



|2.0e-45 



Protein name 



Locus Name 



octylprenyl diphosphate synthase -like protein 



|gp75FTST7TT 



Acc# 



AF153713 



Description 



Pseudomonas sp. BG33R strain BG33R octylprenyl diphosphatesynthase-liKe 
protein gene, complete cds . 



ORF Name 



707S157 c2 200 



Protein name 



NT ID 



7777 



NT 



AA 



AAID Length Length 



Score Probability 



411 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



S.$A0.5.&..±2L...&iL 



Protein name 



NT ID 



NT AA o _ , , . . . . 
— — , Score Probability 
AAID Length Length 



7775" 



5.0e-2l4 



Locus Name 



DNA polymerase I 



gp:AF121780 



Acc# 



AF121780 



Description 



Rhodothermus obamensis DNA polymerase I (polA) gene, complete cds. 



NT 



AA 



ORF Name 



NTID 



±Q±B3£A2.±±...2(>. I |3557 



AAID Length Length 



777S 



T7T 



Score Probability 
6.3e-05 



TY1 



Protein name 



Locus Name 



immunity region protein in prophage homo log 
ydcM 



pir:B69774 



Acc# 



B69774 



Description 



ORF Name 



Protein name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 





2558 




7780 




71 


216 



Locus Name 



Acc# 



Description 
INO-HIT 



ORF Name 



Protein name 



|lCL3LS4ail.„c2„.m I 



NT 



AA 



NT ID AAID Length Length 

TTWL — 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NT ID AAID Length Length 

pnrs5 — 



Score Probability 



T7WT 



1ST 



Locus Name 



Acc# 



Description 
[MO-HIT 



ORF Name 



NT AA 

— , — , Score Probability 
NT ID AAID Length Length JL 



Protein name 



755i 1 rrrm 1 m i iz75 



Locus Name 



Acc# 



Description 
IN0-H1T 



705 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



1220517 t2 113 



JUT 



9.3e-23 



Protein name 



Description 



Locus Name 



sp:DDH_C0RGL 



Acc# 



P04964 



MESO-DlAMltfOPIMELATfi D- DEHYDROGENASE , 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



12305502 tl 44 



TTWT 



3.4e-32 



Protein name 



Locus Name 



spiKGUAJ/EAST 4 



Acc# 



P15454 



Description 

GUANYLATE KINASE, (GMP KINASE) 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



I2L3.3.Ii0.1..±1...3.i 



1.4e-35 



Protein name 



Locus Name 



probable permease D1828 



pir :D64^44 



Acc# 



D64944 



Description 



ORF Name 



lii.7.40.2S„.±3„.i0.i.. 



Protein name 



NTID 



TTWT 



NT 



AA 



AAID Length Length 



Score Probability 
B.5e-71 



35F 



Locus Name 



sp:NRDD_HAEIN 



Description 

ANAEROBIC RIBONUCLEOTIDE-TRIPHOSPHATE REDUCTASE, 



Acc# 



P43752 



706 



NT 



AA 



ORF Name 



NT ID 



1281260 c2 372 



AAID Length Length 



Score Probability 
FTCI 



6.8e-43 



Protein name 



Locus Name 



O-unit tlippase-lilce protein 



|gp:YPE25iyii 



Acc# 



AJ251713 



Description 



Yersinia pest is strain EV76 nemH gene (partial) ana o-antigen genecluster 
for ddhD gene, ddhA gene, ddhB pseudogene, ddhC gene, prtgene, wbyH gene, 
wzx gene, wbyl pseudogene, wbyJ gene, wzypseudogene, wbyK gene, gmd 
pseudogene, fcl pseudogene, manC gene,wbyL gene, manB gene, wzz gene and gsk 
crene (partial) . 



ORF Name 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 

fzws — 



Score Probability 



ST" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



p.5.ai^.a.as....c^....4.4.a 



2568 



7750 



1005 



|1.0e-32 



Protein name 



Locus Name 



galactosyl transterase 



igpiAPoaoavi 



Acc# 



AF030373 



Description 



Streptococcus pneumoniae strain SP-264 alpna, l-6-glucosidase (clexBj gene, 
complete cds; capsular polysaccharide biosyntheticlocus, complete sequence ; 
and oligopeptide binding protein (aliA)gene, complete cds. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2.6.Q5.6.6.82....ca...2.45.. 



7791 



IT 



0.015 



Protein name 



Locus Name 



hypothetical protein 



|gp:55tfl^9lir 



Acc# 



Y18930 



Description 

Sultolobus soitataricus 281 kjd genomic DNA tragment, strain P2 . 



707 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


1555i535_r3_192 


2570 


7752 


118 


357 


113 


5.3e-07 



Protein name 



Locus Name 



unknown 



gp:LLU35625 



Acc# 



U35629 



Description 



Lactococcus lactis plasmid. pSRQ802 abortive mtection protein K(abiK) gene, 
complete cds . 



NT 



AA 



ORF Name 



NTID AAID Length Length 




^57T 



Score Probability 




Protein name 



Locus Name 



capsular polysaccharide biosynthesis nomolog 
ywqE 



Description 



pir:H70066 



Acc# 



H70066 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length ■ — 



12572 



7754 



TFT" 



Locus Name 



Acc# 



INO-HIT 



ORF Name 



NT 



AA 



mS.25.1!.±2...1D.<i I 



NTID AAID Length Length 

rrm — 



TIT5" 



Score Probability 
9.5e-14 



TT5 



Protein name 



Locus Name 



Acc# 



Phosphmothricm ace tyl trans t erase (EC 



gp :D90784 



Description 

E.coli genomic DNA, Kohara clone #273 (32.5-32.8 mm.). 



708 



ORF Name 



23486536 c2 340 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
6T5 



Score Probability 



TFT 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



PhnW 



Description 



NT 



AA 



NTID 



AAID Length Length 
TTT3 



T7T7 



T7IT 



Score Probability 
|7.3e-ii0 



Locus Name 



gp:STU69493 



Acc# 



U69493 



Salmonella typhimurium ThU ana Ortl genes, partial cds, and PhnX,PftnW, 
PhnR, PhnS, PhnT, PhnU and PhnV genes, complete cds . 



ORF Name 



Protein name 



NTID 



12576 



NT AA 

— — , Score Probability 
AAID Length Length 



7733 



FT" 



210 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



ITFT 



743" 



5.0e-$6 



Locus Name 



dTDPglucose 

4, 6 -dehydratase, : dTDP-D-glucose-4 , 6-dehydratas 
e : dTDP-glucose dehydratase 



pir :T001U^ 



Acc# 



T00102 



Description 



709 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



78TJTT 



TTTT 



8.2e-3i 



Protein name 



Locus Name 



Acc# 



conserved hypothetical protexn aq__1964 



pir :D70468 



D70468 



Description 



ORF Name 



Protein name 



NTID 



12579 



NT 



AA 



AAID Length Length 
£TS 



Score Probability 



PIT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



Z3.63.28.8.Q...G2...3.41 1 12580 



731T 



7.2e-16 



Protein name 



Locus Name 



hypothetical protein 



gp:EFY17737 



Acc# 



Y17797 



Description 



Enterococcus taecalis gph, ydjH, ydjG, yd] I, pbp4 and ycliC, ORF2and ORF3 
genes, partial. 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



2.3.S.3.7.0.5.i...a2...3.5.7.., 



|4.2e-27 



Protein name 

Description 
HOLLIDAY JUNCTION DNA HELICASE RUVA 



Locus Name 



sp:RWA_£>SEAfi 



Acc# 



Q51425 



710 



ORF Name 


NTID 


AAID 


NT AA 
— — Score 
Length Length 


Probability 


24066055J:3_i73 


2582 


7804 




310 


933 107 




0.022 



Protein name 



Locus Name 



Acc# 



gp:SCYDL057W 



Description 

S.cerevisiae chromosome IV reading trame ORF YDL057w. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




202 



Score Probability 

mi — 



1 .le-26 



Protein name 



Locus Name 



sp:Y0EJ_SACSU 



Acc# 



P54455 



Description 

HYPOTHETICAL 22.2 KD PROTEIN IN A&OD-COMES. INTERGENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



1A&119.S2...G1..A52 1 



Length Length 
Ml 



Score Probability 
2.4e-08 



Protein name 



Locus Name 



Acc# 



cold snocK protein nomoiog cspc 



pir:S43618 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



2l4^6.25l5^...C1...27.5.... 



7WT" 



Length Length 



1317 



Score Probability 
^ 



1 . 8e-99 



Protein name 



Locus Name 



UDP-N-acetylglucosamme 
1 - carboxyvinyl transferase (murA) homolog 



|pxr:G70ib8 



Acc# 



G70158 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



24644068 fc3 20^ 



Length Length 
1ST" 



Score Probability 
6.6e-24 



Protein name 



Description 



Locus Name 



sp:NkDG_HAEIN 



Acc# 



P45080 



(EC 1.37.1.-) 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


24645138_c3_444 


2587 




7803 


210 




633 


110 


JT5e-06 



Protein name 



Locus Name 



unknown 



gp:AF048749 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



7810 



1425 



1.4e-42 



Protein name 



Locus Name 



Acc# 



sp:VEBU_ECOLI 



Description 

HYPOTHETICAL 53.2 KB EfeOTSta IS P&C-P&PA ^TeftflM fC RflCIOti 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

7m — 



tutt 



Score Probability 
II . 3e-204 



1980 



Protein name 



Locus Name 



cation ettlux system protein czcA-1 : protein 
slr0794 :protein slr0794 



pir :S77008 



Acc# 



S77008 



Description 



712 



ORF Name 



NT ID 



Protein name 



response regulator nomoiog 



Description 



NT 



AA 



AAID Length Length 




ITT 



Score Probability 
T3I 



|5.Ie-i5 



Locus Name 



foir:A6953i 



Acc# 



A69531 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 
0.00070 



Protein name 



Locus Name 



receptor antigen B (RagBJ 



bp:PSI130871i 



Acc# 



AJ130872 



Description 



Porpnyromonas gmgivalis W50 receptor antigen (rag) locus encodmga major 
immunodominant 55kDa antigen. 



ORF Name 



NT AA 

— , — , Score Probability 
NTID AAID Length Length 



7814 



|i.6e-66 



Protein name 



Locus Name 



| sp:VWMEJWaU 



Acc# 



P71040 



Description 

HYPOTHETICAL 55.8 KE> PROTEIN IN SPOIIQ-MTA 1NTERGEN1C REGION 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



2L5L5La^l5t2L...al...2L&a., 



2593 



T34T 



4.2e-22 



Protein name 



Locus Name 



probable phosphoesterase, ykuE 



plr :B69865 



ACC# 



B69865 



Description 



713 



NT 



AA 



ORF Name 



NTID 



26212S37 c3 46!> 



AAID Length Length 



Score Probability 
TT7I — 



7.4e-172 



Protein name 



Locus Name 



polyA polymerase 



[gp:AB02£867 



Acc# 



AB022867 



Description 



Prevotella rummicola genes tor polyA polymerase, D-alaninegiycmepermease 
and cellulase, complete cds. 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — ■ 



2«05l36 c3 418 



7817 



JUT 



|l.Se-08 



Protein name 
Description 

163 RRNA PROCESSING PROTEIN RIMM 



Locus Name 



sp:RlMM_HAfilKf 



Acc# 



P44568 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



T5T 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



T5TT 



T2TE" 



|4.7e-06 



Protein name 



Locus Name 



cation ettlux system membrane protein czcC 



pir:«3^0 



Acc# 



C33830 



Description 



714 



ORF Name 
26600682 ±2 110 







NT 


AA 


NT ID 


AAID 


Length 


Length 


2558 


7820 


176 


531 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TWIT 



T5T 



$.6e-09 



Protein name 



Locus Name 



hypotnetxcal protexn aq_1477 



pir :D70428 



Acc# 



D70428 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



£F01T 



Length Length 
ITS 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



6.3e-104 



Protein name 



Locus Name 



glucose -1-phosphate thymiciylyltranst erase, 



tprr:C!69i06 



Acc# 



C69106 



Description 



ORF Name 



19AZ53xl...aX...2.16. 



Protein name 

Description 
REDUC TO I S OMERAS E ) 



NTID 



AAID 



2602 



7824 



NT 



AA 



Length Length 

rnrs — 



Score Probability 
1.5e-86 



Locus Name 



lsp:DXR_£jyN¥;i 



Acc# 



Q55663 



715 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



301641^7 422 



Protein name 



i.6e-43 



Locus Name 



Acc# 



2 -phosphonoacetaidehyde hydrolase 



Description 



gp:PAU4520<> 



U45309 



Pseudomonas aeruginosa 2 -phosphonoacetaidehyde hydrolase gene, complete cds. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



Protein name 



Locus Name 



Acc# 



Description 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



1±&16.SA1.±1....5£. I [Z5uT 



Protein name 



75TT 



i.6e-06 



Locus Name 



Acc# 



sperm- speci lie protein component 



Description 



lgp:£)MUM^7 



U90537 



Drosophila melanogaster sperm-specitic protein component {dj) mRNA, complete 
cds . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



7WTW 



\2.2e-$2 



Protein name 



Locus Name 



Acc# 



probable permease bl828 



pir :D64944 



D64944 



Description 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



33252182 c3 437 



12607 



7829 



Protein name 



ribose 5 -phosphate isomerase (rpi) nomolog 



Description 



713" 



|2.5e-36 



Locus Name 



pir:G63367 



Acc# 



G69367 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



tut 



2106 



T7TT 



l.ie-09 



Protein name 
Description 

HYPOTHETICAL WD -REPEAT PROTEIN &LR1409 



Locus Name 



|sp:YE09JJYN«:i 



Acc# 



P73594 



ORF Name 



Protein name 



NTID 



2609 



AAID 



73317 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



Protein name 



NTID 



2610 



AAID 



NT AA 

— — Score Probability 
Length Length 



55" 



3W 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



12611 



7STT 



hypothetical protein 4 



Description 



NT 



AA 



Length Length 
TTT 



TT7T 



Score Probability 

o.oooia — 



T77 



Locus Name 



bir:fi22§45 



Acc# 



E22845 



717 



NT 



AA 



ORF Name 



341&1500 ±i 6 



NTID AAID Length Length 




Score Probability 
551 



l.ie-fii 



Protein name 



Locus Name 



immunoreactive 106 JcDa antigen PG115 



|gp:AF153767 



Acc# 



AF153767 



Description 



Porphyromonas gingivalis strain W5 0 immunoreactive 106 kDa antigenPGHS 
gene, complete cds . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length J ~ 



34l§l5§7 ±2 103 



TUT 



4 .6e-15 



Protein name 



Description 



Locus Name 



|gp:U§368§ 



Acc# 



U93688 



Staphylococcus aureus toxic shock syndrome toxin- l (tst) ; enterotoxm (ent) , 
and integrase (int) genes, complete cds. 



NT 



AA 



ORF Name 



M5.7.2.m...c3...A£a I [2ST¥ 



NTID AAID Length Length 

rmiE — 



WIT 



Score Probability 
0.00026 



TT7 



Protein name 



Locus Name 



conserved hypothetical protein aq_8 54 



pir:B70374 



Acc# 



B70374 



Description 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



16.12$A1L.±1„.5.± I 



T&TT 



7uTT 



TTUT 



Locus Name 



3.4e-iIS 



Acc# 



sp : PPKJilCOLl 



Description 
POLYPHOSPHATE KINASE, 



P28688 



718 



ORF Name 



NTID 



35242137 c2 306 



Protein name 



AAID 



hypothetxcal protein 



Description 



NT AA 

T T^i-in Score Probability 
Length Length 



T2D" 



6.0e-44 



Locus Name 



pir : JQ1020 



Acc# 



JQ1020 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 
1 



T7T 



Score Probability 

nm — 



0.0045 



Locus Name 



gp:UMCRGl 



Acc# 



X92509 



U.mayclis crgi gene. 



ORF Name 



NTID 



±D:2.A£.5.2...±3....XZ5. 



Protein name 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



1203 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length ^ 



iD.6.2S.1.7...±1...5.a I 



2TT 



l.Oe-17 



Locus Name 



cation etflux system iczcB-iiKe) 



pir:£70342 



Acc# 



E70342 



Description 



NT 



AA 



ORF Name 



NTID 



4054787 t3 174 



12620 



AAID Length Length 
1W£2 



Score Probability 
7TE 



|1.5e-73 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp:PGI130a72 



Acc# 



AJ130872 



Description 



Porphyromonas gingival is W5 0 receptor antigen (rag) locus encodings major 
immunodominant 55kDa antigen. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



4100377 c2 370 



7MT 



2GT 



4.4e-l6 



Protein name 



Locus Name 



unknown 



gp:AF048749 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



4.5e-07 



Protein name 

Description 
HYPOTHETICAL TRANSCRIPTIONAL REGULATOR Y4DJ 



Locus Name 



|sp:Y4M_RHISN 



Acc# 



P55409 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



7§45 



7.6e-2S 



Protein name 

Description 
CAPG PROTEIN 



Locus Name 



sp:CAPG_STAAtf 



Acc# 



P39856 



720 



NT 



AA 



ORF Name 



NTID 



AMD Length Length 



Score Probability 



41146^2 'A J 2y 



12624 



Protein name 



ABC-type transport protein sir0864 rprotem 
slr0864 :protein slr0864 



Description 



Locus Name 



pir :S74B4y 



4.£e-10a 



Acc# 



S74849 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



7S47 



¥7T 



11422 



TUT 



0.025 



Protein name 



Locus Name 



middle molecular weight neurot i lament protein I |gp : XLU85969 



Acc# 



U85969 



Description 



Xenopus laevis middle molecular weignt neurotilament proteinuF-M U) mRJNA, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



262£ 



784S 



Length Length 
57o 



TTTT 



Score Probability 
li.2e-06 



TT0~ 



Protein name 



Locus Name 



sp:YH74J^!TTk 



Acc# 



027802 



Description 
HYPO T HETICAL PRO TEIN MJ1774 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 
!6.6e-06 



Protein name 



Locus Name 



sp:lHFA_HAEIJM 



ACC# 



P43723 



Description 

INT E GRA T ION HOST FACTOR ALPHA- tSLmiW IT (1HF-AL1>HA) 



721 



NT 



AA 



ORF Name 



NTID 



AAID 



14457750 c2 U2 



Length Length 




Score Probability 



Protein name 

Description 
NO -HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



|A3.iaaft5...c2L...i44.. 



7F5T" 



Length Length 



Score Probability 
l.le-15 



Protein name 



Locus Name 



lipoprotein 



gp:AF00094S 



Acc# 



AF000945 



Description 



Vibrio cholerae lipoprotein (nlpDj gene, partial cds, sigma S (rpoS) gene, 
complete cds, and methyl -directed mismatch repairprotein (mutS) gene, 
partial cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




T3T 



558 



Score Probability 
7.5e-2l 



Protein name 



Locus Name 



AlgT 



gp:AF190580 



Acc# 



AF190580 



Description 



Pseudomonas syrmgae pv. syrmgae AlgT (algT) gene, complete cds . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



4&7.S.2.5.2...C3....447.., 



0.0058 



Protein name 



Locus Name 



galactosyl transterase 



bp:SPN2iS004 



Acc# 



AJ239004 



Description 

Streptococcus pneumoniae type 8 capsular gene cluster. 



722 



NT 



AA 



ORF Name 



NT ID 



AAID 



4885937 ti 4^ 



Length Length 




JUT 



Score Probability 
4.8e-42 



Protein name 



Locus Name 



Acctt 



putative 1, 4-dihydroxy-2-naphthoate 



|gp:AP10i047 



Description 



Haemophilus ducreyi putative 
1 , 4-dihydroxy-2-naphthoateoctaprenyltransf erase, YadR (yadR), cytidine 
5 'monophosphateN-acetylneuraminic acid synthetase (neuA) , 
lipooligosaccharidesialyltransf erase (1st) , and putative 
dTDP-D-glucose4 , 6 -dehydratase (rmlB) genes, complete cds; and 



NT 



AA 



ORF Name 



NTID 



AAID 



l5±3jafljQjQ...±i..JLl.4.. 



Length Length 
772 



Score Probability 
3.2e-24 



TI5 



Protein name 



Description 



Locus Name 



sp:HLV^_M^ 



Acc# 



P54176 



HEMOLYSIN 111 (HLY-111) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



L5.8632.2.7....C2...3.7X... 



1107 



Probability 
8.3e-58 



Protein name 



Locus Name 



11m protein 



|pir:A5585^ 



Acc# 



A55856 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



isjBj92ijaa...±i..iaja J pnr 



1.3e-07 



Protein name 



Locus Name 



Acc# 



conserved hypothetical protein MTH7 00 



pir:E6&19;J 



E69193 



Description 



723 



NT 



AA 



ORF Name 



NTID 



AAID 



5337787 t3 2TI 



— — , Score Probability 
Length Length 

T71T 



[T5T 



|2.0e-10 



Protein name 
Description 

pWaMve PHOSfrHAfS frgftMBASS Ml 16 04 



Locus Name 



sp:YG04_HAUlW 



Acc# 
P45268 



ORF Name 



606£040 t3 204 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



T5T 



894 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



2£3§ 



75W 



7u4~ 



£.2e-33 



Locus Name 



conserved nypothetical protein yioc 



pir :A69878 



ACC# 



A69878 



Description 



ORF Name 



NTID 



AAID 



SA2&A3.u...c3....45.0... 



Protein name 



cpsF protein, 40, 6K 



Description 



NT 



AA 



Length Length 
TTJ1 — 



T75~ 



Score Probability 
7.8e-74 



7¥S 



Locus Name 



|pir:S70l5V 



Acc# 



S70157 



ORF Name 



Protein name 



NTID 



&ft3L52La5„.c3L...iaa ...| p^nr 



AAID 



7862 



— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



WO-HIT 



724 



ORF Name 



7148412 ±3 240 



Protein name 



NTID 



AAID 



7WT 



NT 



AA 



Length Length 
FIT" 



Score Probability 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



|14Iiam...cl„.ICLL 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



or 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



7^3" 



NT AA 

— , — , Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



2.3A63J3.2....Z2...3A 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Pr obability 
Length Length 



0.010 



Locus Name 



sp:VGP8_EBV 



Acc# 



P03224 



Description 

PROBABLE MEMBRANE ANTIGEN GPS 5 



725 



NT 



AA 



ORF Name 



NT ID 



ti 21 



AAID Length Length 

mm — 



2FT 



Score Probability 
3.0e-32 



tits 



Protein name 



Locus Name 



115K outer membrane protein precursor : Susc 
protein 



Description 



pir : JC6027 



Acc# 



JC6 02 7 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



2646 



1623 



Score Probability 
|i.2e-226 



Locus Name 



sp : PPCK_ANA£U 



Acc# 



009460 



ORF Name 



Protein name 



Mere protein 



Description 



NTID 



AAID 



NT AA 
— , — , Score 
Length Length 



73W 



Probability 




Locus Name 



gp : EAMMRTRAN 



ACC# 



Y08992 



E . agglomerans pKLH272 incomplete unit ot mosaic mercury 
resistancetransposon . 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



IMi:/.ia:/...±i...22 1 



T3T 



Protein name 



Locus Name 



Probability 



Acc# 



Description 



NO-HIT 



726 



NT 



AA 



ORF Name 



NTID 



24803760 i2 



AAID Length Length 



7S7T 



Score Probability 
|i.4e-152 



Protein name 



Locus Name 



putative oxidoreductase alpha- subun it 



gp:SCAH10 



Acc# 



AL132824 



Description 



Streptomyces coelicolor cosmid AH10. 



ORF Name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



25444787 c^ ya 



7.3e-78 



Protein name 



Locus Name 



Uracil pnospnoriJDOsyltransterase 



gp:AB016085 



Acc# 



AB016085 



Description 



Porphyromonas gmgivaiis Porr, upp, and prtg genes, complete cas. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



26A5.6.BAB..±Z..:±J. | 



7T7T" 



WIT 



TUTF 



i.4e-8i 



Protein name 



Locus Name 



pro£>a£>le oxidoreductase 



pir :E708b4 



ACC# 



E70864 



Description 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



727 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



265627S0 ci 



f¥TF" 



Protein name 



Locus Name 



transposase 



gp:AF038866 



Acc# 



AF038866 



Description 



Bacteroides tragilis transposon Tn5520 transposase (bipH) anctmobilization 
protein BmpH (bmpH) genes, complete cds . 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
7SZ 



Score 



TTT 



Locus Name 



Probability 



Acc# 



Description 



[NO -HIT 



ORF Name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



iwrr 



tott 



|4.2e-0& 



Protein name 
Description 

MSM OPKRON R E GULATORY PROTEIN 



Locus Name 



sp:M5MR_3TRMU 



Acc# 



Q00753 



ORF Name 



NTID 



NT AA 

— — Score Proba bility 
AAID Length Length 



3.2i5.S.0.:/.:/...cl....7.i I msz 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



728 



NT 



AA 



ORF Name 



NTID 



AAID 



2657 



Length Length 




Score Probability 



5T 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

mz — 



TTT 



Score Probability 
Ii.9e-a4 — 



280 



Protein name 



Locus Name 



glycerophosphodiester pnospnodiesterase 
homolog yhdw 



|pir:H6^V 



Acc# 



E69827 



Description 



NT 



AA 



ORF Name 



NTID 



a52mM...ci...&iA 



2659 



AAID Length Length 




Score Probability 



tost 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



2660 



AAID Length Length 

rim — 



7W 



Score Probability 
5 . 7e-il 



376 



Protein name 



Locus Name 



115K outer membrane protein precursor : sus<J 
protein 



bir:JCb027 



Acc# 



JC6027 



Description 



729 



ORF Name 



cl 72 



Protein name 



— — , Score Probabil ity 
NT ID AAID Length Length 



7533 1 [57^ 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NT ID AAID Length Length 

tss — IF7T3 — 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID AAID Length Length 

TTTD 



7885 



Score Probability 
0.00058 



TO 



Locus Name 



|gp:D42067 



Acc# 



D42067 



Porphyromonas gingxvaiis DNA tor Fimnnlin, ORF1-4, complete cas. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
P73 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



730 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



19933530 c2 77 



2665 



17887 



Protein name 
Description 



Locus Name 



Acc# 



[WO -HIT 



ORF Name 



NTID 



AAID 



— — Score P robability 
Length Length 



23.4&7AB.u..±Z...l3... 



7888 



443 



TUT 



TTO 



|3.ie-10 



Protein name 



Locus Name 



conserved hypothetical protein yjaiz 



pir :Jbi6ybby 



Acc# 



E69858 



Description 



ORF Name 



NTID 



\2661 



7889 



Protein name 



conserved nypotnetical protein yvrM 



Description 



— — Score Probability 



AAID Length Length 



1.4e-0b 



Locus Name 



pir:G70047 



Acc# 



G70047 



ORF Name 



Protein name 



NTID 



\l±6.b&lD.D..±1..22 ....I 



AAID 



NT 



AA 



Length Length 
ITTT 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


3.u3Al&ul...c3...A6. 




7851 


504 


$15 132 


3.4e-G*J 


Protein name 








Locus Name 


Acc# 










gp:AP000342 


AP000342 


Description 













Plasmid Rioo genomic ujma 



731 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TTSu — 



Score Probability 
0.00015 — 



127 



Protein name 



Description 



Locus Name 



sp:V7y7_METJA 



Acc# 
Q58207 



HYPOTHETICAL £>R0TEtti MJ0797 



NT 



AA 



ORF Name 



NTID 



AAID 



3$32<>37 12 17 



7893 



Length Length 
T7M 



Score Probability 
7.5e-l5 



ITS 



Protein name 



Description 



Locus Name 



sprY&J^JiiCOLl 



Acc# 



P75831 



HYPOTH E TICAL ABC TRANSPORTE R ATP-B1NDING PRoTein ybuz, 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



i.4e-bfc 



Protein name 



Description 



Locus Name 



sp:YF0S_METJA 



Acc# 



Q58903 



HYPOTHETICAL A£C TRANSPORTE R AT£- 6 ll^StrtC proTein MJibus 



NT 



AA 



ORF Name 



NTID 



— — Score Probability 



AAID Length Length 



MI" 



12UF" 



(12TT 



o.ooo^a 



Protein name 



Description 



Locus Name 



lgp:Mb7b2 



Acc# 



D85752 



laecalis plasmxd pPE)i bac A, bacB, bacc, sacD, J3acK,bacF, sacG, 
genes, complete cds. 



Enterococcus 
bacH and bad 



732 



ORF Name 



NT ID 



AAID 



NT AA 
— , — , Score 
Length Length 



17220142 12 11 



7896 



Probability 
B.le-06 



Protein name 



Locus Name 



conserved Hypothetical protein yvrM 



Description 



|pir:G70047 



Acc# 



G70047 



ORF Name 



LD.ZS.S.D.X5.. ..£,!,. ..1,5... 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



2676 



7898 



hypothetical protein S110241 



Description 



NT AA 
, — , — J , Score Probability 
Length Length 



402 



0.035 



Locus Name 



pir :S75099 



Acc# 



S75099 



ORF Name 



1.Q5A1.8.8....CX..3.5.8... 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID 



TF7T 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



AAID 



inaata&2...£i...iia i ettf 



Length Length 
TUT 



Score Probability 



E 



7T 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



733 



NT 



ORF Name 



NTID 



AAID Length Length 



AA 

— Score Probability 



iii096b^ ±1 



|2.3e-4y 



Protein name 



Locus Name 



immunoreactive 21 KD antigen puiu 



|gp:APi4407V 



Acc# 



AF144077 



Description 

£orphyromonas gingivalxs str ain WbU immunoreactive 21 ku antigenpmu gene, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



11132800 tl Jl 



7WF 



Score Probability 
|5.0e-l« 



Protein name 



Description 



Locus Name 



sp: SAH^MMfci (Jk 



Acc# 



P93253 



ORF Name 


NTID 


AAID 


NT 

Length 


AA 

— , Score 
Length 


Probability 


±±125.±±'A..±l.Jlh± 






120 


353 240 


3.2e-20 


Protein name 








Locus Name 


Acc# 










sp:RS15JiOk±JU 


051744 


Description 












30S ftlBOSOMAL PROtSItf SlS | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


l2.lM3..7....Cl..A2.b. 


2682 


7904 


532 


1599 |255 


|l.le-34 


Protein name 








Locus Name 


Acc# 










sp : S_hUmA*J 


P08842 


Description 




ITT c?TTT T!i*!"P^dL> 


— m — rac 


r\ 





734 



NT 



AA 



ORF Name 



NTID 



AAID 



115757180 ±2 220 



7905 



Length Length 



T2T 



Score Probability 
715 



0.035 



Protein name 



Locus Name 



Hypothetical protein APE1598 



pir :A72539 



ACC# 



A72539 



Description 



ORF Name 



NTID 



NT AA 

_ „ T — _ — L1 Score Probability 
AAID Length Length ■ L 



l^a6.3..7.5.D....tl..aZ^ I 12684 



17906 



HITS" 



2.6e-05 



Protein name 



Locus Name 



hypothetical protein PH0212 



pir :B71244 



Acc# 



B71244 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^~ 



HA0JAiil..xii.„ilfi I 



17507 



Protein name 



hypothetical protein APE0625 



Description 



TTT 



ITUT 



l.le-05 



Locus Name 



pir:C72649 



Acc# 



C72649 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
7908 



Score Probability 



78 



Locus Name 



Acc# 



Description 
NO-Hl* 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TIT 



|2.2e-5^ 



Locus Name 



clindamycin resistance transfer ractor JotgB 



pir:B4l£56 



Acc# 



B41656 



Description 



ORF Name 



Protein name 



NT ID 



7910 



NT 



AA 



AAID Length Length 
T55 



Score Probability 



5T" 



Locus Name 



Acc# 



Description 





3RF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


iB.o.i9.ii:z...c^..Au.bL 


... |26 89 


7911 | 


682 2U4y 








Protein name 








Locus Name 


Acc# 




Description 
















MO-HIT | 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 




±6im±.±l....±±2. 


.... 2690 


7512 1 


323 972 




5.7e-lb 




Protein name 






Locus Name 


Acc# 




hypothetical protein SCJ12.2/C 


pir :TJ7044 


T37044 




Description 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 




IBMAlb^al^U^ 


.... 2691 


7913 1 


261 




b9b 


7.8e-ba 




Protein name 


Locus Name 


Acc# 




hypotnetical prot 


em jhplisu 




pir:A71dJtt 


A71838 




Description 




ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 







2652 


7914 


262 | 789 


22b 


l.Se-lS 




Protein name 








Locus Name 


Acc# 




hypotnetical protein £2381 


p±r:B6b012 


| B65012 



Description 



736 



ORF Name 



NTID 



15525177 t2 232 



Protein name 



thiamin monphosphate kinase 



Description 



NT 



AA 



AAID Length Length 
— 



T5T 



Score Probability 
H23 



1 . 3e-39 



Locus Name 



pxr:G6$052 



Acc# 



G69052 



NT 



AA 



ORF Name 



NTID 



ianaai2...t3...3A6. 



TOT" 



AAID Length Length 
UTZ 



1323 



Score Probability 
WT2 



H.5e-40 



Protein name 



Locus Name 



Ncol DNA modification methyl transt erase 



gp:AP06S75i 



Acc# 



AF068761 



Description 



Nocardia corallina Ncol DNA modi ti cat ion methyltransrerase (ncoIM) and Ncol 
restriction endonuclease (ncoIR) genes , complete cds. 



NT 



ORF Name 



NTID 



AAID 



iaaa3.tt7....ci„Aa2 .....i ps55 



TWIT 



Length Length 



AA 

— . , Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2&l±l±Ll...tl..3A& I 



TZTT 



Length Length 
TTT 



Score Probability 



TTT 



Protein name 

Description 
MO-HI? 



Locus Name 



Acc# 



737 



ORF Name 



210537 c2 462 



Protein name 



NT ID 



TZTT 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TFT 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



NT ID 



&isaM3L&..±a„iAa 1 psss 



Protein name 



AAID 



TTTT 



NT 



AA 



Length Length 
TSTT 



Score Probability 



Locus Name 



Acc# 



Description 
[MO-HIT 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



\l±6.a.0.6.11.±1..21D. I 



17521 



I5TT 



7.0e-8S 



Protein name 



Description 



Locus Name 



sp : SPPA_SYNY2 



Acc# 



P73689 



PROTEASE IV H0M0L0G, (END0PEPTIDA5E IV) 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



ttwt 



T5ZT 



ill 6 I 



or 



2.2e-62 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp:PGI130^72 



Acc# 



AJ130872 



Description 



Porphyromonas gingival is W50 receptor antigen iragj locus encoamga ma] or 
immunodominant 55kDa antigen. 



738 



ORF Name 



21673232 cl 424 



nvzr 



1971 



Protein name 



Locus Name 



Acc# 



Description 
[MO-HIT 



ORF Name 



Protein name 



NT ID 



217.5.7.7.ul..±l...£>3. 



TTWT 



NT AA 

— , — , Score Probability 
AAID Length Length J ~ 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



NT AA 

— , — , Score Probability 
NTID AAID Length Length JL 



220£A0.B.l...al..AS£. I fTTUl 1 |7525 1 |55 | [2TJT 



1ST" 



0.016 



Protein name 



Locus Name 



cytochrome oxidase subunit II 



gp:TIMY18 821 



Acc# 



Y18821 



Description 



Timarcna metallica mitochondrial tRNA-Leu and. partial COII genes , isolate 
Forest d'Anlier. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



IZ2LU.7.8.3.9.1...13....3.3.3... 



Protein name 



Locus Name 



Acc# 



Description 
IMO-HIT 



ORF Name 



NTID 



NT AA 
_ — ^. r — ^. Score Probability 
AAID Length Length J - 



22462842 cl 397 



Protein name 



7927 



Locus Name 



Acc# 



Description 
NO -HIT 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



2L2fiLa.7.a2L.7....C3L...55X I 12706 



9.Se-05 



Protein name 



Locus Name 



Acc# 



nypotJietical protein 



Description 



pir:H75507 



H75507 



NT 



AA 



ORF Name 



NTID 



£2fiaa4A£...cl„.5Sfl 



[TTuT" 



AAID Length Length 
— 



13756 



Score Probability 

cm — 



5.6e-47 



Protein name 



Locus Name 



hybrid histidine kinase 



|gp:AF029704 



Acc# 



AF029704 



Description 



Dictyostelium discoideum hybrid histidine kinase (dhkD) mRNA, complete cds. 



ORF Name 



|2.3LAa2L.7.a&...Cl...3LSl5t.... 



Protein name 



NTID 



IT7TTS" 



NT AA 

— , — , Score Probability 
AAID Length Length -L 



Locus Name 



Acc# 



Description 

nsro-niT 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length d£ - 



Z3.5.a3.U6..Xl...La&. I 12709 



7931 



Protein name 



hypothetical protein PH0217 



Description 



\TUT 



Locus Name 



pir:G71244 



Acc# 



G71244 



740 



NT 



AA 



ORF Name 



NTID 



22527750 ±2 1SS 



£7TTT 



AAID Length Length 



7ST2 



T7uT" 



Score Probability 
£F5 



l 3.5e-2i 



Protein name 



Locus Name 



immunoreactive 53 JtD antigen PG123 



gp:AF144641 



ACC# 



AF144641 



Description 



Porphyromonas gmgivaiis strain W50 immunoreactive 53 JtD antigenPG123 gene, 
complete cds. 



NT 



AA 



ORF Name 



I2362S061 tS 247 



E7TT 



NTID AAID Length Length 
7331 



Score Probability 



741" 



Protein name 

Description 
ffT^TTTT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA o n _ _ . _ . 
_ — _ — _ Score Probability 
AAID Length Length JL 



ll&l$Ab±±l...lA.4. I \T7T7 



TUT 



a.3e-!3 



Protein name 



Locus Name 



hypothetical protein (avrc 3 1 region) 



pir :C43649 



Acc# 



C43649 



Description 



ORF Name 



NTID 



AAID 



[TTTT 



733S 



Protein name 

Description 
NO-HI* 



NT 



AA 



Length Length 



Score Probability 



T3UT 



Locus Name 



Acc# 



741 



NT 



AA 



ORF Name 



I22S75636 t2 151 



NTID AAID Length Length 

— 



Score Probability 



E7IT 



TT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




7937 



Score Probability 
TT7UTE 



53 



Protein name 



Locus Name 



TrkA 



gp:BSU62055 



Acc# 



U62055 



Description 



Bacillus subtilis CzcD (czcD) gene, partial cds, TrkA (trkA) gene, complete 
c&s . 



ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


aiaiaai£..±i...iAL 2715 | 










4.5e-56 


Protein name 








Locus 


Name 


Acc# 










Sp:VP6B 


_METJA 


Q58960 


Description 














HYPOTHETICAL PROTEIN MJ156 5 


















ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


mft5fia.7...±2...21i (2717 


7339 


1>S5 


S5§ 


736 


8.9e-73 



Protein name 



Locus Name 



purine nucleoside pnospnorylase 



pir:H72217 



Acc# 



H72217 



Description 



742 



NT 



AA 



ORF Name 



NTID 



24250953 t3 254 



tfht 



AAID Length Length 
7S¥U — 



Score Probability 
55T 



1.9e-98 



Protein name 



Locus Name 



hypothetical protein F10M10.30 



pir :T04772 



Acc# 



T04772 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



M2£.7.m..±3....2.9.2 I \FTT5 



5W 



|4.9e-173 



Protein name 



Locus Name 



methionyl-tRNA synthetase (mets) PAB2364 



pir :B75074 



Acc# 



B75074 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



2I2MIIZcX33!ZZZII| 12720 



7942 



Length Length 



Score Probability 



p5T 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , , — it Score Probability 
Length Length 



2&3.33.13.:L..£2...222. I 12 721 



7MT 



S.le-06 



Protein name 



Locus Name 



hypothetical protein PH0219 



pir:A?l24El 



Acc# 



A71245 



Description 



ORF Name 



NTID 



Z4.3A3..7.5l6....£3....25.5. | H7T2 



Protein name 
Description 

mo-mn 



AAID 



7944 



NT 



AA 



Length Length 
52" 



Score Probability 



HUT" 



Locus Name 



Acc# 



743 



NT 



AA 



ORF Name 



NT ID 



AAID 



24401002 c2 500 



2723 



17945 



Length Length 
TIT 



Score Probability 



Protein name 

Description 
IWO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



2A£±2L26...±2..±A& I [T72¥ 



17946 



2049 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NT ID 



AAID 



NT AA 
Length Length 



Score Probability 



ia44&iG...ca...£ai i 



7947 



TIT 



Protein name 

Description 
JWO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2 726 



7W 



Length Length 



Score Probability 




0.012 



Protein name 



Locus Name 



DNA-directed RNA polymerase, beta' -2 
chain :RNA polymerase rpoC2 



pir :S722B4 



Acc# 



S72284 



Description 



744 



NT 



AA 



ORF Name 



NTID 



124454057 t2 165 



TFTTT 



AAID Length Length 



T5T" 



TU7T 



Score Probability 
[TT8 



Protein name 



Locus Name 



thiol : disulfide interchange protein homolog 
yneN 



Description 



pir :E69891 



l.le-08 



Acc# 



E69891 



ORF Name 



NTID 



NT AA „ , > _ i 
— , — _ Score Probability 
AAID Length Length ^~ 



245ai5Q2...t2...14a. 



Protein name 



Locus Name 



hypothetical protein 



lgp:SSU18930 



Acc# 



Y18930 



Description 



Sultolobus soltataricus 281 kb genomic DNA fragment, strain P2 . 



NT 



AA 



ORF Name 



NTID 



12725 



AAID Length Length 

rra — I &T% — 



Score Probability 



Protein name 
Description 

pimrrT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



12730 



AAID Length Length 



Score Probability 

pro — 



|2.7e-16 



Protein name 



Locus Name 



RNA polymerase sigma tactor SigZ-liJce protein 



gpTAFTTT^T 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S ri&osomai protein sie-iikeprotein, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes , 
complete cds . 



745 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



24647188 c2 $02 



TTTT 



7^T 



77 



Protein name 



Locus Name 



0.019 



Acc# 



gp:HIVY16028 



Description 



Y16028 



HIV-1 vit, vpr , tat, vpu genes, strain 95CAMP44 8. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



37T 



1.4e-41 



Protein name 



Locus Name 



Acc# 



macrophage mtectivity potentiator 



gp:LAU916 06 



U91606 



Description 



Legionella adelaiclensis macrophage mtectivity potentiator (mipj gene, 
complete cds . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



2L4&4a4i2...cl...5L5i I 



7955 



TTT 



Protein name 



Locus Name 



0.00012 



Acc# 



gp:SCYDL0B7W 



Description 



S.cerevisiae chromosome IV reading irame ORF YDL057W. 



NT 



AA 



ORF Name 



NTID 



AAID 



TUT 



Length Length 



Tim- 



Protein name 



Score Probability 



Locus Name 



Acc# 



Description 



746 



ORF Name 



24798157 o'l 4b0 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 
SSu 



Score Probability 



TIT 



Locus Name 



Acc# 



Description 



KO-HIT 



ORF Name 



Protein name 



NT ID 



AAID 



7958 



NT 



AA 



Length Length 
T7TT 



Score Probability 



nnr 



Locus Name 



Acc# 



Description 



IN0-H1T 



ORF Name 



Protein name 



AlgZ 



Description 



NT ID 



— — Score Probability 
AAID Length Length 



TTTT 



3.1e-14 



Locus Name 



gp:PAU5243i 



Acc# 



U52431 



Pseudomonas aerugi nosa Algk-cognate sensor AigZ laigz; gene , complete cas . 



NT 



AA 



ORF Name 



NT ID 



— Score Probability 
AAID Length Length 



12738 



1143 



Protein name 



Locus Name 



Acc# 



Description 



BTO-HIT 



747 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



25632882 tl lUB 



T7JT 



TIT 



S.5e-2y 



Protein name 



Locus Name 



conserved Hypothetical protein aq_ibbb 



bir:<!7044i 



Acc# 



C70443 



Description 



ORF Name 


NTID 


AAID 


NT AA 
— — Score 
Length Length 


Probability 


2629MM..±L.25£. 


2740 


1962 


1087 


T264 422 


l.le-73 j 



Protein name 



Locus Name 



11SK outer membrane protein precursor : sus<j 
protein 



foinJObOaV 



Acc# 



JC6027 



Description 



NT 



ORF Name 



NTID 



AAID Length Length 



AA 

— Score Probability 



638 



tstt 



TTUT 



2.4e-l?b 



Protein name 



Locus Name 



GTP- binding elongation tactor tamiiy protein 
TypA/BipA 



pir :E!75426 



Acc# 



E75426 



Description 



ORF Name 



NTID 



\26A±52LL..a±„.ll& I 



Protein name 



immunoreactive 52KD antigen ±>U4± 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TUT 



5.8e-i9 



Locus Name 



gp:APl7b7ib 



Acc# 



AF175716 



Porphyromonas gm givalis strain W50 immunoreactive b2KD antigenFG4l gene, 
complete cds . 



748 



ORF Name 



26595663 C3 550 



Protein name 
Description 

NO-HIT 



NTID 



NT AA 
T — ^ T — , * Score Probability 
AAID Length Length • L 



12 743 



TUTT 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



2&&125i&Z.„c2...5ta4 1 12744 



AAID Length Length 
T5T 



7966 



Score Probability 
£71 



Protein name 



Locus Name 



coenzyme PQQ synthesis protein (pqqEJ homolog 



Description 



Acc# 



F69551 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length i - 



^6.6.^aaa.7....JL3....25.2 1 IT7¥S 



Protein name 

Description 
NO-HIT 



2745 




7967 







Locus Name 



Acc# 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score Probability 
Length J - 


Z^7.M..±3....3.20. 


2745 


7368 


60 


183 



Protein name 

Description 
1N0-HIT 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



AAID 



I 



T7VT 



Length Length 



TTF 



Score Probability 
TZZ 



Protein name 



Locus Name 



probaftie transcription regulator 



pir :T34578 



Acc# 



T34578 



Description 



749 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



29475676 c2 461 



T7TT 



3.5e-16 



Protein name 



Description 



Locus Name 



Acc# 



DNA- BINDING PROTEIN II (HB) (HOT 



ORF Name 



NTID 



AAID 



125775466 cl 427 



7971 



Protein name 



Hypothetical protein ape 062 6 



Description 



NT 



AA 



Length Length 



Score Probability 
TU3 



l.le-05 



Locus Name 



pir:£>72649 



Acc# 



D72649 



NT 



AA 



ORF Name 



NTID 



AAID 



2MASAll...at..A20. I |T7^u" 



7TTTT 



Length Length 



Score Probability 
3.4e-B7 



Protein name 



Locus Name 



hypothetical protein 



bp:PGI237a9fl 



Acc# 



AJ237898 



Description 



Porpnyromonas gingival is olpA and r£>lA genes and ORF3 (partial) , strain 
(ATCC33277. 



NT 



AA 



ORF Name 



NTID 



AAID 



3.D.5.6.3.15.2L...c3....5i7A.. 



12751 



7973 



— — , Score Probability 
Length Length 

ST" 



TFT 



Protein name 

Description 
1W0-HTT 



Locus Name 



Acc# 



750 



ORF Name 



306315^1 ±J ill 



Protein name 



NTID 



7TT% 



— — Score Probability 



AAID Length Length 
TT5 



TUT 



Locus Name 



Acc# 



Description 



[W0-H1T 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



7975 



TUFT 



|2.5e-i07 



Locus Name 



sp:KPYK>OkBU 



Acc# 



051323 



ORF Name 



NTID 



Protein name 



hypothetical protein UKFZpb66Ui«^4 . i 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TUT 



TZTT 



[TuTT 



Locus Name 



pir :T14767 



ACC# 



T14767 



ORF Name 



NTID 



AAID 



1275b 



7T7T 



Protein name 



NT 



AA 



Length Length 



— Score Probability 



Locus Name 



Acc# 



Description 



751 



ORF Name 



31750058 12 128 



Protein name 

Description 
IMO-HIT 



NT ID 



AAID Length Length Probability 



737F" 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



TFZT 



hypothetical protein slr0904 



Description 



NT 



AA 



T — ^, T — ^ Score Probability 
AAID Length Length ^ 

7373 — 



T357"" 



Locus Name 



pir :S75721 



1.2e-I39 



Acc# 



S75721 



ORF Name 



NT ID 



AAID 



NT AA , , . , . 
— ^ — L1 Score Probability 
Length Length -L 



3.^a&^is>3L...cx...3La:/. i 12758 



735tt 



T.9e-i43 



Protein name 



Locus Name 



UDP-ManNAc dehydrogenase 



|gp:AP125i64 



Acc# 



AF125164 



Description 



Bacteroides tragilis 638R polysaccharide B (PS B2) niosynthesislocus, 
complete sequence; and unknown genes. 



NT 



AA 



ORF Name 



NT ID 



12U5.10..±l..lll I [2753 



AAID Length Length 
73SI — 



T73~ 



Score Probability 




ll.le-07 



Protein name 



Locus Name 



hypothetical protein 



pir:G«375 



Acc# 



G75375 



Description 



752 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
7982 



Score Probability 
10.047 



TUT 



Protein name 



Locus Name 



ototerlin 



gp:AF107403 



Acc# 



AF107403 



Description 



Homo sapiens ototerlin (OTOF) mRNA, complete eels . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
JW3 



Score 



7&T 



Probability 
0.020 



Protein name 



Description 



Locus Name 



Acc# 



gpTWMM^PT 



Plasmodium falciparum MAL3P7, complete sequence. 



NT 



AA 



ORF Name 



NTID 



lliaS.140.5....al...l7.6. I 



AAID Length Length 
7584 



FT 



"ITT 



Score Probability 
1. 9e-06 



TOT 



Protein name 



Locus Name 



hypothetical protein PHS004 



pir :F71245 



Acc# 



F71245 



Descri ption 



NT 



AA 



ORF Name 



NTID 



AAID 



l±a$AX£X..±2...Z±& I WTZT 



Length Length 
TIT 



Score Probability 
15? 



2.6e-ll 



Protein name 



Locus Name 



clindamycin resistance transfer t actor btgA 



fpir:A416S6 



Acc# 



A41656 



Description 



NT 



AA 



ORF Name 



NT ID 



^417^1^7 c2 543 



T7W 



AAID Length Length 
7533 — 



TTTT 



TIT" 



Score Probability 
373 



0.022 



Protein name 



Locus Name 



transposase 



bp:EFEKITIJO 



Acc# 



Y16413 



Description 



Enterococcus taecium entl and entJ genes and two open readmgtrames, 



NT 



AA 



ORF Name 



34176550 c3 549 



NT ID AAID Length Length 



TU4T- 



Score Probability 




1.7e-13 



Protein name 



Locus Name 



mtegrase IntNl 



gp:BUU51917 



Acc# 



U51917 



Description 



Bacteroides unitormis insertion element NBUl iragment, integraselntNl gene, 
complete cds . 



NT 



AA 



ORF Name 



NTID AAID Length Length 

7988 



[ETT 



1ST 



Score Probability 
55 



0.024 



Protein name 



Locus Name 



immunoglobulin heavy chain variable region 



gp:BTU497S3 



Acc# 



U49783 



Description 



Bos taurus immunoglobulin rearranged heavy chain variable regionmRNA, 
partial cds . 



ORF Name 



NTID 



AAID 



NT AA 
T r~^u Score Probability 
Length Length 



2767 



7WT 



TuT" 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



754 



ORF Name 



JblV^lbU ±2 152 



Protein name 



Description 
INO-HIT 



NTID 



AAID 



NT AA 
Length Length Probability 



7990 



7T 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



Hypothetical protein 



Description 



NT AA 
Length Length Probability 

T7TT 



TTTT 



TT7TT 



Locus Name 



pir : jgi020 



'l.be-150 



Acc# 



JQ1020 



ORF Name 



Protein name 



NTID 



T77TT 



Hypothetical protein £>H0554 



Description 



NT AA 

~~ h Score Probability 



AAID Length Length 




TTT 



Locus Name 



pir :e vioyi 



Acc# 



E71091 



ORF Name 



NT 



NTID 



AAID 



±xr£lhl.±l„.z±!k I [T77T 



Protein name 

Description 
IMO-HIT 



Length Length 
74— 



AA 

— ^ Score 



Locus Name 



Probability 



Acc# 



ORF Name 



Protein name 

Description 
INO-HIT 



NTID 



AAID 



NT AA 
Length Length S °° re 



WTTT 



7 994 



55" 



207 



Locus Name 



Probability 



Acc# 



755 



ORF Name 



NT ID 



NT AA 

AAID Length Length Probability 



40^890 ci 425 



TTTT 



7995 



Protein name 



sensory transduction nistidme kinase 
slr2098:protein slr2098 :protein slr2098 



Description 



TFT?" 



Locus Name 



pir:S7S130 



-i .2e-40 



Acc# 



S75130 



ORF Name 



Protein name 



Description 
|M0-HIT 



NT ID 



AAID 



NT AA 
Length Length Probability 



TT7T 



T¥T 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



7997 



NT AA 
Length Lenjth Probability 



b.ie-52 



Protein name 

Description 
HYPOTHET ICAL PROTEIN 



Locus Name 



sp: YWOJffiTJA 



Acc# 



Q58010 



ORF Name 



Protein name 



neiicase 



NTID 



AAID 



12775 



Description 
Knodotnermus mannus dnaB gene. 



NT 



AA 



Length Length 



Score Probability 
U.3e-58 



Locus Name 



gp : RNDNAB 



Acc# 



Y13813 



756 



ORF Name 



NT ID 



NT AA 
AAID Length Length SC ° re 



I4117J36 ti $0 



TT7T 



7^T 



Protein name 



hypothetical protein jnp0094 



Description 



T8T 



Locus Name 



pir:E71575 



Probability 
|8.9e-34 



Acc# 



E71975 



NT 



ORF Name 



AA 



NTID 



, , _ — ^, Score Probability 
AAID Length Length iL 



iih.ia.B.a^i^iM I vrm 



8000 



[TT3T 



li.le-122 



Protein name 



Locus Name 



putative UDP-N-acetyigiucosamme 2-epimerase I Igp : ALW243431 



Acc# 
AJ243431 



Description 



Acmetooacter iwottn wzc, wzb, wza, weeA, weeB, wceC, wzx, wzy,weei), weeE, 
weeF, weeG, weeH, weel, weeJ, weeK, galU, ugd, pgi,galE, pgm (partial) and 
mip (partial) genes (emulsan biosyntheticgene cluster), strain RAG- 1 . 



ORF Name 



NTID 



AAID 



NT AA 
Length Length Probability 



4i.^u^ij.a...a2...d7.A j vrrr? 



8001 



1014 



5045 



7TT 



3.0e-68 



Protein name 



Locus Name 



Acc# 



Description 
HYPOTHETICAL PROTEIN HI0895 



ORF Name 



Protein name 

Description 
pjU-Hl ' l 1 



NT AA 

NTID AAID Length Length Probability 



2780 



8002 



Locus Name 



Acc# 



757 



NT 



AA 



ORF Name 



NT ID 



AAID 



4491411 ±2 133 



2781 



Length Length Probability 



b .4e-05 



Protein name 



Locus Name 



putative large secreted protein 



gp:SCF12 



Acc# 



AL117669 



Description 

Streptomyces coelicolor cosmia F12 . 



ORF Name 



NT ID 



NT AA 

AAID Length Length Probability 



4494028 t3 2$$ 



1191 



l.le-16 



Protein name 



Locus Name 



capsular polysaccnanae rnosyntnsis protein 



Description 



pir :F70441 



ACC# 



F70441 



NT 



AA 



ORF Name 



NTID 



AAID 



\&b.bM.8.1..±±..3£. 



2783 



8005 



Length Length 



Score Probability 



Protein name 

Description 
IN0-H1T 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length Probability 

mm — 



TUT 



Protein name 

Description 
IMO-fllf 



Locus Name 



Acc# 



758 



ORF Name 



4fe94410 ta ^43 



Protein name 



Description 



WO-HIT 



NTID 



AAID 



NT AA 
t — r — , -i Score 
Length Length 



Locus Name 



Probability 



Acc# 



ORF Name 



!4aaib.aii„±3....io.a.. 



Protein name 



NTID 



Hypothetical protein F13H8 . 1 



Description 



NT 



AA 



T — T — Score Prob ability 
AAID Length Length JL 

— 



5B" 



Locus Name 



pir :T16066 



0.0052 



Acc# 



T16066 



ORF Name 



NTID 



\±&6ALbA±l..±± I 12737 



Protein name 



ct602 nypotnetical protein 



Description 



NT 



AA 



AAID Length Length 

mu3 — 



Score Probability 
75 



Locus Name 



| |pir:F72036 



0.0095 



Acc# 



F72036 



ORF Name 



Protein name 

Description 
(NO-HIT 



NTID 



NT AA 
T — _ _ — ^. Score Probability 
AAID Length Length ■ L 



WIT 



Locus Name 



Acc# 



ORF Name 



NT AA 

NTID AAID Length Length Probability 



| 4B.&l6.&l...cl...j.b.b. 1 



[ITT 



9.3e-07 



Protein name 



Locus Name 



protein gp5 7 



pir :T13144 



Acc# 



T13144 



Description 



759 



ORF Name 



NT ID 



NT AA 
_ — _ — Score Probability 
AAID Length Length L - 



48b66b2 £3 280 



£7W 



8012 



\T77T 



0.045 



Protein name 



Locus Name 



nypotnetical protein PFB0765W 



pir :E71606 



Acc# 



E71606 



Description 



ORF Name 



NT ID 



AAID 



NT AA 

t — ^ r — Score 
Length Length 



2791 



Probability 
3 . Oe-56 



Protein name 



Locus Name 



tyrosine recomDmase XerD 



lgp:AF0^548 



Acc# 



AF093548 



Description 



stapnylococcus aureus tyrosine recomtunase XerD (xerD) gene, complete cds. 



NT 



AA 



ORF Name 



NTID AAID Length Length 



Score Probability 
ET3 



7.8e-15 



Protein name 



Locus Name 



probable mannosyltranst erase 



pir :C75423 



Acc# 



C75423 



Description 



ORF Name 



NTID 



NT AA 

AAID Lenjth Length Probability 



8015 



T7T" 



1.3e-16 



Protein name 



Locus Name 



nypotnetical protein APE1457 



pir ;A72625 



Acc# 



A72625 



Description 



NT 



AA 



ORF Name 



b.D.6.5.Q.2...t2...14:£ 



£754 



NTID AAID Length Length 

wnz — 



Score Probability 
ST2 



|4.;Je-50 



Protein name 



Locus Name 



epoxidase 



pir :F69187 



Acc# 



F69187 



Description 



760 



ORF Name 



buyj/yo c3 571 



Protein name 

Description 
WO-HIT 



NT 



AA 



NT ID 



AAID Length Length Probability 
37TT7 — 



T7TT 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
NO-HIT 



NT 



AA 



NT ID 



AAID Length Length 
— 



Score Probability 



TuTT 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



TTWT 



8019 



probable acia--uoA ligase, MTH657 



Description 



NT 



AA 



Length Length 



Score 



flWTT 



Probability 



Locus Name 



pir :D6yi87 



Acc# 



D69187 



ORF Name 



Protein name 

Description 
INO-HIT 



NT 



AA 



NTID 



7VAT _ _ — — _ Score Probability 
AAID Length Length ^ 





FT 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 
r — ^ r — , -i Score 
Length Length 



TuTT 



Probability 
|5.0e-17 



Locus Name 



cation ertlux system (czcB-liite) 



|pir:C70415 



Acc# 



C70415 



Description 



761 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

mn — 



2sr 



Score Probability 
73 



0.054 



Protein name 



Locus Name 



Hypothetical protein PH022 0 



pir :B71245 



ACC# 



B71245 



Description 



ORF Name 



NTID 



NT AA 
— — ^rorp 

AAID Length Length 



25T 



Protein name 

Description 
INO-HTT 



Locus Name 



Probability 



Acc# 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 




2^2 


&024 


143 432 


379 




6 .ie-35 



Protein name 



Locus Name 



3-dehydroqumate dehydratase, : carbonic 
3 - dehydroquinase : protein slllll2 : carbonic 
3 -dehydrogenase: prote in sllll 1 2 



pir:S7755i 



Acc# 



S77551 



Description 



NT 



AA 



ORF Name 



6.4&.7..7.6.2„.a2...fi:L& I para 



NTID AAID Length Length 

8025 



TTT 



Score Probability 

jzz — 



i.0e-32 



Protein name 



Locus Name 



KB FA, putative 



gp:PGt23?S£§ 



Acc# 



AJ237898 



Description 



porptiyromonas gmgivaiis olpA and rJDtA genes and 0RF3 (partial) , strain 
IATCC332 77. 



762 



NT 



AA 



ORF Name 



NTID 



AAID 



6586 ±2 129 



Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



L&2yAb.2..±iJi$& I nms 



AAID Length Length 

wm — 



TTJT 



Score Probability 




3.2e-52 



Protein name 



Locus Name 



0- antigen repeat unit transporter Wzx 



bp:Afi72324 



Acc# 



AF172324 



Description 



Escherichia coli GalF (galF) gene, partial cds; O-antigen repeatunit 
transporter Wzx (wzx), WbnA (wbnA) , O-antigen polymerase Wzy(wzy) , WbnB 
(wbnB) , WbnC (wbnC) , WbnD (wbnD) , WbnE (wbnE) , UDP-Glc-4-epimerase GalE 
(galE) , 6-phosphogluconate dehydrogenaseGnd (gnd) , UDP-Glc- 6 -dehydrogenase 
Ugd (ucfd) , and WbnF (wbnF) genes, complete cds; and chain length determinant 



NT 



AA 



ORF Name 



NTID 



AAID 



W7W 



Length Length 



Score Probability 
TZZ 



Protein name 



Locus Name 



Hypothetical protein RP336 



pir :B71690 



Acc# 



B71690 



Description 



ORF Name 



Protein name 



NTID 



2807 



O-methyl transferase 



Description 



NT 



AA 



AAID Length Length 

mrs — 



Score Probability 




Locus Name 



bir:B7043i 



3.0e-33 



Acc# 



B70431 



763 



ORF Name 



Vsmil ±2 234 



Protein name 



Description 



[MO-HIT 



NT ID 



AAID 



NT AA 
t — , ^ T — ^, Score 
Length Length 



1ST 



Locus Name 



Probability 



Acc# 



ORF Name 



aiiasna...£2L...iaa.. 



Protein name 



NT 



AA 



NT ID 



AAID Length Length 
BOTI — 



TuT" 



Score Probability 
TT2 



±.2b-06 



Locus Name 



nypotnetical protein PH1791 



pir:G71i§5 



Acc# 



G71189 



Description 



ORF Name 



ab.Ul...cl..AZZ.. 



Protein name 

Description 
INO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 
WIT 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



TIT 



7T 



Probability 
10.014 



Locus Name 



yhcV nomolog 2 : inos me -monophosphate 
dehydrogenase (guaB-2) homolog (misnomer) 



pir :F69514 



Acc# 



F69514 



Description 



764 



NT 



AA 



ORF Name 



NTID 



37S577 t2 150 



AAID Length Length 
— 



Score Probability 

— 



6 . le-05 



Protein name 



Locus Name 



hypothetical protein 



gp:SSUl§530 



Acc# 



Y18930 



Description 

Sultolobus soltataricus 2 81 kb genomic DNA tragment, strain P2 . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
SuT5 



Score Probability 




1.2e-24 



Protein name 

Description 
REGULATORY PROTEIN ASNC 



Locus Name 



sp : ASNC_ECOLI 



ACC# 



P03809 



ORF Name 



NTID 



NT AA 

_ _ _ _~ — ^, — _ Score Probability 
AAID Length Length ^ 



ffTT 



TUT 



0.0013 



Protein name 



Locus Name 



thiol : disul tide interchange protein 



bir:C70314 



Acc# 



C70314 



Description 



ORF Name 



NTID 



10.13.13.1.?...±1„.2lD. I I2S1S 



Protein name 



hypothetical protein Jv0534 



Description 



NT 



AA 



AAID Length Length 
WUTJ — 



Score Probability 
53 



Locus Name 



gp:AP121005 



2 .le-06 



Acc# 



AF121009 



Mycobacterium tuberculosis H37Rv hypothetical protein Jv0534 ( Jv0534) mkNA, 
complete cds . 



765 



NT 



AA 



ORF Name 



NT ID 



1456<;5$2 ci 74 



AAID Length Length 

mm — 



Score Probability 

wn — 



l.2e-43 



Protein name 



Locus Name 



sensory transduction mstidine Kinase 
slr2098 :protein slr2098 rprotein slr2098 



bir:S75130 



Acc# 



S75130 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



i&5L7.&a6a...ci...aa 



12817 



VU3T 



Length Length 
T3 - 



Score Probability 



Protein name 

Description 
IN^TTTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
SMu — 



Score Probability 

— 



7.1e-80 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



JC6027 



Description 



ORF Name 



NTID 



AAID 



\!±3A&53A...al...±6A I |28l£ 



8041 



Protein name 

Description 
NO-HIT 



NT 



AA 



Length Length 



Score Probability 



TUT 



Locus Name 



Acc# 



766 



NT 



AA 



ORF Name 



NT ID 



AAID 



Length Length 



Score Probability 




2 . 4e-19 



Protein name 



Locus Name 



hypothetical protein RP32 9 



pir :C71689 



ACC# 



C71689 



Description 



ORF Name 



NTID 



ia5L3.a.7..7....tl...Z&. I 12821 



Protein name 



NT 



AA 



AAID Length Length 

mm — 



Score Probability 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



NTID 



±^m2:L±2...ii I 



Protein name 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



2.DA3..7.6.6.2...13....5.4.. 



Protein name 



NTID 



AAID 



2823 



regulatory protein hpaA 



Description 



NT AA 

t^+-h r^^-h Score Probability 
Length Length 



JUT 



TFT" 



|2.2e-u8 



Locus Name 



Acc# 



pir :A55349 



NT 



AA 



ORF Name 



NTID 



AAID 



22iA&26.i...al„±l± I 12324" 



Length Length 
1ST 



Score Probability 
STu 



|4.6e-46 



Protein name 



Locus Name 



cation erf" lux (AcrB/AcrD/AcrF family) 



[pir:F7u3S8 



Acc# 



F70368 



Description 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



22365790 ti 6 



fott 



75" 



TIT 



0.012 



Protein name 



Locus Name 



unknown 



gpiAPOOVlbV 



Acc# 



AF007157 



Description 

Homo sapiens clone 23856 unknown mRNA, partial cds . 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



2354SSS0 c2 120 



282S 


S04S 


2S3 


752 




2?S 





|6.3e-24 



Protein name 



Locus Name 



Acc# 



Description 

HYPO T H E TICAL 27.9 KB PROT E IN IN MOLR -BGLX INTERGENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



T^TT 



PIT 



Length Length 



Score 



T7uT" 



Probability 
4 . le-50 



Protein name 



Description 



Locus Name 



sp:A(!DB_]AAti{U 



Acc# 



P45857 



ACYL-COA DEHYDROGENASE, 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



Protein name 



Locus Name 



Acc# 



Description 



768 



ORF Name 



24407552 c2 102 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



Length Length 
353 



Score Probability 



\1.2e-55 



Locus Name 



Acc# 



sp:DMAJ_THETH 



DNAJ PROTEIN 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAID Length Length 



Score Probability 
|S.le-70 



7TU 



Locus Name 



sp : FIXB_CLOAB 



ACC# 



P53578 



PTXB PROTEIN 



ORF Name 



Protein name 



Description 



NT AA 

— „ — , Score Probability 
NT ID AAID Length Length 



l.Se-41 



Locus Name 



gp:ATAC005S51 



Acc# 



AC005851 



Arabidopsis thai i ana chromosome II BAC F24D13 genomic sequence, complete 
sequence . 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



\2&A$.0£L±1..5.6. 



STO4" 



7u" 



TTT 



Protein name 



Locus Name 



Acc# 



Description 



769 



NT 



AA 



ORF Name 



NTID 



AAID 



29476!^ ti 6 



Length Length 



— Score Probability 



T3T 



Protein name 

Description 
[MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score 



Probability 
|i.ie-07 



Protein name 



Locus Name 



cysteine proteinase CP1 



pir:SS748i 



Acc# 



S67481 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



ZZI pTSTT" 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



llL±$5&l...z±...$.6. 



2836 



7TT 



i.6e-0$ 



Protein name 



Locus Name 



conserved hypothetical protein 



bir:B75482 



Acc# 



B75483 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



805$ 



3.$e-47 



Protein name 
Description 

TRANSFER FLAV0PR0TEIN SMALL StIBDMIT) (ETF^S) 



Locus Name 



sp:ETFB_CLOAIi 



Acc# 



P52040 



770 



NT 



AA 



ORF Name 



NTID 



AMD 



34172130 ±3 65 



Length Length 

— 



Score Probability 
|i.3e-I70 



1253" 



Protein name 
Description 

BfiTA-GAlACTOSlfiASS , (La«aS£) 



Locus Name 



|sp:BGAL_BAdMl! 



Acc# 



052847 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



35556713 c2 100 



$061 



7T 



234 



Protein name 



Description 



Locus Name 



Acc# 



[NO- HIT 



ORF Name 



NTID 



NT AA 

— , — , Score Prob ability 
AAID Length Length JL 



IMF 



TTTT 



|2.8e-i8 



Protein name 



Locus Name 



Acc# 



sp:¥fiHU_El(I0Ll 



Description 

HYPO T HETICAL 62.1 KB PROTEIN IN MOLk-kGLX INTERGE NIC REGION PRaOUk^Qk 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Prob ability 
Length Length 



4Q2.aa6.2...cZ...103... 



TUT 



0.00026 



Protein name 



Locus Name 



transcription regulator MerR tamiiy 



pir:D70361 



Acc# 



D70361 



Description 



771 



NT 



AA 



ORF Name 



NT ID 



I44752S1 tl 14 



AAID Length Length 
— 



5TT7" 



Score Probability 
3.4e-ii 



T7F7 



Protein name 



Locus Name 



unknown 



|gp:U96771' 



Acc# 



U96771 



Description 



Prevotella bryantii putative polygalacturonase, B-l, 4-endoglucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



ORF Name 



669458S c3 lis) 



Protein name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 
— — ■ Score 
Length Length 



\X$.6.11$.11..±1...± 



TUT 



Probability 
|0.0oui£ 



Protein name 



Locus Name 



penicillin-binding protein 2 



gp:AF14744^ 



Acc# 



AF147448 



Description 



Pseudomonas aeruginosa strain PAOl penicillin-binding protein 2 tpbpA; , 
rod- shape-determining protein (rodA) , membrane -bound lytictransglycosylase 
(mltB) , rare lipoprotein A (rlpA) , penicillin-binding protein 5 (dacA) , and 
lipoate biosynthesisprotein B (lipB) genes, complete cds; and unknown gene. 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 

rm — 



Score Probability 
Oe^ 



TS7 



Protein name 



Description 



Locus Name 



sp:RODA_HAI<!Ijsl 



Acc# 



P44468 



ROD SHAPE -DETERMINING PROTEIN RODA 



772 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



14865875 t3 5 



TMT 



1WT 



7.3e-78 



Protein name 



Locus Name 



sp:FEOB_METJA 



Acc# 
Q57986 



Description 
FERROUS IRON TRANSPORT SROTfiltf B HOMOLOG 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 





§065 


$4 


2SS 



Locus Name 



Acc# 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



lul3A&3.£..±2>...lCl I 



ST3TT 



ii.0e-05 



Protein name 



Locus Name 



unknown 



gp:AFl24349 



Acc# 



AF124349 



Description 

Zymomonas mobilis ZM4 tosmid clone 41A4, complete sequence. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



\±03A0.95.2..±±JX I 



\SU7T~ 



ITT 



|2.ie-05 



Protein name 



Locus Name 



Acc# 



|sp:PMB_RAT 



035264 



Description 

ACTIVATING FACTOR ACEx YLHYDROLAS 2 ALPHA 2 SUBMIT) (PaF-AH ALPHA 2) 



773 



ORF Name 



24694837 ri 2 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 



Score Probability 



S7TT 



Locus Name 



Acc# 



Description 



KO-HIT 



ORF Name 



2A25.2116....Q1JX1. 



Protein name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TWO" 



$.4e-54 



Locus Name 



Acc# 



aaenylate cyclase nomolog 



Description 



pir:T171yV 



T17197 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



wit 



1008 I \wrr 



WIT 



Protein name 



Locus Name 



6.9e-82 



Acc# 



receptor antigen (RagA) 



gp:PGIU0^72 



Description 



AJ130872 



Porphyromonas gingivalis W50 receptor antigen (rag) locus encodinga major 
immunodominant 5 5kDa antigen. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



6.2.5..7.8.2.fi...i,3....12... 



TTT 



0.012 



Protein name 



Locus Name 



Acc# 



cyclic beta 1-2 giucan syntnetase 



pir :T31419 



T31419 



Description 



774 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAID Length Length 



T7T 



Score Probability 
|1.3e-I4 



TF7 



Locus Name 



sp:Y32&_yYNY3 



Acc# 
Q55535 



IEIC 3.1.3.48} 



ORF Name 



11930451 tl 2 



Protein name 



Description 



NT 



AA 



NT ID 



AAID Length Length 



Score Probability 



2F51T 



TTT 



TT3" 



|2.§e-M 



Locus Name 



ACC# 



Q55535 



(EC 3 ,1.3 .40) 



ORF Name 



21S.7.5..7.7.6....al...i7... 



Protein name 



NTID 



AAID 



NT AA ^ _ - , . . . . 
— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



WU7T 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



775 



NT 



AA 



ORF Name 



NT ID 



AAID 



24525761 ti 1 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



2Sa5ai25Lt:L.5L 



AAID Length Length 



TTTT 



Score Probability 
\A.2e-M 



557 



Protein name 

Description 
(GLtJftS) 



Locus Name 



sp : SYE_BACST 



ACC# 



P22249 



NT 



AA 



ORF Name 



NTID 



|Sl2fi5&fi3™c2L„.lfiL I I^TT 



AAID Length Length 
SUSS — 



Score Probability 
|4.le-68 



Protein name 



Locus Name 



Acc# 



sp:VOPf_fiAdSU 



Description 

HYPOTH E TICAL 79. a KB PROT E IN IN PHOH-DGKA INTUR GENIC imtilOJM 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



msmiiu 



TIT 



7.5e-i0 



Protein name 



Locus Name 



|sp:YEAQ_E<L*OLl 



Acc# 



P76246 



Description 

HYPOTHETICAL 5 . 7 Kt> PROTSIN IN GAPA-RNft MfERSBKfl C REGION 



776 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



mm 



Score Probability 
FFTSe^ 



Protein name 



Locus Name 



3 OS ribosomai protein sv 



bp:AP0874i4 



Acc# 



AF087414 



Description 



Haemophilus ducreyi OapA loapA) , OapB loapB) , RtaF (rtaF) , 30Srit>osomai 
protein S12, and 30S ribosomal protein S7 genes, completecds; and elongation 
factor G gene, partial cds . 



NT 



AA 



ORF Name 



NTID 



1$2§12 cl 2§ 



AAID Length Length 
— 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



NTID 



NT AA ^ _ , , . - . . 
— — , Score Probability 
AAID Length Length dL 



maiflCi...c3L„.eii 



TOM- 



TIT 



444 



Protein name 



Locus Name 



hypothetical protein Jv0166c 



gp:AP12i004 



Acc# 



AF121004 



Description 



Mycobacterium tuberculosis H37Rv nypotneticai protein Jv0i66c ( Jv0i66cj 
mRNA, complete cds . 



ORF Name 



NT AA 

— — , Score Probability 
NTID AAID Length Length 



mM&a2i...ci„.2a I pro? 



7.0e-20 



Protein name 
Description 

HYPO T HE TI CAL 66.3 KB PRO T E IN IN HAG 2 5 1 REGION 



Locus Name 



sp:YHA2_UlK(J0 



Acc# 



P35649 



777 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



122460877 £2 5 



3T5W 



i.4e-45 



Protein name 

Description 

kIBOaOMAL PROTglM S12 



Locus Name 



sp:RS12_AWANl 



Acc# 



P18662 



NT 



AA 



ORF Name 



NT ID 



24i75M6 c^ b6 



2867 



AAID Length Length 
— 



Score Probability 



90 



273 



Protein name 



Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



24.&M7.1.1...H...& 



709 



2130 



2946 



070 



Protein name 



Locus Name 



EP-G 



bp:Afi03546<> 



Acc# 



AB035469 



Description 



Porphyromonas gingivalis gene tor ef-g, complete cas, strain : SUNY1021 . 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
TTS 



Score Probability 
2.5e-29 



Protein name 



Locus Name 



nbosomal protein S10 



gp:AF1152&:i 



Acc# 



AF115283 



Description 

Leptospira interrogans SlO-spc -alpha locus, complete sequence. 



778 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



342752S0 16 



FT 



PET" 



Score Probability 
TT7UT2 



Protein name 



Locus Name 



D-23 protein 



gprGHLEA^ 



Acc# 



X13203 



Description 



Cotton set 5A Lea gene tor seed protein D-29. 



NT 



AA 



ORF Name 



NTID 



AAID 



14064027 cl 2$ 



12871 



Length Length 



1149 



Score Probability 
1.0e-10 



Protein name 



Locus Name 



hypothetical protein RP338 



pir :D7ieyu 



Acc# 



D71690 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
™ — 



sum 



Score Probability 
[DTT5 



Protein name 



Locus Name 



Acc# 



033431 



Description 

BETA 1 CHAIN) (RNA POLYM E RASE BETA ' SIM MIT) ( Jb'kA(iMKJsJTj 



NT 



AA 



ORF Name 



NTID 



HMULi I pro 



AAID Length Length 
TITT 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



INO-HIT 



779 



NT 



AA 



ORF Name 



NTID 



I45SI260 ±1 6 



AAID Length Length 
TIT 



Score Probability 
7.9e-26 



Protein name 



Description 



Locus Name 



sp:RL3_THJ!!TH 



Acc# 



P52860 



505 ftlfiOSOMAi MlOTEirt L3 



NT 



AA 



ORF Name 



NTID 



501401 cl 21 



AAID Length Length 



Score 



109 



Probability 
|3.2e-05 



Protein name 



Description 



Locus Name 



l sp:YHA2JiiikcJO 



Acc# 



P35649 



HYPOTHETICAL 66 .J JKE) PROTEIN IN HAG1> 5 1 REGION 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



Probability 
|2.6e-S9 



Protein name 



Locus Name 



DNA- dependent RNA polymerase summit Joeta 



gp:LMY16468 



Acc# 



Y16468 



Description 



Lxsterla monocytogenes umdentir ied gene and partial rpoB gene. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




Score Probability 
7.0e-20 



Protein name 
Description 

HYPOTHETICAL 66 .j RD PROTEIN lis! HMi'2 5 ' REGION 



Locus Name 



Acc# 



P35649 



780 



ORF Name 



NTID 



110271W ci 174 



Protein name 



hypothetical protein PH0133 



Description 



NT 



AA 



AAID Length Length 



Score Probability 
II .4e-0b 



TU7 



Locus Name 



pir :C7il>i4 



Acc# 



C71234 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



lllftttiil^t^a | pr? 



prnr 



I3T 



|7.5e-0b 



Protein name 



Locus Name 



IsprV^JM EdoLl 



Acc# 



P42594 



Description 

HYPOTHETICAL ib.O KB J^kOTKlN IN EBticJ-UXAA IN ' l'Uft SENlC kUciluU 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probabi lity 
Length Length 



148B 



|5.8e-iS<> 



Protein name 



Description 



Locus Name 



sp : IMDH__AgUAE 



Acc# 



067820 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Proba bility 
Length Length 



TTTUT 



7M~ 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



781 



NT 



AA 



ORF Name 



NT ID 



13757180 n ivi 



AAID Length Length 




SIM 



FIT 



Score Probability 
TTTTm 



Protein name 



Locus Name 



nypotnetical protein APE1598 



Description 



bir:A72b^9 



Acc# 



A72539 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



12883 



Length Length 



Score Probability 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



Length Length 




Score Probability 



27 



Locus Name 



Acc# 



(NO-HIT 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


l±&MAll...Q±Jli):l | 


2885 


8107 


85 


270 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



NT 



AA 



NTID 



AAID 



2885 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



782 



ORF Name 







NT 


AA 


NT ID 


AAID 


Length 


Length 


2887 


8109 


102 


309 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



i£&mi2...ci...m 



1288& 



18110 



Length Length 

— 



Score Probability 
3.1e-0fl 



TTJ 



Protein name 



Locus Name 



nypotneticai protein aq_nu,3 



pir:A703yb 



Acc# 



A70395 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



l£.&12Mb....cl...li*4.. 



TOT 



TTZT 



TT7T 



2.3e-177 



Protein name 



Locus Name 



hypotnetical protein 



pir : jgiu^u 



Acc# 



JQ1020 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



WTTT 



1ST 



Probability 
|6.0e-l2 



Protein name 



Locus Name 



sp:£L32_BA<JST 



ACC# 



P07840 



Description 

505 RIBOSOMAL PROTEIN ( RIBOSOMAL PROTEIN I) ibi^vj 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



I£8.3.a0.u2...cl...l&l.. 



TTTTT 



TIT 



TTTT 



TTTT 



3.8e-12o 



Protein name 



Locus Name 



DNA nelicase Recg 



pir:G7b41J 



Acc# 



G75413 



Description 



783 



ORF Name 



15537^62 c^ 2bl 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



TTTT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



|ia.7.2S.2b.6....cl...l9.2 1 



NT AA 

— — , Score Probability 
AAID Length Length 



[3TTF - 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



|£0.0.<m&7...cl...2.M I 



WW 



V7T 



H.6e-66 



Protein name 
Description 

G T P- BINDING PRO T EIN E RA HOMO LOG (BEX PROTEIN) 



Locus Name 



sp :ERA_BACSU 



Acc# 



P42182 



ORF Name 



NTID 



NT AA 

— — , Score Probabi lity 
AAID Length Length 



WW? 



TT7T 



3.1e-22 



Protein name 



Locus Name 



probable lipopolysaccharide 
N-acetylglucosaminyltransf erase, rfbU 



pir :F64500 



Acc# 



F64500 



Description 



784 



ORF Name 



NT ID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



Protein name 



WHIT 



¥3"5~ 



T7T" 



|2.1e-i0 



Locus Name 



Acc# 



Description 



sp:Y90 7_MKTJA 



Q58317 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Pro bability 
Length Length 



21450752 cl lSl 



Protein name 



1242 



Locus Name 



Acc# 



Description 



ORF Name 



NTID 



AAID 



NT AA „ . « i-i • , 
— — Score Probabil ity 
Length Length 



Protein name 



Fi2ir 



7T 



2TT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



aiaa.7.aaa...ci...2Lia | 



Protein name 



AAID 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



785 



ORF Name 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 



2500 


8122 




214 


S45 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



wrzr 



252 



0.031 



Locus Name 



Acc# 



P36378 



Description 

(OSTEOKtBtfrfltf) (OKf) (SASEMtiNT M EMSfeAtifi PkOTSlISf BM-40) 



ORF Name 



NTID 



NT AA _ _ , , . _ . . 
— — Score Probability 
AAID Length Length 



Protein name 



\11&&&2B.±.±1..:±1 I 



18124 



H7lS 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



NT 



AAID Length Length 



AA 

— Score Probability 



75 



Locus Name 



Acc# 



Description 



NO-HIT 



786 



ORF Name 



24252127 ci 2±b 



Protein name 



NT ID 



AAID 



NT AA 

— — Score Proba bility 
Length Length 



TT7T 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



WttT 



NT 



AA 



Length Length 
735 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



S12S 



NT AA 

— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



NT 



AA 



NTID 



AAID 



2±5M.Lll...z±..±ll 



2907 



WTZT 



Length Length 



TJTT 



Score Probability 
12. Oe-^ 



Protein name 



Description 



Locus Name 



Acc# 



P50743 



REGION 



787 



ORF Name 



NTID 



NT AA 
— — , Score 
AAID Length Length 



Probability 
|8.0e-06 



Protein name 



Description 



Locus Name 



|sp:PRyA_BA<Jfc!U 



Acc# 



P24327 



^kOT^ltf tfXPO&T PkOT^ltN PRSA PRECURSOR 



NT 



AA 



ORF Name 



NTID 



24651442 cl 1S7 



AAID Length Length 



8131 



1884 



Score Probability 
0.60058 



T2T 



Protein name 



Locus Name 



MAR binding tilament-like protein i:MFPl 
protein 



Description 



pir :T07lll 



Acc# 



T07111 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
TTT5 



Score Probability 



STT- 



Lo cus Name 



Acc# 



Description 



ORF Name 



NTID 



25.7.aii2b.b....Gi....2L5i. I 



Protein name 



AAID 



mrrr 



— — Score Probability 
Length Length 



TUT 



TIT 



Locus Name 



Acc# 



Description 



788 



ORF Name 



125907687 ri 'Ah 



Protein name 



NTID 



NT 



AA 



AAID Length Length 

cms — 



Score Probability 



"ST 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



Protein name 



synthase III 



Description 



NTID 



AAID 



NT AA 
— — . Score 
Length Length 



TUTT 



Locus Name 



pir :F7U^y4 



Probability 
£.5e-b9 



Acc# 



F70394 



ORF Name 



\265.9.b.b..±'L.h:<L 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



TTTT 



\Z7T 



|3.3e-i8 



Locus Name 



RNA-bindmg protein 



Igp : SYOkBM 



Acc# 
L48548 



Description 



Synechococcus sp. PCC 7942 ftNA-bincling protein (rJDpAj gene , complete cas. 



NT 



AA 



ORF Name 



\23.±ll..si2J±2L I mrE 



NTID AAID Length Length 
[2TT7 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



789 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



77T" 



Score Probability 
1.2e-^ 



333 



Protein name 



Description 



Locus Name 



Igp : PSPUT 



Acc# 



X97228 



P. gingival is gpdxJ, put, and yhbG-pg genes. 



NT 



AA 



ORF Name 



NTID 



31447126 Cl 203 



12917 



AAID Length Length 




Score Probability 
l.Se-25 



Protein name 



Locus Name 



CMP-N-acetylneurammic acid synthetase 



gp:MNKJ62lb 



Acc# 



AJ006215 



Description 



Mus musculus mkNA tor CMP-JM-acetyineurammic acid syntnetase. 



NT 



AA 



ORF Name 



NTID 



116A0.9/i:L.al...lTl 



AAID Length Length 



Score Probability 
I.8e-i3 



Protein name 



Locus Name 



trigger tactor 



bir:C704l6 



Acc# 



C70416 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



2FT 



Score Probability 
1 . 9e-06 



TTTJ 



Protein name 



Locus Name 



hypothetical protein PHSUU4 



pxr:t7ii24b 



Acc# 



F71245 



Description 



790 



NT 



AA 



ORF Name 



NTID 



34155705 ±2 114 



AAID Length Length 




Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



ORF Name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



\lA±&L6Ab..±l..±3A... I 



\TTT 



Probability 
3.ie-08 



Protein name 



Locus Name 



sp:YGJN_lilC!OLl 



ACC# 



P42595 



Description 



ORF Name 



NTID 



AAID 



NT AA 
— , — , S core 
Length Length 



8144 



TST" 



[43T 



Probability 
8.7e-43 



Protein name 



Locus Name 



probable ribonucleotide transport: ATP-Jomaing 
protein mkl (mkl) RP097 



BTrTTTTTTTS" 



Acc# 



H71718 



Description 



ORF Name 



15.6.6±52B..±±..&\L 



Protein name 



NTID 



hypothetical protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



3T" 



Locus Name 



|gp:SS27b208 



|2.7e-07 



Acc# 



Z75208 



B.subtilis genomic sequence 89009£>p. 



791 



ORF Name 



5525055 c!4 260 



Protein name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



TUT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



imiEEZiczizza: 



Protein name 



NTID 



AAID 



NT AA 

— — , Score Probabil ity 
Length Length 



TZTT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



Length Length 

ros — 



Score Probability 
0.020 



Locus Name 



sp:MTl3__MyTiiJD 



Acc# 



P80248 



Description 
METALLOTHIONEIN iO -III (MT-iO-lli) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2527 



TFT" 



504 



Protein name 



Locus Name 



Acc# 



Description 



[MO-HIT 



792 



ORF Name 



42700 cl 19b 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



Locus Name 



Acc# 



Description 



(NO -HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probabil ity 
Length Length 



2929 



S151 



WIST 



5.8e-iiV 



Protein name 



Locus Name 



ClpX protein 



gpiBSOLWiyiW 



Acc# 



X95306 



Description 



B.suotilis cipx gene. 



ORF Name 



NTID 



— — Score P robability 
AAID Length Length 



4a.7..7.6....c3....^^.. 



74T" 



$.0e-57 



Protein name 



Locus Name 



ATP -dependent protease proteolytic suPunit 
ClpP 



gp:APl27082 



ACC# 



AF127082 



Description 



Myxococcus xanthus ATP -dependent protease proteolytic summit cipPtcipP), 
ATP-dependent protease ATPase subunit ClpX (clpX) , prolylendopeptidase 
precursor Pep (pep), ATP-dependent protease LonV(lonV), oligopeptide 
permease homolog OppA (oppA) , oligopeptidepermease homolog OppB (oppB) , and 
oligopeptide permease homologQppC (qppC) genes, com plete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



&&&2&.Q1....C2...221 J I333T 



Length Length 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



[NO-HIT 



793 



ORF Name 



4957677 Cl 183 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



T5T 



|5.2e-i9 



Locus Name 



Acc# 



Description 



:sp:SORA_ECOLl 



SURA) , (PPtASE) (rOTAMaSS C) 



ORF Name 



NTID 



AAID 



66$£4l2 fi iSy 



Protein name 



NT AA 

— — , Score P robability- 
Length Length 



TTT 



TJT 



4.5e-30 



Locus Name 



Acc# 



conserved .Hypothetical protein aq_3 55 



Description 



pir:£70«l 



E70331 



ORF Name 



Protein name 



NTID 



AAID 



3T5T 



NT AA 

— — Score Pr obability 
Length Length 



T5UT 



I.4e-83 



Locus Name 



Acc# 



Description 
DNA M I SMATCH R E PA I R PRO T EIN MUTL 



sp:MUTLJWJyU 



P49850 



ORF Name 



Protein name 



NTID 



AAID 



3T5T 



NT AA 

— — , Score Proba bility 
Length Length 



711 



Locus Name 



Acc# 



vsrD protein 



Description 



pir :±40b4U 



140540 



794 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


818785_C3_262 


2936 




410 


1233 






Protein name 








Locus 


Name 


Acc# 


Description 














NO-HIT 1 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


UAll&...alJlAl 


2937 


8159 


539 | 


1520 


1138 


2.3e-lli> 

















Protein name 



Locus Name 



phosphor xbosylaminoimidazolecarooxamiae 
formyl transferase 



pir:C7046« 



Acc# 



C70468 



Description 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



\116MAlL...c.L.±Tl 



TZT 



T7T" 



6.7e-13 



Protein name 



Locus Name 



gp:AB024S64 



Acc# 



AB024564 



Description 



Bacillus halodurans gene tor TtiM, iilkMk , VCBJ , YHCG, YMcJ^ ana YHChi, complete 
cds . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l±5:mB12^1JLlb. 


.... 2939 


8151 


220 


663 


94 


0.0032 


Protein name 








Locus 


Name 


Acc# 



Description 



|sp:£XSl_XANCJP 



034259 



BIOPOLYMER TRANSP ORT 5XBD1 PROTEIN 



795 



ORF Name 



NTID 



AAID 



NT AA 

Score 

Length Length 



15714000 t'A Bi 



2940 



8162 



T7T 



Probability 
|i.3e-89 



Protein name 



Description 



Locus Name 



Acc# 



gp : BNRRTEAB 



Bacteroides thetaiotaomicron rteA ana rtaB genes involved mproduction or 
plasmid-like forms, complete cds, and tetQ gene, 3'end. 



ORF Name 



NTID 



16632SS5 11 22 



12941 



Protein name 



nypotnetical protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



411 



HTTZT 



2.3e-l77 



Locus Name 



pir : JQ1U2U 



Acc# 
JQ1020 



ORF Name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



17.&.7.fift&a...cl...l7.3. I 



18164 



TUT 



2 . 5e-10 



Protein name 



Locus Name 



unknown 



|gp:AP0^647 



Acc# 



AF062647 



Description 



Butyrivibrio librisolvens butyrivi&riocin OR79 (JDvx/yj gene, complete cas; 
and unknown genes. 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



796 



ORF Name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



155812 ci 17tJ 



2544 



pur 



269 



75 



Probability 

Tmm 



Protein name 



Locus Name 



Acc# 



IgprYSCMTRm 



Description 

Yeast (S.uvarumJ mitocnondria RF2 gene, segment i 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


22860i28_t2_77 


2945 


§167 


83 252 


64 


|0.03l 


Protein name 








Locus Name 


Acc# 










sp : Sp£C_XeNLA 


| P36378 


Description 














(OSTEONECTIN) (ON) 


(GARMENT 


MEMBRANE PROTEIN 


BM-40) 




















ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




21bM.tib...±±...±L9. 


2546 


5168 


100 303 


105 


6.6e-06 




















Protein name 








Locus Name 


Acc# 


nypotnetical protein PH0217 


pir :G71244 


G71244 


Description 














ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




iasiSiiLci^m.™ 


2547 


8165 


283 852 


157 


1.2e-l5 




Protein name 


Locus Name 


Acc# 



TolQ protein 



gp : PP\>AL1 



X74218 



Description 

Pseudomonas putida ruvB, tolQ, tolR, toiA, tolB ana oprL genes. 



797 



ORF Name 



23870287 ti ±2b 



Protein name 



NTID 



NT 



AAID Length Length 



AA 

— Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



3T7T" 



hypotnetical protein PHU^iy 



Description 



NT AA 

— — Score P robability 
Length Length 



[ST 



Locus Name 



tpir:A7i24b 



Acc# 
A71245 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



mrrr 



TTTT 



TUT 



0.0002^ 



Protein name 



Locus Name 



outer membrane protein Ompyb 



gp:AP0^i24b 



Acc# 



AF021245 



Description 



Neisseria meningitidis outer membrane protein Omp85 lomp8b) gene , complete 
cds . 



ORF Name 



NTID 



NT AA 

— — Score Prob ability 
AAID Length Length 



WT7T 



6.7e-36 



Protein name 



Locus Name 



NorM 



|gp:AB0lO463 



Acc# 



AB010463 



Description 

Vibrio parahaemoiyticus gene tor NorM, complete cds . 



798 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TUU — 



ITT 



Score Probability 
2.0e-75 



7ST 



Protein name 



Locus Name 



rod shape determining protein MreB 



pir:B703Va 



Acc# 



B70373 



Description 



NT 



AA 



ORF Name 



NTID 



\l^B2&ti±±l..&i 



AAID Length Length 



Score Probability 



ITT 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



\2l6A±$2B....zl.A&l I 125^ 



AAID Length Length 
WTTZ — 



Score Probability 
ITTD 



[07TT3T 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



bir:JC6027 



Acc# 
JC6027 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



8177 



1440 



P771T 



l.2e-10 



Protein name 



Locus Name 



conserved hypothetical protein MTH83 



BTrTF^TTr 



Acc# 



F69210 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



799 



ORF Name 



266067 ti T) 



Protein name 



NTID 



12957 



AAID 



mrrr 



NT 



AA 



Length Length 
TUT 



— Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



2.6.6..7.3.16.2...X3....lb.Z.. 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 

nrs — 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



|2£.7£.:ZM2...c2...2.ib.., 



Protein name 



NT AA 

— — , Score Probability 
NTID AAID Length Length 



2955? 



5181 



442 



i.8e-41 



Locus Name 



penicillin-binding protein 2 



gp:AP147449 



Acc# 



AF147448 



Description 



Pseudomonas aeruginosa strain PAOl penrcillin-JDinding protein 2 tpbpAj , 
rod- shape -determining protein (rodA) , membrane -bound lytictransglycosylase 
(mltB) , rare lipoprotein A (rlpA) , penicillin-binding protein 5 (dacA) , and 
lipoate biosynthesisprotein B (lipB) genes, complete cds; and unknown gene. 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



|m3.tta..±2L...lflL3. I [^TT 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



800 



ORF Name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



29550402 cl 196 



TUT 



Probability 
5.0e-14 



Protein name 



Locus Name 



rod snape- determining protein (mrecj nomoiog 1 ipir :C70l8y 



Acc# 



C70189 



Description 



NT 



AA 



ORF Name 



NTID 



29£2 



AAID Length Length 

?uzz — 



Witt 



Score Probability 
|9.8e-ia6 



Protein name 



Locus Name 



PepO 



gp:AB0iO440 



Acc# 



AB010440 



Description 



Porphyromonas gmgivalis gene tor Pepo, complete cas . 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
|5.0e-49 



Protein name 



Description 



Locus Name 



sp:Y4WA_kHl^N 



Acc# 



P55679 



HYPOTHETICAL ZINC PROTEASE V4WA, 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



INO-HIT 



801 



NT 



AA 



ORF Name 



NTID 



AAID 



3321087!> cl 1*2 



Length Length 



T5T 



Score Probability 
|2.4e-S3 



Protein name 



Description 



Locus Name 



Acc# 



aOTL Carrier p&Otei n RfiDtfcrArih!) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



340S1405 c4 '220 



2966 



TUT 



1. 9e-06 



Protein name 



Locus Name 



hypothetical protein PKSUU4 



pir:?7l245 



Acc# 



F71245 



Description 



ORF Name 



NTID 



NT AA „ _ 
— — Score Probability 
AAID Length Length 



FIST 



2.5e-14 



Protein name 



Locus Name 



erythromycin esterase nomolog yJDtO 



bir:A697i>0 



ACC# 



A69750 



Description 



ORF Name 



Protein name 



NorM 



NTID 



NT AA „ _ -t -i t i , 
— — Score Pro bability 
AAID Length Length 



§150 



2u7T 



[STTT 



li.Se-11 



Locus Name 



|gp:AB0i04bi 



Acc# 



AB010463 



Description 

Vibrio paranaemolyticus gene tor NorM, complete cas. 



802 



NT 



AA 



ORF Name 



NTID 



354135 c3 211 



AAID Length Length 



TIT 



Score Probability 
5.0e-^4 



Protein name 



Locus Name 



Acc# 



P74346 



Description 

AYtO'tHfiTieAIi 36.0 KD PkOTfillsf SLR1629 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



4u4uS?$ c3 '±±3 



2304 



;l.de-7S 



Protein name 



Locus Name 



tetracycline resistance element regulator 
RteA 



pir :A4186U 



Acc# 



A41860 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TTJT 



Protein name 



Locus Name 



|4.6e-I4 



Acc# 



phosphate ABC transporter, peripiasmic 
phosphate-binding protein 



pir:C72^76 



Description 



C72276 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



\&&A6.1TL.a±..±l± I 



TUT 



Protein name 



Locus Name 



|4.2e-06 



Acc# 



ExbD2 



Description 



gp:AP047^74 



AF047974 



Vibrio cholerae Tol£ (tolft ) , fixbB2 (exbB2) , ElxbD2 (exbD2) , andTonB2 (tonB2j 
genes, complete cds; and unknown genes. 



803 



NT 



AA 



ORF Name 



NTID 



AAID 



4S77157 'A'AA 



TTTT 



Length Length 
WIS — 



Score Probability 
8.2e-09 



TWI 



Protein name 



Description 



Locus Name 



|sp:TOWBJWLl>Y 



Acc# 



025899 



TONS PkOTBllN 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



4S80452 c^ ^00 



Protein name 



Locus Name 



ABC transporter MutF 



gp:AP0^21Bi 



Acc# 



AF082183 



Description 



Streptococcus mutans ABC transporter MutF <mutF) , membrane spannrngprotexn 
MutE (mutE) , and membrane protein MutG {mutG) genes , complete cds; and 
fructose bi-phosphate aldolase (fba) gene, partial cds. 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



TTW 



WIW 



Protein name 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NT 



AA 



NTID 



si2Lflm...c2„.iaa .....| \ittz 



AAID Length Length 

iw%- 



wwr 



Score Probability 
3.3e-43 



[457 



Protein name 



Locus Name 



ABC transporter, ATP -binding protein nomoiog I foir :D7017i 



Description 



Acc# 



D70171 



804 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



5132837 c2 'AU 



Protem name 



Locus Name 



Acc# 



ABC-type transport protein s±r0864 :protem 
slr0864 :protein slr0864 



Description 



pir :S74B4y 



S74849 



ORF Name 



Protein name 



NTID 



NT AA 

— — , Score Probab ility 
AAID Length Length 



8200 



Locus Name 



6.2e-10 



Acc# 



Description 



|sp:Y^3_£{TAAU 



P23217 



HYPOTH E T I CAL TRANSCRIPTIONA L REGULATOR IN OACA 5 1 REGION (ORF IBS) 



ORF Name 



NTID 



— — , Score Probabil ity 
AAID Length Length 



Protein name 



Locus Name 



Acc# 



Description 



MO-MIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



805 



ORF Name 



NT ID 



NT AA 
— , — , Score 
AAID Length Length 



±2 109 



7T 



7ZW 



51" 



Probability 





Protein name 



Locus Name 



unknown 



|gp:MHU7bbW 



Acc# 



U75508 



Description 



Marinococcus nalopnilus plasmid. pPLl, complete sequence. 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



ci Tib 



2332" 



JUT 



Score Probability 
1.3e-22 



Protein name 



Locus Name 



CGI-124 protein 



gp:AFl5ia&2 



Acc# 



AF151882 



Description 



Homo sapiens CGI -124 protein mRNA, complete eels . 



ORF Name 



NT ID 



|&6.5..7.5.7„.±i...iD.3. 1 



Protein name 



NT AA 
— , — , Score 
AAID Length Length 



B2U5" 



T7W 



Locus Name 



Probability 



Acc# 



Description 



IN0-M1T 



ORF Name 



Protein name 



NT ID 



233T" 



AAID 



NT 



AA 



Length Length 

— 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



806 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 





2$85 


8207 


255 771 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NT ID 



3.0.2.7.20.£>..7...±2....6. I W 



AAID Length Length 
£T3I — 



Score Probability 
|4.7e-56 



Protein name 



Locus Name 



hybrid histidme Kinase nomolog 



|gp:AP0246i^ 



Acc# 



AF024619 



Description 



E>seudomonas tluorescens ny£>rid nistidme Kinase nomolog (stySj andresponse 
regulatory protein (styR) genes, complete ccis . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



12A0.O2AD....cl...lb.., 



1611 



Score Probability 
0.00050 — 



Protein name 



Locus Name 



STARP antigen 



gp : PFSTAKP 



Acc# 



Z26314 



Description 



P.talciparum gene tor STARP antigen. 



NT 



AA 



ORF Name 



NTID 



AAID 



S.3.D.0.0.3...±2...10.. 



7WW 



8210 



Length Length 
73~ 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



807 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



5862501 t2 3 



WITT 



Protein name 



Description 



Locus Name 



Acc# 



(NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



ia5.4.mi..±l...i I IS33TF 



WITT 



TIT 



3.8e-10 



Protein name 



Locus Name 



hypothetical protein siioyjy 



|pir:S74723 



ACC# 



S74723 



Description 



NT 



AA 



ORF Name 



NTID 



I£&3.2aa5....c3....&6. 



AAID Length Length 
POTT 



Score Probability 
|2.3e-177 



TTIT 



Protein name 



Locus Name 



hypothetical protein 



pir : JQIU2U 



Acc# 



JQ1020 



Description 



ORF Name 



22ll$.l&.l...c2..&2.. 



Protein name 



NTID 



AAID 



envelope glycoprotein 



Description 



NT 



AA 



Length Length 
E3I 



1Z 



Score Probability 




7? 



Locus Name 



|gp:Agll35^6 



Acc# 



AF113578 



HIV-1 isolate 3 02^04 group O trom Spain envelope glycoprotein (env)gene, 
partial cds. 



808 



NT 



AA 



ORF Name 



NT ID 



1 22648^61 ti b3 



AAID Length Length 

— 



Score 



TUT 



Probability 
i.0e-05 



Protein name 



Locus Name 



conserved hypothetical protein MTH6 y b 



Description 



pir:F&9192 



Acc# 



F69192 



ORF Name 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 




Score Probability 



37 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



5217 



conserved hypothetical protein 



Description 



NT AA rt 

— — Score Pro bability 
Length Length 



ST7" 



2.7e-09 



Locus Name 



pir:<S'm8b 



Acc# 



G72385 



ORF Name 



215.2$.$l&....al..±Q±.. 



Protein name 



NTID 



AAID 



WHS" 



NT AA 

— — , Score Probabil ity 
Length Length 



25" 



33TT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



25&l6.B£l.±2..Xl.. 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



mr 



Locus Name 



Acc# 



Description 



NO- HIT 



809 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



26598385 c3 ^6 



Protein name 



transcription regulator NtrC tamily 



Description 



fTTTT 



772 



Locus Name 



1 . 4e-76 



ACC# 



C70396 



ORF Name 



NT AA 

— , — , Score Probability 
NTID AAID Length Length 



|4.2e-07 



Protein name 



Locus Name 



Immunoreactive 52KD antigen PG41 



gp:APi757i6 



Acc# 



AF175716 



Description 



Porphyromonas gmgivaiis strain W50 immunoreactive 52KD antigenPG4l gene, 
complete cds. 



ORF Name 



Protein name 



l3Luafiftii.„c3t...itt2L I puinr 



NT 



AA 



NTID AAID Length Length 

T53 — \mi — 



Score Probability 



wrrr 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Pr obability 
Length Length 



7T 



Locus Name 



Acc# 



Description 



810 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



33244627 i3 34 



TUUT 



TTT 



Score Probability 
10.00016 



T77 



Protein name 



Locus Name 



hypothetical protein HO^Fuy.J 



pir :T33369 



Acc# 



T33369 



Description 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



\TUUT 



18225 



I5T" 



I0.00S1 



Protein name 



Locus Name 



|sp:YF03_MYO>N 



Acc# 



P75445 



Description 

HYPOTHETICAL 85.3 KD PROTEIN P10 OkPVbO 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Pr obability 
Length Length 



TJUT 



TTT 



|4.3e-10 



Protein name 



Locus Name 



hypothetical protein aq_224 



pir:H70326 



Acc# 



H70326 



Description 



ORF Name 



lAH.ll^L.cxl.Al 



Protein name 



sensor 



Description 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



TIT 



1290 



Locus Name 



gprPSEFLKSK 



2.8e-20 



Acc# 



L41213 



Pseudomonas aeruginosa (strain PAKJ putative tleR kinase (ties; 
andtranscriptional activator (fleR) genes, complete cds . 



811 



ORF Name 


JN 1 1JJ 


NT 

AAID Length 


AA 
Length 


Score 


Probability 




3006 


8228 417 1254 


115 


2.5e-06 


Protein name 






Locus Name 


ACC# 


integrase 


gp:BPU , 7 53 71 


U75371 


Description 












Sacteroides tragiiis transposon Tn455b TnpA ttnpA) , integrase (mt) , Tnpc 




(tnpC) , excisionase 


(xis) , mobilization protein {mobA),and beta 


-lactamase 




(cfxA) genes, complete cds; and unknown genes. 






















ORF Name 


NT ID 


NT 

AAID Length 


AA 
Length 


Score 


Probability 


4l79002_cl_67 


5007 


822$ 210 633 


106 


0.014 


Protein name 






Locus Name 


ACC# 


nigtt-molecular-weignt surtace 


-exposed protein 


pir :A43855 


A43855 


HMW1 













Description 



ORF Name 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 
T75 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



yhgF protein 



Description 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 



TIT 



690 



Locus Name 



pir:B65136 



|4.1e-S2 



Acc# 



B65136 



812 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



10664125 c2 lib 



WZTF 



Protein name 



Locus Name 



putative secreted protein 



gprtJOMll 



Acc# 



AL133278 



Description 



Streptomyces coeiicoior cosmld Mil. 



NT ±±ti 

— — , Score Probability 
AAID Length Length 

lo.ooosi — 



AA 



ORF Name 



NTID 



110822142 c2 148 



itbt 



Protein name 



Locus Name 



transposase 



|gp:AF0^66 



Acc# 



AF038866 



Description 



Bacteroides tragiiis transposon Tnbb2U transposase ttapH) anamor>i±ization 
protein BmpH (bmpH) genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
JTE 



TUT" 



Score Probability 
2.6e-0a 



TT7 



Protein name 



Description 



Locus Name 



sp:YDEti_acJH±>0 



Acc# 
Q10449 



Hy&OtHfiflCAL 57.2 k£> PROTg tti Cl2felu.l£0 1VS chromusomk i 



NT 



AA 



ORF Name 



NTID 



JUTT 



AAID Length Length 



Score Probability 
2.1e-S7 



F74 



Protein name 



Locus Name 



115K outer membrane protein precursor : Susc 
protein 



tair: J06027 



Acc# 



JC6027 



Description 



813 



ORF Name 



19688750 ±2 41 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
T3S 



Score Probability 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



WITT 



NT 



AA 



Length Length 
7S - 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



8238 



NT AA 

— — Score Pr obability 
Length Length 



TETT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



T0T7" 



WZ3T 



alkaline pnospnatase 



Description 



NT AA o _ , , . _ . . 

— — Score P robability 
Length Length 



T5T 



TT7T 



AST 



S.4e-53 



Locus Name 



pir:B72410 



Acc# 



B72410 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



Protein name 



3018 



TUT 



7.3e-05 



Locus Name 



Acc# 



giucoicinase 



Description 



bir:F7^4e 



F72246 



814 



ORF Name 



24226532 fi y2 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 



TTZT 



TIT 



Locus Name 



i.3e-35 



Acc# 



Description 



sp:XYLkJiAl«!lN 



P45043 



XYLOSS RSGOLATORY PROTEIN 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



Protem name 



T3T 



T5T 



Locus Name 



2.0e-0S 



Acc# 



transposase slrObll rprotem sirObll :protem 
slr0511 



Description 



pir: 87664:4 



S76643 



ORF Name 



\116AZ.112.±1..KL 



Protein name 



NTID 



TUTT 



WITT 



NT 



AA 



AAID Length Length 



Score Probability 



TM" 



Locus Name 



0.0026 



Acc# 



nomolog yvqc 



Description 



pir :E'/U04b 



E70045 



ORF Name 



NTID 



NT AA 

— — Score Prob ability 
AAID Length Length 



Protein name 



W5~ 



Locus Name 



Acc# 



Description 



NO -HIT 



815 



ORF Name 



NT ID 



NT AA 

— — Score Probabi lity 
AAID Length Length 



Witt 



TT5T 



TTT 



0.0015 



Protein name 



Locus Name 



Acc# 



immunoreactive 42KD antigen PGii 



|gp:AP17B71b 



AF175715 



Description 



£>orphyromonas gingivalis strain WbO immunoreactive 42KD antigenPG33 gene, 
complete cds . 



ORF Name 



NT AA 

— — Score Pro bability 
NT ID AAID Length Length 



2£2 c2 134 



Protein name 



8246 



I4T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Probability 
Length Length 



Protein name 



8247 



1ST" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



— — S core Probability 
Length Length 



Protein name 



§245 



Locus Name 



Acc# 



Description 



NO-HIT 



816 



ORF Name 



NT ID 



NT AA 
— , — , Score 
AAID Length Length 



IIT7F" 



Probability 
|2.5e-77 



Protein name 



Locus Name 



sp:YDEG_&CHR> 



Acc# 
Q10449 



Description 

KlfPOTflfiTlCAL 57.2 PkOTBl^ Cl2B10.l £C lit ^MkOMCSOME 1 



ORF Name 



Protein name 



Description 



NT ID 



NT AA 

— — , Score Pro bability 
AAID Length Length 



vr 



Locus Name 



Acc# 



WfO-HlT 



ORF Name 



Protein name 



NT ID 



AAID 



NT AA „ „ , , . n . . 

— — Score Probability 
Length Length 



1UT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



1131 



Score Probability 
5.0e-24 



fZTT 



Locus Name 



immunoreactive 53 KD antigen PG123 



gp:AF144b41 



Acc# 



AF144641 



Description 



Porphyromonas gingivaiis strain WbU immunoreactive 53 KD antigenPGl23 gene, 
complete cds. 



817 



ORF Name 



NT ID 



AAID 



"NTT AA 

— — Score P robability 
Length Length 



3514587 cl ll'A 



T5T 



8.6e-^0 



Protein name 



Locus Name 



nypotnetical protein 



pir : S7bUb3 



Acc# 



S76053 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


4iMS.i.7....cl...lii 


3032 


8254 


381 


1146 


115 


0.00058 















Protein name 



Locus Name 



clostripam-related protein 



bir:B ' ^ibb 



Acc# 



B72365 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TZZ 



Score Probability 
0.0081 



Protein name 



Locus Name 



hypothetical protein TU3K/.4 



pir :T244U4 



ACC# 



T24404 



Description 



ORF Name 



Protein name 



transposase 



Description 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



T4~4ir 



|3.6e-14 



Locus Name 



Acc# 



AF038866 



Bacteroides iragilis transposon Tnbb^u transposase UDipH) andmoJDilization 
protein BmpH (bmpH) genes, complete cds . 



818 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



4423m cli ibV 



F7T 



T5T 



|2.6e-08 



Protein name 



Locus Name 



Hypl protein 



gp:HVHVJr>lPkU 



Acc# 



Y09797 



Description 



H. vulgar is mRNA tor Hypl protein. 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


4804632_cl_ll6 


3036 


8258 


389 


1170 


197 


8.8e-l3 



Protein name 



Locus Name 



conservect nypotnetical protein TPoyii 



pir:t>7l264 



Acc# 



D71264 



Description 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


&&lbh}xh..±±..A* 


3037 


8259 


715 2148 181 


1.9e-i0 


Protein name 








Locus Name 


Acc# 


immunoreactive 


53 KD antigen 


PG123 




gp:AFI4464I 


AF144641 


Description 




Porpnyromonas 
complete cds . 


gingivalis strain W5U 


immunoreactive b3 KD antigenPUi^J gene, 
















ORF Name 


NT ID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 




3038 


8260 


487 1464 230 


2 .4e-2l 



Protein name 



Locus Name 



Acc# 



Description 



gp:ATAC004411 



AC004411 



Arabidopsis thaliana chromosome II BAC F14M4 genomic sequence , complete 
sequence . 



819 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
T5T 



Protein name 



conserve©: Hypothetical protein TP0931 



Description 



F7TT 



Score Probability 
|4.9e-ii 



T7T" 



Locus Name 



bir:D71264 



Acc# 



D71264 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 
T3TT 



— Score Probability 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



Description 



MO -HIT 



NT 



AA 



NTID 



AAID 



Length Length 
S3 - 



Score Probability 



T5T 



Locus Name 



Acc# 



ORF Name 



Protein name 



unknown 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



TTTT 



|2.2e-07 



Locus Name 



gp: 1196 7 71 



Acc# 



U96771 



£>revotella bryantii putative polygalacturonase, B-l , 4-enaog±ucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



820 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



cl 120 



Protein name 



751T 



2.5e-20 



Locus Name 



Acc# 



probable transmembrane protein 



Description 



bir:T346bi 



T34651 



ORF Name 



aa2iii..ci..M., 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



T5W 



1.2e-2i 



Locus Name 



Acc# 



Hypothetical protein pabiou2 



Description 



pir:G75064 



G75064 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 




Score Probability 



74" 



Locus Name 



Acc# 



Description 



IMO-StT 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



F9~ 



207 



7T" 



Locus Name 



Acc# 



Description 



gp:feA1^42byi 



AJ242593 



Bacteriophage Alls complete genome . 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probabi lity 
Length Length 



Protein name 



RW7~ 



77" 



Locus Name 



Acc# 



Description 



NO-HIT 



821 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



(MO-HIT 



ORF Name 



±±&£A&±l...a±..A± 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length J - 



wrnr 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
2uT 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



iEnissoo: 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



Locus Name 



conserved nypotnetical protein BB02 62 



Description 



pir :F70132 



9.1e-17 



Acc# 



F70132 







NT 


AA 


NTID 


AAID 


Length 


Length 


3052 


8274 




138 


417 



ORF Name 



Protein name 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



822 



ORF Name 



NT ID 



22946087 ±1 3 



TUTT 



Protein name 



NT AA 

— — , Score Probability 
AAID Length Length 



TTTT 



nypotnetical protein 6 2 



Description 



TUT 



TIT 



IT 



Locus Name 



pir :T3102£ 



0 .033 



Acc# 



T31025 



ORF Name 



23.5.Mab.7...±i....3.b. 



Protein name 



NT ID 



TU^T 



AAID 



TUT 



NT 



AA 



Length Length 
TUB — 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



3055 



AAID 



§277 



NT 



AA 



Length Length 

nrs — 



Score Probability 



Tl 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT 



AA 



NTID 



AAID 



maaai7...±a...aa 



TToT 



TTTT 



Length Length 
TJT 



WUT 



Score Probability 
|4.2e-06 



TT7 



Protein name 



Locus Name 



conserved hypothetical protein 



bir:G72:Jao 



Acc# 



G72380 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



JUTT 



TUT 



Length Length 
TT§ 



Score Probability 



T5 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



823 



NT 



AA 



ORF Name 



NT ID 



AAID 



29321538 rl 16 



8280 



Length Length 

m% 



Score Probability 
7.0e-4i 



Protein name 
Description 



Locus Name 



sp:VP2^_HAii!l^ 



Acc# 



P44243 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



3<33S5305_c3_$0 


3055 


S2S1 


2?l 


816 


411 















|2.5e-38 



Protein name 



Description 



Locus Name 



sp:S0J_6AC^U 



Acc# 



P37522 



SO J PROTEIN 



NT 



AA 



ORF Name 



NTID 



AAID 



3.10A9.B.0A..±l..:ll I prr^T 



8282 



Length Length 
TIT 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO- HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



13.8.B.28.3.7....c3....ai , I 



Length Length 



Score Probability 
0.024 



TO 



Protein name 



Locus Name 



hypothetical protein F20D10.230 



p±r :T05638 



Acc# 



T05638 



Description 



824 



ORF Name 



Protein name 



NT ID 



3062 



AAID 



NT 



AA 



Length Length 
-TUT 



Score Probability 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



NT ID 



AAID 



NT AA 

— — Score Probability 
Length Length 



TOT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 




Score Probability 



or 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



WIST 



NT 



AA 



Length Length 
TTT 



Score Probability 



Locus Name 



Acc# 



Description 



bsfO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



S2S8 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



825 



ORF Name 



ci bb 



Protein name 



NTID 



JUST 



NT 



AA 



AAID Length Length 




Score Probability 



ST 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



41S.22lI1..±1...40... 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
TFT 



Score Probability 



TTWT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



2070 



AAID 



§292 



NT AA 

— — , Score Probability 
Length Length 



T27T 



Locus Name 



Acc# 



Description 



InO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



WO-HIT 



826 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



11851375 t2 45 



TTT 



445 



Protein name 



Locus Name 



sp:YACN_BAC^U 



Acc# 



Q06756 



Description 

kY£>0l 1 £E , r2CAL 17.1 Kb PROTEIN list MECb-GltX iNTERGEttIC rEG£On 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



12929693 t3 78 



2FT" 



|i.3e-34 



Protein name 



Locus Name 



sp:YAAA_EC6LI 



Acc# 



P11288 



Description 

HYPOTHE TI CAL 23.6 KB PRO T EIN I N T HRC -TALB INTERGENIC REGION 



ORF Name 



NTID 



— — , Score Probability 
AAID Length Length 



ia&7Ma„±a...ai 



1434 



i.0e-35 



Protein name 



Description 



Locus Name 



sp:RLUU_BA<JiJU 



Acc# 



P35159 



(£>5ETO0tfRlDYLATE SYNTHASE) (URACIL HYDROLVASiil) 



ORF Name 



NTID 



NT AA o n 
— — , Score Probability 
AAID Length Length 



±±AA0.B.1..±2..SA 



3075 



§257 



Protein name 



Locus Name 



Acc# 



Description 



PSTO-HIT 



827 



ORF Name 



14547277 c5 235 



NTID 
|3076 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



1014 



TZTZ~ 



i.2e-123 



Protein name 



Locus Name 



immuno reactive 36 kDa antigen PG14 



gp:AFl4bvya 



Acc# 



AF145798 



Description 



immunoreactive 36 KDa antigenPGl4 gene, 



Porphyromonas gingivalis strain W50 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



14S86027 ±3 Is 



JU7T 



7.3e-39 



Protein name 



Description 



Locus Name 



sp:MAA_BAt^U 



Acc# 



P37515 



TRAMS ACETVLAS E ) 



ORF Name 



NTID 



13.7.9.3.0.I3...±2...I2 



Protein name 



AAID 



NT AA 

— — Score Probability 
Length Length 



T3TT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



2o:iriLia..±±.m 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



T7F" 



TTJT 



3 .6e-i40 



Locus Name 



adenylosuccinate lyase 



gp:LMFP1421 



Acc# 



AL132764 



Description 

Leishmania major Friecllin chromosome 4 PAC P1421. 



828 



NT 



AA 



ORF Name 



NTID 



AAID 



121679756 c2 1^0 



Length Length 
— 



Score Probability 
|2.3e-ii 



TT7 



Protein name 



Description 



Locus Name 



Acc# 



P56430 



TMORfilSOXlN (TWO 



ORF Name 



NTID 



AAID 



— , — , Score Probability 
Length Length 



25553426 ±2 57 



724" 



2.6e-S2 



Protein name 



Description 



Locus Name 



sp:lLVE_HAEIN 



ACC# 



P54689 



NT 



AA 



ORF Name 



NTID 



AAID 



S7TJ¥" 



Length Length 




Score Probability 
|6.Ie-SS 



Protein name 



Locus Name 



conserved hypothetical protein 



pir :E72226 



Acc# 



E72226 



Description 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



2&o.iiA±i.±i..&$. I mm 



S4T 



l.le-SS 



Protein name 



Locus Name 



Acc# 



ribosomal protein S2 (rpsB) :r±JDOsomal protein 
BS1 



pir :A696yy 



Description 



829 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



24026561 ci 148 



7W 



0.0093 



Protein name 



Locus Name 



late expression lactor 2 nomoiog lei -2 



kfp:AF002732 



Acc# 



AF002732 



Description 



Cydia pomonella granulovirus late expression l actor 2 nomoiog iet-2gene / 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



c2 171 



M5" 



Length Length 



Score Probability 
0.00037 



111 



Protein name 



Description 



Locus Name 



Acc# 



P54451 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



|24412S.ll..±A...yj.., 



TSTT 



TTT 



2.6e-34 



Protein name 



Locus Name 



ribosomal protein L13 



pir:P71677 



Acc# 



F71677 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probabi lity 
Length Length 



T5T 



7.$e-2£ 



Protein name 



Locus Name 



Acc# 



conserved hypothetical protein ytmQ 



pir :B6yyyv 



B69997 



Description 



830 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



24651013 cl 122 



MIT 



l.Oe-17 



Protein name 



Locus Name 



lipase-UKe protein 



bir:A64706 



Acc# 



A64706 



Description 



NT 



AA 



ORF Name 



NTID 



\2iiami.„aL.±2i I pros 



AAID Length Length 

— 



Score Probability 
1357 



Protein name 



Locus Name 



conserved hypothetical integral membrane 
protein HP1486 



Description 



pir :F6470b 



|I.ie-33 



Acc# 
F64705 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



8312 



T7l 



2 .4e-44 



Protein name 



Locus Name 



alpha -glucosidase 



|gp:6TO66fi97 



Acc# 



U66897 



Description 



Bacteroides thetaiotaomicron neopuliuianase tsusA) anaaipha-giucosiaase 
(susB) genes, complete cds . 



NT 



AA 



ORF Name 



NTID AAID Length Length 





Score Probability 



nrm 1 



Protein name 



Locus Name 



Acc# 



Description 



[NO-HIT 



831 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



JUST 



Protein name 



Locus Name 



i.3e-32 



Acc# 



hypothetical protein ydiH 



Description 



pir:A55757 



A69787 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



Protein name 



JUWT 



37T 



H5" 



Locus Name 



0.0016 



Acc# 



splicing regulatory protein SWAP homolog 
(alternatively spliced, clone pFL2) 



pir:A540i7 



Description 



A54037 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



Protein name 



Locus Name 



Acc# 



Description 



(NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



liimaaa..±i„.a2L I 



Protein name 



411 



TITS' 



Locus Name 



8 . le-40 



Acc# 



probable exodeoxyribonuc lease VII large 
subunit 



pir :C7bb4y 



Description 



C75549 



832 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length ±L - 



wrnr 



TTS~ 



l.le-41 



Protein name 



Description 



Locus Name 



sp:EFTS_MYCTU 



Acc# 



Q10788 



NT 



AA 



ORF Name 



NTID 



AAID 



1264543$! cl 128 



WITT 



Length Length 
TTT 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



26£.0.0.26±±1...&A.. 



Length Length 




TJiT 



Score Probability 
l.8e-28 



TIB" 



Protein name 
Description 

503 RTBOSOMAL PROTEIN S9 



Locus Name 



sp:RS3J4YCTU 



Acc# 



006259 



NT 



AA 



ORF Name 



NTID 



AAID 



215A121...a2J20±. 



WITT 



Length Length 
S3 - 



Score Probability 



Protein name 

Description 
INO-HTT 



Locus Name 



Acc# 



833 



ORF Name 



129407552 ±i 13 



Protein name 



Description 



MO-HIT 



NT 



AA 



NTID 



AAID 



TTUTT 



Length Length 
Si- 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



-5TZT 



hypotnetxcal protein RP306 



Description 



NT 



AA 



Length Length 



TZUT 



Score Probability 
6.6e-61 



Locus Name 



pxr :E71686 



Acc# 



E71686 



ORF Name 



Protein name 



Description 



[NO-HIT 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



sir 



£u"7~ 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



1±6.165±&..±1...9.5. 



3102 



8325 



1014 



7.7e-67 



Protein name 

Description 
MRP f>ftOTEM H0M0L0G 



Locus Name 



sp:MRP_SYNY3 



Acc# 



P53383 



834 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



33398425 i2 72 



JTUT 



2586 



T7W 



|4.3e-ii 



Protein name 



Locus Name 



unknown 



gprAFOO'mi 



Acc# 



AF007381 



Description 



Flavobacterium ]ohnsoniae gliding motility protein (glcLA) gene, complete 
cds ; and unknown genes . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



33£l250 £2 73 



S.7e-l2 



Protein name 



Locus Name 



RNA polymerase sigma tactor SigZ-liJce protein | |gp : AF137263 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S riJoosomai protein si6-iiKeprotein, tucose 
gene cluster/ and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


111216.ltt^tl^± 


3105 


8328 


153 


462 


202 


6.6e-15 



Protein name 



Locus Name 



subtil is in sendai nomolog 



pir :ce>y4bb 



Acc# 



C69456 



Description 



ORF Name 



Protein name 



NTID 



AAID 



TTUT 



phase-1 tlagellin 



Description 



NT AA 

— — , Score Probability 
Length Length 



Locus Name 



pir:333191 



0.021 



Acc# 



S33191 



835 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



TTUW 



TT7T 



TJE~ 



|4.ie-06 



Protein name 

Description 
TONB PROTEIN 



Locus Name 



Acc# 



025899 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



34g4$582 ±3 118 



Score Probability 
3.2e-06 



134 



Protein name 



Locus Name 



transmembrane sensor 



gp:AF05l691 



Acc# 



AF051691 



Description 



Pseudomonas aeruginosa stress tactor A (pstA) , ECF sxgma tactor (tiul) , 
transmembrane sensor (fiuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds. 



ORF Name 



15.1±2L%2..±±J.IA 



Protein name 



NT 



AA 



NTID 



AAID 



3110 



8332 



Length Length 
TSSI 



Score Probability 



Locus Name 



Acc# 



Description 



NO- HIT 



ORF Name 



3.5.6.25£0.7....c3....ML. 



Protein name 



NTID 



RTTT" 



AAID 



TTJTT 



NT AA 

— — , Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



836 



ORF Name 



NTID 



NT AA „ , , > _ . 
— — , Score Probability 
AAID Length Length ^ 



35944140 ±i 90 



Protein name 



TUT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



16.5.1M$A...a2..±15. I 



PTTT 



Protein name 



3¥7~ 



T97 



Locus Name 



|2.?e-13 



Acc# 



hypothetical protein jhpl380 



Description 



pir:S7i8i!> 



G71815 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probab ility 
Length Length 



7T 



0.0037 



Protein name 



Locus Name 



Acc# 



exodeoxyr iJdoiiuc lease VII, small chain 



pir : JQ0664 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



liiA2L.7i.2...ti...s. I 



47TT 



1413 



Protein name 



Locus Name 



|1.0e-l42 



Acc# 



sp:5Ytf_J^Yi 



Description 



P52276 



LTGASE) (ASMRS) 



837 



NT 



AA 



ORF Name 



NTID 



AAID 



4375427 c3 240 



TTT5" 



Length Length 



T7T 



Score Probability 
1.2e-08 



Protein name 

Description 
REPRESSOR PROTEIN) 



Locus Name 



sp:BLAI_STAAU 



Acc# 



P18415 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



I44S5312 c3 271 



STTT 



74S~ 



I225T" 



5.$e-47 



Protein name 

Description 
(MU TOXIN) 



Locus Name 



sp:tfAGH_CLOI>E 



Acc# 



P26831 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



i8.S.15.0..7....cl...l5.2L I I3TTS 



TFT 



5.8e-14 



Protein name 

Description 
HYPOTHETICAL li.J 10) £>£>0TfiM SLL0546 



Locus Name 



sp:Vb46_^VNVJ 



Acc# 



Q55397 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



5235212iiiS I [3TT£ 



8341 



Protein name 



Locus Name 



isomerase like protein 



gp : ATFCA5 



Acc# 



297340 



Description 

Arabidopsis tnaliana DNA chromosome 4, ESSA I fca contig tragmentNo. 5. 



838 



ORF Name 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 
T$2 



Score Probability 



Locus Name 



Acc# 



Description 
My^TTTT 



ORF Name 



Protein name 



Description 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length aL 



TTZT 



TWIT 



3.4e-56 



Locus Name 



sp:BGAL_XMMN 



Acc# 



P48982 



£ETA-GALaCTOS±DASeI PRlSCtJkSOk, (LACTASE) 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



NTID 



AAID 



lD.a2.3.3.D.a...c;2.„.iaZ I 



Protein name 



NT 



AA 



Length Length 

\s±z — 



Score Probability 



TUB" 



Locus Name 



Acc# 



Description 
ITO^TTT 



839 



NT 



AA 



ORF Name 



NT ID 



10603388 c3 138 



AAID Length Length 

mzz — 



Score Probability 

jzi — 



6.4e-31 



Protein name 



Locus Name 



sp:YEIH_EC10LI 



Acc# 



P33019 



Description 

HYPOTHETICAL 36.3 KB PROTEIN W LYSP-NFO ItiTERGEtilC ftEGIOti 



NT 



AA 



ORF Name 



NT ID 



11882812 ±3 64 



AAID Length Length 

srn — 



T4T 



WST 



Score Probability 
1.5e-0S 



Protein name 



Locus Name 



conserved Hypothetical protein TP0412 



Description 



(pir:B7l3i>7 



Acc# 



B71327 



NT 



AA 



ORF Name 



NTID 



AAID 



12Lfl4aift...c2...1Q& I WFZZ 



Length Length 



Score Probability 



WIT 



Protein name 



Description 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — . Score Probability 
Length Length 



riOABAZ.B....al..±ll I \5TTT 



8345 



T7T 



Tu"5T 



I.5e-i09 



Protein name 



Locus Name 



hypothetical protein 



pir: JQ1020 



Acc# 



JQ1020 



Description 



840 



ORF Name 



14730313 t3 ^ 



Protein name 



Description 



NTID 



AAID 



NT AA 

— — n Score Probability 
Length Length 



Locus Name 



sp:CLPB_SYNY3 



Acc# 



P74361 



CLPB PftOfElBf 



ORF Name 



15541008 c3 132 



Protein name 



NTID 



HIT 



AAID 



NT AA 

— — Score Probabil ity 
Length Length 



079 



Locus Name 



Acc# 



Description 



psfO-HlT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



TTZT 



|2.3e-l77 



Locus Name 



hypothetical protein 



pir : JQ102Q 



ACC# 



JQ1020 



Description 



NT 



AA 



ORF Name 



NTID AAID Length Length 

— 



1ST 



Score Probability 
7.7e-35 



Protein name 



Locus Name 



conserved hypothetical protein aq_4 9 5 



pir :E70344 



Acc# 



E70344 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1S.7.6.2.7...±1..A | wm 



TZTF 



TFTT 



i.6e-151 



Protein name 



Locus Name 



hypothetical protein slr0049 



pir:S74i47 



Acc# 



S74347 



Description 



841 



NT 



AA 



ORF Name 



NTID 



15703511 c2 108 



TT5T 



AAID Length Length 
— 



TUB" 



Score Probability 
TZ1 



3.8e-07 



Protein name 



63 JcDa protein 



Locus Name 
|gp:MBU73ggT~ 



Acc# 



U73653 



Description 



Mycobacterium bovis 6 3 JcDa protein, 4 7 kDa protein and cipB gene, compiete 
cds . 



NT 



AA 



ORF Name 



NTID 



l$7l88 c2 95 



AAID Length Length 
— 



TTT 



Score Probability 
TT3 



1. Oe-ll 



Protein name 



Locus Name 



clrgA protein: protein slr!719 :protexn slr!719 



pir : S75047 



Acc# 



S75047 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



313b 


8357 


210 


633 




241 



2.5e-20 



Protein name 



Locus Name 



conserved hypothetical protein 



pir:D75341 



Acc# 



D75341 



Description 



NT 



AA 



ORF Name 



NTID 



2.2l8£QI28....c3....140. I 



AAID Length Length 




Score Probability 




0.031 



Protein name 



Locus Name 



Acc# 



P36378 



Description 

lOSffiONfiCflKt) (OUt) (BASEMENT MfiMfel^El PROTEltf BM-40) 



842 



NT 



AA 



ORF Name 



NTID 



AAID 



\JTTT 



Length Length 
TT5~ 



T5T 



Score Probability 
I4.7e-12 



Protein name 



Locus Name 



conserved Hypothetical protein yr£>F 



pTr7E3W7T- 



Acc# 



E69972 



Description 



ORF Name 



NTID 



AAID 



2&5MA&2.±l...b:.L 



Protein name 



hypothetical protein SC7H2.05 



Description 



NT AA n _ i— ■ t ■ *. 
— — Score Probability 
Length Length 



T73~ 



5TT 



ST5TT 



5.6e-16 



Locus Name 



pir:T35736 



Acc# 



T35736 



ORF Name 



Protein name 



NTID 



AAID 



glucose/galactose transporter 



Description 



NT AA 

— — Score Probability 
Length Length 



TTTT 



7 .6e-44 



Locus Name 



pir:A7l^0 



Acc# 



A71850 



ORF Name 



Protein name 



NTID 



l±6A0.9.12..±2...1l). I 



NT 



AA 



AAID Length Length 
TIT 



Score Probability 



Locus Name 



Acc# 



Description 



ORF Name 



2^6AZ3.LZ...tl...l7..... 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



pep t idyl -tRNA hydrolase 



Description 



i.3e-32 



Locus Name 



bir:S72229 



ACC# 



B72229 



843 



ORF Name 



Protein name 



Description 



RECA PROTEIN 



NT 



AA 



NTID 



AAID 



Length Length 



Tim- 



Score. Probability 
— 



S.4e-164 



Locus Name 



ACC# 



P22841 



ORF Name 



24881300 C2 111 



Protein name 

Description 
NO-HIT 



NT 



AA 



3143 



NTID AAID Length Length 

mzs — 



Score Probability 



700 



Locus Name 



Acc# 



ORF Name 



Protein name 



Description 



NTID 



NT AA 

, ^-..^ _ — ^ _ — _ Score Probabi lity 
AAID Length Length JU 



16.5.$Ab±C)....c.2.A4. I 



TUT 



$.4e-3£> 



Locus Name 



gp:AB024531 



Acc# 



AB024531 



Enterococcus seriolicicia SA2F01-1, =~2~; -3 genes, partial andcomplete cds . 



ORF Name 



NTID 



NT AA 

_ _ _ _ — ^. T — ^ Score Probability 
AAID Length Length JL 



3.U2J.!La2S....C3....14&.. 



|6.2e-17 



Protein name 



Locus Name 



conserved hypothetical protein 



bir:D75333 



Acc# 



D75333 



Description 



844 



ORF Name 



NTID 



AAID 



NT AA 
T — . , T — . . Score Probability 
Length Length. 



30519457 r2 29 



Protein name 



Locus Name 



Acc# 



Description 

NO-HIT 



ORF Name 



3.141.7.ia.7....ci...&i.. 



Protein name 



NTID 



tutt 



AAID 



NT 



AA 



Length Length 
735 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



sttit 



NT AA 

— , — , Score Probability 
Length Length 



7T" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AftIflai3....c2L...lCL5 1 



Protein name 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TU7T 



TUT 



0.022 



Locus Name 



Acc# 



hypothetical protein ybbR 



Description 



pxr :A69745 



A69745 



845 



NT 



AA 



ORF Name 



NTID 



426337 il 7 



AAID Length Length 
W5T2 — 



TTJ" 



Score Probability 
TUZ 



6.0e-06 



Protein name 



Locus Name 



RNA polymerase sigma ractor SigZ-liJte protein 



[gp:AF137Z£T 



Acc# 



AF137263 



Description 



Bacteroides tnetaiotaomicron 3 OS ribosomal protein si6-iikeprotein, fucose 
gene cluster, and RNA polymerase sigma factorSigZ-like protein (sigZ) genes, 
complete cds . 



ORF Name 



4720511 r3 4§ 



Protein name 



NTID 



AAID 



WTTT 



NT AA 

— , — , Score Probability 
Length Length 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



Protein name 



Description 



NTID 



NT AA 
_ — _ — Score Probability 
AAID Length Length 



T±5T 



MI 



1.7e-l7 



Locus Name 



sp:YJJU_EOTLI 



Acc# 



P39407 



HYPOTHETICAL 39.8 KD PROTEIN IN OSMY-DEOC INTERGENIC REGION (0357) 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length i * 



§375 



TuT" 



0.017 



Protein name 



Locus Name 



KIAA063 6 protein 



|gp:AB0i453£ 



Acc# 



AB014536 



Description 

Homo sapiens mRNA for KIAA0636 protein, complete cds . 



846 



NT 



AA 



ORF Name 



NTID 



4373765 ti 8 



AAID Length Length 
— 



Score Probability 
3.4e-08 



T2I 



Protein name 



Locus Name 



SigG 



gp:AFm&49 



Acc# 



AF121849 



Description 



Synechococcus PCC7002 SigG (sigG) and hypothetical protein genes , complete 
cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



5i2Sl562 c5 126 



wrrr 



Length Length 



Score Probability 
l.le-Od 



Protein name 



Locus Name 



putative secreted protein 



gp : SC4A7 



Acc# 



AL133423 



Description 

Streptomyces coelicolor cosmia 4A7 . 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



TO" 



Score Probability 
3.3e-34 



T72 



Protein name 



Locus Name 



transcription regulator LysR tamily 



pir:F70356 



Acc# 



F70356 



Description 



ORF Name 



NTID 



AAID 



aftllll...ci...ia£ | 13157 



Protein name 



N utilization substance protein B 



Description 



NT AA 

— — , Score Probability 
Length Length 



TUTT 



TTT 



Locus Name 



pir:D72212 



B.le-l2 



Acc# 



D72212 



847 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length J - 



976536 r3 6S 



Protein name 



probable ribosomal protein L2 5 



Description 



1UT 



PUT 



8.0e-i7 



Locus Name 



pir:H71S65 



Acc# 



H71665 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 
TZ1 



TUT 



Score Probability 

m 



Locus Name 



dihydro folate reductase, / thymidylate 
synthase, 



pir :T01684 



Acc# 



T01684 



Description 



NT 



AA 



ORF Name 



NTID 



10..7.3.S0.0.2...C2...1S. I PTSU 



AAID Length Length 




Score Probability 
l.Se-13 



TTJ 



Protein name 



Locus Name 



RNA polymerase sigma tactor SigZ-liKe protein 



|gp;AFl372gT" 



Acc# 



AF137263 



Description 



Bacteroides tnetaiotaomicron 30S ribosomal protein S16-HKeprotein, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



123.3.3.3.3. 0....t 2.. ..2.....! 



T7T 



|2.2e-05 



Protein name 



Locus Name 



hypothetical protein F14F9.5 



pir :T33774 



Acc# 



T33774 



Description 



848 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ~ — ^ 



i.2e-57 



Protein name 



Locus Name 



Acc# 



115K outer membrane protein precursor : SusC 
protein 



bir:JC60ii7 



JC6027 



Description 



ORF Name 



NTID 



Protein name 



probable phosptioesterase, yvnB 



Description 



NT 



AA 



AAID Length Length 




Score Probability 
T*3 



Locus Name 



pir :C70044 



|3.0e-06 



Acc# 



C70044 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



£iflaaj.3L±i...a.... 



TTT 



0.00031 



Protein name 



Locus Name 



transmembrane sensor 



|gp:AF05l64l 



Acc# 



AF051691 



Description 



Pseudomonas aeruginosa stress tactor A (pstA) , ECF sigma tactor (tiulj , 
transmembrane sensor (fiuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
1791 



5B" 



Score Probability 
l.Se-42 



Protein name 



Locus Name 



Acc# 



mobilization protein C 



gp;APHB243 



AF118243 



Description 

Bacteroides fragilis mobilization protein C (mobCJ gene, completecds . 



849 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



5135158 cl 10 



TuTT 



3TF5" 



|S.0e-44 



Protein name 



Locus Name 



mobilization protein C 



gp:AP115243 



Acc# 



AF118243 



Description 



Bacteroides fragilis mobilization protein C (mobC) gene, completecds . 



NT 



AA 



ORF Name 



7112683 13 6 



NTID AAID Length Length 

3735 — 



Score Probability 

Tim — 



2.7e-l03 



Protein name 



Locus Name 



mobilization protein B 



|gp:AFll824i> 



Acc# 



AF118242 



Description 



Bacteroides tragilis mobilization protein B (mobB) gene, completecds . 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



1.3e-17 



Protein name 



Locus Name 



sp:HLYD_PA3HA 



Acc# 



P16534 



Description 

L2UK0T0XM SECRETION £>R0TE1N d 



NT 



AA 



ORF Name 



NTID 



AAID 



10.3.7.3..7.5.1...13....1.7.& 



Length Length 



Score Probability 
f^7T5 



8.8e-66 



Protein name 



Locus Name 



Acc# 



P28303 



Description 
DNA- DAMAGE- INDUCIBLE PROTEIN P 



850 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



1173187 c± U l l 



rrnr 



TTF" 



0.011 



Protein name 



Locus Name 



Hypothetical protein, MAL1P3.07 



|gp:PPMALlP3 



Acc# 



AL031746 



Description 



Plasmodium falciparum MAL1P3, complete sequence. 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



TT5T 



Locus Name 



Acc# 



Description 



MO-SI* 



ORF Name 



Protein name 



NTID 



JT7T 



AAID 



NT 



AA 



Length Length 
TUT" 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



TTTT 



AAID 



NT AA 

— — Score Probability 
Length Length 



5.0e-24 



Locus Name 



Acc# 



ligase 



Description 



pir:A70351 



A70351 



851 



NT 



AA 



ORF Name 



NTID 



134S87 ci 221 



TTPT 



AAID Length Length 



Score Probability 
3TJ3 



5.4e-27 



Protein name 



Locus Name 



Acc# 



sp:THIM_PEA 



Description 

THlORSDOXltf M-TY^El, CHLOROPLAST PRECURSOR (T&K-M) 



ORF Name 



NTID 



Protein name 



hypothetical protein slr20l3 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



pir:£-75346 



Acc# 



S75346 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



li2S5.5.0.S..±2...120. .J PT7F 



FF7~ 



5.5e-07 



Protein name 



Description 



Locus Name 



[gpTTmF7T 



Acc# 



U93872 



Kaposi's sarcoma-associated, herpesvirus glycoprotein M, DNAreplication 
protein, glycoprotein, DNA replication protein, FLICEinhibitory protein and 
v-cyclin genes, complete cds, and tegumentprotein gene, partial cds. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



TT7T" 



TZT 



i.2e-32 



Protein name 



Locus Name 



hypothetical protein mexF 



pir :T30830 



Acc# 



T30830 



Description 



852 



ORF Name 



14557131 11 33 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 




Score Probability 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 

— — Score Pr obability 
Length Length 



TTTT 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



I5.7.6.0.2b.a...t2...lal.. 



Protein name 



YvrN protein 



Description 



NT 



AA 



NTID 



AAID Length Length 
TJTE 



441 



Score Probability 
|4.0e-3I 



TO 



Locus Name 



gp:BS43KBDMA 



Acc# 



AJ223978 



Bacillus subtilis 42.7kB DNA tragment trom yvsA to yvgA. 



NT 



AA 



ORF Name 



NTID 



3181 



AAID Length Length 



7JT 



Score Probability 
2 . 8e-ll 



170 



Protein name 



Locus Name 



hypothetical protein Rv36 95 



pir :H70792 



Acc# 



H70792 



Description 



853 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
MM — 



Score Probability 
ffT3 



0.00035 



Protein name 
Description 

HYPOTHETICAL 28.7 KD PROTEIN 1 iKf RElCA 3 ' REGION 



Locus Name 



sp:YRECjSYNE>2 



Acc# 



P19737 



NT 



AA 



ORF Name 



NTID 



AAID 



1700$642 c3 357 



34uT" 



Length Length 



1TT 



Score Probability 




|4.1e-20 



Protein name 



Locus Name 



rprY protein 



Acc# 



S33662 



Description 



NT 



AA 



ORF Name 



NTID 



3184 



AAID Length Length 




Score Probability 
0.00018 



TT3 



Protein name 



Locus Name 



pX02-46 



|gp:AF188«5 



Acc# 



AF188935 



Description 



Bacillus anthracis plasmict pX02 , complete sequence. 



NT 



AA 



ORF Name 



NTID 



2Q.5.U8.3.12....£1...3.9... 



AAID Length Length 




34u7 



■JUT 



Score Probability 




3 .le-14 



Protein name 



Locus Name 



acriilavin resistance protein AcrE 



pir:A703Si 



Acc# 



A70361 



Description 



854 



NT 



ORF Name 



NTID 



AAID Length Length 



AA 

— Score Probability 



T5T 



TUZT 



0.00085 



Protein name 



Description 



Locus Name 



sp:Y§76_Mli!TJA 



Acc# 



Q58286 



ptfTAflVfi AbC Tran sporter PERMEASE protein Muuavb 



ORF Name 



214540^1 12 ill 



Protein name 



NTID 



— — Score P robability 
AAID Length Length 



I23T" 



Locus Name 



Acc# 



Description 



DsfO-MiT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



S4TIT 



Length Length 

r7^ — 



Score Probability 
5 . 6e-64 



653 



Locus Name 



sp:t>YkH_li!<JoLi 



Acc# 



P29464 



(UMP KINASE) — (SMBA PkOTkllNJ 



NT 



AA 



ORF Name 



NTID 



AAID 



8411 



Length Length 
T¥H5 — 



Score Probability 



^3" 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



855 



NT 



AA 



ORF Name 



NTID 



AAID 



2266S577 ci 234 



Length Length 
1533 



"TUT 



Score Probability 
fTS 



0.024 



Protein name 
Description 

SAkLY 5.0 GLYCOPROTEIN 



Locus Name 



sp:E3ii_ADl!l0i 



Acc# 



P11317 



NT 



AA 



ORF Name 



NTID 



AAID 



2343376 tl 32 



34TT 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



DsfO-HlT 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



116.1$Xll.±-L.AX I pr^i 



8415 



573 



WIT 



l.le-40 



Protein name 



Locus Name 



ribosome recycling factor 



pir:C753S6 



Acc# 



C75386 



Description 



856 



NT 



AA 



ORF Name 



NTID 



AAID 



|23626b6l ti 188 



Length Length 
OTJ2 — 



Score Probability 



Protem name 



Description 



Locus Name 



gp:VCU47542 



Acc# 



U47542 



Vibrio cholerae ADP-L-giycero-D-mannoheptose-6-epxmerase (rtaD)gene, 
complete cds . 



ORF Name 



NTID 



NT AA 

— — , Score Proba bility 
AAID Length Length 



tl 



IMS" 



ITT 



2.0e-29 



Protexn name 



Locus Name 



catxon ettlux system (czcB-lxke) 



[pir:C704lS 



Acc# 



C70415 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



Protein name 



3T55" 



FT 



Locus Name 



Acc# 



Description 



KO-HIT 



ORF Name 



Protein name 



NTID 



7TST 



AAID 



NT 



AA 



Length Length 
E2TJI 



Score Probability 



SF 



Locus Name 



Acc# 



Descriptxon 



MO-SIT 



857 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Prob ability 
AAID Length Length 



■M71T 



TT5" 



Locus Name 



Acc# 



Description 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



2iD.16.D..7..7...±l...ia... 



1TW 



Length Length 



Score Probability 
li.Se-08 



T7T 



Protein name 



Description 



Locus Name 



sp:Y7^_Mk!TJA 



Acc# 



Q58203 



tiY&OTftfi'l'lCAL PkOThJXN MJO'/yi 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



3200 



T7TT 



2.Se-40 



Protein name 



Locus Name 



DNA polymerase 



gp:AE , 063S4i 



Acc# 



AF083949 



Description 



Treponema denticola DNA gyrase sufcumt B igyrbj ana cnromosomairepiication 
initiator protein (dnaA) genes, complete cds; and DNApolymerase (dnaE) gene, 
partial cds. 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



\TZUT 



8423 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



858 



NT 



AA 



ORF Name 



NTID 



AAID 



2439687V ti Ifai 



JTUT 



Length Length 




Score Probability 
0.0025 



Protein name 



Description 



Locus Name 



sp:Y794JVWTJA 



Acc# 



Q58204 



HYPO'JHEl' 1 CAh PROTKiti MJ07^4 



ORF Name 



24406886 c3 320 



Protein name 



NTID 



AAID 



NT AA 

— — Score Pr obability 
Length Length 



TUB" 



3TB" 



Locus Name 



Acc# 



Description 



IN0-H1T? 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



— — Score P robability 
Length Length 

C5G5 



TUT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



18427 



7W 



2223 



1.9e-S3 



Locus Name 



heterocyst ditterentiation protein HetC 



pir:T31072 



Acc# 



T31072 



Description 



859 



ORF Name 



NT ID 



NT AA 

— — Score Pr obability 
AAID Length Length 



ci 338 



|7.7e-08 



Protein name 



Locus Name 



laminannase 



gp:AF04700J 



Acc# 



AF047003 



Description 

&hodothermus marinus strain 1Ti278 laminannase UamR) gene, complete cds . 







NT 


AA 


Score 


Probability 




ORF Name 


NTID AAID 


Length 


Length 






244S$377_t3_l75 


5207 §425 


149 450 


154 


b .be-uy 


Protein name 






Locus Name 


Acc# 


hypothetical protein SC2El.iy S02E1.19 


pir:«4787 


T34787 


Description 
















NT 


AA 


Score 


Probability 




ORF Name 


NTID AAID 


Length 


Length 






lUA^Al^tl^l'lL 


3208 8430 


120 363 


168 


i . 4e-i^ 


Protein name 






Locus Name 


Acc# 


MmcQ 


gp:AF127J74 


AF127374 


Description 




Streptomyces lavenauiae LinA nomoiog, 


cytochrome P4bu nyaroxyi. 


aseuKU'4 , 




cytochrome P450 hydroxylase 0RF3 , MitT 


(mitT) , 


MitS (mitS) ,MitR 


(mitR) , MitQ 




(mitQ) , MitP (mitP) , 


MitO (mitO) , MitN 


(raitN) f MitM (mitM) , MitL 


(mitL) , MitK 




(mitK) , MitJ (mitJ) , 


MitI (mitI),MitH (mitH) , MitG (mitG) , MitF 


(mitF) , MitE 




(mitE) , MitD (mitD) ,MitC (mitC) , MitB (mitB) , MitA (mitA) , MmcA 


(mmcA) , MmcB 








NT 


AA 


Score 


Probability 




ORF Name 


NTID AAID 


Length 


Length 




2AMM2:L..qX.:±A& 


3209 8431 


232 699 


118 


u . UUU1Z 


Protein name 






Locus Name 


Acc# 


conserved nypotheti 


cal protein 




pir :A7222U 


A72220 



Description 



860 



NT 



AA 



ORF Name 



NTID 



24883260 t2 liU 



TTDT 



AAID Length Length 

mrm — 



Score Probability 
7 . 8e-90 



897 



Protein name 



Locus Name 



acritiavin resistance protein D tacru) RP170 I |pir :F71727 



Acc# 



F71727 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




3211 


8433 


186 




359 


7.0e-32 


Protein name 








Locus 


Name 


Acc# 



immunoreactive 8 9KD antigen PCi8 7 



gp:AP17b722 



AF175722 



Description 

Porphyromonas gingivalis strain WbU immunoreactive 89&D antigenPCi«7 gene, 
complete cds . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Pr 


obability 


255.0±5A2..±2...i2b. 


3212 


8434 




1578 


117 




0.00071 


Protein name 








Locus 


Name 




Acc# 



beta-i, 4-gaiactosyitranst erase IV 



|gp:AB02445F 



AB024436 



Description 

Homo sapiens mRNA tor beta- 1 , 4-galactosyltranst erase IV, compietecas. 



ORF Name 



NTID 



NT AA 

— — Score Probab ility 
AAID Length Length 



TUT 



MT5" 



74" 



TZT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



861 



NT 



AA 



ORF Name 



NTID 



2«708&a fi 148 



AAID Length Length 



Score Probability 
4 . le-20 



Protein name 



Locus Name 



conserved nypotnetical protein yaiB 



pir:C6«J7tib 



Acc# 



C69786 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



^5Mm.Li...m t |32T? 



1596 



Score Probability 
3.9e-73 



Protein name 



Description 



Locus Name 



sp:DMAK_HALMA 



Acc# 



Q01100 



DNAK PRtfl ' mM (HEAT PkOTii lH 70) (Htit>70) 



ORF Name 



NTID 



NT AA 
— — , Score 
AAID Length Length 



WTT 



[ITB" 



Probability 
0.0012 



Protein name 



Locus Name 



enterotoxm 



|gp:AF1^276^ 



Acc# 



AF192766 



Description 



Bacillus cereus strain AelO enterotoxm mRNA, complete cas. 



NT 



ORF Name 



NTID 



AAID Length Length 



AA 

— , Score 



|29.5i115lZ6....c1...ZZQ.. 



JZTT 



TUTT 



\TZTT 



1446 



Probability 
1.3e-l70 



Protein name 



Locus Name 



|sp:DP3AJJokbU 



Acc# 



051526 



Description 
DNA POLVMflRASE ill, ALPHA CHAIN , 



862 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



31327701 rl iB 



273 



5£T 



Protein name 



Locus Name 



conserved hypotnetrcal protein MTH6 Ufa 



Description 



pir :fc;6yi8u 



1.5e-54 



Acc# 



E69180 



ORF Name 



Protein name 



NTID 



— — Score P robability 
AAID Length Length 



18441 



Locus Name 



Acc# 



Description 



N0-U1T 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 

— 



149 



Score Probability 
7.8e-42 



444 



Locus Name 



gp : PGPGAAc^N 



Acc# 



X95938 



£>.gingivalis rnhB & pgaA genes & orts lbo, 197, 'zoz & ±yy. 



NT 



AA 



ORF Name 



|3.3.a.7.6.0.6.7....c2....15.S. I |332T 



NTID AAID Length Length 

msz — 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



863 



ORF Name 



34254387 ci '21$ 



Protein name 



NTID 



TZTT 



NT 



AA 



AAID Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



3.44§.2J>.2i*...ci..J.4u... 



Protein name 



NTID 



3223 



AAID 



NT 



AA 



— — Score P robability 
Length Length 



Locus Name 



Acc# 



Description 



MO -HIT 



ORF Name 



3.6.120.a42...c3....3.b.y.., 



Protein name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



7.7e-0!> 



Locus Name 



immunoreactive B3KD antigen puav 



|gp:API7b7^2 



Acc# 



AF175722 



Description 



Porphyromonas gingivalis strain WbU immunoreactive 8 yKD antigenPGBV gene, 
complete cds . 



NT 



AA 



ORF Name 



3.&21&&&3...±2...B.3.., 



NTID AAID Length Length 

m 11275 — 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



864 



ORF Name 


\TPTfi 
iN J. ±U 






NT 
Length 


AA 

— , Score 
Length 


Probability 




3226 




8448 


311 


936 546 


1.2e-52 | 


Protein name 












Locus Name 


Acc# 


conserved nypotnetical prote 


m 






pir:A7221y 


A72219 


Description 


ORF Name 


NT ID 




AAID 


NT 
Length 


AA 

— l Score 
Length 


Probability 


3.9.3.7.:7.:/.b....CZ..^.lL 


3227 




8449 


1065 


3X38 468 


~j 4.9e-83 


Protein name 












Locus Name 


Acc# 


115K outer membrane 


protein 


precursor : SusC 




pir:JC6U2V 


JC6027 


protein 
















Description 
















ORF Name 


NT ID 




AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 


1SA4.±S.Q...±±.&2 


322$ 




8450 


143 


432 




Protein name 












Locus Name 


Acc# 


Description 
















NO-HIT | 


ORF Name 


NTID 




AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


iafi£3iS5.a...ci...2L4i 


3225 




84Si 


73 


222 |li3 


1.3e-07 


Protein name 












Locus Name 


Acc# 


immunoreactive 89KD 


antigen 


PG87 






gp:AP17!DV22 


AF175722 



Description 



Porphyromonas gingivalis strain W50 immunoreactive 82KD antigenPG87 gene, 
complete cds. 



865 



ORF Name 



4031301 c2 281 



Protein name 



NTID 



TUT 



NT 



AA 



AAID Length Length 
2UT 



Score Probability 



55" 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



tttt 



AAID 



NT 



AA 



Length Length 



Score Probability 



TIT 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



TUT 



S454 



TTT 



TUT 



Locus Name 



microbial coiiagenase, precursor : cog protein I ipir : JC43y3 



Description 



0.00074 



Acc# 



JC4393 



ORF Name 



Protein name 



Description 



NTID 



TUT 



NT 



AA 



AAID Length Length 



Score Probability 



TTT 



TIT 



Locus Name 



sp:APX_STRUk 



|4.6e-17 



ACC# 



P80561 



866 



/^"O "C* "NT —I TV! ^ 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


472553«_ti_i7i 




§455 


239 


720 225 


4.8e-iy | 


Protein name 








T .or*! l c» "fsja mp 

±J^J \^ LL O J.NCLL1LG 


Acc# 










sp:P^_HELW 




Description 












(PHOSPflAflftVLSfiftlNli SYNTriASE) | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


4S65802_t3_l70 




§457 


251 


372 


3.3e-34 


Protein name 








Locus Name 


ACC# 



unKnown 



gp;NGU5476 0 



U34760 



Description 

Neisseria gonorrhoeae UvrA (u vrA) and ORF2bb? genes, complete cds , 



ORF Name 



NTID 



— — Score Probability 



AAID Length Length 



\A9£Al&l...z2..J.ti.± 



TUT 



11227 



8.5e-12b 



Protein name 



Locus Name 



immunoreactive 8 9KD antigen j*iav 



|gp:AF17b7Z2~ 



Acc# 



AF175722 



Description 

Eorphyromonas gmgivalis str ain Wbu immunoreactive wkd antigenPG87 gene, 
complete cds. 









NT 


AA 


Score 


Probability 


ORF Name 


NTID 


AAID 


Length 


Length 








|5.3.5.2lB.7....cl...i5.6. 


5257 


8459 


505 


912 


755 


3.3e-7b 





Protein name 



Locus Name 



hypothetical protein phuv 7b 



pir:£7ll26 



Acc# 



B71126 



Description 



867 



ORF Name 



NTID 



547525 c2 2vl 



Protein name 



hypothetical protein sir 14 7 B 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



pir:S7b694 



Acc# 



S75694 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
2679 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



RprX 



Description 



NTID 



AAID 



— — Score Probability 
Length Length 



17W 



2.2e-21 



Locus Name 



gp:S5yuuu 



Acc# 



S59000 



ORF Name 



Protein name 



NTID 



AAID 



nypotneticai protein 



Description 



— — Score Probability 
Length Length 

\2TT 



AA 



fZJT 



|7.0e-i8 



Locus Name 



bir:A?S6i:i 



Acc# 
A75613 



ORF Name 



Protein name 



NTID 



AAID 



TIFT 



— — Score Probability 
Length Length 



ITT 



Locus Name 



Acc# 



Description 



(NO-HIT 



868 



ORF Name 



11907501 c2 10b 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



[MO -MIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



3244 



Length Length 
JT7 



Score Probability 



TUT 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



VicK protein 



Description 



NT 



AA 



NTID 



AAID 



Length Length 
— 



Score Probability 
5.6e-!^ 



Locus Name 



gp:EPA01^oSu 



Acc# 



AJ012050 



Enterococcus taecaiis vie operon and tlanJting genes. 



ORF Name 



NTID 



— — Score P robability 
AAID Length Length 



2.113.aL2L...tZ..3.6. 



T7ZZ~ 



TIT 



WTT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



869 



ORF Name 



NT ID 



122062410 c2 lUB 



Protein name 



— — Score Probability 
AAID Length Length 



hypotnetical protein 



Description 



PI 



Locus Name 



tpir:^76lb2 



|4.ie-i39 



Acc# 
S76152 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAID Length Length 



Score Probability 



1855 



Locus Name 



Acc# 



IN0-H1T 



ORF Name 



Protexn name 



NT ID 



NT 



AA 



AAID Length Length 
TTT 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



— — Score Probability 



AAID Length Length 



26.16.026±±1..U, 



3250 



582 



nrrr 



i.8e-0^ 



Protein name 



Locus Name 



gp : MMSACi 



ACC# 



X84710 



Description 

M.mazei surface antigen genes ort4y2, orti37b ana orf/BJ. 



870 



ORF Name 



NT 



AA 



NTID MID Length Length 



Score Probability 



23804754_c2_9:i 


3251 


847^ 


413 


1242 








Protein name 








Locus 


Name 


Acc# 


Description 
















MO-HIT 1 


ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


%A0m6.11.±2...1B. 


3252 


8474 | 


702 


2109 




155 


| 4.Se-l^ 



Protein name 



Description 



Locus Name 



Acc# 



sp:M'Ub_KC0LT | P06129 



VITAMIN Bl2 feEdHP'l'Ok PftE^Uk^Ok 



ORF Name 



NTID 



AAID 



— — Score Probability 

Length Length 



Protein name 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



|&m.7.S.l..±3....b.b.., 



Protein name 



NTID 



8476 



NT 



AA 



AAID Length Length 



Score Probability 



ITT 



ITT 



FT7T 



Locus Name 



6.£e-4$ 



Acc# 



response regulator DrrA 



Description 



pir :D722l>8 



D72228 



871 



NT 



AA 



ORF Name 



NT ID 



4878387 c3 ±22 



AAID Length Length 



Score Probability 



WT7 



TZUT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


49.5.46.9.1^1^3.2 


3256 


8478 


270 


810 


213 


|2.4e-17 



Protein name 
Description 

PROBABLE! H^aT^DlJ^jOL ~M05!>MAl h Asll! / 



Locus Name 



lsp:HIS9_SCHP0 



Acc# 



014059 



NT 



AA 



ORF Name 



NTID 



AAID 



ftaiiaai..ci...ai.. 



M7T 



Length Length 



Score Probability 
$.4e-06 



1^ 



Protein name 



Locus Name 



hypothetical protein 



bir:A756l3 



Acc# 



A75613 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Prob ability 
Length Length 



10.3.5.13.8.i...cl...Zlfo.., 



1230 



5TT" 



1 .4e-49 



Protein name 



Description 



Locus Name 



sp:GLtTP__BkUAB 



Acc# 
Q44623 



GLUCOS E /GALACTOSE TRANSPORTER 



872 



NT 



AA 



ORF Name 



NTID 



AAID 



110683 t2 ill 



Length Length 



Score Probability 
ME 



1.0e-09 



Protein name 



Description 



Locus Name 



|gp:PVPVAi 



Acc# 



X92485 



P . vivax pval gene . 



NT 



AA 



ORF Name 



NTID 



AAID 



112506407 c2 261 



Length Length 



Score Probability 
T&S 



I2.fie-i7 



Protein name 
Description 

PHOS PHOGL YCOLATE PHOSPHATASE, (PGP) 



Locus Name 



Acc# 



P32662 



ORF Name 



NTID 



NT AA 
_ — _ — Score Probability 
AAID Length Length JL 



TIFT 



Protein name 
Description 



Locus Name 



sp:DBHA_SALTy 



Acc# 



P15148 



NT 



AA 



ORF Name 



NTID 



AAID 



1112.7.0.5.0...±2...10.2 1 



Length Length 



Score Probability 
TT2 



6.0e-06 



Protein name 



Locus Name 



RNA- directed DNA polymerase, , msDNA 
specific : DNA nucleotidyltransferase 
(RNA-directeti) ; reverse transcriptase ;revertase 



pir :S19248 



Acc# 



S19248 



Description 



873 



NT 



AA 



ORF Name 



NTID 



14226625 ±1 1 



_ — — — Score Probability 
AAID Length Length ^ 

MSB — 



TFT 



Protein name 



Description 



Locus Name 



sp:YG7 7_METJA 



Acc# 



Q59071 



HYPOTHETICAL PROtffilM MJl6l1 



ORF Name 



NTID 



NT AA , , . . 

, , — — _ — Score Probability 
AAID Length Length JL 



14<>58557 cl 200 



8486 



I3T5" 



5.9e-l24 



Protein name 



Description 



Locus Name 



Acc# 



XANTHOSI S PERMEASE (XANTHOSIS TRANSPORTER) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
BT5"7 — 



Score Probability 
[2TJ2 



3.5e-l6 



Protein name 



Locus Name 



transcription regulator Crp/Fnr tamily 



pir :A70344 



Acc# 



A70344 



Description 



ORF Name 



Protein name 



NTID 



NT AA 
— — Score 
AAID Length Length 



7^- 



Locus Name 



Probability 
|1.8e-58 



Acc# 



Description 



sp : KD&A_CHLP<5 



Q46225 



8 -PHOSPH ATE ^NTH S TAaE) (KDO 8-P SYNTHASE) 



874 



NT 



AA 



ORF Name 



NTID 



AAID 



15205215 c3 514 



Length Length 

its — I mi — 



Score Probability 




l.le-56 



Protein name 



Locus Name 



probable tRNA 
delta (2) - isopentenylpyrophosphate transferase 
(miaA) 



pir:B71501 



Acc# 



B71301 



Description 



NT 



AA 



ORF Name 



NTID 



l£45$6S2 cl 20§ 



AAID Length Length 

mm — 



Score Probability 



TZTF 



Protein name 



Locus Name 



immunoreactive 106 JcDa antigen PG115 



gp:AFl53767 



Acc# 



AF153767 



Description 



Porphyromonas gmgivaiis strain W50 immunoreactive 106 JcDa antigenPG115 
gene, complete cds . 



ORF Name 



NTID 



NT AA 
T — ^, r — Score Probability 
AAID Length Length ^~ 



±6.B3£M3....zl...l2S. I 



|2.5e-26 



Protein name 



Locus Name 



lsp:MUi7_T0]JA<! 



Acc# 



P30155 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



l7M4&3La...al...2L&l I [TT7U 



Length Length 
T7u~ 



Score Probability 



1113 



Protein name 

Description 
INO-UtT 



Locus Name 



Acc# 



875 



ORF Name 



Protein name 



Description 

smmsm 



NT 



AA 



NTID 



AAID 



Length Length 



Score 



[TU2T 



Probability 
|2.Se-i7 



Locus Name 



sp : GLGA_BACST 



ACC# 



008328 



ORF Name 



12189:1753 t3 liS 



Protein name 

Description 
pFTITT 



NTID 



AAID 



NT AA 
T — -I, t — Score Probability 
Length Length JL 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

t 4_u t — 4-i- Score Probability 
Length Length L 



3273 



|b.3e-20 



Locus Name 



Description 
j HYPOTHETICAL 18.1 KD PROTEIN RV1529 



Acc# 



Q50604 



NT 



AA 



ORF Name 



Zib.l7.0.6.2...c2...2!10. I I3T7¥ 



NTID AAID Length Length Probability 

|5?55 — 



IXTST 



i.4e-S5 



Protein name 



Locus Name 



Acc# 



tiypotnetical protein 



gp:L>y0734 



Description 

hiscnericiiia colx genomic DNA . - 22 . 3 min) 



876 



ORF Name 



NTID 



NT AA 

AAID Lenjth Lenjth Probability 



24025302 c2 264 



\TZ5T 



TT¥TT 



i.4e-ii5 



Protein name 



Locus Name 



glutamate syntnase, beta subunit 



pir :H"/2230 



Acc# 



H72230 



Description 







NT 


AA 


NTID 


AAID 


Length 


Length 


..| 3276 


8498 


507 


1524 



ORF Name 



2425.7.S.0.7....ci...2L21 1 HTTZ 



Protein name 
Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



13277 



AAID Length Length 
18499 



[Ju3" 



I9TF" 



Score Probability 
1553 



Ei.le-89 



Protein name 



Locus Name 



sp : YPGA_PORGI 



Description 

HYPOTHETICAL R£> £&0TS1N Itif &mt h-PGAA lOTERGErtTg REGION 



Acc# 
Q51834 



NT 



AA 



ORF Name 



NTID 



AAID 



8500 



Length Length 



Score Probability 



Protein name 

Description 
jMO-HTT 



Locus Name 



Acc# 



877 



NT 



AA 



ORF Name 



NT ID 



MtiOlte 1 } c3 340 



TTTT 



AAID Length Length 

mm — 



Score Probability 




i.3e-23 



Protein name 



Description 



Locus Name 



sp:Y03M MYCTU 



Acc# 



Q10647 



hypothetical 25.7 kd pfeOTHtisr CY130.22 



ORF Name 



NT ID 



NT AA 1 1 . n , 

AAID Le^th Length Probability 



2482l0$4 c2 231 



1WT 



T5T 



l.6e-3S 



Protein name 



Locus Name 



putative vicilm storage protein 



ATAC00613S 



Acc# 



AC006135 



Description 



Arabidopsis tnaliana chromosome II BAC F24H14 genomic sequence, complete 
sequence . 



NT 



AA 



ORF Name 



NT ID AAID Length Length 

— 



K77T 



Score Probability 

r77 



0.045 



Protein name 



Locus Name 



Hypothetical protein aq_l25 



pir:B70312 



Acc# 



B70312 



Description 



ORF Name 



NTID 



2.5.5.Q£.5.53...±1...$A 



Protein name 

Description 
INO-HIT 



AAID 



NT 



AA 



Length Length 



Score Probability 



1170 



Locus Name 



Acc# 



878 



NT 



AA 



ORF Name 



NTID 



25589087 c3 322 



AAID Length Length 

w&js — 



Score 



T7F" 



Probability 
|4.5e-23 



Protein name 

Description 
LIPOPROTEIN SPR PftfiCOftSOft 



Locus Name 



Acc# 



sp:SPR_EC0LI 



NT 



AA 



ORF Name 



NTID 



AAID 



±3 113 



Length Length 



Score Probability 




l.3e-57 



Protein name 
Description 

PUTATIVE ALPHA- AMYLASE , 



Locus Name 



sp :AMYjVf£'rJA 



Acc# 



Q59006 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


^Za:/.5L5ii™t2.™6.fe 


32S5 


^507 


61 


186 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 

or 



Score Probability 




I.3e-16 



Protein name 
Description 

50S RIBOSOMAL PROTEIN L21 



Locus Name 



sp:£L21__HAEIN 



Acc# 



P44359 



879 



NT 



AA 



ORF Name 



NT ID 



AAID 



26449224 cl 169 



Length Length 
T53" 



T7T" 



Score Probability 

— 



5.7e-23 



Protein name 



Locus Name 



hypothetical protein C11G6.3 



pir :T19201 



Acc# 



T19201 



Description 



NT 



AA 



ORF Name 



NTID 



Z6.mCL03.D....r2....XD3. I 13288 



AAID Length Length 
FSTu" 



Score Probability 
fZZTS — 



|4.7e-2S7 



Protein name 



Locus Name 



heme uptake protein A and B 



gp:AF143945 



Acc# 



AF143945 



Description 



Porphyromonas gingivaiis heme uptake protein A and B gene, completecds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




1341 



Score Probability 
WTE 



1 . 2e-35 



Protein name 



Locus Name 



sensory transduction histidme kinase 
slr2098 :protein slr2098 :protein slr2098 



bir:^75130 



Acc# 



S75130 



Description 



NT 



AA 



ORF Name 



NTID 



ittM5M.5....ziJ±b.6. I rrzwu 



AAID Length Length 
— 



Score Probability 
7T7 



Protein name 



Locus Name 



Vexp2 



gp:AP1407S4 



Description 



7.7e-19 



Acc# 



AF140784 



Streptococcus pneumoniae Vexpl (vexl) , Vexp2 (vex2) , Vexp3 (vex 3 ) , and P2 8 
(pep27) genes, complete cds . 



880 



ORF Name 



NT ID 



NT AA . . 
^„ ^ _ — ^. ,. — ^. Score Probability 
AAID Length Length JL 



30743750 ±2 100 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID 



im3.7.m..±2...ai.. 



I55TT 



Length Length 



Score Probability 
7ST5 



|i.3e-77 



Protein name 



Locus Name 



thiore&oxm reductase 



|gp:AF124757 



Acc# 



AF124757 



Description 



Zymomonas mobilis tosmicl clone 43D2, complete sequence. 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 




7T4 - 



Score Probability 
TU1 



0.00050 



Protein name 



Locus Name 



Hypothetical protein SC66T3.28C 



|pir:T3S3S5 



Acc# 



T35385 



Description 



NT 



AA 



ORF Name 



\±±26±ti3±..Ql..216. I [TIM 



NTID AAID Length Length 

mrz — 



Score Probability 
Ttt 



7.7e-36 



Protein name 



Locus Name 



putative vicilm storage protein 



gp:ATAC006l35 



Acc# 



AC006135 



Description 



AraJoidopsis thaliana chromosome II BAC F24H14 genomic sequence , complete 
sequence . 



881 



ORF Name 



.35595442 tl 55 



Protein name 



NTID 



NT AA 
T — ^ T — ^, Score Probability 
AAID Length Length A ~ 



18517 



T7T 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



^5T3~ 



NT AA 
— — Score 
Length Length 



7T" 



Locus Name 



Probability 



Acc# 



Description 
NO-HIT 



ORF Name 



NT 



AA 



NTID 



3.£.mMI...Gl...l7.0. I 



AAID Length Length 




Score Probability 




3.1e~15 



Protein name 



Description 



Locus Name 



Acc# 



gp:SCYKL202W 



S.cerevisiae chromosome XI reading trame ORF YKL202w. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



3.6.2. 2. 5. 2.5.0.... 1 3... .112... 



|4.3e-40 



Protein name 



Locus Name 



probable lipopolysaccharicle 
N-acetylglucosaminyltransf erase, rfbU 



pir :F64500 



Acc# 



F64500 



Description 



882 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



13913411 c3 



1.4e-119 



Protein name 



Locus Name 



beta-N-acetylglucosaminidase 



gp:AF072374 



Acc# 



AF072374 



Description 



Pseudoalteromonas sp. 39 beta-N-acetylglucosamimdase (chiQ) gene, complete 
cds . 



ORF Name 



4004016 11 Si 



Protein name 



NTID 



jjutt 



AAID 



NT 



AA 



Length Length 




Score Probability 



35 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



I4ii22iaci..±i..i 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 
|1.4e-27 



Locus Name 



glycogen debranchmg enzyme -related protein 



pir:H75549 



Description 



Acc# 



H75549 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



iAaa5aa7....c2L...24i i 



223S 



Protein name 



Locus Name 



sp:GLNA_6AC?& 



Acc# 



P15623 



Description 

GLUTAMINE SYNTHETASE, I GLUTAMATE - - AMMONIA LIGASE) (GS) 



883 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


4i046S7_c2_260 


|3303 


8525 


238 | 


717 


343 


4.0e-3i 



Protein name 



Locus Name 



abc transporter, ATP -.binding protein PAB16 96 



pir:H75077 



Acc# 



H75077 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 
F73 



6.3e-85 



Protein name 



Locus Name 



sp : SYS_AQUAE 



Acc# 



066647 



Description 

SERYL-TRNA SYNTHETASE, (SERINE- -TRNA LIGASE) (SERRS) 



NT 



AA 



ORF Name 



NTID 



AAID 



[J3TT5" 



8 52 7 



Length Length 
TTT 



Score Probability 
POT 



S.Oe-17 



Protein name 

Description 
HYPOTHETICAL PROTEIN HI0303 



Locus Name 



sp:YOTJ_HAlillN 



Acc# 



P44627 



NT 



AA 



ORF Name 



NTID 



AAID 



I&4D.&S.17....C1...2.0.& I 



Length Length 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



884 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length -£ - 



|4490«7 c2 



7W 



WIT 



Protein name 



Locus Name 



ACC# 



P37556 



Description 

HY£ ) OTriEiT t lCAL 66 .1 KB PROTEIN IN MgD-DtVitC tfrffiRflENlC ftBGlOti 



NT 



AA 



ORF Name 



NT ID 



AAID 



SHUT 



Length Length 



Score Probability 
10.0012 



Protein name 

Description 
HYPO T HE T ICAL PkOTLlJIN MJ06A7 



Locus Name 



sp:^87_METJA 



Acc# 



Q58100 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



|48..7.£5.a7...±1...5.7. 



|4.6e-195 



Protein name 



Locus Name 



valine- -tRNA ligase, 



pir :D72206 



Acc# 



D72206 



Description 



ORF Name 



Protein name 



NTID 



conserved hypothetical protein 



Description 



NT 



AA 



AAID Length Length 




3TT 



Score Probability 




|2.1e-25 



Locus Name 



pir :F72386 



ACC# 



F72386 



885 



NT 



AA 



ORF Name 



NTID 



AAID 



14897750 c J 2 2B!> 



Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



|AMd0.1I..±2...10.S I 13312 



T5T 



7.0e-lfl 



Protein name 



Locus Name 



RNA polymerase sigma-E tactor 



Description 



Acc# 



B72234 



ORF Name 



£110.3.£2...a2...24$... 



Protein name 



DNA helicase 1 



Description 



NTID 



AAID 



NT AA 

— t — , Score Probability 
Length Length 



TUT 



|i.3e-17 



Locus Name 



pir :T14895 



Acc# 



T14895 



ORF Name 



NTID 



PITT" 



Protein name 

Description 
TRANSCRIPTASE ) 



AAID 



NT 



AA 



Length Length 
2FT 



FF4" 



Score Probability 

pus — 



Locus Name 



sp:Rf65_MYXXA 



2.2e-34 



Acc# 



P23071 



886 



NT 



AA 



ORF Name 



NTID 



657555 ti 44 



AAID Length Length 
— 



Score Probability 
IMS — 



Protein name 



Locus Name 



Acc# 



sp:SP3B_BACSU 



Description 

STAGE ill SpOruLaticM PkOTElIrt e 



ORF Name 



NTID 



NT AA 

_ _ _ — _ _ — Score Probability 
AAID Length Length *~ 



S1S675 ti § 



IT4" 



5.0e-3l 



Protein name 



Locus Name 



PanD protein 



|gp:WSAJ3u4S 



Acc# 



AJ003049 



Description 



wolineiia succmogenes nycLD, nyciE, panD and ispA genes; ort!02 anciorr.341. 



ORF Name 



NTID 



NT AA 
T — T — ^ Score Probability 
AAID Length Length 



B.2.2..7.5.a...t3....11.6.. 



3TT7" 



|1.3e-57 



Protein name 



Locus Name 



pantoate- - tie ta~ alanine ligase 



Acc# 



E72296 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



l&£5.£ul2L...cl..i££ I |33Tff 



Protein name 



Locus Name 



2.ue-u§ 



Acc# 



ToW 



gp:RLU40388 



Description 



U40388 



RnizoJaium leguminosarum positive regulator of po£>A (poJoRJ gene, complete 
cds, and 4-hydroxybenzoate hydroxylase (pobA) gene, partial cds . 



887 



ORF Name 



NTID 



10557812 c2 233 



JJTT 



Protein name 



hypothetical protein PAB0040 



Description 



NT 



AA 



AAID Length Length 
— 



323" 



Score Probability 




Locus Name 



pir :B75194 



3.6e-21 



Acc# 



B75194 



NT 



AA 



ORF Name 



NTID 



332u~ 



AAID Length Length 
— 



FITT 



Score Probability 
T3"5 



5.6e-07 



Protein name 



Description 



Locus Name 



sp:YA52_HAEIN 



Acc# 



P45008 



HYPOTHETICAL TRANS I PT I ONAL REGULATOR HI1052 



ORF Name 



NTID 



AAID 



NT AA 
, — L , , — ^ Score Probability 
Length Length 



iD.ai5.s.3.s...±i...i3.d I rmr 



T55" 



Protein name 

Description 
pSfO-HlT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
^ — , ^ — L1 Score Probability 
Length Length lL ~ 



TTZT 



8544 



WITT 



Protein name 

Description 
K0-H1T 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 

— , — ^ Score Probabi lity 
Length Length J ~ 



7W 



1.3e-78 



Protein name 



Locus Name 



hypothetical protein HP0513 



pir :A64584 



Acc# 



A64584 



Description 



888 



NT 



AA 



ORF Name 



NT ID 



AAID 



12568877 tl 13 



Length Length 



EST 



Score Probability 

izra — 



2.1e-16 



Protein name 
Description 



Locus Name 



sp : FPQJACLC 



Acc# 



P42371 



NT 



AA 



ORF Name 



NTID 



i2§-?88u2 c2 218 



AAID Length Length 



3547 



TuTT 



Score Probability 
5U2 



S.Se-48 



Protein name 



Locus Name 



OprM 



gp:AB0ll^8l 



Acc# 



AB011381 



Description 



Pseudomonas aeruginosa gene tor OprM, complete cds . 



ORF Name 



NTID 



AAID 



NT AA 
T — . , T — , . Score Probability 
Length Length 



I17.1117.8....C1...18.3... 



1.2e-164 



Protein name 



Locus Name 



proJoattle pyrophosphate- -tructose 6 -phosphate 
1 -phosphotransferase, beta subunit 



foir:C713:L2 



Acc# 



C71312 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length dL 



Protein name 



Locus Name 



Acc# 



KIAA0738 protein 



gp:AB01828i 



AB018281 



Description 



Homo sapiens mRNA tor KIAA073 8 protexn, complete cds. 



889 



ORF Name 



14644032 tl 19 



Protein name 



NTID 



NT AA 

— , — - Score Probability 
AAID Length Length ^ 



arsenate reductase 



Description 



T7F" 



BIT" 



TITT 



|1.3e-09 



Locus Name 



pir :B70360 



Acc# 



B70360 



ORF Name 



|li&£.3&3.7....cl...^5... 



Protein name 



NTID 



SOT- 



MID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



I5.5.3.D.5....c2l...23.I.. 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



TFT 



5.1e-80 



Locus Name 



Acc# 



sp:TRKH_ECOLI 



Description 

TRK SYSTEM POTASSltJM WfAKfi PkOTfelti 



ORF Name 



15.6.3..7.6....C3....Z53... 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



— Score Probability 



Locus Name 



Acc# 



Description 
KO-HIT 



ORF Name 



Protein name 



NTID 



±6.6.0.11B.D....k1...2±1 



13332 



Hypothetical protein 3hp0462 



Description 



NT 



AA 



AAID Length Length 




Score Probability 
1.2e-52 



Locus Name 



pir:C7l$2$ 



Acc# 



C71929 



890 



ORF Name 



NT ID 



±1 16 



JJJT 



Protein name 



hypothetical protein 



Description 



NT 



AA 



AAID Length Length 

— 



Score Probability 
|2.3e-177 



TTTT 



Locus Name 



pir : JQ1020 



Acc# 



JQ1020 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



20AA16.26....alJ2.&b. I 



31T 



Protein name 



Locus Name 



unknown 



gp:AF025662 



Acc# 



AF025662 



Description 



Vibrio cholerae lipoprotein (vipAj and unknown proteins genes, complete cds. 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



T7T 



2.6e-73 



Protein name 



Locus Name 



|sp:HIS6_BA0SU 



Acc# 



034727 



Description 
HISF PROTEIN ( CYCLASE) 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



T5T 



7 . 2e-16 



Protein name 



Locus Name 



RNA polymerase sigma tactor SigZ-like protein 



gp:AF137263 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S ribosomal protein S16-liJteprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



891 



ORF Name 



NT 



AA 



NTID 



— Score Probability 
AAID Length Length — J ~ 



Protein name 



wr 



Locus Name 



Acc# 



Description 



| sp:SPR0 mihA 



P36378 



(OSTEONECTIN) (OH) (BASEMfiNT MEMBRANE f>&0T£lN BM-40) 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length J ~ 



cl 1S7 



Protein name 



194 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



2assaia5L.ii.„ia£ i pn? 



Protein name 



Locus Name 



Acc# 



hypothetical protein ydeA 



Description 



C69777 



ORF Name 



NTID 



NT AA 

— , — n Score Probability 
AAID Length Length 



1126 



13 .8e-73 



Protein name 



Locus Name 



Acc# 



115K outer membrane protein precursor : susc 
protein 



pir : JC602V 



JC6027 



Description 



892 



NT 



AA 



ORF Name 



NT ID 



24227211 ti 30 



AAID Length Length 
TUT" 



JUT 



Score Probability 
TI^ 



|4.8e-iS 



Protein name 



Locus Name 



sp : YGBA_ECOLI 



Acc# 



P25728 



Description 

HYPOTHETICAL 13.S K£) PROTElIN XlSf FHiA-MtJTS 1n?ERCe!N1C REGION 



NT 



AA 



ORF Name 



NT ID 



24257786 r2 114 



AAID Length Length 
E7u 



S3" 



Score Probability 
[51 



.6.0032 



Protein name 



Locus Name 



hypotnetical protein pXOl-90 



pir:H«l02 



Acc# 



B59102 



Description 



AAID 


NT 
Length 


AA 

— . , Score 
Length 


Probability 




8565 


96 


251 131 


1 . 2e-08 



ORF Name 



NTID 



13343 



Protein name 



Locus Name 



unknown 



gp:LLU80410 



Acc# 



U80410 



Description 



Lactococcus lactis cremoris pnospnopentomutase (cleoB) and purinenucieoside 
phosphorylase (deoD) genes, complete cds. 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



3344 



1.7e-4€ 



Protein name 



Locus Name 



Acc# 



sp:YEaX_t!COLl 



Description 

HYPOTHETICAL 32.0 KD PROTEIN IN FfiM4-THID IMTERSENIC kUGlON 



893 



ORF Name 



NT ID 



AAID 



NT AA 
— — Scors 
Length Length 



124547750 c3 287 



flTT 



TFT 



Probability 
|2.0e-ii 



Protein name 



Locus Name 



probable general stress protein 26 



pxr:D754M 



Acc# 



D75431 



Description 



ORF Name 



NT ID 



AAID 



\2A6A&5.Ll...z±..±&£ I 



8568 



Protein name 



hypothetical protein SC3A7.16C 



Description 



NT AA 

— , — , Score Probability 
Length Length 



IUT 



VUT 



TTT 



Locus Name 



pir :T29435 



0.00044 



Acc# 



T29435 



ORF Name 



NT ID 



26.25.!$.11.±1...±$. I 1^47 



Protein name 



AAID 



S5ZT 



NT 



AA 



Length Length 



Score Probability 



1083 



Locus Name 



Acc# 



Description 



ORF Name 



NTID 



AAID 



afiia^fifti..ci„.iaa I inn 



f^Tir 



Protein name 



hypothetical protein APE0900 



Description 



NT AA 

— , — , Score Probability 
Length Length 



TUT 



TIT 



Locus Name 



BTFTUTI^T 



1.9e-08 



Acc# 



D72685 



ORF Name 



\26A±11±1.±±..A0.., 



Protein name 



NTID 



AAID 



TIFT 



NT 



AA 



Length Length 
71 - 



Score Probability 



TIT 



Locus Name 



Acc# 



Description 



InO-iM 



894 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



25557066 ci 214 



7TJT 



6.ie-42 



Protein name 



Locus Name 



Acc# 



ceil division ATP -binding protein ttsK 



pir :E709iy 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



3S7T 



Length Length 
2TS 



57 



Score Probability 
1.4e-05 



Protein name 



Locus Name 



hypothetical protein 



gp:SSU18330 



Acc# 



Y18930 



Description 



Sulrolobus soltataricus 2bl Kb genomic DNA rragment, strain vz . 



ORF Name 



NTID 



NT AA 

— — „ Score Probability 
AAID Length Length 



F5TT 



T7W 



TFT 



0.00048 



Protein name 



Locus Name 



TRK potassium uptake system protein (trKHj 
homo log 



bir:G^B4 



Acc# 



G69354 



Description 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



Length Length 
773 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



895 



ORF Name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 



32125280 t% 154 



Protein name 



3354 




8575 


61 186 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



11163A±l...c±...2&b.. 



Protein name 



TTT 



2.Se-0S 



Locus Name 



Acc# 



proba&le araC tamily transcription regulator I Ipir :T35902 



T35902 



Description 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



maftm...c:L..m i msi 



7JT 



5.4e-43 



Protein name 



Locus Name 



Acc# 



phosphoribosyltormimino- 5 ammo imidazole 



gp:AB008676 



AB008676 



Description 



Escherichia coix 0157 DNA, map position at 46 mm., complete cds . 



896 



ORF Name 



34084407 t3 IBS 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 
1ITS3 



Score Probability 



[STT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



3.3.28.3.^5.^1^1:/^. I 



AAID 



NT AA 
— — , Score 
Length Length 



Locus Name 



Probability 



Acc# 



Description 



KO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



4.0.5.M3.a..±1...3.b... 



T3W 



Length Length 




Score Probability 
|2.7e-64 



636 



Protein name 



Locus Name 



putative membrane transport protein. 



gp:SC!C7BA 



Acc# 



AL133220 



Description 



Streptomyces coelicolor cosmid C75A. 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Prob ability 
Length Length 



ilfltlSafl...ai...iflL2 1 



TUT 



5.0e-l7 



Protein name 



Locus Name 



hypothetical protein ysdA 



bir:G6^83 



Acc# 



G69983 



Description 



897 



ORF Name 



1417530b i2 l /2 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



TIFT 



Length Length 
Si- 



Score Probability 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



427.M..±1...17. 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 
7^ 



Score Probability 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



15364 



Length Length 



Score Probability 
E.3e-52 



Locus Name 



sp:HiS2_KLEM 



Acc# 



024714 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 
TTZZ — 



Score Probability 
3.5e-07 



152 



Locus Name 



chromosome assembly protein homo log 



prr :B703b6 



Acc# 



B70356 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



-TUT 



Score Probability 
0.017 



7^ 



Protein name 



Locus Name 



hypothetical protein YHR16 7w 



pir :S526oy 



Acc# 



S52609 



Description 



898 



ORF Name 



4875287 Cl 211 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



Locus Name 



|2.ie-4I 



Acc# 



Description 



sp:H!Sb_ECOLl 



P10375 



ORF Name 



NTID 



I4S7SX26 c2 251 



Protein name 



NT 



AA 



AAID Length Length 



Score Probability 



TT7T 



P7uTT 



Locus Name 



S.8e-6S 



Acc# 



diaminopimelate decarboxylase, 



Description 



pir ;C70404 



C70404 



ORF Name 



Protein name 



NTID 



8591 



NT 



AA 



AAID Length Length 



Score Probability 



TTT 



1134 



Locus Name 



Acc# 



precursor monotunctional aspartokmase 



|gp:AF13Sa62 



Description 



AF135862 



Glycine max precursor monotunctional aspartoJcmase mRNA, completecds . 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



&S.0.X5.6.2...C.1...2.14. I [3Trt7 



1341 



|l.Se-35 



Protein name 



Locus Name 



Acc# 



cell division inhibitor : protean 
slrl223 :protein slrl223 



pir :S77404 



S77404 



Description 



899 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



5177305 ti 16 



TT7T 



731T 



5.5e-73 



Protein name 



Locus Name 



sp;VBIN_EC!OLl 



Acc# 



P75782 



Description 

riYfrOTHflT^AL U.'l Ku £R0T2lfri IN DING-GLI^ O InT£rgWNIC RElGIOfr 



ORF Name 



NTID 



AAID 



NT AA ^ 

— — , Score Probabi lity 
Length Length 



3T7T 



303 



|2.Se-l3 



Protein name 



Locus Name 



transcription regulator AraC/XylS tamiiy 
homo log ydeE 



pir:G6$777 



Acc# 



G69777 



Description 



NT 



AA 



ORF Name 



NTID 



S£15.&tl...a2Jl&& I |3T77 



AAID Length Length 



15535 



77T" 



Score Probability 
|6.5e-12 



Protein name 



Locus Name 



mutator protein mutT:nypotneticai protein 
S111045 :hypothetical protein slll045 



pir :S74bOS 



Acc# 



S74508 



Description 



ORF Name 



Protein name 



NTID 



AAID 



TTTT 



F5^~ 



Description 

POTATlVfi OX t DOREDtfCTAS S kv0$4 5, 



NT AA rt _ -i i_ ■ t ■ j_ 
— — Score Probabil ity 
Length Length 



TTT 



$.3e-i5 



Locus Name 



S p;vra<>_Mtferu 



Acc# 



P71564 



900 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



1800677 cl lb 4 



Protein name 



8597 



T7T 



[T7TT 



Locus Name 



Acc# 



Description 



INO-HTT 



ORF Name 



NTID 



NT AA 

— — Score Prob ability 
AAID Length Length 



.aisti?is...ci...nt. 



Protein name 



Locus Name 



2.7e-25 



Acc# 



chromosome assembly protein Jnomolog 



Description 



pir:B70356 



B70356 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 
S3 - 



Score Probability 



1UT 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



2578 



AAID Length Length 
TTT~ 



Score Probability 
5.0e-i0 



Locus Name 



bp:AB006709 



ACC# 



AB006709 



Vibrio alginolyticus rpoN gene tor RNA polymerase sigma t actor N, partial 
and complete cds . 



901 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1450a4i:i ci 77 



JTTT 



T7T" 



I.6e-il 



Protein name 



Locus Name 



sp:RLiO_BA(!aU 



Acc# 



P42923 



Description 

(VfiflBTATlVfi PftOl'KlJrf 300) (Vl^OO) 



ORF Name 



NTID 



AAID 



15527151 ±1 16 



Protein name 



nypotnetical protein RP516 



Description 



NT 



AA 



Length Length 



Score Probability 
|2.3e-3S 



Locus Name 



bir:P7l6b!D 



Acc# 



F71655 



ORF Name 



NTID 



NT AA 

— — Score Probab ility 
AAID Length Length 



22&6.216.5....L1..AZ „..i pi 



TTWT 



TU7T 



6.6e-10^ 



Protein name 



Locus Name 



aramopeptiaase P 



gpiDMEiiii^O 



Acc# 



AJ131920 



Description 



Drosopnila meianogaster Dammopep-p gene, partial. 



NT 



AA 



ORF Name 



NTID 



AAID 



\211x0.10.2...z2...b.b. I 



Length Length 



1200 



Score Probability 
IS .5e-2l2 



204$ 



Protein name 



Description 



Locus Name 



sp:EPTU_BAdt ! k 



Acc# 



P33165 



ELONGATION 1-ACTOk T P (EF-TU) 



902 



ORF Name 



24407752 c2 67 



Protein name 



NT ID 



TJZT 



AAID 



I55U5" 



NT AA 

— — , Score Probability 
Length Length 



Locus Name 



i.3e-41 



Acc# 



Description 

505 RlBO^OMAL PROTEIN Lll 



sp:RLii_MV(JTU 



P96931 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


274030_rl_l2 


3384 




860£ 


$5 


288 


108 


3.2e-06 



Protein name 



Locus Name 



Acc# 



hypothetical protein PH1485 



Description 



pir:H7l023 



H71023 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



TT1" 



ITT 



Locus Name 



0.00012 



Acc# 



SecE protein 



Description 



pir:JE0331 



JE0331 



ORF Name 



|3.412.0.2a7....c3....7.b.., 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



7F" 



Locus Name 



1.4e-12 



Acc# 



Description 

505 RIBOSOMAL prOtAIN Ll 



sp:kLi_HAlillN 



P44342 



903 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



55222558 c2 60 



|4.1e-36 



Protein name 



Locus Name 



3-deoxy-manno-octulosomc acid transterase 



|gp:SMU52&44 



Acc# 



U52844 



Description 



Serratia marcescens putative glycosyltranslerase, 
putativeglycosyltransf erase, putative heptosyllll transferase 
(waaQ) , 3-deoxy-manno-octulosonic acid transferase (waaA) , 
glucosyltransf erase (waaE) , and KdtB (kdtB) genes, complete cds; and 
Fpg(fpcr) gene, partial cds. 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


.423.7.Q.Q2....C2...6.3 


|3388 


8610 


308 


527 


507 


1.7e-48 



Protein name 



Locus Name 



tyrosine recomPmase xerD 



gp:AF093548 



Acc# 



AF093548 



Description 

Staphylococcus aureus tyrosine recombinase XerD (xerD) gene , complete cds. 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



lU22l25...z2...££ 



57T 



384 



l.fie-3S 



Protein name 



Locus Name 



99% identity over 181 amino acids witn E. 
coli 



gp : STYSTMF1 



Acc# 



AF170176 



Description 

Salmonella typhimurium fragment STMFl. 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
533 



Score Probability 
[7.6e-44 



Protein name 
Description 

50S rIbOSOmal ^kOTSltf Ll 



Locus Name 



sp:Rlil_STRSy 



ACC# 



Q07976 



904 



NT 



AA 



ORF Name 



NT ID 



AAID 



5855052 ci 56 



Length Length 
F7T3 



Score Probability 
|2.ie-09 



Protein name 



Description 



Locus Name 



sp:RS2i_li0kbU 



Acc# 



051271 



3 OS klGO&OMAL PftciTfilW S21 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
F4T7 



8614 



TUT 



Score Probability 
BT5 



1.0e-30 



Protein name 



Description 



Locus Name 



sp:ftL7_HAfilN 



Acc# 



P44348 



505 RIBOSOMAL PROT EIN L7/L12 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



TTTT 



|1.5e-22a 



Protein name 



Locus Name 



RNA polymerase B-subunit 



Acc# 



AF087812 



Description 



Legionella pneumophila RNA polymerase B-suJDunit (rpoB) gene , complete cds; 
and RNA polymerase B'-subunit (rpoC) gene, partialcds. 



ORF Name 



NTID 



NT AA „ 

— , — , Score P robability 
AAID Length Length 



3 3 94 



1788 



TUT 



l.Se-07 



Protein name 



Locus Name 



unknown 



gp:U$677l 



Acc# 



U96771 



Description 



Prevotella bryantu putative polygalacturonase, B-i, 4- enaogiucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



905 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



24415717 t3 4 



7T 



Protein name 



Locus Name 



spisuovrum 



Acc# 



P25126 



Description 

SUCCTOVL-COa SYNTHETASE Bfl fA CHAIN, (SCS-B^iTA) 



ORF Name 


NT ID AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


333242l6_r2_l 


33$6 8618 


668 2007 473 


2 . 0e-42 


Protein name 






Locus Name 


Acc# 


115K outer membrane protein precursor : susC 


pir:JC60i7 


JC6027 


protein 










Description 










ORF Name 


NTID AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


23.mm...Gi...25. 


3397 8619 


405 1218 734 


i.5e-7^ 












Protein name 






Locus Name 


Acc# 


putative Hydrolase 


gp:SCMll 


AL133278 


Description 




Streptomyces coeiicoior cosmia Mil. 














ORF Name 


NTID AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 



T3W 



I2.2e-l8 



Protein name 



Locus Name 



hypothetical protein TM02 8U 



|pir:P72iyi> 



Acc# 



F72395 



Description 



906 



NT 



AA 



ORF Name 



NTID 



TT55" 



AAID Length Length 
T5TT7 — 



Score Probability 
5.5e-09 



Protein name 



Description 



Locus Name 



bpiBOUiSiVy 



Acc# 



U15179 



Sacteroides ovatus arabmosidase (asdll) gene, complete cds andputative 
transketolase, partial cds . 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


3i4377$2_rl_$ 


3400 




244 


735 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



Protein name 



NTID 



3401 



AAID 



NT 



AA 



— — Score Probability 
Length Length 

TB3 



5ff 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



NT AA „ „ , , . -. . 
— — Score Probabil ity 
AAID Length Length 



TTT 



1116 



TUT 



0.017 



Locus Name 



endo-al, 5-arabmanase 



gp : PFAKBA 



Acc# 



Y10458 



Description 

P . rluorescens arbA gene tor encio-al , 5-arabmanase . 



907 



ORF Name 



NT ID 



NT AA 

— , — , Score Pr obability 
AAID Length Length 



7Su~ 



|2.2e-89 



Protein name 



Locus Name 



Acc# 



P10089 



Description 

flfiMOLVSIM SEORfiTlOrt AT2-B1ND1N G frROTBlrt, CHROMOSOMAL 



ORF Name 



23M5937 tl 2 



Protein name 



NT ID 



NT AA 

— — „ Score Prob ability 
AAID Length Length 



JUT 



Locus Name 



Acc# 



Description 



IN0-H1T 



ORF Name 



|2l4^4i1Sl..±1...1 



Protein name 



NTID 



AAID 



15405 



NT AA 
— — Score 
Length Length 



WTT 



Locus Name 



Probability 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



13406 



AAID Length Length 



Score Probability 



72u~ 



4 . 8e-08 



Locus Name 



glycosyltransf erase 



[gp:AF146bJ2 



Acc# 



AF146532 



Description 
Klebsiella pneumoniae waa gene cluster. 



908 



ORF Name 



133442593 i'2 6 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



nm — I wn 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



UA1L^L±'L± 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



T3T31T 



T5W 



337" 



8.3e-54 



Locus Name 



sp:ATCSJJYNY3 



Acc# 



P73241 



Description 



ORF Name 



NTID 



Aaaiaii^,.ci...fi I 



F^T- 



Pro te in name 



hypothetical protein PFB022bc 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



FT 



T73~ 



Locus Name 



bir:E7l^20 



S.6e-l3 



Acc# 



E71620 



ORF Name 



Protein name 



NTID 



3410 



S£32 



hypothetical protein PFB022bc 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



FT" 



TFF" 



Locus Name 



bir:fi7l62t> 



i.2e-ii 



Acc# 



E71620 



909 



NT 



AA 



ORF Name 



NTID 



148828124 10 



AAID Length Length 
T35 



5T" 



Score Probability 
0.0053 



FIT 



Protein name 



Locus Name 



translation initiation tactor eIF-2 beta 
chain 



pir:T17104 



Description 



Acc# 



T17104 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



bin JdeOiiV 



Description 



2.2e-ii 



Acc# 



JC6 027 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 



F7T 



Score Probability 
2.7e-l6 



Locus Name 



sp:YM67_MCt'U 



Acc# 



028017 



(EC i. -.-.-) 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Prob ability 
Length Length 



72TT 



Locus Name 



Acc# 



Description 



NO-HIT 



910 



NT 



AA 



ORF Name 



NT ID 



20326010 ±2 bO 



AAID Length Length 



Score Probability 
TT2 



9.Se-06 



Protein name 



Locus Name 



encio-l, 4-beta-xylanase homolog yjeA 



jpirT 



Acc# 



G69849 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



2l3.£1£S.&2..±2....3.8. 



ETJT5" 



|i.6e-169 



Protein name 



Locus Name 



hypothetical protein 



gp:PAL24^61 



Acc# 



AJ243361 



Description 
Prevotella al&ensis ORFl, isolate M384. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



3417 



S635 



12457 



3.3e-256 



Protein name 



Locus Name 



hypotnetical protein 



|pir:S76^b7 



Acc# 



S76257 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



Protein name 



7TT 



|9.0e-161 



Locus Name 



probable copper- transporting ATPase, yvgX 



pir :E70041 



Acc# 



E70041 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



124475400 tl '1 



Length Length 
TUT 



Score Probability 
6.4e-08 



OS 



Protein name 



Locus Name 



response regulator 



Acc# 



Y18245 



Description 



Pseudomonas putida todX, todF, toctCl, todC2 , toclB, toOA, tocLD,tociE, toclG, 
todl, todH, todS, todT genes. 



ORF Name 



Protein name 



NT AA 

— — , Score Probability 
NTID AAID Length Length 



m^i 1 [m 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



\l$.5.2$Al.6...±X..±£.. 



NTID 



AAID 



NT AA „ _ , , . _ . . 
— — , Score Probab ility 
Length Length 



VST 



§.7e-24 



Protein name 

Description 
D- XYLAN X YLANOH V DkOLA^ E fi) 



Locus Name 



spTXYNE^BUTFr 



Acc# 



P26223 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



T45 



438 



ITT 



|5.7e-07 



Protein name 



Locus Name 



mercury reductase homolog 



pir:I64109 



Acc# 



164109 



Description 



912 



ORF Name 



26231303 ±1 lb 



Protein name 



NTID 



AAID 



— — , Score Probability 
Length Length 



TIT 



Locus Name 



Acc# 



Description 



MO -HIT 



ORF Name 



NTID 



NT AA 
— — , Score 
AAID Length Length 



aafi2a&a2„±2L...aa t 



Probability 
0.025 



Protein name 



Description 



Locus Name 



|gp:AF0^^6 



Acc# 



AF025396 



Vibrio anguillarum rtb region, partial sequence. 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score 



T4uT" 



Probability 
S.6e-li 



Protein name 



Description 



Locus Name 



sp : &E£E_SACS(J 



Acc# 



P35164 



SENSOR MOTE IN kklSE, 



ORF Name 



NTID 



AAID 



NT AA 
— , — , Score 
Length Length 



\115Al±16....z±.Al 



Probability 
3.5e-39 



Protein name 



Locus Name 



lipoate-protem ligase B 



gp:APib367B 



Acc# 



AF153678 



Description 



Myxococcus xanthus lipoic acid synthetase precursor, 
lipoamideacyltransf erase, and lipoate-protein ligase B genes, 
cds;and unknown genes. 



complete 



913 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TTT 



5.4e-34 



Protein name 



Locus Name 



sp:VCBL_k(JoLl 



Acc# 



P75849 



Description 

HVf>0?HSTlCAL 23,8 KD PfeOTElN llSt MUKk-AS frC ljStf&i(i&tlC REGION 



ORF Name 



NTID 



AAID 



33603128 ci 100 



Protem name 



hypotnetical protein PH0362 



Description 



NT AA 
— — Score 
Length Length 



TTT 



Locus Name 



pir :G7ll43 



Probability 
|4.3e-05 



Acc# 



G71143 



ORF Name 



NTID 



NT AA „ _ , , . _ . 
— — , Score Probability 
AAID Length Length 



3425 



\5&5T 



TTT 



9.ie-ii 



Protein name 



Locus Name 



regulatory protein 



gp:AF036i>44 



Acc# 



AF036244 



Description 



Azotobacter chroococcum 4-nydroxyoenzoate hydroxylase (pooA) gene , partial 
cds; and regulatory protein (pobR) gene, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



3.aiaas.t)...±2...b.i.. 



I4^u" 



EZ5T 



Length Length 
TTT 



TTT 



Score Probability 
B.7e-13 



Protein name 
Description 

GLUCOSE! INH I BITED D I VISION MOTE IN H 



Locus Name 



.spiGIDUJSAcUU 



Acc# 



P25813 



914 



NT 



AA 



ORF Name 



504010 ±2 44 



NT ID AAID Length Length 

5555 — 



405 



Score Probability 
3.1e-08 



TT7 



Protein name 



Locus Name 



nypotneticai protein APKi4bb 



bir:G72S24 



Acc# 



G72624 



Description 



ORF Name 



NTID 



£3.5.1i£D...±2...ibL„ 



5^5¥ 



Protein name 



transcription regulator NtrC tamily 



Description 



NT 



AA 



AAID Length Length 
TUT? 



555" 



Score Probability 
i.2e-63 



55U 



Locus Name 



pir :C70396 



Acc# 



C70396 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



5I5T 



T55" 



477 



TuT - 



0.0015 



Protein name 



Locus Name 



chitmase IV precursor 



|gp:Agll2555" 



Acc# 



AF112966 



Description 



Triticum aestivum cmtmase IV precursor (Cnt4j ttiKNA, complete cas . 



ORF Name 



NT AA 

— — Score Probability 
NTID AAID Length Length 



!iaiS.5.0.6.3...±l...M 



5555" 



TJUT 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



5355" 



AAID Length Length 
5557 — 



255" 



Score Probability 
|5.1e-06 



Protein name 



Locus Name 



nypotneticai protein AF22yy 



pir7T^5TT 



Acc# 



C69537 



Description 



915 



ORF Name 



NTID 



10546880 c2 519 



Protein name 



hypothetical protein PAHO'/yu 



Description 



NT 



AA 



AAID Length Length 



Score Probability 
3.8e-07 



Locus Name 



bir:H75098 



Acc# 



H75098 



ORF Name 



\±05M5.6.1.±2..±92.. 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



8659 



[JuT" 



0.0014 



Locus Name 



gp:D42067 



Acc# 



D42067 



Porphyromonas gingivalis DNA tor FimJorilm, ORFl-4, complete cds . 



ORF Name 



Ift&5ft7.ai..±I...5i 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



NT AA rt _ -i -i • -I • , 
— , — , Score P robability 
AAID Length Length 



10£6.5.9uS...±I...<i6. I m?5 



0.023 



Locus Name 



TT! lactis predicted coding region ORF00061 



gp:AE001272 



ACC# 



AE001272 



Description 

Lactococcus lactis DPC3147 plasmid pMRCOl, complete plasmidsequence . 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



117142 c2 481 



2440 



Protein name 



§552 



'JUT 



Locus Name 



5.4e-27 



Acc# 



hypothetical protein 



Description 



pir:S7692S 



S76925 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
T5 - 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probabi lity 
Length Length 



FTT" 



Locus Name 



7.0e-121 



Acc# 



conserved hypothetical protein ymdA 



Description 



pir :F69884 



F69884 



ORF Name 



HSLllO.S.6.a...Cl..AlS. I 



Protein name 



NT 



AA 



NTID AAID Length Length 

— 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



3444 



hypothetical protein PAB1224 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TTT 



Locus Name 



pir :A75022 



|3.le-06 



ACC# 



A75022 



917 



ORF Name 



NT ID 



— — , Score Probability 
AAID Length Length 



Protein name 



DNA heiicase relatea protein 



Description 



WIT 



Locus Name 



pir :H6yifei 



|2.5e-79 



Acc# 



H69163 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



i.6e-07 



Protein name 



Locus Name 



transmembrane sensor 



|gp:AFubi69i 



Acc# 



AF051691 



Description 



Pseudomonas aeruginosa stress tactor A ipsrA) , EUF Sigma tactor tnuij , 
transmembrane sensor (f iuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds . 



ORF Name 



12S.a.7..7.0.2.,±i...i:/. 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



T5T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



TZT 



37u~ 



1.0e-05 



Locus Name 



hypotnetical protein 



gp:f>S'J24536S 



Acc# 



AJ249385 



Description 

Pseudomonas stutze ri piiT, pilU, ORFi (partial) and ORF2 ipartiai) genes . 



918 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TTT2 



373 



Score Probability 

Tnrm 



TUU 



Protein name 



Locus Name 



neurotilament protein H torm H2 (repetitive 
region) 



Description 



bir:B424il7 



ACC# 



B43427 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



T7S" 



1.7e-21 



Protein name 



Description 



Locus Name 



sp:TRA2_BACPk 



Acc# 



Q45119 



TRANSP05A5E FOR IMSERTION gKQPENCU ELE MENT l&ii-LlKfcl 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



JZ5T 



JUT 



2.0e-59 



Protein name 



Locus Name 



probable ion transporter 



pir :E7b47U 



Acc# 



E75470 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



W7T 



IT 



0.040 



Protein name 



Locus Name 



Acc# 



sp : TXMA_DENPO 



P80494 



Description 
MOSCARllHC! TOXl^ ALPHA (Mt-ALPkA) 



ORF Name 



NT ID 



13*53" 



Protein name 



hypothetical protein F13D12 . 3 



Description 



NT i-iH. 

— - — , Score Prob ability 

AAID Length Length 

15573 — 



AA 



Locus Name 



pir :T20S31 



3.4e-05 



Acc# 
T20831 



ORF Name 



Protein name 



Description 



NT 



NT ID 



AAID Length Length 



AA 

— , Score 



8676 



73" 



Probability 
I 10.012 



Locus Name 



Acc# 



P02259 



HI STONE Hb 



ORF Name 



\±5.±0£2B.l.±2..±:iA.. 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
253 



Score Probability 



35 



Locus Name 



Acc# 



Description 



IW0-H1T 



ORF Name 



Protein name 



NTID 



AAID 



3T5F" 



8678 



— — Score Pro bability 
Length Length 



ITS" 



Locus Name 



Acc# 



Description 



[NO-HIT 



920 



ORF Name 



NT ID 



NT AA 

— — , Score Proba bility 
AAID Length Length ^ 



Protein name 



8679 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



16.S1.7.0.3.7...±2...1M.. 



Protein name 



NTID 



AAID 



SF5TT 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probab ility 
Length Length 



16.5.Z53.a3....tl...aa.. 



Protein name 



IZI 13455 



TSTT 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Pr obability 
Length Length 



\±&9.21&±1.±1J2.'±'L. 



Protein name 



FT 



T5T 



Locus Name 



Acc# 



Description 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



Protein name 



Locus Name 



9.3e-il0 



Acc# 



O-acetyinomoserme sultnydxylase 



Description 



pir :D72J24 



D72324 



921 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



TFT 



\T7W 



7 .7e-35 



Protein name 



Description 



Locus Name 



sptYHIDJiCOLl 



Acc# 



P26606 



ttYEOTHtiTtCAL 23. ^> Jfl) PkO'T^lN tit SLP-HD^ B INTERBANK? kBGloM (QftP-C) 



ORF Name 



1992011 cl 367 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 

Tm — 



Score Probability 



Locus Name 



Acc# 



Description 



no-hit 



ORF Name 



Protein name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



3464 



1457 



T7T" 



II . 4e-76 



Locus Name 



spiYKSFjflCjOLl 



Acc# 



P77536 



Description 

HYPOTHETICAL b3.1 KB PROTElisi IN MAfc!H-B13 TA INTEkGENIC REGION 



ORF Name 



NTID 



NT AA „ ^ . . . 

— — Score Pro bability 
AAID Length Length 



203126&1...Z1..MA 



75" 



0.047 



Protein name 



Locus Name 



nypotnetical protein aq_JL6«o 



pir :F7044b 



Acc# 



F70445 



Description 



922 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 
10.0047 



Protein name 



Locus Name 



hypothetical protein Rv06 03 



pir :F"/oyoy 



Acc# 



F70909 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

1 i^nr 



8689 



Score Probability 
'5.5e-18 



2TT 



Protein name 



Locus Name 



hypothetical protein PH0142 



pir:D71^b 



Acc# 



D71235 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



[3W 



10.0014 



Protein name 



Description 



Locus Name 



sp : £>QRA_provu 



Acc# 
Q52620 



REGULATORY PROTEIN V&kK 



NT 



AA 



ORF Name 



NTID 



AAID 



18551 



Length Length 




Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



923 



NT 



AA 



ORF Name 



NT ID 



23445887 t2 lh>'2 



AAID Length Length 




Score Probability 
0.0025 



Protein name 



Locus Name 



sensory transduction hist id me Kinase 
slll475:protein slll475 : protein sll!475 



pir :S76818 



Acc# 



S76818 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



3471 



8£53 



Length Length 




FIT" 



Score Probability 
|3.4e-17£ 



1712 



Protein name 



Locus Name 



propionyl-CoA carboxylase 



gp:AB007o00 



Acc# 



AB007000 



Description 

Myxococcus xanthus MxppcB gene tor propionyi-uoA carboxylase, complete cas. 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


215/X53Al...a±..A±B. 


|3472 


8654 


351 


1056 


54 


0.038 



Protein name 



Locus Name 



ORF188 



gp:AB00010y 



Acc# 



AB000109 



Description 

Dictyostelium discoiaeum mitocnondriai dna, complete sequence. 





ORF Name 


NTID 


AAID 


NT 

Length 


AA 

— , Score 
Length 


Probability 


ll&l&rXt^al^SAZ 


3473 


8655 


1054 


3285 300 




l . oe-47 



Protein name 



Locus Name 



receptor antigen (RagAJ 



p:PGI130872 



ACC# 



AJ130872 



Description 



Porphyromonas gmgivalis W50 receptor antigen (rag; locus encodmga ma] or 
immunodominant 55kDa antigen. 



924 



ORF Name 



241542 tl ^6 



Protein name 



NT ID 



AAID 



NT AA 

— — , Score Probabil ity- 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



NT AA ^ i u 1 1 1 f 
— — Score Pr obability 
AAID Length Length 



2±2±9ATL.cx'L.A±^ 



i.4e-S7 



Protein name 



Locus Name 



RNA polymerase sigma tactor sigZ-HJce protein | |gp : AF1372^ 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S riJDOsomal protein sl6-±iJceprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



ORF Name 



NTID 



Protein name 



hypothetical protein £>2463 



Description 



NT 



AA 



AAID Length Length 
— 



Score Probability 
|l.2e-lS2 



\TT7T 



Locus Name 



pir:F6b02l 



Acc# 



F65021 



ORF Name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



3ATT 



JTT 



551" 



7.2e-06 



Protein name 
Description 

EXOENZYME 5 SYN'MUSIg kl^ULA TORY PROTEIN EX^A 



Locus Name 



Acc# 



P26993 



925 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


24255332_ti_87 


3478 


§700 


770 2313 


504 




8.9e-48 



Protein name 



Locus Name 



Acc# 



|sp:YBAL_ECOLl 



Description 

HYPOTHETICAL 59.4 KD PkOfEilN IN gflK-FSR l^TjfiRGENl C £2GlON 



ORF Name 



NTID 



2425S3S5 cl 412 



TFTT 



Protein name 



arsenate reductase homoiog yusl 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TTT 



5.5e-25 



Locus Name 



pir:fe7002l 



Acc# 



B70021 



NT 



AA 



ORF Name 



NTID 



2425.o.aa5....c3....^:/.D. 



Protein name 



Description 



AAID Length Length 
2U7 



Score Probability 



TTTTTZ 



Locus Name 



Acc# 



bJO-fltf 



ORF Name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



TTT 



ITTT 



0.00067 



Protein name 
Description 



Locus Name 



sp : EBA2_FLAME 



ACC# 



P36912 



926 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



24407150 ±2 170 



FTuT" 



T5T 



|4.0e-09 



Protein name 



Locus Name 



pobR regulator 



|gp:Ml!Yi^b27 



Acc# 



Y18527 



Description 

Pseudomonas sp. pobA, poJoR, pcaQ, pcaH and pcaG genes, 



ORF Name 



24413137 12 118 



Protein name 



Description 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



2 . Oe-12 



Locus Name 



Acc# 



032028 



ORF Name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



244I4.7.1:L.cl...m. I pm 



5.4e-S0 



Protein name 



Locus Name 



sp : PNCB_ECOL! 



ACC# 



P18133 



Description 

NICO T INATE PHOSPHOR I BOS YL T & kNSFERA^E , (NAPRTA^li) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2l4&13.u£2l..±3....2l3.1.. 


3485 


8707 


228 


£87 


124 

















!2.3e-06 



Protein name 



Locus Name 



isochorismatase nomolog ywoC 



pir :F7UU64 



ACC# 



F70064 



Description 



927 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



24494000 rl 20 



EST 



|2.2e-2i 



Protein name 
Description 

flVpOMflTlCAL PROTkl ti HlllfiS 



Locus Name 



sp:YGCF_HAEIN 



Acc# 



P45097 



ORF Name 



NTID 



AAID 



— — Score Proba bility- 
Length Length 



24£334£2 c3 630 



S7W 



Protein name 



Description 



Locus Name 



Acc# 



3p:VBS5_HAEiM | P45213 



HYPOTH E TICAL PROTEIN H114bb 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



8710 



7¥T 



1.7e-4S 



Protein name 



Locus Name 



hypothetical protein SCF43A.0b 



Acc# 



T36428 



Description 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



\2±6AXS±1..±Z..±&1 1 F£W 



F7TT 



Protein name 



TTTT 



Locus Name 



gp:YP102KB 



Acc# 



AL031866 



Description 

Yersinia pestis 102 Koases unstable region: trom i to Iiy44i, 



928 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



W7TT 



1434 



5.2e-43 



Protein name 



Description 



Locus Name 



sp:YLCB_ECOLl 



Acc# 



P77211 



PRECURSOR 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



WTTT 



320 



4 . 9e-56 



Protein name 



Locus Name 



Acc# 



sp:YN£6_Ec!OLl 



Description 

HYPOTHETICAL 33.1 KB PROTEIN IN MAOC-ACPD INl'Ek OENKJ kEciioM 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



2i&5.0.IS..7...±2...L3.l I |355? 



TJJT 



TTTT 



|2.3e-ii3 



Protein name 



Locus Name 



sp:YICE_E<J0Ll 



Acc# 



P27432 



Description 

HYPOTHETICAL 46.3 Kb PROTEIN IN GLTS -SeLC 1NTERGSN1C REOlo^J 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



\2±B.0£.B±2.±1..±1'.L 



3493 



18715 



4 .4e-bb 



Protein name 



Locus Name 



CGI -32 protein 



|gp:AFl^y66 



Acc# 



AF132966 



Description 

Homo sapiens CGI-32 protein mRWA, complete cas. 



929 



ORF Name 



NT ID 



24870957 £2 200 



Protein name 



AAID 



NT AA 

— — , Score Probability 
Length Length 



291 


576 




104 





Locus Name 



0.0042 



Acc# 



Description 



sp:VG77_BPMLS 



Q05292 



ORF Name 



|248$7S92 £2 120 



Protein name 



NT 



AA 



NTID AAID Length Length 



Score Probability 



mrr 



2W 



TIT 



Locus Name 



1.2e-05 



Acc# 



hypothetical protein HP0137 



Description 



[pir:A£4«7 



A64537 



ORF Name 



Protein name 



NTID 



\25A229^2..±±...9A 



NT AA 

— — , Score Probability 
AAID Length Length 



871S 



Locus Name 



Acc# 



Description 



ORF Name 



NTID 



\26253A8.1.±lJ2bA I 13557 



Protein name 



NT AA 

— — , Score Proba bility 
AAID Length Length 



WTTT 



TETW 



TUT 



Locus Name 



0.0052 



Acc# 



tiJorom heavy chain PG-2 ' 



Description 



pir:fi6l61S 



B61615 



ORF Name 
\2&2&£±&±...a±..A21.. 



NTID 



Protein name 



AAID 



NT 



AA 



Length Length 



Score Probability 



T3T 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



26287817 ±2 14b 



Protein name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



75" 



fZJT 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



NT ID 



AAID 



NT AA „ n , , . . . ^_ 
— — . Score Probabil ity 
Length Length 



Protein name 



WTZT 



Locus Name 



0.013 



Acc# 



putative integral membrane protein 



Description 

Streptomyces coelicoior cosmid 51A. 



gp:SCbiA 



AL121596 



ORF Name 



NT ID 



AAID 



NT AA 

— , — n Score Probability 
Length Length 



2^Al$.0.I.7....cl...3.10. 



Protein name 



wnr 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT ID 



AAID 



NT AA 
— , — , Score 
Length Length 



Probability 



Protein name 



£&S.l5.$.3£..±l...£0. I 



TTT 



TTTT 



Locus Name 



Acc# 



Description 



[NO-HIT 



931 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ™ 



2SS53S3& ±2 114 



Protein name 



vacuolar type ATP synthase subunit 



Description 



£TT 



0.00012 



Locus Name 



| gp:D^7^9 



Acc# 



D63799 



Thermus thermophilus genes, Operon ot Vacuolar type ATPsyntasesubunit , 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2551553 12 126 



TTTT 



5.2e-l3 



Protein name 



Locus Name 



Acc# 



115K outer membrane protein precursor : SusC 
protein 



pir :JC6 02 7 



JC6027 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
535 



ITT" 



Score Probability 
0.00023 



TT7 



Protein name 



Locus Name 



unknown 



gp:APl752« 



Acc# 



AF175293 



Description 



Enterococcus taecium strain N97-330 vanD glycopeptide resistancegene 
cluster, complete cds; and unknown gene. 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length ■ 



i.2e-32 



Protein name 



Locus Name 



cation efflux system protein 



pir :C71§31 



Acc# 



C71831 



Description 



932 



NT 



AA 



ORF Name 



NT ID 



AAID 



30275287 c3 674 



J5UT 



W7ZT 



Length Length 
THT5 — 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
— 



I6.4e-182 



Protein name 



Locus Name 



lsp:IMDHjrRI*'0 



Acc# 
P50097 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



|3.Da20a0.8,.±£...1.7.1 1 



Length Length 
TTTT 



Score Probability 
1.5e-lS 



Protein name 



Locus Name 



hypothetical protein SC5F2A . UBc 



pIFTTT^U" 



Acc# 



T35250 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



T5W 



W7TF 



Length Length 



TJUW 



Score Probability 
|2.0e-b2 



Protein name 



Locus Name 



inypothetical protein MTH1458 



pxr :B6yU61 



Acc# 



B69061 



Description 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


12t±t)±bA...QL.Al& 


2511 


8733 


82 


249 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



933 



ORF Name 



32422050 ±2 1^0 



Protein name 



NTID 



NT AA 

— — Score Probabi lity 
AAID Length Length 



TFT 



Locus Name 



Acc# 



Description 



NO -HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



8735 



Length Length 
7TJT 



fZTTT 



Score Probability 
i.5e-26 



Protein name 



Description 



Locus Name 



sp:Y634_METJA 



Acc# 



Q58051 



HYPOTHETICAL PROTEIN MJ0634 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



iaa&aafi2Lfi„.ci...42L& | psn 



[X3T 



I2TT 



|S.5e-l8 



Protein name 



Locus Name 



Acc# 



IsprYSM EdoLl 



Description 

HYPOTH E T I CAL 14 . 8 KB PROTEI N IN PRIC-APT INTERGENIO kEGluN 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



8737 



0.015 



Protein name 



Locus Name 



probable glycine- ricn secreted protein 



Acc# 



T36291 



Description 



934 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Probability 
Length Length 



33789067 &A 4^6 



YHT 



|4.6e-06 



Protein name 



Locus Name 



conserved Hypothetical protein AF1017 



jpirrA^VV 



ACC# 



A69377 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
551 



Score Probability 



ITT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



NTID 



NT AA 

— — , Score Pro bability 
AAID Length Length 



a4miaa..-c3L.„5.7.3. i psxs 



] [ 



T7IT 



1 . 4e-l4 



Protein name 



Locus Name 



methylmalonyl-coa decarboxylase gamma cliain 
PAB1771 



pir:F75135 



Acc# 



F75135 



Description 



ORF Name 



NTID 



3Aimi2...al..A3.£ I |3"5T? 



NT 



AA 



AAID Length Length 
4AT 



Score Probability 
ll.6e-23l 



Protein name 
Description 

(NAD (E>) H-D E P E ND E N T GLUTAMA TE DEHYDROGENASE) 



Locus Name 



sp:DHU4_±WJTN 



Acc# 



P94598 



935 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 




Score Probability 
1.2e-05 



Protein name 



Locus Name 



microtilarial sheatn protein SHPi 



bp:LSU54t>b6 



Acc# 



U54556 



Description 



Litomosoides sigmoaontis microtilanal sneath protein SHP3a tshp3a) ana 
microfilaria! sheath protein SHP3 (shp3) genes, complete cds. 



NT 



AA 



ORF Name 



NT ID 



24495216 cl 355 



AAID Length Length 



2S11 



Score Probability 
1.4e-163 



Protein name 



Locus Name 



acritiavm resistance protein acrF: protein 
slr2131:protein slr2131 



bir:g7Sii0ri 



Acc# 



S75508 



Description 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



TFFF 



8744 



TUT 



TfJT 



0.0014 



Protein name 



Locus Name 



sp:Yfiffi_licJ0Ll 



Acc# 



P45580 



Description 

HYPOTHETICAL 12.6 KB PROTE I N IN P E PP -S5R INTERGENIC REGION (OiOS) 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



5745 



558 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



936 



ORF Name 



Protein name 



NTID 



W7TZ 



NT 



AA 



AAID Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



NT AA 

— — S core Probabi 1 1 ty 
AAID Length Length 



15747 



TIT 



Protein name 



Locus Name 



CeoB 



lgp:BOJ^7042 



ACC# 



U97042 



Description 



feurkhoideria cepacia CeoA (ceoA) ana ceoB (ceoB) genes, compietecas. 



ORF Name 



NTID 



"MT 1 AA 

— — Score Probability 
AAID Length Length 



J7T 



1125 



1352" 



|2.Se-3S 



Protein name 



Locus Name 



SmeA 



gp:Al?l7^26 



ACC# 



AF173226 



Description 



Stenotrophomonas maltophilia multidrug ettlux system smeR ismeRj f smea 
(smeS) , SmeA <smeA) , SmeB (smeB) , and SmeC (smeC) genes , complete cds. 



NT 



AA 



ORF Name 



NTID 



19A&i).h£..±±..3.s. I 



AAID Length Length 
TZZ 



FT" 



Score Probability 
7.5e-0B 



Protein name 



Locus Name 



histone Hi-like protein 



Ipir: JH06S^ 



Acc# 



JH0658 



Description 



937 



ORF Name 



3961641 til 4 Ob 



Protein name 



Description 



NT ID 



AAID 



NT AA 
— , — , Score 
Length Length 



Locus Name 



Probability 



Acc# 



MO-HIT 



ORF Name 



Protein name 



transposase 



Description 



NT 



AA 



NT ID 



13529 



AAID Length Length 
TZTZ 



8751 



TIT 



Score Probability 
12 .4e-08 



Locus Name 



bp:AF03886fc 



Acc# 



AF038866 



Sacteroides tragilis transposon Tn5520 transposase (bipH) anamobilization 
protein BmpH (bmpH) genes, complete cds . 



NT 



AA 



ORF Name 



NTID AAID Length Length 



Score Probability 



T5W 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



hypothetical protein RP338 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



T5T 



TTST 



Locus Name 



bir:D716yO 



Ifl.7e-14 



Acc# 



D71690 



938 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



4103438 c2 465 



FTFT 



1314 



|5.0e-134 



Protein name 



Description 



Locus Name 



sp:ACCC_MKTJA 



Acc# 



Q58626 



CARBOXYLASE J (ACC) 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probabi lity- 
Length Length 



TTTT 



Locus Name 



Acc# 



Description 



ttsfO-HIT 



ORF Name 



NT 



AA 



NTID 



AAID 



|4ll£3.25....cl..A&fi I [^4 



Length Length 
— 



1ST 



Score Probability 



Protein name 



Locus Name 



sp:ALST_BACStf 



ACC# 



Q45068 



Description 
AMINO ACID CARRIER PROTEIN ALST 



ORF Name 



NTID 



AAID 



NT AA ^ ^ , , . n . , 

— — , Score Probability 
Length Length 



T5T 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



939 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Proba bility 
Length Length 



4144005 ti 44 



8758 



|2.Se-173 



Protein name 



Locus Name 



cation ettlux system lAcrB/AcrD/AcrF tamilyj 1 |pir :G70396 



Acc# 



G70396 



Description 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



8759 



1.2e-38 



Protein name 



Locus Name 



pyridoxal kinase tpclxK) nomolog 



bir:G701^b 



Acc# 



G70195 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TTT 



Score Probability 
il.6e-0£ 



T4^ 



Protein name 



Locus Name 



N- ace tylmuramoyi - L - alanine amidase nomolog 



Description 



bir:H70l77 



Acc# 



H70177 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



TIT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
TSS 



Score Probability 



Locus Name 



Acc# 



Description 



NO-SIT 



940 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 
|2.0e-27 



Protein name 



Locus Name 



conserved hypothetical protein aq_l42 0 



pir :D704i>3 



Acc# 



D70423 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 
3.4e-M 



Protein name 



Locus Name 



Acc# 



unknown 



gp:AP0888S7 



Description 

Zymomonas mofcilis cosmic clone 6 5G3, partial sequence. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



I3TT" 



Protein name 



Description 



Locus Name 



Acc# 



MO-HM 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


±12S£Z6...±1../12± 


3544 




117 


354 


175 


" 2.0e-13 

















Protein name 



Locus Name 



conserved hypothetical protein aq_853 



pir:A70i74 



Acc# 



A70374 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



8767 



Length Length 



Score Probability 



1555 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



941 



ORF Name 



NTID 



AAID 



NT AA 
— , — , Score 
Length Length 



8768 



Protein name 



Locus Name 



2 - acylglycerophosphoetnanoiamme 
acyl transferase 



pir :E704V6 



Description 



Probability 
1 .4e-08 



Acc# 



E70476 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
■231 



Score Probability 



75 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



DNA helicase reiatea protein 



Description 



NT AA 

— — Score Probability 
Length Length 



T5W 



Locus Name 



pir:H6W63 



|3.4e-E>2 



Acc# 



H69163 



as? 








NT 


AA 




ORF Name 


NTID 


AAID 


Length 


Length 




6.±±5A2.Z..±2....!B.b. 




S771 


6$ 


210 



Protein name 



Locus Name 



Acc# 



Description 



[NO-HIT 



942 



ORF Name 



NTID 



NT AA 

— — , Score Probabi lity 
AAID Length Length 



16845262 c2 b27 



WITT 



|2.3e-106 



Protein name 



Locus Name 



|sp:YFBK_ECOLl 



Acc# 
P76481 



Description 

HYPOTHETICAL 63.6 KB PROTBM llSf SLAD-NUoM fflffERtiEd lC RECjION 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TLS 



TuT" 



Score Probability 

o.ooss 



FT 



Protein name 



Description 



Locus Name 



sp:ZN£u_HUMAN 



Acc# 



Q03938 



Z I NC FINDER PROTEIN 90 (kSlJsJC PlNGEk J^kOTKlM ( FRA^M^NT ) 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



\T5T 



[F7F" 



IT7T- 



|3.2e-i3 



Protein name 



Locus Name 



lsp:RKJH pyEAK 



Acc# 
Q06198 



" Description 

RrtA £OLYM£ftASE siciMA-H FACTO R (5I(jMA- ^0) 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



\10.11B5Al...cxlJ112 1 



8775 



TUT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



943 



ORF Name 



1174451S t'A 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



8776 



Length Length 



Score Probability 



T7JT 



Locus Name 



Acc# 



MO -HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



W77T 



Length Length 



Score Probability 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



13557 



8779 



Length Length 
TZU3 



— , Score Probability 



[37KT 



Locus Name 



Acc# 



NO-SIT 



944 



ORF Name 



NT ID 



AAID 



NT AA 
— — Score 
Length Length 



TUT 



I32T 



|2uT" 



Probability 
14 .4e-16 



Protein name 



Locus Name 



Acc# 



|sp:DBHJiA(JfcJT 



Description 
t>NA-5lNf)lN(j £R0l'fc!lN' It (HH) (HU) 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



18851 t3 74 



F7ST" 



TT53" 



l.7e-ll7 



Protein name 



Locus Name 



sp : SY&Jl'khlPA 



Acc# 



083803 



Description 



ORF Name 



\±$£.11SM..±±..VA 



Protein name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



8782 



15TT 



TT5T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



13561 



AAID 



8783 



NT 



AA 



Length Length 
27T7 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



945 



ORF Name 



NTID 



22853411 ci 127 



Protein name 



Hypothetical protein £>C6ElU.iyc 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



trmr 



7T 



Locus Name 



pir :T3bb0b 



0.037 



Acc# 



T35506 



ORF Name 



Z3.5l55l3.XZ...G2...16.SL 



Protein name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



^5" 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



3564 



AAID Length Length 
ulT 



1827 



Score Probability 
|2.3e-168 



Locus Name 



hemolysin erythrocyte lysis protein 2 



gp:AF052=il£> 



Acc# 



AF052516 



Description 



£>revotella intermedia hemolysin hemolytic protein, hemolysmerythrocyte 
lysis protein 1, and hemolysin erythrocyte lysisprotein 2 genes, complete 
cds . 



NT 



AA 



ORF Name 



NTID 



\21£AL&l...a2...±15. 



3565 



AAID Length Length 

is* — I m$ — 



Score Probability 



8787 



] [ 



Protein name 



Locus Name 



Acc# 



Description 



INO-HIT 



946 



NT 



AA 



ORF Name 



NT ID 



25710877 c2 174 



AAID Length Length 

tzujz — 



TUTT 



Score Probability 
i.3e-94 



Protein name 



Locus Name 



protein-export membrane protein 



gp:AB022§65 



Acc# 



AB022865 



Description 



Prevotella ruminicoia genes tor polygalacturonase, 
xylosidase, protein- export membrane protein, complete cds. 



ORF Name 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 

— — , Score Pro bability 
Length Length 



F7W 



W5T 



or 



;5.2e-l3 



Locus Name 



.sp:Y907_METJA 



Acc# 



Q58317 



HYPOTHETICAL PRO T EI N MJ0907 



ORF Name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



1±2$.83M.±2...5.6. I 



JET 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



947 



NT 



AA 



ORF Name 



NTID 



2439&412 ci ^07 



AAID Length Length 
— 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Prob ability 
Length Length 



|24il5a.7..7....ci...na.. 



3571 



TUUT 



\TUTT 



T5T 



2 .2e-07 



Protein name 



Locus Name 



conserved nypotnetical protein aq__i8yfo 



pir :K /U46jJ 



Acc# 



E70463 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 
|2.5e-i4I 



Protein name 



Locus Name 



topoisomerase I 



gp:AF088896 



Acc# 



AF088896 



Description 



Zymomonas moJoilis tosmid clone 42C11, complete sequence. 



NT 



AA 



ORF Name 



NTID AAID Length Length 

T7TT3 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



948 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



24650377 ±1 Jl 



Protein name 



TTTT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



Length Length 



AA 

— Score Probability 



S35" 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Prob ability 
AAID Length Length 



3TT" 



Tu3~ 



Locus Name 



Acc# 



rkNA methylase {SpoU tamiiy) (OO, TP) 
PFB0855C 



pir:B71604 



B71604 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — Score P robability 
Length Length 



Protein name 



8799 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
TZUQ 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



949 



ORF Name 



30084687 ±2 by 



Protein name 



NTID 



NT AA 

— — , Score Proba bility 
AAID Length Length 



VIST 



Locus Name 



Acc# 



Description 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



aflSftma..±i...2L | 



TTT 



0.038 



Protein name 



Locus Name 



submaxillary mucin l 



Acc# 



T42233 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



Protein name 



Locus Name 



probable gipG protein 



|pir:D71i>b& 



Acc# 



D71258 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




309 



Score Probability 
'3.2e-l3 



177 



Protein name 



Locus Name 



probable gipG protein 



pir:£>7l25a 



Acc# 



D71258 



Description 



ORF Name 



NTID 



Protein name 



AAID 



NT 



AA 



Length Length 
231 



Score Probability 



77 



Locus Name 



Acc# 



Description 



950 



NT 



AA 



ORF Name 



NT ID 



36207937 ti I 



AAID Length Length 

firm — 



Score Probability 



muz 



Protein name 



Description 



Locus Name 



Acc# 



1N0-H1T 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 
|8.8e-li 



T5T 



Protein name 



Locus Name 



mutator protein mutT 



pir :D6444ji 



Acc# 



D64443 



Description 



ORF Name 



NT ID 



AAID 



|4aa33..7..7....t3....28. 



Protein name 



lic-l protein D 



Description 



8808 



NT 



AA 



Length Length 



Score Probability 
|4.6e-20 



TT7 



Locus Name 



pir:E641^^ 



Acc# 



E64128 



ORF Name 



Protein name 



Description 



NT ID 



AAID 



NT AA 

— — Score Pro bability 
Length Length 



77" 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



^.9.40.11..±i...7.i.. 



Protein name 



Description 



NT ID 



AAID 



NT AA 

— — Score Proba bility 
Length Length 



155" 



|T7T 



|l.3e-23 



Locus Name 



Acc# 



sp:DBH_liACaT 



DNA-BINDING PkOTklM II (HB) (MU) 



951 



Protein name 



NT 



AA 



ORF Name 


NT ID 


AAID 


Length 


Length 


5260952_11_32 


3589 




8811 


425 


1281 



Score Probability 



Locus Name 



Acc# 



Description 



[NO -HIT 



ORF Name 


NTID 


AAID 


NT 

Length 


AA 

— , Score 
Length 


Probability 


£8^0.7.^11^7. 


3590 


8812 


100 


303 73 


0.023 


Protein name 








Locus Name 


Acc# 










sp:YM25_Y±i!A:y i r 


P40219 


Description 














HYPOTHETICAL 16 


4 KD £>kOT21N 


IN T1F34-SWP1 INTKRGKNIC REGION 


















ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


aamis-.tija 


3591 


8813 


107 


324 




Protein name 








Locus Name 


Acc# 


Description 














NO-HIT 
















ORF Name 


NTID 


AAID 


NT 
Length 


— , Score 
Length 


Probability 


£aua3Lfl...ci..iflii 


3592 


8814 




210 




Protein name 








Locus Name 


Acc# 


Description 















[MO-HIT 



952 



ORF Name 



7134587 ±2 36 



Protein name 



NTID 



NT 



AA 



AAID Length Length 

— 



Score Probability 



TUZT 



Locus Name 



Acc# 



Description 



[NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
T52 



TO 



Score Probability 
B.le-16 



Protein name 



Locus Name 



glycosyl transferase 



gp:SPAJ6^6 



Acc# 



AJ006986 



Description 



Streptococcus pneumoniae type 33F DNA, capsular gene cluster. 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



ISSlS 



Length Length 




Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



953 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



10740936 ti 2iJ 



Z7T 



7.6e-24 



Protein name 



Locus Name 



serine/ threonine protexn kinase related 
protein 



Description 



pir:H690<54 



Acc# 



H69064 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



JUT 



Protein name 



Description 



Locus Name 



Acc# 



INO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TTT 



i.6e-4^ 



Protein name 



Locus Name 



hypothetical protein 



Igp : SAUkkD 



Acc# 



Y09927 



Description 



Staphylococcus aureus glmM gene cluster. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 
2.7e-i6 



sin 



Protein name 



Locus Name 



sp:YO&J_EdoLl 



Acc# 



P33372 



Description 

HYPO T H E T I CAL 14.6 KB PROTE IN IN PBPG-CDD INTERGENIC REGION 



954 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
I5U2 — 



Score Probability 
6.8e-43 



Protein name 



Locus Name 



spiYyUWJUOoLl 



Acc# 



P77619 



Description 

HYPOTHETICAL 47. £ K& PROTiiilM lit UCPA-A MlA iNftiKGENlCi RKCiXOftf PRiilCiURSUR 



ORF Name 



l2£7375l 12 171 



Protein name 



Description 



NT ID 



NT AA 

— — , Score Prob ability 
AAID Length Length 



TUT 



Locus Name 



Acc# 



IWO-HtT 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



— — Score P robability 
Length Length 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



|X3..7.a3.^3.1...aZ..Al&.., 



Protein name 



NT 



AA 



NT ID 



AAID Length Length 



Score Probability 



TTT 



4TT" 



1. le-lfl 



Locus Name 



nypotnetical protein 



pir :S76920 



Acc# 



S76920 



Description 



ORF Name 



Protein name 



NT ID 



mrr 



comEA protein- related protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 
3.1e-13 



TT7 



Locus Name 



pir :F723UI 



Acc# 



F72301 



955 



ORF Name 



NTID 



14257802 £3 240 



13506 



Protein name 



hypothetical protein 



Description 



NT 



AA 



AAID Length Length 
TT75 



Score Probability 
|2.Ie-13 



TTJ 



Locus Name 



Acc# 



G75375 



ORF Name 



NTID 



AAID 



NT AA rt ^ , , . . . 
— — , Score Probability 
Length Length 



[JSTJT 



I2TT 



[7TT" 



10.0065 



Protein name 



Locus Name 



triadm isotorm i 



|gp:AF16B917 



Acc# 



AF165917 



Description 



Cams tamiliaris triadm isotorm 3 mRNA, complete eels . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



T2W 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



NTID 



NT AA „ _ , , . -. . . 
— — Score Probability 
AAID Length Length 



13603 



SS31 



5.7e-3i 



Protein name 
Description 

CYTOCHROME C BIOGEN E 5I5 PRO TEIN CC&K 



Locus Name 



sp:CCSA_TOHA(j 



Acc# 



P12216 



956 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



Protein name 



7T 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Pr obability- 
Length Length 



1WT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



\±52MMA.±1...2h& I \^T2 



|3.4e-0b 



Locus Name 



mterpnotoreceptor retinoia-iDinaing protein | igp : DRRNA1RBP 



Acc# 
X85957 



Description 



Danio rerio mRNA tor mterpnotoreceptor retinoid- £>mamg protein. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
£415 



Score Probability 



79 



Protein name 



Locus Name 



Acc# 



Description 



[NO-HIT 



957 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



19S47890 ti b 



|370 | 



8.2e-17 



Protein name 



Description 



Locus Name 



sp:0TC_AR<3FU 



Acc# 



029013 



Oft^imtNS C ARBAMO V LT.kAN'fi ffERAd hi , (OfCASa) 



ORF Name 



19727^3 ci 4yi 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



351?" 



AAID 



NT 



Length Length 



AA 

— Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT AA 

— — Score Probability 
NTID AAID Length Length 



\10.!±1Z21..±2..X6.S. I [3CT7 



Protein name 



TTf 1 [55* 



Locus Name 



Acc# 



Description 
NO-HIT 



958 



ORF Name 



NTID 



21913177 c3 446 



Protein name 



transcription regulator, crp tamily 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



75TT 



7.6e-il 



Locus Name 



pir :F7228b 



Acc# 



F72285 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 

— — Score Pro bability 
Length Length 



[T5T5" 



CCT7T" 



S.le-56 



Locus Name 



spTYn^rmcsir 



Acc# 



P40407 



ORF Name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



|2.2e-£0 



Protein name 
Description 

(D-ALANYL-D- ALPINE-ADD I NG ENZYME) 



Locus Name 



sp:MURFJWJaU 



Acc# 



P96613 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



|l.2e-38 



Protein name 



Locus Name 



glutamate racemase 



pir:^7tmy 



Acc# 



B70329 



Description 



959 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



22136 t2 129 



8S44 



359" 



Protein name 



Locus Name 



3.1e-95 



Acc# 



sodium- dependent transporter homolog yock 



Description 



pir:D6yyu2 



D69902 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



TO" 



1121 I [JIFF 



EFT" 



Locus Name 



1.5e-66 



Acc# 



115K outer membrane protein precursor : Susc 
protein 



bir:JC6027 



Description 



JC6027 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



Protein name 



aiaciaifi.7....ci...3Lia i pssi 



[T75" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score P robability 
Length Length 



Protein name 



1179 



Locus Name 



l.Oe-lld 



Acc# 



Description 



spiPARCJiOkliU 



051066 



TOPOiaOMLift&SK IV aUBUNIT A, 



960 



NT 



AA 



ORF Name 



NTID 



23721906 c3 472 



AAID Length Length 
ITT 



Score Probability 



7JT 



Protein name 



Description 



Locus Name 



Acc# 



INO-HIT 



NT 



AA 



ORF Name 



2aaiaaia...ta...m 



NTID AAID Length Length 

2520 



Score Probability 
l.ye-38 



^77 



Protein name 



Locus Name 



outer membrane protein omp85 



[gp:AF02l245 



Acc# 



AF021245 



Description 



Neisseria meningitidis outer membrane protein Omp85 tompbbj gene, complete 
cds . 



ORF Name 



NT AA 

— — Score Probab ility 
NTID AAID Length Length 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



msiaaft..±i...A&. 



Length Length 
JUT 



Score Probability 
1. Oe-28 



357 



Protein name 



Locus Name 



Acc# 



two component sensor 



gp:AP0J0Jb2 



AF030352 



Description 

PseucLomonas aeruginosa two component sensor (lemA) gene, partialcas. 



961 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



|l.ie-12 



Protein name 



Locus Name 



DNA repair protein 



h?ir:H72^ 



Acc# 



H72239 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

wzz — 



Score Probability- 
lS. 2e-iB 



Protein name 



Locus Name 



Acc# 



sp :£ADC J5AC5U | Q021 



70 



Description 
DNA REPA I R PROTEIN RADC HOM OLOG (ORFB) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 
li.ie-15 — 



TFT 



Protein name 



Locus Name 



immunoreactive 3UKD antigen PG44 



gp:APiVb717 



Acc# 



AF175717 



Description 



Rorphyromonas gingivalis strain VibO immunoreactive 30KD antigenPG44 gene, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
|4.2e-5$ 



Protein name 



Description 



Locus Name 



sprYFEUJIAEiN 



Acc# 



P44862 



HYPOTH E TICAL RROT EIN HI 07 54 



962 



NT 



AA 



ORF Name 



NT ID 



24270327 c3 



AAID Length Length 




Score Probability 

0.015 — | 



Protein name 



Locus Name 



unknown 



gp:AF0442^ 



Acc# 



AF049236 



Description 



Arabidopsis thaliana putative transmembrane protein Glp {AtGl} , putative 
nuclear DNA-binding protein G2p (AtG2) , Eml protein (ATEM1) , putative 
chlorophyll synthetase (AtG4) , putative transmembrane protein G5p (AtG5) , 
putative acyl-coA dehydrogenase (AtG6) , and calcium dependent protein kinase 
genes, complete cds;and unknown genes , 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



\2£A±BM2....cl..AM I 



Length Length 
2TT — 



Score Probability 



1Z 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



L2.4.4.7.0^.0.0....cJ...^.0.0.. 



Protein name 



acetate kinase 



Description 



NT 



AA 



NTID 



AAID 



8858 



Length Length 



Score Probability 



Locus Name 



Acc# 



H72397 



ORF Name 



NTID 



\2AL4±$Z1.±2..±42.„ I RETT 



Protein name 



glycosyl transterase PAB07 72 



Description 



NT 



AA 



AAID Length Length 



TT7T 



Score Probability 
9.Se-iO 



T£7 



Locus Name 



pir :B7b0y6 



ACC# 



B75096 



963 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



I2464S257 ti ^Ob 



TUT 



Protein name 



Locus Name 



ACC# 



IsptYOHKJicJoLl 



P33373 



Description 



ORF Name 



NT ID 



NT AA 

— — , Score Probabil ity 
AAID Length Length 



I24S04562 ti 229 



341 



1055 



!2.3e-47 



Protein name 



Locus Name 



a spartate - semiaidenyae aenyarogenase , 



pir :B7U4bl 



Acc# 



B70461 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
W7W 



Score 



1413 



Probability 
|2.1e-24 



Protein name 



Locus Name 



amidase enhancer 



gp:Ab0171y4 



Acc# 



AB017194 



Description 



Plectonema boryanum 0kF270, proline immopeptiaase , terredoxm andamidase 
enhancer genes, complete and partial cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1AT 



TIT 



1 . 8e-33 



Protein name 



Locus Name 



sp :YGDL_E C0L1 



Acc# 



Q46927 



Description 

HYPOTHE T ICAL 28.6 KB PROTEIN IN GCV A-MLTA INTERGENIC REGION 



964 



NT 



AA 



ORF Name 



NT ID 



26364591 c3 512 



AAID Length Length 




Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID 



Length Length 
TTSTS 



Score Probability 
4.2e-122 



Protein name 
Description 

CYTOCHROME C$$2 £>££CURSOft 



Locus Name 



|sp:NRFA HALI1N 



Acc# 



P45017 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



WTT 



l.le-6S 



Protein name 



Locus Name 



sp:YBAS_t!doLl 



Acc# 



P77454 



Description 

HYPO T HE T ICAL 32 . 9 KB PROTEIN llsi UdMA-TESA HJTEkcjB HIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
|2.2e-53 



557 



Protein name 



Locus Name 



dihyciropteroate synthase 



pir:E7242b 



Acc# 



E72425 



Description 



965 



ORF Name 



125414003 t2 118 



Protein name 

Description 
NO-HIT 



NT 



AA 



NT ID 



AAID 



Length Length 
7T3 - 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT 



AA 



NT ID 



AAID 



Length Length 
ITT 



TUT 



Score Probability 
3.0e-7$ 



7TTJ 



Locus Name 



hypothetical protein 



pir:S76532 



Acc# 



S76532 



Description 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length -L 



Protein name 



Locus Name 



.3.8e-i68 



Acc# 



d- lactate dehydrogenase 



Description 



pir:A7i843 



A71843 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



3.U12Q2.6.7....C.1...3.16... I 13649 



Protein name 



1158 



Locus Name 



Acc# 



Description 
MO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



TS5TT 



8872 



35" 



i.3e-05 



Protein name 



Locus Name 



unknown 



gp:U3677i 



Acc# 



U96771 



Description 



Prevotella JDryantii putative polygalacturonase, B-l , 4 -endoglucanase, and 
mannanase genes, complete cds ; and unknowngenes . 



ORF Name 



NTID 



NT AA 

— — , Score P robability 
AAID Length Length 



13155527 c3 499 



J4T 



6.3e-S6 



Protein name 



Locus Name 



phospnotransacetylase 



gp:TTAJ4 87 0 



Acc# 



AJ004870 



Description 



Thermoanaerobacterium tnermosaccnarolyticum ptaA ana acJcA genes, orti, ort'A , 
orf3, orf 4 . 



NT 



AA 



ORF Name 



NTID 



355T 



AAID Length Length 



2TT 



— ^ Score Probability 
531 



|4.ie-52 



Protein name 



Locus Name 



ABC transporter 



plr :B70327 



Acc# 



B70327 



Description 



ORF Name 



Protein name 



Description 



NTID 



3£S3 



3T75 



NT 



AA 



AAID Length Length 




Cm- 



Score Probability 
S.3e-45 



Locus Name 



sp:DDL_HAE!N 



D- ALANINE --B- ALANIN E LlflASM, (D-AhAls l VL ALAlsilNL] aYNTHETAiSLl) 



Acc# 



P44405 



967 



ORF Name 



NT ID 



AAID 



NT AA 
To^t-i. Score Probability 
Length Length 



33406262 c3 440 



TTT 



5.3e-20 



Protein name 



Locus Name 



Acc# 



nimB protein 



pir:I40183 



Description 



NT 



AA 



ORF Name 



NTID 



I3.3.6.7.S25.3....JL2...127.., 



AAID Length Length 
1751 



rtbtt 



Score Probability 




i.4e-il 



Protein name 



Locus Name 



NorA 



lgp:AB019536 



Acc# 



AB019536 



Description 



Staphylococcus aureus norA23 gene tor NorA, complete cds. 



ORF Name 



NTID 



AAID 



3.4D.23A2.5...C.3....48.6. I 



Protein name 



glutamate decarboxylase : protein 
S111641 rprotein slll641 



Description 



NT 



AA 



Length Length 
T¥51 



Score Probability 
|1.6e-ii4 



TOTT 



Locus Name 



|pir:S7!>lbO 



Acc# 



S75150 



ORF Name 



3.40.6.3.0.6..7...±2...3.8.... 



NTID 



Protein name 

Description 
ELONGATION FACTOR P (fif-P) 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



K5T 



S7T 



Locus Name 



sp:EFP_BACFR 



3.2e-M 



Acc# 



P70889 



968 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Probability 
Length Length 



340S5887 c3 SOi 



\TTTT 



5.7e-S8 



Protein name 



Locus Name 



alJcalme pnosphatase 



pir:B72410 



Acc# 



B72410 



Description 



ORF Name 



NT ID 



NT AA n ^ - . . - . . 
— — , Score Probability 
AAID Length Length 



TTTT" 



0.038 



Protein name 



Locus Name 



hypothetical protein S110670 



bxr:S770b4 



Acc# 



S77054 



Description 



NT 



AA 



ORF Name 



NT ID 



asi±aa:La..±i...ii£ 



AAID Length Length 
— 



mm 



Score Probability 
|5.3e-S3 



Protein name 



Locus Name 



glutamate 5 -kinase pro J 



pir :F69682 



ACC# 



F69682 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



3£3A5.2S.2...cl...3.3.a.. 



T7S~ 



2.4e-33 



Protein name 
Description 

HYPOTH ET ICAL 3£ . 7 KB PROTEIN IN BCHI 5 ' REGION 



Locus Name 



sp:YB05__0HLVl 



Acc# 



050310 



NT 



ORF Name 



NT ID 



AAID 



Length Length 
^7" 



AA 

— Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



969 



NT 



AA 



ORF Name 



NTID 



3923751 ci VIA 



AAID Length Length 
F51T5 



TTT* 



Score Probability 
2T7 



6.8e-20 



Protein name 



Locus Name 



conserved hypothetical protein 



pir:C75306 



Acc# 



C75306 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 




0.0015 



Protein name 



Locus Name 



elastic titm 



PirTTJ^^" 



Acc# 



138346 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
T& 



Score Probability 



TUT 



Protein name 



Description 



Locus Name 



Acc# 



[MO-HIT 



NT 



AA 



ORF Name 



NTID 



aa£i4i2L.ca...isa I psss 



AAID Length Length 

msz — 



4JT 



TT75" 



Score Probability 
UTTJT3 



Protein name 



Locus Name 



polyprotein 



gp:AF20£441 



Acc# 



AF206441 



Description 



Hepatitis C virus isolate 28B polyprotein gene, E1/E2 region, partial cds. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TO - 



Score Probability 
PB 



0.047 



Protein name 



Locus Name 



conserved hypothetical protein aq_34 0 



pir:C70330 



Acc# 



C70330 



Description 



970 



ORF Name 



4101502 c3 454 



Protein name 

Description 
NO-HIT 



NT 



AA 



NT ID 



AAID 



Length Length 



Score Probability 



TTTT 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT 



AA 



NT ID 



AAID 



8891 



Length Length 



Score Probability 
0.0082 



Locus Name 



hypothetical protein W06B4.2 



pir :T34482 



Acc# 



T34482 



Description 



ORF Name 



NT ID 



AAID 



NT AA 
T — ^ Score Probability 

Length Length 



3670 



8892 



F7F" 



1.5e-09 



Protein name 



Locus Name 



Acc# 



hypothetical protein 



pir:S7599i 



S75991 



Description 



NT 



AA 



ORF Name 



NT ID 



A41QaQ.7....t3L...2L3l& I 13671 



AAID Length Length 
— 



OTT" 



Score Probability 
1352 



Protein name 



Description 



Locus Name 



gp iriiLlCS 



4 .le-19 



Acc# 



X57315 



Haemophilus influenzae l±c3 locus, containing gaiE and adJc genestor 
UDP-galactose-4-epimerase and adenylate kinase. 



971 



NT 



AA 



ORF Name 



NTID 



AAID 



453545? t3 177 



Length Length 



Score Probability 




Protein name 



Locus Name 



gamma -glutamyl phosphate reductase 



gp : STPROBA 



ACC# 



X92418 



Description 

S . thermophilus proB and proA genes. 



NT 



AA 



ORF Name 



cl 32$ 



NTID AAID Length Length 

— 



HT4~ 



Score Probability 
|2.$e-3S 



Protein name 
Description 

HYPOTHETICAL 35.7 KD PROTEIN IN BCHI 5 ' REGION 



Locus Name 



sp:YBC5_CULVl 



Acc# 



050310 



ORF Name 



NTID 



NT AA 
TT> — T — _ Score Probability 
AAID Length Length 



ITS" 



i.3e-20 



Protein name 



Locus Name 



small subunit ot cytochrome c nitrite 
reductase 



E 



p:WSU245540 



Acc# 



AJ245540 



Description 



Wolinella succinogenes mreB gene (partxai) , nr±H, nr±A, nrf I, andnrtJ 
genes . 



NT 



AA 



ORF Name 



46.5.3.3....c3...A3.5. I 1357? 



NTID AAID Length Length 

— 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



972 



ORF Name 



NT ID 



Protein name 



NT AA 

^ _ — _ _ — Score Probability 
AAID Length Length • L ~ 



protein kinase homolog 



Description 



T5T 



Locus Name 



pir :T42077 



i.3e-0fl 



Acc# 



T42077 



ORF Name 



Protein name 



NT ID 



AAID 



NT AA score 
Length Length 



TUTT 



Locus Name 



Probability 



Acc# 



Description 
[MO-HIT 



ORF Name 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 
FT" 



Score Probability 



Locus Name 



Acc# 



Description 
NO-HIT 



ORF Name 



Protein name 



NT ID 



AAID 



hypothetical protein PAB0896 



Description 



NT AA 

— , — , Score Probability 
Length Length ^ 



ITT 



Locus Name 



pir:O75045 



0.0020 



Acc# 



G75045 



ORF Name 



48..7.6.5.6.3....C.2..AQ.3.. 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
JIB 



Score Probability 



TUT 



Locus Name 



Acc# 



Description 
NO-HIT 



973 



ORF Name 



5272S13 £1 63 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 





3681 




8903 




313 


942 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



5.9.0..7.3.12...C1...3.3.2 1 



Protein name 



AAID 



NT AA 

to^>, T^i-in Score Probability 
Length Length 



Locus Name 



Acc# 



Description 
INO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



hypothetical protein SPAC11E3.10 



Description 



NT AA 
— , — , Score 
Length Length 



1ST" 



Locus Name 



pir:T37SJB 



Probability 
10.0068 



Acc# 



T37538 



ORF Name 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length J ~ 



FT 



T5T 



Locus Name 



Acc# 



Description 
NO- HIT 



ORF Name 



Protein name 



NTID 



AAID 



hypothetical protein :jhp0277 



Description 



NT 



AA 



Length Length 
TFT 



TIT 



Score Probability 
FTCT 



Locus Name 



pir:H71450 



1.4e-42 



Acc# 



H71950 



974 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 



Protein name 

Description 
IN^TITT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



|S..7..7.0.8.&...g2l...412 1 



Length Length 
TTT 



Score Probability 



\JTT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID AAID Length Length 

mm — 



ST" 



Score Probability 
S3 



0.0057 



Protein name 



Locus Name 



NADH dehydrogenase 1 



gp:AF0S9183 



Acc# 



AF069183 



Description 



Lipolexis gracilis NADH dehydrogenase 1 gene, mitochondrial geneencodmg 
mitochondrial protein, partial cds . 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



975 



NT 



AA 



ORF Name 



NT ID 



AAID 



/y/2bb ±1 173 



t — ^ t — , -i Score Probability 
Length Length ^~ 



9.7e-12 



Protein name 



Locus Name 



automembrane protein H 



gp : VEOMPH 



Acc# 



Y12468 



Description 
Y . enterocolitica ompH gene. 



ORF Name 



NTID 



AAID 



NT AA 
_ — ^ — . Score Probability 
Length Length ^ 



815626 12 104 



447 



T34T 



Protein name 

Description 
WO-fll* 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



ftAaii..±i...fi&. i 



IB5TT 



Length Length 



Score Probability 
532 



9.4e-62 



Protein name 

Description 
(ORF2 J 



Locus Name 



sp:Y££C_>AC£tf 



Acc# 



P40407 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
HIT 



|2.5e-3f 



Protein name 



Locus Name 



hypothetical protein 



pir : S75887 



ACC# 



S75887 



Description 



976 



NT 



AA 



ORF Name 



NTID 



riWOOO ta 174 



, , xr , T — _ — Score Prob ability 
AAID Length Length JL 





TIT 



i.Se-05 



Protein name 



Locus Name 



periplasmic protein 



gp:PLU236y20 



Acc# 



AJ236920 



Description 



Pnotorhabcius iummescens yaeL (partial), rirA (partial), oma andompH genes. 



NT 



AA 



ORF Name 



NTID AAID Length Length 

— 



TIT 



Score Probability 

wn — 



4 . 8e-l21 



Protein name 



Locus Name 



glycine- -tKNA ligase, glys :giycyi-tRNA 
synthetase :glycyl-tRNA synthetase 



Description 



pir :B70146 



Acc# 



B70146 



ORF Name 



NTID 



AAID Lenjth Length Probability 



£lBAll...a±.„l±l I 



Protein name 

Description 
INO-fil? 



1074 



Locus Name 



Acc# 



ORF Name 



Protein name 
Description 

ino-mt 



NT 



AA 



NTID 



AAID 



Length Length 
T35" 



Score Probability 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

AAID Lenjth Length Probability 



9540957 ci 325 



m7W 



73T 



1.8e-i9 



Protein name 



Description 



Locus Name 



Acc# 



P44046 



HtfMfftfiflCAL PROTEIN S10735 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT 



AA 



Length Length 
TIT 



T7T 



Score Probability 
TT5 



7 . 8e~06 



Locus Name 



sp ; FAS_PlflgCA 



Acc# 



P29251 



ORF Name 



NT 



AA 



NTID 



10.6.2b.0.iA...ci...2S.D. I ITTTJu" 



AAID Length Length 
3322 



TUTT 



Score Probability 




|i.2e-29 



Protein name 



Locus Name 



Jaemin permease 



gp : VEHEMSTUV 



Acc# 



X77867 



Description 



Y.enterocolitica nems, nemT, nemu ana nemv genes. 



NT 



AA 



ORF Name 



12L&l&.7.aQ...C2L...2L5a. 



™™ _ _ -pi-v T — , , r — ^, Score Probability 
NTID AAID Length Length JL 

— 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



978 



ORF Name 



126i>%6=; ci 202 



Protein name 

Description 
MO-HIT 



NT 



AA 



NT ID 



TTUT 



— _ — Score Probability 
AAID Length Length 2 - 

"$57% — 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
INO^ITTT 



NT ID 



NT AA 

_ _ _ - _ _ — _ — ^, Score Proba bility 
AAID Length Length z - 



3703 



Locus Name 



Acc# 



ORF Name 



Iilu21iy....cl...l72... 



Protein name 



NT 



AA 



NT ID 



TTUF 



AAID Length Length 
— 



Score Probability 
1521 



lii.le-39 



Locus Name 



sp : VLC!A_EC!OLI 



Acc# 



P77380 



Description 

PROBABLE TRANSCRIPTIONAL RE GULA T ORY PROT E IN YLCA 



NT 



AA 



ORF Name 



NTID 



AAID 



13.6.D.9A2L..±1...2Cl I 15705 



Length Length 
[273 



Score Probability 



5TT 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



979 



ORF Name 



14179057 tl 44 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID Length Length 

mzz — 



Score Probability 



£1T 



Locus Name 



Acc# 



ORF Name 



144M0A1..±3....1&£., 



Protein name 



NT 



AA 



NTID 



JTUT 



AAID Length Length 
— 



7TT 



Score Probability 
£23 



It.ie-id 



Locus Name 



precorrin-2 metnyl transferase, : protein 
slr!879 :protein slr!879 



Description 



bir:S77131 



Acc# 



S77131 



NT 



AA 



ORF Name 



NTID 



AAID 



T7W 



_ — ^ — ^, Score Probability 
Length Length -L 



0.045 



Protein name 

Description 
HYPOTHETICAL PROTEIN TP0031 



Locus Name 



|sp:Y0^1_TREPA 



Acc# 



083074 



NT 



AA 



ORF Name 



NTID 



T7UT 



_ -p-p>, T — _ — ^, Score Proba bility 
AAID Length Length jL 

— 



per 



|1.2e-82 



Protein name 



Locus Name 



putative ettlux pump component MtrF 



gp:AF176820 



ACC# 



AF176820 



Description 



Neisseria gonorrnoeae strain FA19 putative ettlux pump componentMtrP (mtrP) 
gene, complete cds. 



980 



NT 



AA 



ORF Name 



NT ID 



AAID 



l6ttiaS02 c2 250 



T7TTT 



Length Length 



TZT 



Score Probability 
B5I 



|2.2e-21 



Protein name 



Locus Name 



Hypothetical protein PAB0910 



pir :B75048 



Acc# 



B75048 



Description 



NT 



AA 



ORF Name 



NTID 



TTTT 



AAID Length Length 




Score Probability 



7T 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



T7TT 



Length Length 



T5T 



Score Probability 
[376 



1.3e-34 



Protein name 



Description 



Locus Name 



sp : P YR I_P YRAB 



ACC# 



P77919 



ASPARTATE C ARB AMOYLT RAWS tf£ RASE REGULATORY CtiAlKf 



NT 



AA 



ORF Name 



NTID 



AAID 



ltiA2&526..±±..±S2. 1 [T7T7 



Length Length 



Score Probability 
TZZ 



1.7e-14 



Protein name 



Locus Name 



hypothetical protein PH0856 



pir:D71136 



Acc# 



D71136 



Description 



ORF Name 



NTID 



NT AA 

, , „ _ — — Scor e Probability 
AAID Length Length JL 



T7TT" 



Protein name 



Locus Name 



nisticLme Kinase sensor protein (barA) RP229 



pir :B71677 



Description 



7.4e-25 



Acc# 



B71677 



981 



ORF Name 



NT ID 



NT AA 
AAID Length Length Sc ° re 



'2±3 l !M2h ti ISO 



Probability 
0.000&7 



Protein name 



Description 



Locus Name 



gp:D42 06 7 



Acc# 



D42067 



Porpnyromonas gmgivalis DNA tor Fimbriim, ORFl-4, complete cds . 



NT 



AA 



ORF Name 



22^7^061 15 131 



TTTZ 



NT ID AAID Length Length Probability 




TTT 



7FJ" 



9.4e-78 



Protein name 
Description 

PYROPHOSPHATE SYNTHETASE) 



Locus Name 



sp : KpR&JJfiLPY 



Acc# 



P56184 



NT 



AA 



ORF Name 



NTID 



AAID Length Length Probability 
— 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



Z2b.i.y.U0.2...al...l.7.7.. 



T7HT 



AAID Length Length 
S^u" 



OT" 



Score Probability 




3.3e-22 



Protein name 

Description 
HYPOTHETI CAL PROTEIN 



Locus Name 



sp:YS78_METJA 



Acc# 



Q58288 



982 



NT 



AA 



ORF Name 



NT ID 



22656925 cl 218 



T7TT 



AAID Length Length 

mzi — 



Score 



73T 



Probability 
(5.7e-76 



Protein name 



Locus Name 



penicillin binding protein 1A 



pir : F70355 



Acc# 



F70355 



Description 



NT 



AA 



ORF Name 



NTID 



3720 



AAID Length Length 




Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA r, , , ■ , . 

,, Tn T — _ — _ Score Probability 
AAID Length Length JL 



2.A&JJA8J...XtJ± I 



\TTIT 



Protein name 



n YP°thetxcal protein BBI16 



Description 



TT5~ 



0.00025 



Locus Name 



pir:G70241 



Acc# 



G70241 



ORF Name 



23.b.S.Z25.Q....a;3....3.ai.. 



Protein name 

Description 
IWO-HIT 



NT 



AA 



NTID 



TTZT 



AAID Length Length 

mzz — 



Score Probability 



Locus Name 



Acc# 



ORF Name 



i44l0.S3..7...±i...ll&.. 



Protein name 



NT 



AA 



NTID 



AAID 



Length Length 
TTT 



4T¥" 



Score Probability 




l.le-12 



Locus Name 



structural protein P5 



gp:AF155037 



Acc# 



AF155037 



Description 
Aiteromonas phage, complete genome. 



983 



ORF Name 



NTID 



NT AA 

, , ™ T — , , _ — _ Score Probability 
AAID Length Length z - 



24495875 t2 93 



Protein name 

Description 
ISO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



SWT 



Length Length 

1496 



Score Probability 
|5.2e-0S 



T5T 



Protein name 



Locus Name 



putative histiame Kinase 



gp:AF0369S4 



Acc# 



AF036964 



Description 



Lactobacillus sake putatxve response regulator (rrpl) and putatxvehistidine 
kinase (hpkl) genes, complete cds . 



ORF Name 



NTID 



AAID Length Length Probability 



T72T 



5W 



234" 



S.4e-i3 



Protein name 



Locus Name 



processing proteinase S112009 : protein 
S112009 :protein sll2009 



pir :S77156 



Acc# 



S77156 



Description 



ORF Name 



Protein name 



NTID 



MbAD.<m..±2...B.b. I [T7T7 



iiypotnetical protein 



NT 



AA 



AAID Length Length 
^ 



TUT 



5TT 



Score Probability 
525 



Locus Name 



gp:A?0S§§57 



Description 

Zymomonas mobilis cosmid clone 65G3, partial sequence. 



|i.2e-82 



Acc# 



984 



ORF Name 



NTID 



NT AA , , 
_ — _ — _ Score Probability 
AAID Length Length JL 



24642257 tl 35 



[7X7" 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



NTID 



NT AA 
— „ — Score 
AAID Length Length 



Z4,6A^U6..7....Cl...iaQ.. 



TTTT 



Probability 
1 . 8e-44 



Protein name 



Locus Name 



Acc# 



rerric enteromctm transport ATP-oinding 



gp:U6753i 



Description 



Metnanococcus jannaschu section 73 ot 150 ot tne complete genome. 



NT 



AA 



ORF Name 



NTID AAID Length Length 




Score Probability 



TFTT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



TTTT 



J 



* — T — _ — . Score Probability 
AAID Length Length 2 - 

— 



TIZT 



ITT^" 



1.3e-ll7 



Protein name 

Description 
"TSEMTT 



Locus Name 



sp : GLYA_ECOLI 



Acc# 



P00477 



985 



ORF Name 



NTID 



NT AA 
r — . u T — Score Probability 
AAID Length Length z - 



:iSi&010 ti 31 



TTJT 



1404 



0.0040 



Protein name 



Locus Name 



53KDa major outer membrane protein 



D31835 



Acc# 



D31835 



Description 



Porpnyromonas gmgivalis DNA tor 53kDa major outer membraneprotein, 
complete cds . 



ORF Name 



NT AA 

NTID AAID Length Length Probability 



2S429811 c3 274 



T4uT" 



1.5e-3l 



Protein name 



Locus Name 



LisK 



gp:AF139$08 



Acc# 



AF139908 



Description 



Listeria monocytogenes lisR/iisK gene locus, complete sequence. 



NT 



AA 



ORF Name 



NTID 



AAID 



££&7u0.5.7..±1...3.3... 



3734 



8956 



Length Length 



Score Probability 
333 



1.4e-83 



Protein name 
Description 

T&ANSCARBAMYLASE) (ATCASfi) 



Locus Name 



Acc# 



P96174 



ORF Name 



NTID 



NT AA n 
T — _ — Score Probabil ity 
AAID Length Length JL 



3735 



3TJu~ 



34S~ 



2 .4e-31 



Protein name 



Locus Name 



prooajDie aTDP-4-aenyarornamnose reductase 
IAPE1179 



pir:G72588 



Acc# 



G72588 



Description 



986 



NT 



AA 



ORF Name 



26595887 cl 224 



T73F 



T, TrT1 „ tvtv-t^ t — ^ T — Sco re Probability 
NT ID AAID Length Length L 

8958 



7JT 



IFF" 



Protein name 



Locus Name 



probable two- component system response 
transcription regulator 



Description 



pir :T36499 



|2.3e-12 



Acc# 



T36499 



ORF Name 



Protein name 

Description 
INO-HIT 



NT AA , , . . 
™„ ^^-r^ t — — — ^ Score Probability 
NTID AAID Length Length x - 



\rnr 



8959 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
INO-HIT 



NT 



AA 



NTID 



AAID 



3738 



8960 



Length Length 




Score Probability 



Locus Name 



Acc# 



ORF Name 



Z&lb±&..±2..£l 



Protein name 
Description 

NO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 
7F~ 



Score Probability 



Locus Name 



Acc# 



ORF Name 



|^y.:/.ay.2ll..±3....U6.., 



Protein name 

Description 
NO-HIT 



NT 



AA 



™ Tr . 7\7\t^ t — *-u r — ^ Score Pro bability 
NTID AAID Length Length JL 

m&z — 



IT 



Locus Name 



Acc# 



987 



NT 



AA 



ORF Name 



NT ID 



14008468 t3 116 



AAID Length Length 




Score Probability 



3bT 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



3742 



„ m ™ _ _ — J _ 1 _ — ^. Score Probability 
NT ID AAID Length Length z - 

SSSI — 



0.010 



Protein name 



Locus Name 



VirM 



gp:ATTIA6NCl 



Acc# 



AF039888 



Description 



Agrobactenum tumetaciens plasmici pTiA6NC VirM (virMJ and VirL(virL) genes, 
complete cds. 



NT 



AA 



ORF Name 



T743 



NT ID AAID Length Length 

vws — 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



\lZ22&All...alJ2&£. I [T74T 



AAID Length Length 

mzz — 



\TT2T 



Score Probability 
T7I 



2 .4e-15 



Protein name 



Locus Name 



unknown 



|gp:AF007381 



Acc# 



AF007381 



Description 



Flavo&actenum jonnsomae gliding motility protein (glctA) gene, complete 
cds ; and unknown genes . 



988 



NT 



AA 



ORF Name 



NT ID 



32229687 Cl 217 



AAID Length Length 

mzi — 



T5T 



Score Probability 
PI 



0.00022 



Protein name 



Description 



Locus Name 



Acc# 



P43723 



INTEGRATION HOS? FACTOR ALPHA-SxjfeUNll 1 drib 1 - ALPHA) 



NT 



AA 



ORF Name 



NTID 



c3 275 



AAID Length Length Probability 




T3T 



1.3e-l4 



Protein name 



Locus Name 



sp : YVBG_BACSU 



Acc# 



032244 



Description 

HYPOTHETICAL 22.6 KB MO T EIN IN 0MJC&-KN0 IMTBRSBHIC REGION 



NT 



AA 



ORF Name 



NTID 



T7TT 



AAID Length Length 

mzv — 



Score Probability 
^5 



2.1e-4i 



Protein name 



Locus Name 



RprX 



|gp:SB5000 



Acc# 



S59000 



Description 



NT 



AA 



ORF Name 



M2.7.5.7.£.0....ci..J.10. I 



T — ^, _ — ^. Score Probability 
NTID AAID Length Length JL 

mrv — 



2.2e-09 



Protein name 



Locus Name 



processing proteinase rprotein slrl331 :protein 
slr!331 



pir :S75528 



Acc# 



S75528 



Description 



989 



ORF Name 



NT ID 



AAID 



NT AA 
t — t — ^ Score Probability 
Length Length JL 



36040712 c3 232 



mrr 



TUT 



TIT 



TUT 



8.be-05 



Protein name 



Locus Name 



Acc# 



Hypothetical protein HI1452 



D90724 



Description 

Escherichia coll genomic DNA. - 19.8 mm) 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

T — ^, Score 
Length 


Probability 


^7575_c2_244 






570 




L713 1575 


| 4.2e-l62 


Protein name 










Locus Name 


Acc# 












sp:mS_CL0AC 


P13419 


Description 














SYNTHETASE ) ( FHS ) ( FTHFS ) 


















ORF Name 


NTID 


AAID 


NT 
Length 


— Score 
Length 


Probability 


^OAlllL^tl^A 


3751 


8<m 


1095 


3288 475 


9.6e-78 | 


Protein name 










Locus Name 


Acc# 


11 5K outer membrane 
protein 


protein precursor 


SusC 




pir : JC6027 


JC6027 














Description 














ORF Name 


NTID 


AAID 


NT 
Length 


AA 

„ — ^ Score 
Length 


Probability 


A4inaaafli™a^2L4a 


3752 


8374 


74 


225 125 


| b.oe-08 


Protein name 










Locus Name 


Acc# 


nypotnetical protein PH0719 


pir:H71118 


H71118 



Description 



990 



NT 



AA 



ORF Name 



NT ID 



4484812 c2 261 



, , T — _ — _ Score Prob ability 
AAID Length Length x ~ 

— 



Protein name 



Locus Name 



hypothetical protein slrl485 



pir :S74454 



Acc# 



S74454 



Description 



ORF Name 



NT ID 



NT AA 
AAID Length Length Sc ° re 



47.2.7126L±3l...11.7. I 15754 



TT2~ 



Protein name 

Description 
NO-HIT 



Locus Name 



Probability 



Acc# 



ORF Name 



NT ID 



AAID 



NT AA 

T — 4-x, t — ^ Score 
Length Length 



Protein name 

Description 
INO-HIT 



Locus Name 



Probability 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



\&M2l&L.cxl...±±6. I 



Length Length 
350" 



TT7T 



Score Probability 
£75 



7.3e-23 



Protein name 



Locus Name 



transposase 



gp:AF038866 



Acc# 



AF038866 



Description 



Bacteroides tragxlis transposon Tn552 0 transposase (bipM) andmobilization 
protein BmpH (bmpH) genes, complete cds . 



991 



ORF Name 



14557577 13 150 







NT 


AA 


NTID 


AAID 


Length 


Length 


3757 


8979 


266 i 


501 



Protein name 

Description 
NO-HIT 



Locus Name 



Probability 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
_ — ^ — , Score Probability 
Length Length L - 



2 . Be-21 



Protein name 



Locus Name 



sp : PA1 G__HUMAN 



ACC# 



Q15102 



Description 
StMriJIT) (PAP- AH GAMMA 



NT 



AA 



ORF Name 



NTID 



AAID 



375$ 



Length Length 
75— 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length Probability 

— 



TUT 



TIT 



8.2e-l4 



Protein name 



Locus Name 



Acc# 



|sp:M'U0_EC0LI 



Description 

VITAMIN B12 TRANSPORT SYSTEM PERME ASE PROTEIN B T UC 



992 



NT 



AA 



ORF Name 



NT ID 



66796b2 t2 78 



R7FT" 



■j. _ -p-p>. T — T — Score Probability 
AAID Length Length 2 - 

3351 — 



3^7" 



3T3" 



l.3e-30 



Protein name 



Description 



Locus Name 



sp:KD^_HAEIN 



Acc# 



P44490 



SYNTHETASE) (CM^-^-KfiTO-^-DEOXYOG^trLOSONl^ ACID SYNTHETASE) — (CKS) 



NT 



AA 



ORF Name 



t3 145 



3752 



™™ 7V7s-rr> t- — _ — _ Score Probability 
NT ID AAID Length Length JL 

mwi — 



7U4 - 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



X&±$.$.2..±±..:2. I [3753 



AAID Length Length 
3335 



T5T 



WTT 



Score Probability 
TT1 



1.2e-l7 



Protein name 



Locus Name 



probable phospno- sugar mutase 2 



Acc# 



E71082 



Description 



ORF Name 



NTID 



AAID 



±$.5±25±2„.a±..A I 13757 



Protein name 

Description 
INO-HiT 



NT 



AA 



Length Length 
T35" 



Score Probability 



Locus Name 



Acc# 



993 



ORF Name 



3957142 tl 1 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 
1231 



Score Probability 



7F - 



Locus Name 



Acc# 



Description 



NO-HIT 









NT 


AA 


ORF Name 


NTID 


AAID 


Length Length 


±o:±2i±&:l±i..m. 


3766 


8388 


219 


660 









Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



TTZT 



AAID Length Length 




11083 



Score Probability 
|8.3e-34 



Locus Name 



conserved hypothetical protein ag_163 0 



pir :F70440 



Acc# 



F70440 



Description 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



1W 



Protein name 



Description 



Locus Name 



Probability 



Acc# 



NO-HIT 



994 



ORF Name 



NTID 



AAID 



NT AA 

Score 

Length Length 



11726391 c3 B47 



T5F 



Probability 
8.6e-52 



Protein name 



Locus Name 



putative UDP-GicNAc :undecaprenylphospnate 



gp:AP04S749 



Acc# 



AF048749 



Description 



Bacteroides fragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



NT 



AA 



ORF Name 



NTID 



AAID 



1175127 ±1 $ 



fTTTJT 



Length Length 
TT35 — 



T74" 



Score Probability 
WT2 



|$.§e-5S 



Protein name 



Description 



Locus Name 



sp:AR6C_SYtiY3 



Acc# 



P23353 



PH0SPH0LYA5E) 



NT 



AA 



ORF Name 



NTID 



AAID 



I226.412..±2....17.0. I 



3771 



S553 



Length Length 
T7TT 



11113 



Score Probability 
6.6e-09 



Protein name 



Locus Name 



GumF protein 



|pir:S67^5^ 



Acc# 



S67855 



Description 



NT 



ORF Name 



NTID 



AAID 



m&3A^...±2...1$.£..„ I ITTTZ 



Length Length 



AA 

— Score Probability 



FT7T 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



995 



ORF Name 



13009675 c3 590 



Protein name 



NTID 



3773 



NT AA 

— — , Score Prob ability 
AAID Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



T77¥" 



Length Length 



urn" 



Score Probability 
OS 



|2.0e-0a 



Locus Name 



|sp:VAPI_BA0N0 



Acc# 
Q46560 



Description 
VlfeULfiUgE-AS50CIATg£) frftOTEilfri 1 



ORF Name 



Protein name 



NTID 



T775" 



AAID 



NT AA rt „ , , . . . ^ 
— — , Score Pr obability 
Length Length 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA _ _ m -i ■ i . 
— — Score Pro bability 
Length Length 



XA.12&&&±±l...lXl I 



2.9e-§3 



Protein name 



Locus Name 



conserved hypothetical protein 



foir:H7^77 



Acc# 



H72377 



Description 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



14938777 c2 427 



Protein name 





3777 




8955 




213 


542 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



i5&aaa...c±.„a7.2 1 



Protein name 



TZT 



i.7e-16 



Locus Name 



Acc# 



riJDosomal protein S06 



Description 



pir:G70305 



G70305 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



TTTT 



|4.2e-36 



Protein name 



Locus Name 



Acc# 



Jiypotiietical protein TM1421 



pir:B72256 



B72256 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



16A46.Q.Q.2...a3....5A5. I 13780 



— , — , Score 
Length Length 

12406 



SuT" 



Probability 
ll.Oe-lOl 



Protein name 



Locus Name 



hypothetical protein Rv0584 



pir :G70934 



ACC# 



G70934 



Description 



NT 



AA 



ORF Name 



±660r±&& c2 447 



NTID AAID Length Length 

mvi — 



TFT" 



Score Probability 
|2.2e-37 



mi 



Protein name 



Locus Name 



BsaA 



bp:AB0i3377 



Acc# 



AB013377 



Description 



Bacillus halodurans c-125 comGB and bsaA genes anci tRNA-His , Ala,Arg, Giy 
and Tyr genes, complete and partial cds. 



NT 



AA 



ORF Name 



NTID 



17010542 cl 337 



TTWF 



AAID Length Length 
] |787 | p^4 — 



Score Probability 
|l.Se-ll2 



TTTT" 



Protein name 



Locus Name 



beta-galactosidase, : lactase 



pir: JC5£i§ 



Acc# 



JC5618 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



±16.5.6±...a!..A±6. .1 PTO 



Protein name 



2.3e-5S 



Locus Name 



Acc# 



hypothetical protein slrl772 



Description 



pxr:S74628 



S74628 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



iai552L...ca...512 I 



Protein name 



i.4e-67 



Locus Name 



Acc# 



probable piiospho- sugar mutase 2 



Description 



pir:E710S2 



E71082 



998 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


20192257_tl_10b 


3785 




9007 




304 




915 


281 




1.5e-24 



Protein name 



Locus Name 



gp:STALYT<3 



Acc# 



L42945 



Description 

Staphylococcus aureus lytS and lytR genes, complete cds . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



2031317 tl 99 



5uW 



33- 



Protein name 



Description 



Locus Name 



Acc# 



K0-H1T 



NT 



AA 



ORF Name 



NTID 



T7FT 



AAID Length Length 



Score Probability 
l.3e-09 



TU7 



Protein name 



Locus Name 



transmembrane sensor 



gp:AF05l691 



Acc# 



AF051691 



Description 



Pseudomonas aerugxnosa stress tactor A (pstAJ , ECF sigma tactoritiui) , 
transmembrane sensor (fiuR) , and hydroxamate- typef errisiderophore receptor 
(fiuA) genes, complete cds. 



ORF Name 



NTID 



— , — , Score Probability 
AAID Length Length 



1110 



CTu" 



S.Se-18 



Protein name 



Locus Name 



Acc# 



sp:YEHU_ECOLl 



Description 

HYPOTHETICAL 62,1 KD PROT^lJSf IN MOLfe-BOLx Iti'l'flft fiBtttC REGION £RfiOURSOk 



999 





NTID 


AAID 


NT 
Length 




AA 
Length 


Score 


Probability 


20704802_ci_402 


37fc9 


9011 


465 




1398 


465 




4 . 7e-44 


Protein name 














Locus 


Name 




Acc# 


conserved Hypothetical protein 


pir :G72220 


G72220 


Description 






















ORF Name 


NTID 


AAID 


NT 
Length 




AA 
Length 


Score 


Probability 


aiictm^ci^isa 




9012 


118 




357 








Protein name 














Locus 


Name 




Acc# 


Description 
























MO-HIT 


























ORF Name 


NTID 


AAID 


NT 
Length 




AA 
Length 


Score 


Probability 


a21147.fi^tl™3Ll 


5791 


9015 


532 




1599 


512 


4 . 9e-49 


Protein name 














Locus 


Name 




Acc# 


nypotnetical protein jhplllO 


pir :A71849 




A71849 


Description 






















ORF Name 


NTID 


AAID 


NT 
Length 




AA 
Length 


Score 


Probability 


maaatt^ti^ia 


5792 


9014 


169 




510 


Si 




0 . 0040 


Protein name 














Locus 


Name 




ACC# 


DbhB 




gp:AFll0l85 


AF110185 



Description 



Burkhoideria pseudomallei strain 1026P DPnB icUonB) , generaisecretory 
pathway protein D (gspD) , general secretory pathwayprotein E (gspE) , general 
secretory pathway protein F (gspF) , GspC (gspC) , general secretory pathway 
protein G (gspG) , generaisecretory pathway protein H (gspH) , general 
secretory pathwayprotein I (gspl) , general secretory pathway protein J 



1000 



ORF Name 



i>2$10052 t3 310 



Protein name 

Description 
&ELA i>R0TEl]Sf 



NT ID 



NT AA 

n ,„ T — _ — _ Score Probability 
AAID Length Length L 



Locus Name 



;Sp : HELA LEGPN 



Acc# 



Q48815 



ORF Name 



22915938 Cl 405 



Protein name 



NTID 



NT AA 

AAID Length Length Probability 



3794 



hypotheticai protein APE0978 



Description 



P4TT 



0.0040 



Locus Name 



pir :B72£95 



Acc# 



B72695 



ORF Name 



Protein name 
Description 

( BETA- NAHAS E ) 



NT 



AA 



NTID 



AAID 



WIT 



Length Length 
TTT 



Score Probability 

— 



7.6e-10S 



Locus Name 



sp : HEXA_PORGI 



Acc# 



P49008 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 
T — ^ _ — Score 
Length Length 



Probability 
|l.4e-« 



Locus Name 



long-cnam-tatty-acxd CoA ligase 



pir :D70386 



Acc# 



D70386 



Description 



1001 



ORF Name 



NT ID 



NT AA , . 

AAID Length Length Probability 



23601701 13 289 



TTTT 



THT 



Protein name 
Description 

im^iTT 



Locus Name 



Acc# 



ORF Name 



NT ID 



NT AA n , , . , . 
, , ™ T — T — Score Probab ility 
AAID Length Length L - 



Z3.&.2.5212L...13....28.CL 



TTW 



WTT 



TZT 



|2.8e-07 



Protein name 



Description 



Locus Name 



Acc# 



AL031866 



Yersinia pestis 102 jcbases unstable region: trom 1 to 119443 . 



NT 



AA 



ORF Name 



NTID 



\116.11±&±...al...Al(). I |T7^ 



AAID Length Length 
¥U71 



423 



TTTT 



Score Probability 




Protein name 

Description 
h'RCk PROTEIN 



Locus Name 



|sp:PfiCft_SCOLI 



Acc# 



P23485 



ORF Name 



NTID 



AAID 



NT AA 

T — ^ T — x.- Score 
Length Length 



| [IFDU 



9022 



7¥T" 



Probability 
|1.0e-27 



Protein name 



Locus Name 



conserved nypotnetical protein ylbK 



pir:H69874 



Acc# 



H69874 



Description 



1002 



NT 



AA 



ORF Name 



NTID 



AAID 



24072712 c3 613 



Length Length 



Score Probability 
OS — 



|2.ie-0S 



Protein name 



Locus Name 



hypothetical protein sJ.10687 



|pir:S744i6 



Acc# 



S74416 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




Score Probability 
[JT7 



|2.0e-2£> 



Protein name 



Locus Name 



N- ace tylmuramoyi - L- alanine amidase 



bir:G70445 



Acc# 



G70445 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
S3 - 



Score Probability 
WZZ 



|l.3e-lfi 



Protein name 

Description 

RIBO^OMAL PROTEIN S18 (5^21) 



Locus Name 



sp:RSi8_fiAGS'T 



Acc# 



P10806 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

Ttrzz — 



Score Probability 

m 



1.0e-08 



Protein name 
Description 

SlAL:£t)ASE fftfiCtJftSOft, (NEURAMINIDASE ) 



Locus Name 



sp:NANH_MICVI 



Acc# 
Q02834 



1003 



NT 



AA 



ORF Name 



NT ID 



24615811 ti 10 



AAID Length Length 

mn — 



TTTT 



Score Probability 

fnm — 



1.0e-103 



Protein name 



Locus Name 



ArgE/DapE/Acyl tamiiy protein 



|pir:E75324 



Acc# 



E75324 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length J ~ 



3805 



[2W 



IS.Se-08 



Protein name 



Locus Name 



hypothetical protein aq_1533 



pir :A70433 



Acc# 



A70433 



Description 



ORF Name 



NTID 



±±6A&aii..±±..±o.i I imrr 



Protein name 



acritlavm resistance protein AcrE 



Description 



NT 



AA 



AAID Length Length 
— 



Score Probability 
l.Oe-lS 



Locus Name 



pir :A703£l 



Acc# 



A70361 



NT 



AA 



ORF Name 



NTID 



AAID 



|246.S.0.2Si7....c.3....£3.4 1 



Protein name 

Description 
BSfO-HlT 



Length Length 



Score Probability 



TTUT 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

T -,u Score Probability 
Length Length 



3809 



|2.6e-59 



Locus Name 



hypothetical protein TM1269 



pir :D72274 



Acc# 



D72274 



Description 



1004 



ORF Name 



24706b7b c2 496 



Protein name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



247.U7.2L8.7....G1...3.7.6. 



AAID Length Length 
TUSu 



Score Probability 
i.£e-39 



Protein name 



Description 



Locus Name 



|gp:PflU6020a 



Acc# 



U60208 



Porphyromonas gmgivalis orti, ort2 and or±3 genes, complete cds. 



NT 



AA 



ORF Name 



NTID 



\lA&bAi.&.l..±l...xl$. I praT3 



AAID Length Length 



Score Probability 
O.Odll 



ST 



Protein name 



Locus Name 



sodium cnannel protein 



gpTWU^TIF" 



Acc# 



U26718 



Description 



Drosopnila virilis sodium cnannel protein (para) gene, exonsl, 2 , 3 , 4 , and 
optional segment i, partial cds. 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



TTTTTW 



Protein name 



Locus Name 



hypothetical protein BBA32 



pir:H7u210 



Acc# 



H70210 



Description 



1005 



NT 



AA 



ORF Name 



2535530 12 196 



NTID AAID Length Length 

?U3Z — 



Score Probability 




0.034 



Protein name 



Locus Name 



cellulose syntnase 



Description 



pir:I39714 



Acc# 



139714 



ORF Name 



Protein name 



rprY protein 



Description 



NTID 



2.^3.156.2L...Cl...3..7.L 



TSTF" 



NT 



AA 



AAID Length Length 



Score Probability 



TZTT 



Locus Name 



pir :S33662 



3.2e-123 



Acc# 



S33662 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



2££2ft£afi-.cl...l7.£ I 



\44T 



T75u" 



77T 



5.2e-77 



Protein name 



Locus Name 



sp:YQ2V_£AC£U 



Acc# 



P54462 



Description 

HYPOTHE T ICAL 51.7 KL PROTEIN IN DNAJ-RPaU IMTLikkflBHIC VLUdlbU 



NT 



AA 



ORF Name 



NTID 



AAID 



25£7.A15.7....gI...17.&.. 



Length Length 
1M> 



Score Probability 



5T 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^~ 



2.53J3.Q.X2...&X..3.9.3. 



3S18 



4.5e-05 



Protein name 



Locus Name 



sodium- dependent transporter homolog yocS 



pxr :E69902 



ACC# 



E69902 



Description 



1006 



ORF Name 



Protein name 



NT 



AA 



NTID AAID Length Length 

SMI — 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



16.1C)ABAl...al...6Al I 



T5UT 



2.5e-iS7 



Protein name 



Description 



Locus Name 



sp:MUTB_PORGI 



Acc# 



Q59676 



Mfif H VLMAL6&VL - G&A MUTA5E ALPHA- StJBWl? , (MCM-ALMA) 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



Protein name 



hypothetical protein TM12 6 7 



Description 



TTTT 



Locus Name 



pir:fi72274 



|1.7e-ll5 



Acc# 



B72274 



ORF Name 



Protein name 



Description 



NTID 



9044 



NT 



AA 



AAID Length Length 



Score 



TT5TT 



Locus Name 



sp:G6PA_BAC£T 



Probability 
|2.5e-iab 



Acc# 



P13375 



I30MESASE A) 



1007 



ORF Name 



26642912 c2 4b0 



Protein name 



NTID 



WIT 



SMS 



NT 



AA 



AAID Length Length 
133 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



TTZT 



1.5e-62 



Locus Name 



sprQUEAJiOOLl 



Acc# 



P21516 



(OUSUOSINE BIOSYNTHESIS f>ftOT EIM QtJEA) 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



lXl6Akl&.±l.A6x I 



T77T 



1242 



2 .le-126 



Protein name 



Locus Name 



|sp:3«K_HAcJaU 



Acc# 



P37477 



Description 

LY5YL-TRNA SYNTHETASE, (LYSINE- -T&NA LIGASE) (LYSRS) 



1008 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



312^536 i'2 164 



TTTT 



0.0014 



Protein name 



Locus Name 



cytocnrome oxidase I 



lgp:AF072662 



Acc# 



AF072662 



Description 



Exoneurella eremopniia cytochrome oxidase I gene, mitochondriaigene 
encoding mitochondrial protein, partial cds . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



3l42S54l t3 2^2 



KEY 



5.7e-0£ 



Protein name 
Description 

HYPOTHETICAL 66.3 KT) PROTEIN IN S 'RUOION 



Locus Name 



sp:YHA2_31&L 4 0 



Acc# 



P35649 



NT 



AA 



ORF Name 



NTID 



AAID 



3.1.7.5.5.0.0.0...±1...29... 



3329 



9051 



Length Length 




Score Probability 
|4.0e-109 



1079 



Protein name 
Description 

HYPOTHETICAL PkOTfiltf H1003S 



Locus Name 



sp:YTDE_HAEIN 



Acc# 



P44472 



NT 



AA 



ORF Name 



NTID 



iiftaa5&.7....ci„.a&ft I 



AAID Length Length 
— 



3TT 



Score Probability 
TSB 



3.1e-09 



Protein name 



Locus Name 



probable lipid A biosynthesis acyitransterase 



pir:H7l954 



Acc# 



H71954 



Description 



1009 



ORF Name 



55557175 c2 t>aa 



Protein name 



NT ID 



SETT - 



NT AA 

— — , Score Probability 
AAID Length Length 



Su"S3~ 



Locus Name 



Acc# 



Description 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



3.3.7.8^.&7.7....cl...3.b^.. 



AAID Length Length 



Score Probability 
0.00032 



172 



Protein name 



Locus Name 



MutS-liJce protein 



gp : SATRXA 



Acc# 



AJ223480 



Description 



Staphylococcus aureus trxA and uvrC genes and partial muts and dhscgenes . 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



3.3.&5.&$A..c3....h&:L 



JZJT 



0.061$ 



Protein name 



Locus Name 



hypothetical protein C56G2.lb 



pir :T15873 



Acc# 



T15873 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



3834 



$056 



Length Length 
TTT 



Score Probability 
2.5e-13 



TT5 



Protein name 



Locus Name 



probable isomerase 



pir :B70986 



ACC# 



B70986 



Description 



ORF Name 



3.44D.£3.u3...±l...&7. 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



1010 



NT 



AA 



ORF Name 



NTID 



34411051 t3 ^00 



AAID Length Length 



Score Probability 
133 



5.3e-07 



Protein name 



Description 



Locus Name 



gp:PGU6020b 



Acc# 



U60208 



Porphyromonas gingival is ortl, or £2 and ort3 genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



34417142 cl 414 



nrr 



TOT 



Protein name 



Locus Name 



sp:MUTA_PORGl 



ACC# 



Q59677 



Description 

M ET HYLMALONYL - COA MUTA5E BETA-SUBUNIT, (MOB-BETA) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
72 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TUT 



i.4e-26 



Protein name 



Locus Name 



hypothetical protein PAB0910 



pxr :B75048 



Acc# 



B75048 



Description 



1011 



ORF Name 



3840 




9062 




912 


2736 




506 





5.4e-56 



Protein name 



Locus Name 



Acc# 



115K outer memJorane protein precursor : SusC 
protein 



|pir:JC6(«7 



JC6027 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



— Score Probability 
Length Length 



EST 



lO.OOOSi 



Protein name 



Locus Name 



hypothetical protein SCE3 9.3 0 



pir :T36240 



Acc# 



T36240 



Description 



ORF Name 



NTID 



AAID 



Protein name 



hypothetical protein cOllS 



Description 



NT AA 

— , — , Score Probabi lity 
Length Length 



TTTT 



Locus Name 



pir :S74051 



|4.&e-l0 



Acc# 



S74051 



ORF Name 



NTID 



iD.6.8.7.:z:L..c2...46.i I 133*3 



Protein name 



Description 



AAID 



NT 



AA 



Length Length 
T7T7 



Score Probability 
6.8e-^4 



mi 



Locus Name 



spTEHTjTHETH" 



Acc# 



P13551 



ELONGATION FACTOR G (EF-G) 



1012 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



14078910 c3 614 



TTTT 



8.8e-60 



Protein name 



Locus Name 



receptor antigen (RagA) 



Acc# 



AJ130872 



Description 



Porphyromonas gmgivalis W50 receptor antigen (rag) locus encodmga ma^or 
immunodominant 55kDa antigen. 



ORF Name 



41562 c3 571 



Protein name 



NT ID 



3845 



NT 



AA 



AAID Length Length 
PS- 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



TS4S" 



Locus Name 



pr ol me/pyrroi me - 5 - carboxylate dehydrogenase I bir :B71^80 



Description 



8.6e-22 



Acc# 



B71980 



ORF Name 



NTID 



AAID 



43.2M2Sl..±3....26.5. .......J PM7 



Protein name 



NT 



AA 



Length Length 
213" 



Score Probability 
8.4e-22 



Locus Name 



sp:YFfif_SCOLl 



Description 

HYPOTHE T ICAL ^.7 KB PROTEIN IN LkHA-ACKA INTiiik aJ&UsJIC kUciluM 



Acc# 



P77625 



1013 



ORF Name 



NT ID 



14378530 tl 21 



Protein name 



AAID 



MTU" 



probable glycosyl Hydrolase 



Description 



NT AA 

— , — , Score Probab ility 
Length Length 



1353" 



Locus Name 



(pir:T36467 



2.2e-44 



Acc# 



T36467 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



TTT 



0.0057 



Protein name 



Locus Name 



Acc# 



putative outer surtace protein 



gp:BBU«0<>60 



Description 



Borrelia burgdorferi strain CAI2 putative outer membrane protein (ospE) 
gene, complete cds and putative outer surface protein (ospF)gene, partial 
cds . 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



4,5.5l16.3.5l..X1...1QZ 


3850 


9072 


1285 


3858 1850 















i.7e-192 



Protein name 



Locus Name 



czrA protein 



gp : PACZR 



Acc# 



Y14018 



Description 

Pseudomonas aeruginosa czrR, czrC, czrB, czrA genes, ORF5 andpartial ORF6 . 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



4&aaai(L±A...m 



T71T 



3.4e-78 



Protein name 



Locus Name 



ribonucleoside-dipnosphate reductase, large 
chain nrd 



pir:U694b7 



Acc# 



G69457 



Description 



1014 



NT 



AA 



ORF Name 



NT ID 



1470467b i2 ifey 



AAID Length Length 



Protein name 



4-alpna-g±ucanotransterase nomolog T20B5.4 



Description 



Score Probability 
|2.5e-145 



Locus Name 



pir :TUUV48 



ACC# 



T00748 



ORF Name 



NTID 



AAID 



NT AA 
— — , Score 
Length Length 



uss — 1 



ITST 



Probability 
|i.2e-ib 



Protein name 



Description 



Locus Name 



sp:FOLlijAA(j£iU 



Acc# 



P28823 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Pr obability 
Length Length 



[7TT 



Protein name 



Description 



Locus Name 



|sp:T0P3_HAElN 



Acc# 



P43704 



DNA T0P01S0MERA&L! 111, 



NT 



AA 



ORF Name 



NTID 



AAID 



mrr 



Length Length 
TTS5 



355" 



Score Probability 
|2.$e-5S 



5^ 



Protein name 



Locus Name 



coproporphyrmogen oxidase, III , 
oxygen- independent hemN 



pir :B6y64U 



ACC# 



B69640 



Description 



ORF Name 



NTID 



4S92261 c2 466 



3S56 



Protein name 



— , — , Score Probability 
AAID Length Length 



ribosomal protein L09 



Description 



TTT 



Locus Name 



pir:B7047!> 



i.7e-25 



Acc# 



B70475 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TuTO — 



SIT" 



Score Probability 
1.7e-37 



WU1 



Protein name 
Description 

DEPEND E NT D I HYDROXYACETON E - PHOSPHATE REDUCTASE!) 



Locus Name 



sp:GPDA_MAcJfcJU 



Acc# 



P46919 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



\±9.1B.6.2B...±1...9±..... 



T2F" 



TST" 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



5.117..7.5.3....Gl...3.8.a., 



Length Length 




Score Probability 
3.4e-25 



£F7 



Protein name 



Locus Name 



probable reductase APE 1044 



bir:E727M 



Acc# 



E72703 



Description 



1016 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



T7T 



3.4e-09 



Protein name 



Locus Name 



unitnown 



|gp:APQ55VTO" 



Acc# 



AF095748 



Description 



Burkholderia cepacia principal sigma tactor (sigA) , pntnaiatedioxygenase 
reductase (ophAl) , putative phthalate permeaseN- terminal region, putative 
phathalate permease C-terminal region (ophD) , 4 , 5-dihydroxyphthalate 
decarboxylase (ophC) , phthalate- inducible quinolinate phosphoribosyl 
transferase (ophE) , transposase (trip), phthalate dihvdrodiol dehyd rogenase 



ORF Name 



Protein name 



NT ID 



]S&£&5±2....q±..SZ± J CJ^T 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NT ID 



AAID 



LS.031717....C1..3.7.U J M 



Protein name 



translation elongation tactor o 



Description 



NT 



AA 



Length Length 
T5T 



Score Probability 
OF^ 



T2T 



Locus Name 



|pir:H7^227 



Acc# 



H72227 



NT 



AA 



ORF Name 



NTID 



AAID 



6JL3.6.6.35....C.3....5.6X. 



Length Length 



Score 



TF5T" 



Probability 
\T7Te-772 



Protein name 



Locus Name 



RprX 



IgpT^uW 



Acc# 



S59000 



Description 



NT 



AA 



ORF Name 



NT ID 



1625087 ti b± 



AAID Length Length 



Score Probability 
S.^e-78 



Protein name 



Locus Name 



sp:DNAA_EAC^U 



Acc# 



P05648 



Description 

CHROMOSOMAL REPLICATION INITIATOR PROTEIN DNaA 



NT 



AA 



ORF Name 



NTID 



6447131 tl 44 



AAID Length Length 
I22T 



M7 



Score Probability 
l.le-65 



Protein name 

Description 
URACIL-1>NA (^LYCO^LAi^ i>kU0i3 RS0R, (Ulxj) 



Locus Name 



:sp:tM5JlUMAls] 



Acc# 



P13051 



NT 



AA 



ORF Name 



NTID 



6ASAM.l...a2..All I mZZ 



AAID Length Length 



Score Probability 
!4.4e-32 



Protein name 



Locus Name 



methylgiyoxal synthase 



pir :G72^&4 



Acc# 



G72284 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TW 



Score Probability 
2.0e-l3 



F7S 



Protein name 



Locus Name 



probable RNA polymerase sigma-24 tactor 
(rpoE) 



pir :E7±36& 



Acc# 



E71368 



Description 



1018 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TTT 



Ii.8e-i2 



Protein name 

Description 
HYPOTHETICAL ££OTE!ltf MJ0778 



Locus Name 



sp:Y778_Mli!TJA 



Acc# 



Q58188 



— — Score Probability 
AAID Length Length 



AA 



ORF Name 



NTID 



1554787 tl 43 



11128 



:1.6e-82 



Protein name 



Description 



Locus Name 



sprAStfAJlAlilrt 



Acc# 



P44338 



ASPARTATE - -AMMONIA LIO^, ( A5PARAG1NE SYM'HMTA&jij A) 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probabil ity 
Length Length 



\TTT 



10.047 



Protein name 



Description 



Locus Name 



IsprYSC^JIAUlN 



Acc# 

P44053 



HY&OTHfil'ICAL tROTEitf HI0804 



NT 



AA 



ORF Name 



NTID 



11.7.2Q.Q&.3....C3....9.6.., 



AAID Length Length 
1560 



Score Probability 
2.0e-68 



Protein name 



Locus Name 



alpha galactosidase precursor 



gp:AF06i:i:il 



Acc# 



AF061331 



Description 



Saccharopolyspora erytnraea alpna galactosidase precursor (melA) gene, 
complete cds . 



1019 



NT 



AA 



ORF Name 



NT ID 



12688787 tJ bo 



AAID Length Length 
5TJ3 



Score 



BT5" 



Probability 
|1.4e-17 



Protein name 



Locus Name 



cytidme deaminase 



bp:BCA237S7a 



Acc# 



AJ237979 



Description 



Bacillus caldoiyticus cdd gene tor cytidine deaminase. 



NT 



AA 



ORF Name 



NTID 



112773337 ci 77 



AAID Length Length 
25" 



Score Probability 
l.le-06 



142 



Protein name 



Locus Name 



conserved Hypothetical protein yJcnz 



pir :E69858 



Acc# 



E69858 



Description 



NT 



AA 



ORF Name 



NTID 



\12A±5.21:L±1 211 5.1 1 



AAID Length Length 
— 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 
|2.8e-37 



TUT 



Protein name 



Locus Name 



unknown 



gp:Af083252 



Acc# 



AF083252 



Description 



Pseudomonas aeruginosa enoyi-CoA nydratase gene, partial cds; 
pilinbiosynthetic protein (fimL) gene, complete cds; and unknown gene. 



1020 



ORF Name 



NTID 



— — , Score Probability 
AAID Length Length 



121665630 f2 34 



3876 



30S8 



7TT 



3.0e-42 



Protein name 



Description 



Locus Name 



sp:YK<3B_MAIi!lN 



Acc# 



P44577 



ttXPOTti&VlCtiL £R0TElN mo2id 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



22144041 i J 2 33 



9099 



2FT 



PIT 



1. 8e-19 



Protein name 



Locus Name 



PobR protein 



gp:£>£'U25l'7^2 



Acc# 



AJ251792 



Description 



Pseudomonas putida pobk gene tor pojdk protein ana pooA gene torPoJDA 
protein . 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



3878 



478 



TZTT 



TTT 



0. 000^6 



Protein name 



Locus Name 



unknown 



|gp:U96771 



Acc# 



U96771 



Description 



£>revotella bryantn putative polygalacturonase, B-i, 4- enaogiucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
T4U4 



Score Probability 

wm — 



1.4e-97 



Protein name 



Description 



Locus Name 



Acc# 



P77212 



INTERflEMIC REGION 



1021 



ORF Name 



NT ID 



AAID 



NT AA 
— — score 
Length Length 



12409531*7 c^ 106 



1ST 



WIT 



Probability 
3.2e-61 



Protein name 



Locus Name 



hemagglutinin 



gp:AF01V4I7 



Acc# 



AF017417 



Description 



£>revotella intermedia hemagglutinin (phg) gene, complete cds . 



NT 



AA 



ORF Name 



NTID 



246426S7 cl « 



AAID Length Length 



Score Probability 
Oe^l 



Protein name 



Locus Name 



sp:f>eaS_(JL0Pl!l 



Acc# 



P04194 



Description 

H I 5TID1LNE DECAkijOX^LAS E l^kOENZVMK PkUC URSOk, (PI CHAIN) 



NT 



AA 



ORF Name 



NTID 



|M7.mi7....cJ....y.b. 



AAID Length Length 



Score Probability 



T7T~ 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



a5a2aaa.-.cA.„ifta i 



AAID Length Length 



Score Probability 
I4.2e-ii 



Protein name 



Locus Name 



YvrN protein 



bp:BS43KB]MA 



Acc# 



AJ223978 



Description 

Bacillus subtilis 4 2.7JcB DNA tragment trom yvsA to yvqA. 



1022 



ORF Name 



NTID 



Protein name 



hypothetical protein aq_2 94 



Description 



NT 



AA 



AAID Length Length 



Score Probability 
|i.7e-ii — 



Locus Name 



Acc# 



H70326 



ORF Name 



NTID 



AAID 



NT AA „ ^ 
— — Score Pro bability 
Length Length 



T5W 



[779 



|2.5e-77 



Protein name 



Locus Name 



putative ABC transporter ATP-JDinding protein I |gp : SCF56 



Acc# 



AL133424 



Description 



Streptomyces coelicolor cosmia F56 . 



ORF Name 



Protein name 



NTID 



AAID 



sots" 



NT AA 

— — Score Pro bability 
Length Length 



TIT 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



IT 



0.020 



Locus Name 



0£F MSV147 hypothetical protein 



|gp:AF06S866 



Acc# 



AF063866 



Description 

Melanoplus sanguinipes entomopoxvirus, complete genome. 



1023 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


675677_c3_9:i 


3888 


9110 




134 


585 




159 




3 .oe-iu 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp:PGI130872 



Acc# 



AJ130872 



Description 



£>orphyromonas gmgivaiis W50 receptor antigen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



ORF Name 



975050 cl 78 



Protein name 



NT ID 



3889 



9111 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



iiaii24...ci...&a 



Protein name 



NTID 



NT AA 

— — , Score Prob ability 
AAID Length Length 



T5F" 



8.0e-23 



Locus Name 



encLo-JDeta-galactosidase 



gp:AF083896 



Acc# 



AF083896 



Description 

Flavobactenum keratolyticus enao-JDeta-galactosidase gene, compietecas . 



NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


3891 


3113 


71 


216 


230 


3 .7e-19 



ORF Name 



|lia7.51A.7....cl...lftA I |3ff5T 



Protein name 



Locus Name 



rubreaoxin 



Acc# 



H72348 



Description 



1024 



ORF Name 



11435420a iyb 



Protein name 



NT ID 



NT AA „ 

— — , Score Proba bility 
AAID Length Length 



WTT 



TUT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



lS.7.126.a.I..±2....b.y... 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 

— 



Score Probability 



7TT 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



Description 



NT ID 



NT AA 

— — Score Pr obability 
AAID Length Length 



4.3e-10 



Locus Name 



gp:D782bV 



ACC# 



D78257 



Enterococcus taecaiis plasmid pYI17 genes tor BacA, BacB, ORF3,ORF4, ORFb, 
ORF6, ORF7, ORF8 , ORF9, ORF10, ORF11 , partial cds . 



1025 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



3896 



ITT" 



2.4e-0S 



Protein name 



Locus Name 



unknown 



gp:AF11646 6 



ACC# 



AF116463 



Description 



Streptomyces iincolnensis putative regulatory protein WdlA lwcL±A;gene, 
complete cds; and unknown gene. 



ORF Name 



2l7S55Si> i'l 67 



Protein name 



NTID 



3897 



NT 



AA 



AAID Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



IN0-H1T 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



nypotnetical protein slrl534 



Description 



Fir- 



Locus Name 



pir:S75^Sb 



5.§e-26 



Acc# 



S75855 



ORF Name 



NTID 



AAID 



Protein name 



NT 



AA 



Length Length 
T5S 



Score Probability 



Locus Name 



Acc# 



Description 



1026 



ORF Name 



2344605b cl lib 



Protein name 



NTID 



1W 



AAID 



9122 



NT 



AA 



Length Length 



Score Probability 



TZ3T 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
T51T — 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Proba bility 
AAID Length Length 



567 



Locus Name 



pho spho r 1 Jao sy 1 ammo lmiaazole c ar boxyi a s e 
(pure) PAB1077 



Description 



bir:B7b0ii 



5.8e-46 



Acc# 



B75013 



ORF Name 



\2AA0:ib.m..±l..A 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



4 74 



Locus Name 



Acc# 



Description 



NO-HIT 



1027 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



244093^3 ti 31 



TUT 



|i.4e-id 



Protein name 



Locus Name 



Acc# 



hypothetical 23. 5K protein (ginA-tanE 
intergenic region) : hypothetical protein o2 06 



pir :S4082y 



Description 



ORF Name 



NTID 



NT AA 

— — Score P robability 
AAID Length Length 



ii6±63±.±±..±& I mus 



TTZT 



TFT 



Protein name 



Locus Name 



hypothetical protein 12c 3«c!b.l2c 



Description 



pir :T3b4BJ 



1.6e-13 



Acc# 



T35483 



ORF Name 



NTID 



Protexn name 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



4.3e-12l 



Acc# 



uridine Kmase-reiated protein 



Description 



pir :£72341 



B72341 



ORF Name 



Protein name 



NTID 



TvFT AA 

— — Score Probabi lity 
AAID Length Length 



TUUT 



Locus Name 



i.3e-50 



Acc# 



riboflavin Kinase 



Description 



bir:D703l3 



D70313 



ORF Name 



NTID 



as&ftma^ti^i 



muz- 



Protein name 



nypotnetical protein 
Description 



NT AA „ ^ , , . ., . , 

— Score Probability 



AAID Length Length 
^T7T5 



IFF 



Locus Name 



pir :F72424 



1.2e-27 



ACC# 
F72424 



1028 



ORF Name 



NT ID 



NT AA 

— — Score Proba bility 
AAID Length Length 



2$&6±2h± ci lit 



ll.ie-15 



Protein name 



Locus Name 



sensor nistidine Kinase 



Acc# 



A72383 



Description 



ORF Name 



NT ID 



NT AA 

— — Score Prob ability 
AAID Length Length 



3310 



WTJT 



|2.4e-54 



Protein name 



Description 



Locus Name 



sp:ATCiJJl<Jbl 



Acc# 



P54678 



OA T ION - TRANS PORTING ATPA^ii! PAT1, 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 




Score Probability 
3.1e-54 



Protein name 



Locus Name 



sensory transduction nistidine Kinase 
slr2104 :protein slr2104 rprotein slr2104 



ACC# 



S75136 



Description 



ORF Name 



3.16..7.41E8...±L.i:L 



Protein name 



NT ID 



AAID 



Ttt%~ 



NT 



AA 



Length Length 



Score Probability 



\T7T 



Locus Name 



Acc# 



Description 



NO-HIT 



1029 



ORF Name 



NTID 



NT AA 
— — , Score 
AAID Length Length 



bit 



Probability 
l.le-bO 



Protein name 



Locus Name 



aminopeptidase 



AF04103i 



Acc# 



AF041033 



Description 



Shigella rlexneri aminopeptiaase tpepP) gene, complete cas. 



ORF Name 



NTID 



AAID 



NT AA 
— , — 1 S core 
Length Length 



Probability 
|2.4e-47 



Protein name 



Description 



Locus Name 



sp:ftf>54_A<JltJA | 



Acc# 



P33983 



RNA POLYM E RASE SI^ MA-54 FACTOk 



ORF Name 



NTID 



NT AA 
— — , S core 
AAID Length Length 



mftaai7„.±3L.„a5 



Probability 
|4.5e-23 



Protein name 



Locus Name 



transcription regulator nomolog yozci 



pir:C69<m 



Acc# 



C69931 



Description 



ORF Name 



Protein name 



gcpe protein 



Description 



NTID 



9138 



NT 



AA 



AAID Length Length 
— 



Score Probability 
5.0e-65 



Locus Name 



toir:E7U0tt7 



Acc# 



E72087 



1030 



NT 



AA 



ORF Name 



NTID 



JWTT 



AAID Length Length 




Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



ORF Name 



NTID 



— — , Score Prob ability 
AAID Length Length 



iaEL4412!3LL.cl„.aa 



\T5V i fTTTTT 



|2.6e-i2 



Protein name 



Locus Name 



Acc# 



sp:QPCT_MUMAN 



Description 

( GLUTAMIC YL-TftrtA ciyCLQfRAMSif^fiASS) (GrL tJTAMl^VL CYCLASE) 



ORF Name 



NTID 



NT AA „ 

— — , Score Probability 
AAID Length Length 



Ilfi4111il.±l...lfil I 



19141 



3 



|2.7e-S4 



Protein name 



Locus Name 



calcium motive P-type ATPase 



gp:AFl45282 



Acc# 



AF145282 



Description 



Trichomonas vaginalis calcium motive P-type ATPase (CA-2) gene, partial cas . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
ITS 



145 



Score Probability 
|2.2e-2I 



Protein name 



Locus Name 



sp:YEiy_SYNYi 



ACC# 



P74523 



Description 

HYPOTHETICAL 17.7 KD PROTEIN &L£l4l9 



1031 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TTTT 



T5T 



JJT 



Protein name 



Locus Name 



sp:GC^H_>!c 4 oLl 



Acc# 



P23884 



Description 
GLYCINE! OLElAVA^ SYSTEM PROTKIJJ 



ORF Name 



4415705 rl 14 



Protein name 



NTID 



3922 



AAID 



9144 



NT 



AA 



Length Length 
T57T 



Score Probability 



54T 



Locus Name 



Acc# 



Description 



KO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



5HT 



2.0e-ll 



Locus Name 



Acc# 



sp:V61A_M3BTdA I P81310 



HYPO T HETICAL ^koTUIN MJOfell.l 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



230 



|4.6e-J6 



Protein name 



Locus Name 



nypotneticai protein gcpK 



tpir:E7ibb2 



Acc# 



E71562 



Description 



1032 



ORF Name 



5782828 t2 b'2 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
¥T8 



Score Probability 



T¥5~ 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



Locus Name 



conserved hypotnetical integral memorane 
protein TP0771 



Description 



k>ir:H7ili«J 



|4.3e-7^ 



Acc# 



H71283 



ORF Name 



Protein name 



Description 



NTID 



AAID 



— — Score Probability 
Length Length 



TWIT 



55T" 



T55T 



i.2e-27 



Locus Name 



Acc# 



sp:VicJl_EcJoLl 



HYPOTHEC J CAL tis.l H) PkOfKlN 1st <SLTs-Se!LC iN'l'^RGENic RhimuJN 



ORF Name 



NTID 



— — Score Pr obability 
AAID Length Length 



3T5u~ 



TUB" 



ITT 



Protein name 



Locus Name 



Acc# 



Description 



1033 



NT 



AA 



ORF Name 



NTID 



10188427 2bb 



AAID Length Length 
BT5 



IT 



Score Probability 
8.7e-0b 



TH3 



Protein name 



Locus Name 



transposase 



[gp:AF038^^~ 



Acc# 



AF038866 



Description 



Sacteroides tra gilis transposon Tnbb2U transposase tbipH) andmo^iiization 
protein BmpH (bmpH) genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



tl 12 



3930 



Length Length 
1395 



4^4" 



Score Probability 
1.5e-75 



SOT 



Protein name 



Description 



Locus Name 



ACC# 



P74409 



HYPOTHETICAL 49.2 KB iPkoTKIN SLL026O 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



13931 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



10..7.l9J.B,:A..±i....3.i.U. 



9154 



Length Length 
TSTu — 



^5" 



Score Probability 
2.1e-20 



2^3 



Protein name 



Locus Name 



transposase 



gp:AF0388£>6 



Acc# 



AF038866 



Description 



Bacteroides trag ilis transposon Tn5520 transposase IJoipH) andmoJoiiization 
protein BmpH (bmpH) genes, complete cds. 



1034 



ORF Name 



NT ID 



NT AA 

— — Score Probabil ity 
AAID Length Length 



ci 41b 



TOT 



7TT 



2.5e-ii0 



Protein name 



Locus Name 



alpha-glucosiaase 



bp:BTU66B97 



Acc# 



U66897 



Description 



feacteroides thetaiotaomicron neopuilulanase (susaj anaaipna-glucosiaase 
(susB) genes, complete cds . 



ORF Name 



NTID 



10S7567S rl 20 



Protein name 



probable purine NTPase paboziz 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TIT" 



Locus Name 



pir :F75103 



|2.4e-09 



Acc# 



F75103 



ORF Name 



Protein name 



NTID 



15157 



NT 



AA 



AAID Length Length 



Score Probability 



7T" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



51BS 



NT 



AA 



Length Length 
STQ 



Score Probability 



TUT 



Locus Name 



Acc# 



Description 



1035 



NT 



AA 



ORF Name 



NT ID 



1205436 ±2 'All 



AAID Length Length 
3?T 



TT7F" 



Score Probability 
5.0e-47 



mi 



Protein name 



Locus Name 



xmmunoreactxve 42JcD antigen PG33 



gprAFiVSVlb 



Acc# 



AF175715 



Description 



£>orphyromonas gingivaiis straxn W50 immunoreactive 42kD antigenPG33 gene, 
complete cds. 



ORF Name 



NT ID 



123163S2 c2 SS3 



Protein name 



O-acetyinomoserxne suitnydryiase 



Description 



NT 



AA 



AAID Length Length 



Score 



IFF 



Probability 
|2.5e-43 



Locus Name 



bir:D72^4 



Acc# 



D72324 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Pr obability 
Length Length 



PIT 



T3T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



SIT 



Locus Name 



Acc# 



Description 



ttfO-HtT 



1036 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



112650527 cl bbb 



cm" 



[2W 



Protein name 



Locus Name 



Acc# 



hypothetical protein F19D11 . 16 : hypothetical 
protein F14M4 . 29 : hypothetical protein F14M4.29 



pir :T026«y 



Description 



ORF Name 



NT ID 



AAID 



i2amtii.-ci..Aii.- I 



Protein name 



O-acetylhomosenne suirhydryiase 



Description 



NT 



AA 



Length Length 



\TTT 



Score Probability 
1755 



Locus Name 



bir:D72;^4 



|4.8e-74 



Acc# 
D72324 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



Length Length 




Score 



S4 



Locus Name 



Probability 



Acc# 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



Length Length 
T53 



[5TF 



Score Probability 
1. de-OS 



Locus Name 



hypothetical protein PH1147 



| |pir:E710b6"~ 



Acc# 



E71056 



Description 



1037 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AMD Length Length 



14156287 13 'Zbb 



445 



TTJT 



168 



14 . 4e-oy 



Protein name 



Description 



Locus Name 



gp:K5U60iiOtt 



Acc# 



U60208 



f>orphyromonas gingivalis orti, ort2 and ort3 genes, complete cas. 



ORF Name 



NTID 



14463437 C3 bV2 



9168 



Protein name 



nypotneticai protein ycgf 



Description 



— — Score Probability 



AAID Length Length 



705 



5.6e-l6 



Locus Name 



pir:A697ba 



ACC# 



A69758 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



5947 



[TUT 



TT 



0.013 



Protein name 



Locus Name 



NADH ctenydrogenase subunit 'l 



bp:AFl<>0t*b4 



Acc# 
AF160864 



Description 



Tetrahymena pyritormis mitocnondriai DNA, complete genome, 



NT 



AA 



ORF Name 



NTID AAID Length Length 



Score Probability 



T245" 



$170 



321 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



1038 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



[3W 



|1.2e-33 



Protein name 



Description 



Locus Name 



|sp:VHCS_KC0Ll 



ACC# 
P45423 



HYPOTHETICAL 45.3 KB PROTElU IN <3LTJ? '-yrANT iKCrfifeGEilSft^ kEGIQN Wfb) 



NT 



AA 



ORF Name 



NTID 



AAID 



c2 bib 



Length Length 
Su7 



— Score Probability 
1. le-3b 



Protein name 



Locus Name 



transposase 



|gp:A£M8866 



Acc# 



AF038866 



Description 



Bacteroides tragi lis transposon TnSbau transposase IbipHj ancLmoaiiizatiion 
protein BmpH (bmpH) genes, complete cds . 



ORF Name 



Protein name 



NTID 



lS.22ai42...c2...b.O^. I 



TT7J 



— — Score Probability 



AAID Length Length 



7B" 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



Description 



NTID 



AAID 



"NTT AA 

— — Score Probability 
Length Length 



I33T" 



|3.4e-16 



Locus Name 



sp:XYNCjJAL^A 



Acc# 



P23553 



ACETYL SSTElkASE, — (ACElTYLXYLOSlDASfi) 



1039 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID Length Length 
HT5 



Score Probability 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



NTID 



wrnr 



nypotfretical protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TTZT 



|2.3e-i77 



Locus Name 



brr:J0102U 



Acc# 
JQ1020 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



WTTT 



|6.9e-240 



Protein name 



Description 



Locus Name 



sp : PFL_CL0M 



Acc# 



Q46266 



FORMATS ACElfrLl'KANSgBftASii, (PYRUVATE Vo tmXti-DtAtitt) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TTTT 



Score Probability 
5.9e-55 



557 



Protein name 



Locus Name 



115K outer membrane protein precursor : Susc 
protein 



pir : 



Acc# 



JC602 7 



Description 



1040 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


1961720^_c3_559 


3957 


9179 68 207 




Protein name 








Locus Name 


Acc# 


Description 












NO-HIT 










1 


ORF Name 


IN 1 _LJJ 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


19.7.uk2b.&..xi2..A4I 




9180 


76 


231 143 


4 . Oe-uy 


Protein name 








Locus Name 


Acc# 


transposase 


gp:AF03^66 


AF038866 


Description 




Bacteroxdes rragii 
protein BmpH (bmpH) 


is transposon Tn552Q 
genes, complete cds 


transposase (brpH) anamoDiiizauion 




ORF Name 


NT ID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


ift&iaii...ci...aia 




9l8l 


79 






Protein name 








Locus Name 


Acc# 


Description 












NO-HIT 










i 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


2LaaiaflLia...c2L„.S3Li 


3950 




130 


393 




Protein name 








Locus Name 


Acc# 


Description 













BTO-HIT 



1041 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



20213304 c2 biy 



7.1e-i6 



Protein name 



Locus Name 



ATP - dependent activating enzyme 



Acc# 



Y09356 



Description 

Pseudomonas rluorescens tJDSC, tbsE, ejdsa and rt>sB genes. 



ORF Name 



2031290 tl 2b 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 
BT3 



Score Probability 



7TT 



Locus Name 



Acc# 



Description 



NO-HIT 



NT 



AA 



ORF Name 



NT ID 



:mmi3ZicZ3Zi: 



AAID Length Length 




^4" 



Score Probability 
3.0e-l0 



Protein name 



Locus Name 



transmembrane sensor 



gp:At l 0bl6yi 



Acc# 



AF051691 



Description 



Pseudomonas aeruginosa stress tactor A ipstA) , sigma ractoritiui), 

transmembrane sensor (fiuR) , and hydroxamate- typef errisiderophore receptor 
(fiuA) genes, complete cds . 



ORF Name 



NT ID 



AAID 



— — Score Probability 
Length Length 



i.2e-7<* 



Protein name 



Description 



Locus Name 



sp:MUS2_M(jyU 



Acc# 



P94545 



MUTS2 PROTfilKf 



1042 



NT AA _ _ 
— — Score Probability 
ORF Name NT ID AAID Length Length 


20801930_c2_S0b 


3965 9187 512 : 


L539 467 i.ye-bl 


Protein name 


Locus Name Acc# 


amidophospnori£>osyl trans t erase 


pir:H69185 H69185 


Description 


— — Score Probability 
ORF Name NTID AAID Length Length 


mS2.71.tl...Ui 


|3966 9188 526 


1581 2616 b.^e-zfz 


Protein name 


Locus Name Acc# 


alky! hydroperoxide reductase summit F 


gp:AFI29406 AF129406 



Description 



Bacteroides tragilis alkyi hydroperoxide reductase suiDunit c (anpC) and 
alkyl hydroperoxide reductase subunit F (ahpF) genes, completecds . 





ORF Name 


NTID 


AAID 


NT 
Length 




AA 
Length 


Score 


Probability 




214^.2:L±l...li 


3967 


9189 


282 


849 


410 


3.1e-38 




Protein name 










Locus Name 


Acc# 


transcription regulator yggG 


pir:G6b078 




Description 
















ORF Name 


NTID 


AAID 


NT 
Length 




AA 
Length 


Score 


Probability 


2±6AZ£&£.±2..±±4. 


3968 


9190 


874 


2625 


93 


0.0019 


















Protein name 










Locus Name 


Acc# 


nypotnetical protein 62228 


pir :B64993 


B64993 



Description 



1043 



ORF Name 


NT ID 


7V 7\ Tfl 
AA1D 


NT 
ijengcn 


AA 


Score 


Probability 


2172292S_ci>_4y6 


3969 


9191 1 










Protein name 








Locus 


Name 


Acc# 


Description 
















NO -HIT 


















ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2.ia.7.B.ab....c2...S0.7. 


3970 


9192 




195 


75 


" 0.019 


Protein name 








Locus 


Name 


Acc# 



putative transmembrane protein 



U96107 



Description 



Staphylococcus c arnosus , 3^lO-metnyienetetranydrometnanopterinreQuctase 
homolog, SceB precursor (sceB) and putative transmembraneprotein genes, 
complete cds, and putative Na+/H+ antiporter NhaC (nhaC) gene, partial cds . 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



^0.440.1..±1...10.i... 



3971 



19193 



TIT 



WW 



Score Probability 
TTTME 



WT 



Protein name 



Description 



Locus Name 



sp:PkIM_LKMO 



Acc# 



P47762 



DNA PRIMAL ill, 



NT 



AA 



ORF Name 



NT ID 



AAID 



TWTT 



Length Length 
TH 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



[NO-HIT 



1044 



ORF Name 



Protein name 



NTID 



TT7T 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



— — Score P robability 
Length Length 



IT 



Locus Name 



Acc# 



Description 



MO -HIT 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



FT 



252 



Locus Name 



Acc# 



Description 



sp:S&kC_XENLA | 



P36378 



( OS T EONECTIN) (ON) (RASlMfWT MBMbkAMK PROTEIN BM-4U) 



ORF Name 



NTID 



NT AA 

— — Score Probabil ity 
AAID Length Length 



iiaaiaaii...ci™44i | 



9198 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



1045 



NT 



AA 



ORF Name 



NT ID 



123445317 cl 4^8 



JTTT 



AAID Length Length 



Score Probability 
3.Se-71 



TIT 



Protein name 



Locus Name 



conserved nypotnetical protein BB0682 



Description 



pir :A7018b 



Acc# 



A70185 



NT 



AA 



ORF Name 



NT ID 



AAID 



Length Length 



TT77T 



Score Probability 
|1.3e-48 



5TO 



Protein name 



Description 



Locus Name 



gp:AUUU47 



ACC# 



A00047 



E.coli mor gene. 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



TTT 



3.4e-06 



Protein name 



Locus Name 



AmpG- signal transducer 



bpiECAMiXli 



Acc# 



X82159 



Description 



E.coli ampG3 gene. 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



TUT 



0.0025 



Protein name 



Locus Name 



hypothetical protein A2 08R 



pir :Ti76ya 



Acc# 



T17698 



Description 



1046 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Pro bability 
Length Length 



TWT 



TTF 



Protein name 



Description 



Locus Name 



sprRFSJ^OLl 



Acc# 
P33998 



frBPJIOE CHAIN Rl^ASE! FACTOR 3 ( RF - i ) 



ORF Name 



23345263 ±3 27b 



Protein name 



NT ID 



TWT 



NT 



AA 



AAID Length Length 
774 



Score Probability 



ET57" 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



Description 



NT ID 



NT AA 

— — Score Pr obability 
AAID Length Length 



TWT 



TTT 



T^4" 



1298 



|2.5e-l32 



Locus Name 



ACC# 



P55179 



PEPT I DASE T, (AMlNoTRIPEPTlDAd E) (TRlPKmUA^) 



NT 



AA 



ORF Name 



NT ID 



AAID 



|2.422a.6.:/.7....cl...i.:Ab. I 



Length Length 

tuts — 



Score 



TTT 



TJTT 



Probability 
4 . 0e-i4i 



Protein name 



Locus Name 



class A beta- lactamase CFXA2 precursor 



bp;AFiiailU 



Acc# 



AF118110 



Description 



Prevotella intermedia class A beta- lactamase CFXA2 precursor (ctxA2) gene, 
complete cds . 



1047 



NT 



AA 



ORF Name 



NTID 



124397187 c3 b6 J A 



AAID Length Length 

mm — 



— Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



|5LiA14D.2k..±2...15D. I M 



AAID Length Length 

— 



Score Probability 
4.ie-36 



330 



Protein name 



Locus Name 



Dps 



jgp:AB025775" 



Acc# 



AB025779 



Description 



Porpiiyromonas gmgivalis gene tor Dps, complete cas . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



Probability 
l.de-43 



Protein name 



Description 



Locus Name 



Acc# 



sp:Y6AL_EdoLl 



HYPOTHETICAL S9.4 KB PRO T EIN IN gSK-F&fft INTEkGEmC REGION 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



3988 



335" 



Probability 
7.1e-10 



Protein name 



Locus Name 



nucleotide pyropnospnatase nomolog T16L4.21U I BTrTTM^T 



Acc# 



T09933 



Description 



1048 



ORF Name 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



WITT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



|248.M&17....cl...6.0..7. 



AAID Length Length 
7TI 



Score Probability 
3.2e-05 



Locus Name 



jsp:EBA2_FIAMIJ 



Acc# 



P36912 



(EifoOGLyoosiDASS ri) 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



Protein name 



Locus Name 



divalent cation transport- related protein 



pir:H-?«60 



Description 



$.$e-5i 



Acc# 



H72360 



NT 



AA 



ORF Name 



NTID 



AAID 



2&&22±±2.±2J22b... 



15214 



Length Length 

— 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



1049 



ORF Name 



24848<^d cl 4by 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — - Score Pr obability- 
Length Length 



Locus Name 



Acc# 



Description 



IN0-M1T 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score P robability 
AAID Length Length 



TIT 



TFTT 



T7F" 



Locus Name 



1.3e-34 



Acc# 



Description 



sp:YIDA_ECOLl 1 



HYPOTHETICAL 29.7 KB PROTEIN IN mJA-cJVk B INTBktiENIC REGION 



ORF Name 



NTID 



AAID 



"NTT AA 

— — Score P robability 
Length Length 



maaaas...c:2L.4fiLi 



1041 



262 



1. 5e-22 



Protein name 



Locus Name 



Acc# 



hypothetical protein F14F9.5 



|pir:T337V4 



Description 



T33774 



1050 









NT 


AA 


ORF Name 


NT ID 


AAID 


Length 


Length 




|3557 


5219 


278 


837 



Protein name 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Prob ability 
Length Length 



7F" 



1W 



0.024 



Protein name 



Locus Name 



HCG-i protein 



gp:Al'0442ly 



Acc# 



AF044219 



Description 



Drosophila melanogaster HCG-l protein 


(HCG-l) 


mRNA, complete 


cds . 
















ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


ISAblOAl^al^A 


3555 9221 


111 


336 


173 


4. ie-13 



Protein name 



Locus Name 



thioredoxm-liKe protein 



|gp:A¥ACj0107l8 



Acc# 



AC010718 



Description 

Aratudopsis thaliana chromosome l bac 
sequence . 



F2 8Q16 genomic sequence, complete 



NT 



AA 



ORF Name 



NTID 



AAID 



|2B.S.5.2.iS.7....Gl...b.9.4.. 



5222 



Length Length 
WTZ 



T5T 



Score Probability 
1. 3e-B7 



Protein name 



Locus Name 



probable dTDP-L-rnamnose synthase 



pir :Til087 



Acc# 



T31087 



Description 



1051 



ORF Name 



NTID 



AAID 



NT AA 

— — Score P robability 
Length Length 



TOT 



T7T 



|i.Se-24 



Protein name 



Locus Name 



sp:ENT(Mi!<JoLl 



Acc# 



P10377 



Description 

ISOCHOrISmateI SYNTHASE fiNTC, 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



5224* 



11740 



"454" 



16 . 8e-43 



Protein name 



Description 



Locus Name 



spiMElSfDJlAijIlW 



Acc# 
P44612 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



7.6e-60 



Protein name 



Locus Name 



probable zinc- containing aenydrogenase 



foiri'l'^bl 



Acc# 



T36961 



Description 



NT 



AA 



ORF Name 



NTID 



4004 



AAID Length Length 
13225 





145 


458 




111 





Score Probability 
1 . 5e-06 



Protein name 



Locus Name 



terric uptaite regulation protein 



bir:G721>i^ 



Acc# 



G72213 



Description 



1052 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



4005 



9227 



TIT 



WIT 



5.1e-i02 



Protein name 



Locus Name 



Acc# 



naphthoate synthase, menB : dhjma 
synthase : dihydroxynaphthoate 
synthase r dihydrox ynapthoic acid synthetase 
Description 



pir iFbVbbb 







NT 


AA 


NTID 


AAID 


Length 


Length 


4006 


$22§ 


134 


405 



ORF Name 



I266046S7 i2 lii 



Protein name 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



transposase 



Description 



NT 



AA 



NTID 



AAID 



9229 



Length Length 
I5B5 



Score Probability 
l.le-19 



E3? 



Locus Name 



gp:Atfd3&d66 



Acc# 



AF038866 



Bacteroides tragilis transposon Tnbb2U transposase (OipH) anamonilization 
protein BmpH (bmpH) genes, complete cds . 



ORF Name 



NTID 



AAID 



"NTT AA 

— — Score Probability 
Length Length 



2£&2tti41...ca..AttI 



I400& 



140 



9.2e-07 



Protein name 



Locus Name 



Hypl protein 



|gp:HvHmi>kO 



Acc# 



Y09797 



Description 



H. vulgaris mRNA tor Hypl protein. 



1053 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



'29398290 U 2VI 



TO" 



WITT 



3T 



0.0053 



Protein name 



Locus Name 



asparagine-nch protein (clone 28C6J 



pir :S14470 



Acc# 



S14470 



Description 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length ^ 



14010 



2418 



l.Se-3i 



Protein name 



Locus Name 



Acc# 



Sensor protein RcsC (EC 2. 7. 3. -J 



gp:D908bU 



Description 



E.coli genomic DNA, Kohara clone #373 (4y . b-4y . y mm.; 



ORF Name 



NTID 



Protein name 



hypothetical protein 



Description 



NT 



AA 



AAID Length Length 
T775 



Score Probability 
5.8e-2£ 



J71 



Locus Name 



pir :C722&S 



Acc# 



C72285 



ORF Name 



Protein name 



NTID 



TUTT 



AAID 



NT AA 

— — Score Prob ability 
Length Length 



TUJT 



Locus Name 



Acc# 



Description 



NO- HIT 



ORF Name 



3.a5..7.2L3.3.X...Cl..A3.U. 



Protein name 



NTID 



AAID 



NT AA „ _ . , . . . . 
— — Score Proba bility 
Length Length 



Locus Name 



Acc# 



Description 



[NO-HIT 



1054 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



32478803 c2 Sb2 



0.033 



Protein name 



Locus Name 



sp : CPE±_B0VIN 



Acc# 



018963 



Description 

CYTOCHROME P4S0 Sfil, (CYMIEI) 



ORF Name 



32523S7£ rl &2 



Protein name 



NTID 



AAID 



9237 



NT 



AA 



Length Length 
TSS 



Score Probability 



ST 



Locus Name 



Acc# 



Description 



fcTO-tilT 



ORF Name 



3.3.1M3..7....C.3...A2&., 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length " L 



T7T 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



mi0.3.3.fL±l..A6.. 



Protein name 



NT 



AA 



NTID 



ffuTT 



AAID Length Length 



Score Probability 
|4.7e-l2 



TZ1 



Locus Name 



RNA polymerase sigma factor SigZ-like protein I |gp : AF137263 



Acc# 



AF137263 



Description 



Bacteroides tnetaiotaomicron 30S noosomai protein sie-iikeprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



1055 



ORF Name 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



TUTT 



Locus Name 



Acc# 



Description 



MO -HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probabi lity 
AAID Length Length 



14015 



5241 



NBU1 mobilization protein mo£) 



Description 



12054 



Locus Name 



pir :A4yyui 



i.3e-2ib 



Acc# 



A49901 



ORF Name 



a4ft2LfiSSft-.±l„.iex 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



145.0.$.7.M...ci...Aii2.. 



Protein name 



NTID 



AAID 



5242 



NT 



AA 



Length Length 
TTT 



Score Probability 



414 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



5244 



nypotnetical protein PAB004U 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TIT 



Locus Name 



|pir:B7Si54 



8.6e-20 



Acc# 



B75194 



1056 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Probability 
Length Length 



14023 



li.Oe-iO 



Protein name 

Description 
flypWHETlCAL PROTEIN HJI0350 (ORfr'3) 



Locus Name 



|sp:Y550_mgnr 



Acc# 



P24326 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



cl 389 



1ST - 



Probability 
|3.Se-33 



Protein name 



Description 



Locus Name 



sp:VZO$_MYcJTU 



Acc# 



Q10543 



HYPOTHE TI CAL TRNA/RRNA METH YLTRAWSFERA5E CY3i.O^, 



NT 



AA 



ORF Name 



NTID 



3£20.7.<m...c:L..4l3.. 



AAID Length Length 
TUZZ — 



Score Probability 
ITS 



i.ie-10 



Protein name 



Locus Name 



chloromuconate cycloisomerase homolog yKtB 



|pir:H69^Sb 



Acc# 



H69855 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — Score P robability 
Length Length 



\16MXC)A1.±1..:2.£>:.L 



9248 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



1057 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
T5T 



576 



Score Probability 
S.0e-99 



757 



Protein name 



Locus Name 



alJcyl Hydroperoxide reductase summit c 



gp:AP12^406 



Acc# 



AF129406 



Description 



Bacteroides tragiiis alKyl ny ar ope r oxide reductase subunit c (anpc)and 
alkyl hydroperoxide reductase subunit F (ahpF) genes, completecds. 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


3947562_c2__«4 


402$ 


$250 


400 


1205 


758 


4.2e-75 



Protein name 



Locus Name 



transposase 



bp:A*038866 



Acc# 



AF038866 



Description 



Bacteroides tragiiis transposon Tnbb^o transposase ir>ipH; anamotaiization 
protein BmpH (bmpH) genes, complete cds . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


£0.M563...±2l...13..7. 


4025 


9251 


445 


1338 


128 


5.1e-05 

















Protein name 



Locus Name 



nypotnetical protein PHoy22 



pxr:D710fcJ!i 



Acc# 



D71082 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Pr obability 
Length Length 



Protein name 



¥uT0~ 



Locus Name 



Acc# 



Description 



NO-HIT 



1058 



ORF Name 



NTID 



NT AA 

— , — , Score P robability 
AAID Length Length 



4054800 J44 



|4.8e-83 



Protein name 



Description 



Locus Name 



sp:SDHL_aTk(JO 



Acc# 



086564 



L-SBRlNE DfifttfDftAfASig, (L-SfcikiNg DSAMlNA^ a) (dDH) (LSD) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



4S046S2 cl 402 



7HT 



1.4e-05 



Protein name 



Locus Name 



intracellular Hyaluronic acia .binding protein | |gp : AF032§6^ 



Acc# 



AF032862 



Description 

Homo sapiens intracellular hyaluronic acid binding protein { ihabp} itikjma, 
complete cds . 



ORF Name 



Protein name 



NTID 



NT AA 

— — , Score Pro bability 
AAID Length Length 



9255 



T7TT 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 

— — i Score Probability 
Length Length 



5TT 



5.6e-3I 



Locus Name 



sp:GA6S_HUMAN 



Acc# 



P34059 



(CMOMMOtl'IlslA^K) 



1059 



NT 



AA 



ORF Name 



NT ID 



4939000 c2 b'A± 



AAID Length Length 



9257 



ran" 



Score Probability 
ll.le-104 



Protein name 



Locus Name 



conserved hypothetical protein 



pir:B7227tJ 



Acc# 



B72278 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



E.D.§.19.5l.7....c2...5.M.. 



2uW 



Protein name 



Locus Name 



cation- transporting atpase, p-type (pacs) 
PAB0626 



Description 



|pir:E7!>i4i 



Acc# 



E75141 



ORF Name 



3.lD.9.3..7....cl...i6.b... 



Protein name 



NT ID 



4uTT 



AAID 



NT AA 

— — Score Probabi lity 
Length Length 



T7T~ 



sir 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



S.l±ll£l...al..£±6. I [4uT 



Protein name 



hypothetical protein F42G9.3 



Description 



NT AA 

— — , Score Proba bility 
Length Length 



TIT 



114 



0.00075 



Locus Name 



pir :T16348 



Acc# 



T16348 



ORF Name 



NTID 



AAID 



SlU.lli3....cI.£flifiL I 



Protein name 



NT AA 

— — Score Prob ability 
Length Length 



Locus Name 



Acc# 



Description 



INO-HIT 



1060 



ORF Name 



Protein name 



NTID 



NT AA 

— , — -, Score Probability 
AAID Length Length ^ 



9262 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



4041 



NT AA 

— — Score Probability 
AAID Length Length 



1143 



TUT" 



Locus Name 



0 .0047 



Acc# 



Description 



gp:PFMAL^P7 



Plasmodium talclparum MAL3P7, complete sequence. 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
T73 



Score Probability 



50" 



Locus Name 



Acc# 



Description 



ttJO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Proba bility 
Length Length 



Locus Name 



6.5e-l5 



Acc# 



sp:RP0KJiAU114 



Description 

RNA P0LVM E RASL1 ^Ic^MA-E VAc^TO R (<U(jMA-24) 



P44790 



1061 



ORF Name 



15444077 ri T) 



Protein name 



NT ID 



AAID 



NT AA 

— — , Score Probability- 
Length Length 



Locus Name 



i.4e-53 



Acc# 



Description 



lsp:PFLA_E(JOLl 



P09374 



ACTIVATING EN2YME) 



ORF Name 



7£>§4£75 rl 19 



Protein name 



NTID 



NT AA n _ , , . . . ^ 

— • — Score Pro bability 
AAID Length Length 



9267 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



Description 



NTID 



— — Score Pro bability 
AAID Length Length 



l.4e-2l 



Locus Name 



sp:Y2AO_E<JOLl 



Acc# 



P76243 



HYPOTH E TICAL 14. 'A Kb PROTEIN IN GAPA-BMB INTEkCj ENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1.4e-65 



Protein name 



Locus Name 



Acc# 



sp:YEIH_ECOLl 



Description 

HYPOTHETICAL 56.$ KD frfrOTEXN IN LYSP -NEO INTErGENIC REGION 



P33019 



1062 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


975780_t3_333 


404§ 


9270 


351 


1056 


93 


0.0034 



Protein name 



Locus Name 



Acc# 



troponin T, cardiac muscle : troponin 



biriTPHUTi; 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



9.9.2.7.3.3.CL..C.3....6.2.7.., 



Length Length 



Score Probability 



T5T 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 



57TT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



i aa£aAM...ci.Jwa.. 



AAID Length Length 



TU1 



Score Probability 
3.§e-&l 



SIS 



Protein name 



Description 



Locus Name 



spr^'lVbAt^U 



Acc# 



P54378 



1063 



ORF Name 



22438202 ±2 1 



Protein name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



7T" 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



inner membrane protein homoiog 



Description 



5.6e-3a 



Locus Name 



pir :A701bb 



Acc# 



A70155 



NT 



AA 



ORF Name 



NT ID 



im^.i..±i...ii6. I msz 



AAID Length Length 

rrz — 



Score Probability 
|2.7e-09 



137 



Protein name 



Locus Name 



Acc# 



transcriptional regulator 



gp:BSUB00I7 



Description 



Bacillus subtilis complete genome (section 17 or 21) : from 3l97001to 
3414420. 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 
2.6e-18 



Protein name 



Locus Name 



neat shock protein, class I 



pir:D7238S 



Acc# 



D72385 



Description 



1064 



NT 



AA 



ORF Name 



NT ID 



TOT 



AAID Length Length 
TIT 



Score Probability 
0.0043 



Protein name 



Locus Name 



conserved Hypothetical protein yulD 



pir :F7UU14 



Acc# 



F70014 



Description 



ORF Name 



NTID 



Protein name 



coenzyme F390 synthetase II 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



WIT 



1317 



11171 



Locus Name 



pir:B6£115 



7.i>e-119 



Acc# 



B69115 



NT 



AA 



ORF Name 



NTID 



i^asftL.ca„iafi I 



AAID Length Length 

wm> — 



i033 I rnm 



Score Probability 
|l.2e-50 



Protein name 



Locus Name 



sensory transduction iiistidme kinase 
slr2098 :protein slr2098 :protein slr2098 



pir:S«l30 



Acc# 



S75130 



Description 



ORF Name 



NTID 



NT AA „ _ "u. ■ t • 4- 
— — , Score P robability 
AAID Length Length 



|liS.&7.1St2..±2L..9.0 1 



UTT 



WIT 



|4.4e-72 



Protein name 



Locus Name 



Tri r 4 allergen 



|gp:Aff0^brr 



Acc# 



AF082514 



Description 

Trichopnyton rubrum Tri r 4 allergen mRNA, complete ccts , 



1065 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
— 



Score Probability 
i.0e-35 



EST 



Protein name 



Locus Name 



sp:YCLF_BAC!^U 



Acc# 



P94408 



Description 

HYPOTHETICAL 55.^ KD frROTSiN M SE'P-GE RKA ftifffiR(3ENie REGlOti 



ORF Name 



NTID 



AAID 



NT AA 

— , — „ Score Probability 
Length Length 



1$70$6§2 cl 235 



4061 



WIST 



TUT 



32T 



549 



Protein name 



Locus Name 



CDP - 4 - Jceto - 6 - deoxy-D- glucose - 3 - defrycLrase 



Description 



pir :E47070 



5.$e-53 



ACC# 



E47070 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



±$:mb.9Al...c2Jlll I ffuTI 



AAID Length Length 
92^5 



Score Probability 
|2.6e-103 



Protein name 



Locus Name 



|sp:YfifiF_fiCOLl 



Acc# 



P75776 



Description 

HYPOTHETICAL ABC TRANSPORTED ATP- BINDING PROTEIN YBHF 



1066 



ORF Name 



1592182 t2 107 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
TIT" 



Score Probability 



Locus Name 



Acc# 



Description 



NO- HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



4065 



9237 



ST 



0.015 



Locus Name 



Acc# 



P36158 



Description 

HYEOTHBTICAL 55.5 Kb MOTElISf IN SlSa-M TOl INTEikGSafIC Rfi^loN 



ORF Name 



Protein name 



NTID 



14066 



NT AA 

— , — , Score Probability 
AAID Length Length 



ST" 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NTID 



AAID 



20.i^0.3.0....c3....3.2l I [4uT7 



Protein name 



hypothetical protein C26D10.4 



Description 



NT 



AA 



Length Length 
— 



Score Probability 
|1.2e-25 



ITT 



Locus Name 



|pir:T194^6 



Acc# 



T19486 



ORF Name 



NTID 



AAID 



|2fl£a£fl5a...a2.J&:L. 



Protein name 



NT AA 

— — , Score Probability 
Length Length 



Locus Name 



hypothetical protein sc5C7.ua SC5C7.08 



pir:MS2lS 



Description 



1.4e-3S 



Acc# 



T35215 



1067 



ORF Name 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



-VIST 



Locus Name 



Acc# 



Description 



KO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



WU7TT 



T7T 



7.4e-14 



Protein name 



Locus Name 



unknown 



gp:AP048749 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccnaricte biosynthesis ope ron, complete 
sequence . 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



\224.6.216A...c2...21& I [4TT7T 



TTuT" 



l.4e-44 



Protein name 



Locus Name 



sp : YBHS_EC0L1 



Acc# 



P75775 



Description 

HYPOTHET I CAL 42.1 KB PRO T E I N I N MOAE-RHL E INT ERGENIC! RUC10N 



NT 



AA 



ORF Name 



NTID 



AAID 



22$.10AS.0....a2J£16.. 



Length Length 
|179 | |540 



Score Probability 
i.4e-09 



Protein name 



Description 



Locus Name 



sp:YHII_EOTLl 



Acc# 



P37626 



1068 



ORF Name 



|2345%^6 160 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 
755 



Score Probability 



Locus Name 



Acc# 



Description 



MO- HIT 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


23.5.16.Q.UU...C1...2.13. 


4074 


5296 


93 


282 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



T5T" 



0.00026 



Locus Name 



sp:YlSli_BPT4 



Acc# 



P17308 



Description 

HYPO T HETICAL ii.5 KB PROTEIN I N GP^1-0D INTERGUNIC REGION (OkP b) 



NT 



AA 



ORF Name 



NTID 



AAID 



116A1±15...±±..±± I 14IF7S 



Length Length 
TT75 — 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



[NO-HIT 



1069 



ORF Name 



23926&77 ci 214 



Protein name 



NTID 



4077 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



INO-HIT 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



6.7e-36 



Locus Name 



Acc# 



damage- inducible protein PAB024 3 



Description 



pxr:A75ibi 



A75151 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



\2.&.22Z2±L..c±..±±L 



Protein name 



i3TT 



7.1e-25 



Locus Name 



Acc# 



nypotnetical protein MTH18 54 



Description 



(pir:A6911b 



A69115 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2±15.1^11...al...20A 


40S0 


£302 




1052 


117 


2.le-06 

















Protein name 



Locus Name 



Acc# 



nypotnetical protein PAB06 03 



Description 



E75137 



ORF Name 



NTID 



NT AA 

— , — , Score Proba bility 
AAID Length Length 



!2440.0.£8.5...±3....1^. I FTOT 



Protein name 



\TJT 



|2.1e-06 



Locus Name 



Acc# 



conserved nypotnetical protein 



Description 



bir:F7532a 



F75328 



1070 



ORF Name 



NT ID 



AAID 



NT AA 
— — Score 
Length Length 



Probability 



24648876 t'A 100 



Protein name 



fZTT 



TUT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT ID 



AAID 



— , — , Score Probability 
Length Length 



2.48.20.3.0.0....c2...Mi.. 



Protein name 



FuT" 



TIT 



Locus Name 



i.4e-05 



Acc# 



Description 



sp:XE£C_SALTY 



P55888 



HWfieRASfiyftfiCOMfeMASE 



ORF Name 



NT ID 



Protein name 



AAID 



NT AA 

— — , Score Probability 
Length Length 



5.8e-li2 



Locus Name 



Acc# 



nelicase 



Description 



|gp : RNDI^B" 



Y13813 



Rhodothermus mannus dnaB gene. 



ORF Name 



NTID 



AAID 



25£A±££2...z±..±&$. I 



Protein name 



NT 



AA 



Length Length 
Tu3~ 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



1071 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



26574061 t2 106 



4086 


9308 




244 


735 




472 



Score Probability 
8.5e-45 



Protein name 



Locus Name 



sanA protein 



pir:D7554S> 



Acc# 



D75549 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2.&5ta^aa2L...CL...2L2L2L.. 



14087 



Protein name 



Locus Name 



Na- translocating NADH-qumone reductase, Nqr5 
subunit 



Description 



5.1e-54 



Acc# 



A72399 



NT 



AA 



ORF Name 



NTID 



AAID 



2l6..7a^0.i2...c0-...20.0... 



Length Length 

jwti — 



34T 



Score Probability 
JZTB — 



5.4e-272 



Protein name 



Description 



Locus Name 



sp:OTkA_£AC^lJ 



Acc# 



034863 



EXCIKDCLEASE ABC SUBUNIT A 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



|2L6..7.5.S.i2I...cl...3.21 1 pnTSS 



JUT 



i.4e-53 



Protein name 



Locus Name 



Na- translocating NADH-qumone reductase, Nqr4 
subunit 



pTr7TT721W 



Acc# 



H72398 



Description 



1072 



ORF Name 



NT ID 



NT AA 
T — _ _ — Score Probability 
AAID Length Length 



2544087 cl 203 



i.3e-i8 



Protein name 



Locus Name 



conserved hypothetical protein SCE20.33C. 



:SCE20 



ACC# 



AL136058 



Description 



Streptomyces coelicoior cosmid E20. 



ORF Name 



NTID 



NT AA 
T — _ T — Score Probability 
AAID Length Length 



3025037 c3 287 



4M" 



53TT 



S.Se-2l 



Protein name 



Locus Name 



site-specitic recomJDinase 



gp:D86934 



ACC# 



D86934 



Description 



Staphylococcus aureus genes, mec region, partial and. complete cds . 



NT 



AA 



ORF Name 



mam.7....c2...254 i wuwz 



NTID AAID Length Length 

mrz — 



Score Probability 



TT5 IP5T 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



3AU2izxa.7....cz...z:y.z 



AAID Length Length 
TuUS 



Score Probability 




|5.9e-76 



Protein name 



Locus Name 



Na- translocating NADH-qumone reductase, Nqr2 
subunit 



pir:F72398 



Acc# 



F72398 



Description 



1073 



ORF Name 



NT ID 



AAID 



NT AA 

— ■ T — ^ Score Probability 
Length Length J - 



34179712 ±3 154 



|6.2e-200 



Protein name 



Description 



Locus Name 



Acc# 



EXCiisfUCLEASE ASC StrfeUNlT 6 (DMA £>&OTElN) 



NT 



AA 



ORF Name 



NT ID 



35344626 cl 201 



AAID Length Length 
T75~ 



BIT" 



Score Probability 




2.le-32 



Protein name 

Description 
EBSC PROTEIN 



Locus Name 



Acc# 



P36922 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




T75 - 



Score Probability 
JI2 



7.6e-28 



Protein name 



Locus Name 



sp:YBHR_ECOLl 



Acc# 



P75774 



Description 

HYPMttfifiCAL 4l t S K£) t>&0TEltf IN MOAE-rhLE INTEkGEtfiC REGION 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



\16.1SM±1.±±...2S. I WUTT 



Protein name 

Description 
WJ^KTT 



Locus Name 



Acc# 



1074 



ORF Name 



NTID 



I39406S8 ti 45 



14038 



Protein name 



conserved hypothetical protein ylbK 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



657 


1374 




237 





Locus Name 



bxr:H53874 



Acc# 



H69874 



NT 



AA 



ORF Name 



3.M5ia.7....tl...m.. 



NTID AAID Length Length 

mzi — 



Score Probability 
— 



|4,8e-iSi 



Protein name 



Locus Name 



: sp:PYRG_BMSU 



Acc# 



P13242 



Description 

OTP SYNTHASE , (UTP-- AMMONIA LIGASE) (CTP SYNTHETASE) 



NT 



AA 



ORF Name 



NTID 



19AS.1D.1...C.1...119. I 



AAID Length Length 

wm — 



Score Probability 

fui — 



Protein name 



Locus Name 



sp:YDGM_ECOLT 



Acc# 



P77223 



Description 

PUTATIVE FERREDOX IN - L I KE PROTEIN IN ADD -NTH INTERGENIC REGION 



ORF Name 



NTID 



NT AA 

^ ^ T — _ — _ Score Probability 
AAID Length Length J ~ 



6.7e-§4 



Protein name 



Locus Name 



cLTDP-6-deoxy-D-glucose-3 , 5 epimerase 



gp:AP04§74^ 



Acc# 



AF048749 



Description 



Bacteroictes Iragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



1075 



NT 



AA 



ORF Name 



NT ID 



4065500 c2 246 



AAID Length Length 
3323 — 



WT 



Score Probability 




2.2e-05 



Protein name 
Description 



Locus Name 



Acc# 



sp:M>C_BK>Hi 



NT 



AA 



ORF Name 



NT ID 



AAID 



424602 tS 165 



Length Length 
TSTT 



F4T 



Score Probability 
ITS 



4.?e-l3 



Protein name 



Locus Name 



alanine- -tRNA ligase, 



P ir:fi722l6 



Acc# 



E72216 



Description 



NT 



AA 



ORF Name 



NTID 



14104 



AAID Length Length 
STZZ 



F37" 



Score Probability 




|6.3e-143 



Protein name 



Locus Name 



glucose- 1-phosphate thymidyi transterase 



gp:AF045745 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



\&l5SA0A...c±...221.. I I4T0? 



^4" 



IMF" 



2.4e-S6 



Protein name 

Description 
GALACTOSE 4-EPIMERASE) 



Locus Name 



sp:GALE_BACSU 



Acc# 



P55180 



1076 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length — 



45077&1 12 11 



Protein name 



"j |277 | |i 



TFT 



|5.5e-25> 



Locus Name 



Acc# 



Description 



sp:YABH_BAC3U 



P37550 



fiVKyHifiTICAL 31.7 Kb MOTfilM Itf SSPP-PUR& iNxERGENiC REGION (0RF1) 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



4$£56S7 13 ISO 



WTUT 



Protein name 



nrrrr 



1^~ 



|2.2e-0fi 



Locus Name 



Acc# 



cell wail -binding protein nomolog yocH 



Description 



pir :E69901 



E69901 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length -L 



Protein name 



2.5e-53 



Locus Name 



Acc# 



hypothetical protein TM0244 



Description 



pir:E72358 



E72398 



ORF Name 



Protein name 



NTID 



14109 



AAID 



NT 



AA 



Length Length 
TTF2 — 



Score Probability 



Locus Name 



Acc# 



Description 
BTO-HIT 



1077 



ORF Name 



NT ID 



NT AA , , . , . 
T — _ — Score Probability 
AAID Length Length J - 



I60271S7 cl 220 



TUT 



\ZTT 



|3.3e-Ii 



Protein name 



Description 



Locus Name 



sp:YDGP_ECOLI 



Acc# 



P77285 



HYSOTHSMCaL 21. $ Kd PROTEIN itf ADt) -isfTH ItffERGfiNlC RtfGlOtf 



NT 



AA 



ORF Name 



7§Sl25 t3 157 



NT ID AAID Length Length 

srn — 



Score Probability 




Protein name 



Locus Name 



ORF MSV198 MTG motit gene lamily protein 



gp:AFu£3S££ 



Acc# 



AF063866 



Description 



Meianoplus sangumipes entomopoxvirus, complete genome. 



NT 



AA 



ORF Name 



NTID 



i?. 1. 1 2. 5i !• • . . Ci 2. . . . 2. 3. 



14112 



_ _ _ _ — _ — ^, Score Probability 
AAID Length Length J - 

^TT3 



6.0e-09 



Protein name 



Locus Name 



hypothetical protein aq_12 73 



pir :C70410 



Acc# 



C70410 



Description 



ORF Name 



NTID 



AAID 



a&aCL.ta...l£l I 14113 



Protein name 

Description 
NO-HIT 



NT 



AA 



Length Length 
71 



Score Probability 



Locus Name 



Acc# 



1078 



NT 



AA 



ORF Name 



9865837 c3 326 



NT ID AAID Length Length 
^ 



T7T 



Score Probability 
£33 



I.8e-19 



Protein name 



Locus Name 



unknown 



gp;AF048749 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



NT 



AA 



ORF Name 



NT ID 



AAID 



$$21876 c3 



|41lS 



Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



29.3.26A2...C1...18.3. I 14116 



Length Length 



TIT 



Score Probability 
52 



0.00616 



Protein name 



Description 



Locus Name 



Acc# 



gp:D90716 



Escherichia coli genomic DNA. (17.6 - 18.0 min) 



NT 



AA 



ORF Name 



NTID 



AAID 



J,X±3.-JL5.2.B....a r 2....b.2. I 14117 



Length Length 



Score Probability 



TUT 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



1079 



NT 



AA 



ORF Name 



NTID 



111^34^7 ±1 2 



AAID Length Length 
5T£T5 — 



1168 j [T5TT7 



Score Probability 
P&V 



6.8e-42 



Protein name 



Locus Name 



receptor antxgen (RagA) 



gp:PGT150S72 



Acc# 



AJ130872 



Description 



Porpnyromonas gmgivalis W50 receptor antigen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



NTID 



11719042 tl 1 



1411$ 



AAID Length Length 
5141 — 



wnr 



Score Probability 
PTSTT7 — 



|4.5e-l65 



Protein name 



Locus Name 



probable polyribonucleotide 
nucleotidyltransferase (pnp} 



pir :C71269 



Acc# 



C71269 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



r±&£MAl..±l...l± I KTTU 



Length Length 



Score Probability 
TJS 



|2.9e-05 



Protein name 



Locus Name 



cell surtace antigen-like protein A29L 



pir:?l7SlS 



Acc# 



T17519 



Description 



ORF Name 



14.6.5.a2L.7.I...r.l...7.., 



Protein name 

Description 
MO-HIT 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



Locus Name 



Acc# 



1080 



NT 



AA 



ORF Name 



NTID 



■26364040 ri 5 



AAID Length Length 
31^3 — 



Score Probability 



Protein name 

Description 
INO-HTT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 
T — _ — _ Score Probab ility 
AAID Length Length 2 ~ 



±$£16&1.±1J1& I wnn 



0.00025 



Protein name 



Locus Name 



transmembrane sensor 



|gp:AF05169i 



Acc# 



AF051691 



Description 



Pseudomonas aeruginosa stress factor A (psf A) , ECF sigma f actor (tiul) , 
transmembrane sensor (f iuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds . 



NT 



AA 



ORF Name 



NTID 



|4lfi££3.2...c2...££ I 



AAID Length Length 

mzz — 



TTT 



TTZE~ 



Score Probability 
973 



5 .4e-98 



Protein name 



Locus Name 



butyrate kinase 



|gp:AB016T75~ 



Acc# 



AB016775 



Description 



Clostridium pertringens DNA tor butyrate kinase and hydrogenase, complete 
cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



A2.D3.2.6.2....L2...11 I \£TZ5 



Length Length 



Score Probability 




1.5e-i5 



Protein name 



Locus Name 



RNA polymerase sigma tactor SlgZ-like protein 



gp:AP137263 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 3 OS ribosomal protein Sl6-likeprotein, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



1081 



ORF Name 



4305327 ±1 4 



Protein name 

Description 
MO-HIT 



NT 



AA 



NTID 



AAID Length Length 
PITS 



Score Probability 



Locus Name 



Acc# 



ORF Name 



l47.6.£S.2>£...c2>...7.3... 



Protein name 

Description 
INO-HIT 



NTID 



NT AA 

, , „ _ — ^ _ — ^. Score Probability 
AAID Length Length 1 ~ 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
INO-HIT 



NT 



AA 



NTID 



AAID Length Length 
!J33T5 



Score Probability 



TTT7~ 



TIT 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



fi.Aaaia7....ci...jia 



14125 



AAID Length Length 
3151 



Score Probability 

wiz — 



3 . 2e-45 



Protein name 



Locus Name 



sp : PTB_CL0AB 



Acc# 



Q05624 



Description 

PHOSPHATE BUTYRYLTRANSFESASE, (PHOSPHOTI^SBUTYRYLASE) 



1082 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



9323505_ti_6 


4130 






54S 


1647 















Locus Name 



Acc# 



Description 
[NO-HIT 



ORF Name 



NTID 



I5AM£5..7...±i...l I 



Protein name 



AAID 



NT AA 
— — Score 
Length Length 



Locus Name 



Probability 



Acc# 



Description 
F^TTTT 



ORF Name 



&±±h.LD...±±Jl 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



4132 






388 1154 




452 





Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Description 



1.4e-41 



Acc# 



JC6 027 



NT 



AA 



ORF Name 



NTID 



125.3,3..7.7....t2....2 



14123 



AAID Length Length 
^J^E 



TT 



Score Probability 

m 



0.0021 



Protein name 



Locus Name 



maturase-liJte protein 



|gp:CPE3PSBC 



Acc# 



AJ222583 



Description 



Euglena spxrogyra chloropiast partial psJoC gene & complete internalmat2 
gene . 



1083 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



124736386 c3 3 





4134 




9356 




62 


189 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



1&2A15l1&...c:L..6.3..... 



AAID Length Length 
— 



7T 



Score Probability 
TTE 



4.5e-07 



Protein name 



Locus Name 



iron (II) transport protein A 



pir :C72423 



Acc# 



C72423 



Description 



NT 



AA 



ORF Name 



NTID 



15.0.17.5.I7....13....3.7. 



AAID Length Length 




Score Probability 

wn — 



iLOe-91 



Protein name 



Locus Name 



sp:PBPC_ECOLI 



Acc# 



P76577 



Description 

SIPtJNOTlOMAL PENIClLLM-fitNDiNG PROTEIN lC Pk£OT£SO£ (PBE>-lC) 



NT 



AA 



ORF Name 



NTID 



19A9.2il8A..±1...3. I RTT7 



AAID Length Length 




TTZT 



Score Probability 




l.le-43 



Protein name 



Locus Name 



cell cycle protein nomolog mesJ 



pir:f3l465 



Acc# 



T31465 



Description 



NT 



AA 



ORF Name 



NTID 



14138 



AAID Length Length 

9360 



T3T 



Score Probability 
TT7 



|2.7e-09 



Protein name 



Locus Name 



vsrD protein 



pir : 140540 



Acc# 



140540 



Description 



1084 



ORF Name 



NT ID 



AAID 



NT AA 
— — Score 
Length Length 



Protein name 



Locus Name 



conserved hypothetical protein 



pir:H7JS370 



Description 



Probability 
|4.5e-62 



Acc# 



H72370 



ORF Name 



3.z:l:lz:/.8.5l...g3....9.3... 



Protein name 



Description 



MO-HIT 



NT 



AA 



NT ID 



AAID 



4140 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



T7T 



i.7e-40 



Locus Name 



sp : FE0B_MET JA 



Acc# 



Q57986 



Description 
FERROUS IRON TRANSPORT PROTEIN B HOMOLOG 



ORF Name 



NTID 



14142 



Protein name 



NT 



AA 



AAID Length Length 
TTtt 



Score Probability 



Locus Name 



Acc# 



Description 
NO-tilT 



ORF Name 



NTID 



NT AA 
_ — _ T — . Score Probabili ty 
AAID Length Length -L 



Ml&3.43.7...±2...23. I f¥T¥T 



Protein name 



Na+/H+ antiporter nomolog yneL 



Description 



2.3e-74 



Locus Name 



pir :D69829 



Acc# 



D69829 



1085 



NT 



AA 



ORF Name 



NT ID AMD Length Length Probability 




ST 



T5T 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NT ID 



NT AA 

_ _ _ _ — ^ — ^, Score Probability 
AAID Length Length JL 



AA2!15±..±1J±& I [¥T^ 



2.1e-55 



Protein name 



Locus Name 



antibiotic resistance protein homolog ydeR 



pir:D65775 



Acc# 



D69779 



Description 



NT 



AA 



ORF Name 



NT ID AAID Length Length 




1870 



Score Probability 
F¥5 



l.Se-60 



Protein name 



Locus Name 



hypothetical protein JD2 52 0 



pir:G65028 



Acc# 



G65028 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



1D.6.Q3.1,3.5...±1...6. 



14147 



Length Length 
TFT 



Score Probability 




6 .Oe-12 



Protein name 



Locus Name 



hypothetical protein CT276 



bir:A7lS55 



Acc# 



A71535 



Description 



ORF Name 



NTID 



117.9.7.&„7....tl...7. I WTZS 



Protein name 

Description 
NO-fllT 



AAID 



5370 



NT 



AA 



Length Length 
WTT 



Score Probability 



Locus Name 



Acc# 



1086 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


125i3i3_cl_2S 


414$ 






1161 


ms 


6.Se-125 


Protein name 








Locus 


Name 


Acc# 










sp : SVW_ 




Q46127 


Description 














(TRPRS ) 


















ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


20$7505l_£l_l 


4150 




3$$ 


1200 


336 


S.6e-37 



Protein name 



Locus Name 



immunoreactive 42kD antigen PG33 



gp:APl757l5 



Acc# 



AF175715 



Description 



Porpnyromonas gmgivaiis strain W5 0 immunoreactive 42KD antigenPG3 3 gene, 
complete cds . 



NT 



AA 



ORF Name 
2l3.5.9.5.25.3....c1...22.. 



NTID 



AAID 



Length Length 



Score Probability 



T3T 



Protein name 
Description 

NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



I 14152 



Length Length 
TUT 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 
MO-HIT 



1087 



NT 



AA 



ORF Name 



NTID 



25516062 ±2 10 



TTST 



AAID Length Length 




T5ST 



Score Probability 




l.le-14 



Protein name 



Locus Name 



immunogenic 7 5 KDa protein PG4 



gp:AF145800 



Acc# 



AF145800 



Description 



Porphyromonas gingivalis strain W50 immunogenic 75 JcDa protein PG4gene, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



4154 



AAID Length Length 

wm — 



Score Probability 
|4.2e-07 



144 



Protein name 



Locus Name 



transposase 



|gp:AF038S66 



Acc# 



AF038866 



Description 



Bacteroides tragilis transposon Tn552 0 transposase (JDipH) andmotnlization 
protein BmpH (bmpH) genes, complete cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



14155 



19377 



Length Length 



Score Probability 



Protein name 

Description 
PT^TTTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



±1±B$M.„±1..±2... I |¥T5F 



AAID Length Length 

wrm — 



1081 1 WZZZ 



Score Probability 

— 



uTTT 



Protein name 



Description 



Locus Name 



sp:PYRi_DICDI 



Acc# 



P20054 



1088 



NT 



AA 



ORF Name 



NT ID 



71406b2 c3 50 



f¥TF7" 



„ _ — T — ^, Score Probability 
AAID Length Length JL 

— 



Protein name 



Locus Name 



EntT 



gp:AF099088 



ACC# 



AF099088 



Description 



Enterococcus raecium enterocin A (entAJ , EntI (entl) , EntF (entF) , EntK 
(entK), EntR (entR) , bacteriocin-like protein, EntT (entT),EntD (entD) , and 
protease IV homolog genes, complete cds; andunknown genes. 



NT 



AA 



ORF Name 



NT ID 



AAID 



10741300 £3 30 



Length Length 
1ST 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



ORF Name 



NT ID 



NT AA n _ 
_ _ _ _ — ^, — _ Score Probability 
AAID Length Length JL 



lil<Xftl£L±2...1& I 



TT7TT 



§.5e-23 



Protein name 



Locus Name 



proline dipeptictase 



pir :D75419 



Acc# 



D75419 



Description 



ORF Name 



NTID 



X35.Q5.Q....£3....±2 1 [STFtf 



Protein name 

Description 
NO-HIT 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



1089 



NT 



AA 



ORF Name 



NTID 



AAID 



14651386 t3 29 



WIST 



Length Length 
FT7TT 



Score Probability 
— 



4 . 6e-243 



Protein name 
Description 



Locus Name 



|sp:DHE4JBACFR 



Acc# 



P94316 



NT 



AA 



ORF Name 


NTID 


AAID 


Length 


Length 


2425lS£3_c2_46 


4162 


S3§4 


248 


747 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA 
Length Length 



Score Probability 



Protein name 

Description 
IKTCFTITT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



2Liaftafliij....Gi„.ia i ets* 



AAID Length Length 
WTZZ — 



Score Probability 
T52 



|8.$e-06 



Protein name 



Locus Name 



probable phosphoenolpyruvate synthase APE0026 



pir:E72754 



Acc# 



E72754 



Description 



1090 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


^88i37_ti_7 


4165 


9387 


81 


246 


114 



Protein name 
Description 

im 



|i.2e-05 



Locus Name 



sp : HPCEJKIMAH 



Acc# 



P48147 



ORF Name 



3355800 c3 48 



Protein name 

Description 
MO-HIT 



NT 



AA 



NT ID 



AAID 



4166 



Length Length 
T7T" 



Score Probability 



WIT 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
INO-HIT 



NT 



AA 



NT ID 



AAID 



Length Length 



Score Probability 



TUT 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 



T5T 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



1091 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
^JWl — 



TUTT 



Score Probability 
TT7 



0.00030 



Protein name 



Locus Name 



transmembrane sensor 



|gp:AF0516&i 



Acc# 



AF051691 



Description 



Pseuciomonas aeruginosa stress factor A (pslA) , ECF sigma tactor ( txul) , 
transmembrane sensor (fiuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



^4i545£2 c3 



WTVT 



Length Length 



Score Probability 



Protein name 

Description 
NO -HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2L46.42L18.6....C.2...5.5. I 14171 



Length Length 



Score Probability 
T5S 



3 .2e-07 



Protein name 



unknown 



Locus Name 
jgp:U9677T 



Acc# 



U96771 



Description 



Prevotelia bryantn putative polygalacturonase, B-l, 4- endoglucanase, and 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



AAID 



WT7T 



Length Length 



Score Probability 



or 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



1092 



NT 



AA 



ORF Name 



NTID 



126376562 c2 54 



WTTT 



AAID Length Length 
— 



Score Probability 

wn — 



|2.^e-44 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



JC6027 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
— 



fTUT 



Score Probability 
4.5e-i4 



Protein name 



Locus Name 



RNA polymerase sigma factor SigZ-like protein 



gp:AF137263 



Acc# 



AF137263 



Description 



Bacteroides tnetaiotaomicron 3 OS ri£>osomal protein Sl6-±ikeprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



IM17.1&8.&...C2...5.1 1 



Length Length 



TT5T 



Score Probability 
TTu 



1.2e-07 



Protein name 



Locus Name 



unknown 



gp:U9677l 



Acc# 



U96771 



Description 



Prevotella bryantu putative polygalacturonase, B-l, 4- endoglucanase, and 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
551 



|2.ie-fl6 



Protein name 



Locus Name 



receptor antigen (RagA) 



bp:PGII30872 



Acc# 



AJ130872 



Description 



Porphyromonas gmgivaiis W50 receptor antigen (rag) locus encodmga major 
immunodominant 55kDa antigen. 



1093 



ORF Name 



3929183 cl 47 



Protein name 

Description 
MO-HIT 



NT 



AA 



NT ID 



wrrr 



AAID Length Length 
~5JW5 — 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT 



AA 



NTID 



14173 



AAID Length Length 
Mu"0 — 



Score Probability 




3.ie-56 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



JC6027 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



1140 



5.2e-S5 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp :PGI130872 



ACC# 



AJ130872 



Description 



Porphyromonas gmgivalis W50 receptor antigen (rag J locus encodmga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



NTID 



&9A9.11...a±„A%. I RTTO 



AAID Length Length 




11775 



Score Probability 
TZ1 



0 .00017 



Protein name 



Locus Name 



outer membrane protein 



gpiSNRCMPB 



Acc# 



L77614 



Description 



Bacteroides thetaiotaomicron outer membrane protein (susD) gene, complete 
cds . 



1094 



NT 



AA 



ORF Name 



1208550 ±2 43 



NTID AAID Length Length 

5?U3 — 



Score Probability 



5TTT 



Protein name 

Description 
IFrcmTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



129.47.2&7...11..6.5 



AAID Length Length 

wm — 



Score Probability 



FT 



VIST 



Protein name 

Description 
WO -HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



iaftfiaaaa..±i„.i5 1 ftst 



NTID AAID Length Length 

mrr=> — 



Score Probability 
fZTZ 



i.4e-17 



Protein name 



Locus Name 



unknown 



AF04S74^ 



Acc# 



AF048749 



Description 



Bacteroides tragiiis capsular polysaccharide biosynthesis operon, complete 
sequence . 



NT 



AA 



ORF Name 



U43.aaai„.ci...ia2L I wtm 



NTID AAID Length Length 

1^ 406 | [77~ 



Score Probability 



&3T 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



1095 



NT 



AA 



ORF Name 



NTID 



15103800 ti 19 



AAID Length Length 




Score Probability 
73 1 [uTTTTS 



Protein name 



Locus Name 



response regulator 



gp:AP130997 



Acc# 



AF130997 



Description 



Enterococcus taecium strain BM4339 vanD glycopeptide resistancegene 
cluster, complete sequence. 



NT 



AA 



ORF Name 



NTID 



4186 



AAID Length Length 
MS — 



7T 



Score Probability 




5 . le-15 



Protein name 



Locus Name 



conserved nypotnetical protein yisQ 



pir :H69837 



ACC# 



H69837 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



16.b.u:L&.3.3....G2...:L2U I 14187 



9409 



Length Length 
ITS 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
— 



1 . 2e-187 



Protein name 



Locus Name 



UDP-ManNAc denydrogenase 



gp:AFl25l64 



ACC# 



AF125164 



Description 



Bacteroides tragilis 638R polysaccnaride B (PS B2) tuosynthesisiocus, 
complete sequence; and unknown genes. 



1096 



ORF Name 



NTID 



168328S5 c2 101 



Protein name 



NT AA 
— — Score 
AAID Length Length 



hypothetical protein 



Description 



TTTT 



Locus Name 



pir : JQ1020 



Probability 
|2.3e-177 



Acc# 
JQ102 0 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



4190 




9412 




81 


246 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 
2L16..7.2.1S.1...C3....13..7.., 



NTID 



AAID 



14191 



Length Length 
T71T 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



I 14192 



Length Length 



Score Probability 
F73 



3.2e-13 



Protein name 



Locus Name 



unknown 



gp : AF048749 



Acc# 



AF048749 



Description 



Bacteroicies tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



1097 



ORF Name 



NTID 



22657551 ci 54 



Protein name 



long -chain- tatty- acid CoA ixgase 



Description 



NT 



AA 



AAID Length Length Probability 
MT5 — 



565 



|7.9e-58 



Locus Name 



pir:D703S6 



Acc# 



D70386 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
MTS — 



TT 



TTT 



Score Probability 
T5B 



|2.7e-10 



Protein name 



Locus Name 



aryisuirotransr erase 



gp:AF126201 



Acc# 



AF126201 



Description 



Pseudomonas putida strain S-313 sultate ester ciesulturization genelocus, 
complete sequence. 



NT 



AA 



ORF Name 



NTID 



AAID 



ZZ.7.3.6.3.3.6....C2L...aa„ 



Length Length 



Score Probability 



TFF 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
3¥T3 



1ST 



Score Probability 

&z 



0.031 



Protein name 



Locus Name 



sp:SPRC_XENLA 



ACC# 



P36378 



Description 

I osteonectin) [UIT] (BASEMENT MEMBRANE PROTEIN BM-40) 



1098 



NT 



AA 



ORF Name 



NTID 



AAID 



ti 17 



Length Length 



Score 



Probability 
10.0024 



Protein name 



Locus Name 



retinoid. X receptor alpha homolog 



gp:UPU31832 



Acc# 



U31832 



Description 



Uca pugilator retinoid X receptor alpha homolog mRNA, DNA bmdingdomain 
region, partial cds . 



ORF Name 



24035212 C2 110 



Protein name 



NTID 



4198 



AAID 



NT 



AA 



Length Length 
TIT 



Score Probability 



Locus Name 



Acc# 



Description 
(NO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

^ „ ^ — ^, — _ Score Probability 
AAID Length Length J ~ 



$421 



5^" 



6.8e-86 



Locus Name 



GDP-L-tucose pathway enzyme 



gp:AB00&676 



Acc# 



AB008676 



Description 



Escherichia coli 0157 DNA, map position at 46 mm., complete cds. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length JL 



Z^aD.6.5.ai...Cl...a.5. 



TT5T" 



Protein name 



Locus Name 



probable PPE protein 



pir :D70604 



Acc# 



D70604 



Description 



1099 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
HI 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



z5taaa^2...ti...La i 14202 



AAID Length Length 

— 



ST 



Score Probability 
57 



0.041 



Protein name 



Description 



Locus Name 



gp:F23A5 



Acc# 



AC011713 



Arabiciopsis thaiiana chromosome 1 BAC F23A5 sequence, compietesequence . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TkZE — 



Score Probability 



ST 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



4204 



AAID Length Length 
MTS 



IT7T 



Score Probability 
TUUS — 



|3.5e-i£5 



Protein name 



Locus Name 



UDP-GlcNAc 2-epimerase 



gp:AF125164 



Acc# 



AF125164 



Description 



Bacteroides tragilis 6 38R polysaccharide B (PS B2 ) Joiosynthesislocus, 
complete sequence; and unknown genes. 



1100 



ORF Name 



NT ID 



AAID 



NT AA 
— — „ Score 
Length Length 



31444127 ±3 51 



\Z7UT 



TZTT 



IU7T 



\TT7T 



Probability 
5.3e-162 



Protein name 



Locus Name 



tructose-bispnosphatase , 



|pir:C69621 



Acc# 



C69621 



Description 



NT 



AA 



ORF Name 



NTID 



dil^a.7.S....X3..„.Z0. 



4206 



_ _ _ _~ — Ll — , , Score Probability 
AAID Length Length JL 

WETS 



TJT 



T2T 



s . ye-07 



Protein name 



Locus Name 



probable lipopolysaccnarxde O-side chain 
biosynthesis protein (0-antigen transpoter) 



Description 



tpir:P71152 



Acc# 



F71152 



NT 



AA 



ORF Name 



NTID 



AAID 



WITT 



TZIT 



Length Length 
TIT 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



3.5a6.6A3.7....t2...3.9.„ 



NTID AAID Length Length 

WI$% 1 — 



Score Probability 



WT 



T7T 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



— _ — Score Probability 
AAID Length Length z - 



3.5.3A3.7.5.3....C.1...7.7... 



¥21)3 1 ig*3l 1 FT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



1101 



NT 



AA 



ORF Name 



NT ID 



.36134625 t2 36 



AAID Length Length 
3132 — 



Score Probability 
— 



|6.0e-138 



Protein name 



Locus Name 



glucose- l-phosphate tnymidyl transterase 



gp:AF048745 



Acc# 



AF048749 



Description 



Bacteroides tragxlis capsular polysaccharide biosynthesis operon, complete 
sequence . 



NT 



AA 



ORF Name 



NTID 



.4465002 ±3 50 



4211 



AAID Length Length 
54T5 



TF7T" 



Score Probability 




5 . 6e-96 



Protein name 



Description 



Locus Name 



Acc# 



sp:Ylt)fi_SCOLl 



HYPOTHETICAL 58.5 KD PROTEIN IN GLVC-IBPB INTERGENIC REGION (ORFA) 



NT 



AA 



ORF Name 



46.M0.5.2..±3....6.7.., 



NTID AAID Length Length 




Score Probability 
TT31 



|6.0e-115 



Protein name 



Locus Name 



Cap8E 



gp:SAU73374 



Acc# 



U73374 



Description 



Staphylococcus aureus type 8 capsule genes, cap8A, cap8B, cap8C,cap8D, 
cap8E, cap8F, cap8G, cap8H, cap8I, cap8J, cap8K, cap8L,cap8M, cap8N, cap80, 
cap8P, complete cds . 



NT 



AA 



ORF Name 



m0.3.2..7...±3....£l. 



NTID AAID Length Length 

MTE — 



5^4" 



Score Probability 

— 



l.Oe-7* 



Protein name 



Locus Name 



dTDP-6-deoxy-D-glucose-3, 5 epimerase 



bp:A?048?45 



Acc# 



AF048749 



Description 



Bacteroides iragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



1102 



NT 



AA 



ORF Name 



NT ID 



4941937 c2 112 



AAID Length Length 
9436 



T7T 



Score Probability 
— 



5 . 9e-124 



Protein name 



Locus Name 



Acc# 



GDP-mannose denydratase 



gp:AF04 747 8 



Description 



Brucella melitensis strain 16M lipopolysaccharide O side chainbiosynthesis 
gene cluster, complete sequence. 



NT 



AA 



ORF Name 



NT ID 



AAID 



5115527 ±2 42 



Length Length 



Score Probability 
4 . ie-45 



Protein name 



Locus Name 



pleiotropic regulatory protein DegT 



pir :D69025 



ACC# 



D69025 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



b.a.7.SL1..7.6....t3....6.a.. 



42±t> 


443S 




^ 252 




16 





Protein name 



reverse transcriptase like protein 1, 
int ron - encoded 



Description 



Locus Name 



pir:S5S503 



0 . 048 



Acc# 



S58503 



ORF Name 



Protein name 



NT 



AA 



NTID AAID Length Length 
^¥T^ 



TUTT 



Score Probability 
171 



3.6e-30 



Locus Name 



aspartate aminotransferase (aspJo-liJcel) 
PAB0774 



pir :D75096 



Acc# 



D75096 



Description 



1103 



NT 



AA 



ORF Name 



NT ID 



7314165 tl 11 



AAID Length Length 

mzv — 



Score Probability 

mi — 



6.Se-38 



Protein name 

Description 
HYPOTHETICAL PftOTfilN HiiMS 



Locus Name 



sp:YA3 8 HAEIN 



Acc# 



P44099 



NT 



AA 



ORF Name 


NTID 


AAID 


Length 


Length 


828957_t3_64 


4i>l$ 


9441 

















Score Probability 



Protein name 

Description 
|NO-flIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



ittimfli„.ci..iia i 



9442 



t — o-i. -r — , -t Score Probability 
Length Length JL 

WTT 



TUT 



7T5~ 



7.2e-7l 



Protein name 



Locus Name 



probable oxidoreductase 



gp:SCFll 



Acc# 



AL132662 



Description 
Streptomyces coelicolor cosmid Fll. 



ORF Name 



NTID 



AAID 



NT AA . , 
T — ^ n , — Ll Score Probability 
Length Length 



lia7.7.1IS....c2...215. | |¥22T 



SOT" 



T7T" 



3 .3e-34 



Protein name 



Locus Name 



snikimate S-cienyctrogenase 



pir :F70377 



Acc# 



F70377 



Description 



1104 



ORF Name 



NTID 



NT AA , , . . 
T — _ — Score Probability 
AAID Length Length 2 - 



11963262 C3 239 



MM" 



1WF 



3 . 6e-30 



Protein name 



Locus Name 



conserved hypothetical protein 



Description 



pir :G72409 



Acc# 



G72409 



ORF Name 



Protein name 



lemA protein 



Description 



NTID 



NT AA , , . . 
_ — _ — Score Prob ability 
AAID Length Length 2 ~ 



TFZT 



T7T" 



Locus Name 



bir:P72311 



\2.6e-4S 



Acc# 



F72311 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 



Score Probability 

— 



2 . 9e-46 



Locus Name 



sp:ALR2_BACStf 



Acc# 



P94494 



PUTATIVE ALANINE KACEMASE, 



NT 



AA 



ORF Name 



NTID 



AAID 



lb.U5.1.5.6.Z.„cl..lS.l I WZtt 



Length Length 



Score Probability 



Protein name 
Description 

NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



lb.fo.b.:/.Ub.2L...c2...211 1 



Length Length 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



1105 



NT 



AA 



ORF Name 



NTID 



lba2%75 r2 57 



WITT 



T — _ — _ Score Proba bility 
AAID Length Length ■ L 

^ 



TTT 



J 



2.3e-05 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp:PGI130872 



Acc# 



AJ130872 



Description 



Porphyromonas gmgivalis W50 receptor antigen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



NTID 



16131S77 c2 231 



AAID Length Length 
54515 — 



Score Probability 




l.^e-3l 



Protein name 



Locus Name 



conserved hypothetical integral membrane 
protein HP1061 



|pir:ES4652 



Acc# 



E64652 



Description 



NT 



AA 



ORF Name 



NTID 



14225 



AAID Length Length 

msi — 



TttT 



3132 



Score Probability 




i.2e-134 



Protein name 



Locus Name 



£>eta-galactosidase 



pir:?722S3 



Acc# 



F72283 



Description 



NT 



AA 



ORF Name 



NTID AAID Length Length 



Score Probability 



\2&i±t)m&.±±..±i& I imu 



Protein name 



Locus Name 



glutamine-asparaglne rich protein 



gp:DDU07S17 



Description 



10.043 



Acc# 
U07817 



Dictyostelium discoideum AX3 glutamme-asparagme rich protemgene, partial 
cds . 



1106 



NT 



AA 



ORF Name 



NT ID 



'22147552 c2 232 



WITT 



AAID Length Length 
15553 — 



TFFT 



Score Probability 
BCT 



Protein name 



Locus Name 



3 -0- acyl transt erase , MdmB : mi dec amy c in 
biosynthesis enzyme 



Description 



pir :A42719 



8. Oe-22 



Acc# 



A42719 



ORF Name 



Protein name 



Description 



NO-HIT 



NT ID 



NT AA n ^ 
_ _ _ _~ _ — ^. — ^ Score Probability 
AAID Length Length L 



14232 



[3T5T 



155" 



[253" 



Locus Name 



Acc# 



ORF Name 



2.3.b.3.Z18..7„..CL...lSA.. 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



W7JT 



M55~ 



Length Length 



7ZT 



Score Probability 
555 



|i.4e-5i 



Locus Name 



Acc# 



sp:LPXA_ECOLI 



(EC 2.3.1.125) {u£P-N-AC2TYLGLUC0SAMiN£! ACYLTkANSPSRASE) 



NT 



AA 



ORF Name 



NTID 



l±y±211L...G±..;±±& I WZtt 



AAID Length Length 
^T55 — 



55T" 



TT7TT 



Score Probability 
T77 



i.ie-23 



Protein name 



Locus Name 



|sp:£Uft5_MfiTJA 



Acc# 



Q57656 



Description 

(AIRS) (PHOSPHOklBOSYL-AMINOIMIDAZOLE SYNTHETASE) — (AIR SYNTHASE) 



1107 



NT 



AA 



ORF Name 



NTID 



123850828 c3 254 



-n t> -i--p^ T — T — ^, Score Probability 
AAID Length Length z - 

S¥S7 — 



Protein name 

Description 
IbJO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



■£3.&b:j3.D.2....a±...211 1 14235 



AAID Le^th Le^th Probability 

3458 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA , , . . 

AAID Length Length Probability 



'<L&i)J.&b.:iZ...al...2±9. I l¥2T7 



rrnr 



fTTTT 



14 . se-90 



Protein name 



Description 



Locus Name 



sp:RFl_COXBU 



Acc# 
P47849 



peptide CHAIN RELEASE FACTOR I (RF-i) 



ORF Name 



NTID 



AAID 



NT AA 
r — ^ r — , -< Score 
Length Length 



l Z&±XVAll...z'2....221 1 KZTS 



9460 



T3u~ 



EST" 



Protein name 

Description 
INO-HIT 



Locus Name 



Probability 



Acc# 



1108 



ORF Name 



NTID 



AAID 



NT AA 
t — ^ t — Score 
Length Length 



TTUT 



Probability 
|9.2e-55 



Protein name 



Locus Name 



hypothetical protein sirl880 



pir:aV7134 



Acc# 



S77134 



Description 



ORF Name 



NTID 



NT AA . , 
A7 , Tn _ — _ — Score Probabil ity 
AAID Length Length JL 



Protein name 



ribosomal protein L20 



Description 



7T 



7TT 



0.033 



Locus Name 



pir :A75326 



Acc# 



A75326 



NT 



AA 



ORF Name 



NTID 



AAID 



ZilHlii6.1...c3....24.a.. 



Length Length 
54^ 



TS4T 



Score Probability 
ll.2e-36 



Protein name 



Description 



Locus Name 



sp:¥UAG_6ACSU 



Acc# 



032076 



HYPOTHETICAL 55 . 0 KB PROTEIN IN GLGB -GBSB IN T ERGENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



24iO.SA5..7....al...l9.& I |¥23!T 



Length Length 



T53T 



Score Probability 




7.0e-59 



Protein name 



Locus Name 



hypothetical protein S111582 



pir:S75309 



Acc# 



S75309 



Description 



1109 



ORF Name 



246446&I c2 256 



Protein name 



Description 



(riC 3.5.1.-) 



NT AA , . 

NT ID AAID Length Length Probability 



T3ZT 



2.7e-44 



Locus Name 



sp:LPXC_HAETN 



Acc# 



P45070 



ORF Name 



24640752 t2 45 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 



Score Probability 



T754 - 



Locus Name 



Acc# 



Description 
psfO-MIT 



ORF Name 



Protein name 



NT ID 



NT AA o ^ ^ , . 
_ — _ — _ Score Probability 
AAID Length Length - L 



14245 



^4F" 



7¥T~ 



Locus Name 



OMP decarboxyiase-orotate phosphoribosyl 
transferase, 



pir :T30520 



Description 



1.4e-37 



Acc# 



T30520 



NT 



AA 



ORF Name 



NTID 



AAID 



2&b.&b3A2...c.lJ±&9. I 



14245 



Length Length 



Score Probability 
^5T5 



Protein name 



Locus Name 



ubiquinone / menaquinone JDiosyntne sis 
methyl transferase 



pir:F75277 



Description 



|4.7e-44 



Acc# 



F75277 



1110 



ORF Name 



NTID 



NT AA , , . , 

AAID Length Length Probability 



124735830 c3 258 



4247 



545$ 



1188 



Protein name 
Description 

NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



mTTT 



Length Length 



Score Probability 




0.0023 



Protein name 



Locus Name 



probable glycosyl hydrolase 



pir :T36467 



Acc# 



T36467 



Description 



ORF Name 



NTID 



NT AA , , , , 
, _ T — ^, — _ Score Probability 
AAID Length Length ^ 



Zb.Zllub..7....£1...3.6... 



Protein name 



polygalacturonase precursor 



Description 



Locus Name 



foir:S5780& 



0.017 



Acc# 



S57806 



NT 



AA 



ORF Name 



Zb.3.6.1^3.S....t2....aZ 1 14250 



NTID AAID Length Length 




Score Probability 



Protein name 

Description 
IHO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



iai44&ai..±i...iia 1 14251 



. TmTr ^ ^-„-r^ T — — ^ n Score Probability 
NTID AAID Length Length JL 

mn> — 



TUT" 



TTTT 



8.3e-2± 



Protein name 



Locus Name 



Acc# 



histidme Jcmase 



|gp:5P>AJ63!=>3 



AJ006393 



Description 

streptococcus pneumoniae rr03 and nk03 genes; two component system03. 



1111 



ORF Name 



1:41520840 t2 $5 



Protein name 



Description 



MO-HIT 



NT ID 



NT AA 

J. y> tt-\ T — ^.-u T — Score Probability 
AAID Length Length L - 



Locus Name 



Acc# 



ORF Name 



3..i23.5&aCL.3LZ...5t5... 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



14253 



AAID Length Length 
I5T75 — 



Score Probability 



64 



Locus Name 



Acc# 



ORF Name 



3.3.&.1Z7.&.2...£2....&1.. 



Protein name 



NT 



AA 



NTID 



, , „ — — _ ^ — _ Score Probability 
AAID Length Length 

wrm — 



I2W 



5.2e-iS 



Locus Name 



nypotnetical protein aq_246 



pir :E70322 



Acc# 



E70322 



Description 



ORF Name 



NTID 



AAID 



NT AA . , 
_ — _ — ^ Score Probability 
Length Length JL 



M1.7.117.5....cii...2i4 ....J I5255 1 



9477 



^5T" 



FIT" 



li.6e-55 



Protein name 



Description 



Locus Name 



sp:i>UR7_ARATH 



Acc# 



P38025 



(AC 6.3.2.6) (SAICAR ^YOThE?ASE) 



NT 



AA 



ORF Name 



NTID 



AAID 



J>.±b.£.b3.±2....tl..M. I 14255 



5478 



Length Length 



Score Probability 
[135 



7.7e-i5 



Protein name 



Locus Name 



conservea nypotnetical protein 



pir:C7236i 



Acc# 
C72361 



Description 



1112 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
— 



Score Probability 



IT 



Protein name 
Description 

NO-HIT 



Locus Name 



Acc# 









NT 


AA 


ORF Name 


NT ID 


AAID 


Length 


Length 


3.s.£uia&3,...ti...£3.. , 


425S 


94^0 


147 


444 



Score Probability 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



3A£2M5i..c2...21£ I 



AAID Length Length 
— 



Score Probability 
3T7 



12 . 



Protein name 



Locus Name 



sp:YQKD_BACSU 



Acc# 



P54567 



Description 

HYPOTHETICAL 24.6 KB PROTEIN IN GLNQ-MSR INTERGENIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
34"S3 



1050 



Score Probability 




6.3e-56 



Protein name 

Description 
(Ud 2.3.1.-) 



Locus Name 



sp:LPXD_RIC!RI 



Acc# 



P32202 



(FIRA PROTEIN) (R1FAMPICIN RESISTANCE PROTEIN) 



1113 



ORF Name 



NTID 



NT AA 

, , „ T — _ — Score Prob ability 
AAID Length Length JL 



39b6556 c2 223 



2 . Oe-50 



Protein name 



Locus Name 



tKNA isopentenyipyrophospnate transierase 
mi a A 



pir :G69S57 



Acc# 



G69657 



Description 



ORF Name 



NTID 



NT AA 
_ — , , _ — Score Probability 
AAID Length Length z - 



\&ZAqA1.±L„11 



4262 



9484 



1.4e-37 



Protein name 



Description 



Locus Name 



sp:TkUA_BACSU 



Acc# 



P70973 



1) (i^EUD OUkil)IN E SVUTHASE I) (URACIL HVDR0LYA5E) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length Probability 
S^SS — 



T7B" 



TTZW 



i.2e-30 



Protein name 



Locus Name 



conserved hypotnetxcal protein 



pir:G723il 



Acc# 



G72311 



Description 



ORF Name 



NTID 



AAID 



&:/.2.X5.b:2.„x±..±±b. I WZZ% 



Protein name 

Description 
INO-HIT 



NT 



AA 



Length Length 



Score Probability 



W5T 



Locus Name 



Acc# 



1114 



ORF Name 



NT ID 



NT AA 

, * ™ _ — _ — _ Scor e Probability 
AAID Length Length z - 



1 4767^52 ci 174 



5457 



TUTT 



7TTT 



5.ie-70 



Protein name 



Description 



Locus Name 



gp:BMAJ4 829 



Acc# 



AJ224829 



Bacillus megaterium DSM319 spolV operon, 5' tianking region, 3' Hanking 
region . 



NT 



AA 



ORF Name 



NTID 



AAID 



b082bl2 c^ 266 



Length Length 
JTT 



Score Probability 



P rote in name 
Description 

NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



[42FT 



_ — _ — ^, Score Probability 
AAID Length Length 

— 



7T 



155 



3 . 3e-ll 



Protein name 



Locus Name 



conserved hypothetical secreted protein 
HP0320 



pir :H64559 



Acc# 



H64559 



Description 



ORF Name 



:/.I.2.feA3.1...cl...l8.5... 



Protein name 



NTID 



AAID 



9490 



leech zinc linger protein 



Description 
h.tnseriaiis Jbzri gene. 



NT 



AA 



T — ^ r — ^ Score Proba bility 
Length Length JL 



7T 



0. 020 



Locus Name 



gprlfMafALZE 1 ! 



Acc# 



X91396 



1115 



NT 



AA 



ORF Name 



NT ID 



85882 £2 56 



AAID Length Length 
5351 — 



Score Probability 




2 . 3e-10 



Protein name 



Description 



Locus Name 



Sp:MECI_STAEP 



Acc# 



P26598 



METfilClLLlltf RESISTANCE REGULATORY PROTEIN MECi 



NT 



AA 



ORF Name 



NTID 



AAID 



c2 221 



Length Length 



T3TT 



Score Probability 
TJS 



l.Se-47 



Protein name 



Description 



Locus Name 



sp:YWE0_6ACStf 



Acc# 



P39651 



HYPOTHETI CAL 51.0 KB PRO T EIN IN PTA 3 1 REGION 



NT 



AA 



ORF Name 



NTID 



4271 



.^-^ _ — _ T — Score Probability 
AAID Length Length JL 

— 



237" 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2L^U.7.a.7.QQ...t2...a.l.. 



WTTT 



Length Length 
T7T" 



Score Probability 
TZT2 



|3.8e-33 



Protein name 

Description 
kOD SI&PE- DETERMINING PROTEIN £0bA 



Locus Name 



Acc# 



|sp:RODA_ECOLI 



1116 



NT 



AA 



ORF Name 



NT ID 



AAID 



\2M$S1U c3 52 



Length Length 



Score Probability 



Protein name 

Description 
ISO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TTKT 



Score Probability 
Tu3 



|6.6e-06 



Protein name 



Locus Name 



hypothetical protein PH0217 



pir :G71244 



Acc# 



G71244 



Description 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



4275 



FT 



T5T 



Protein name 



Locus Name 



hypothetical protein PH0219 



pir:A71245 



Description 



Probability 
|3.2e-06 



Acc# 



A71245 



NT 



AA 



ORF Name 



NTID 



AAID 



Z^4a.7.fl&l...dl...3La I 14276 



Length Length 



Score Probability 




5.3e-36 



Protein name 



Description 



Locus Name 



sp:METF_AOUAE 



Acc# 



067422 



b, iO-MBTHYL B WETETRAHVDROPOIATij REDUCTASE, 



NT 



AA 



ORF Name 



NTID 



AAID 



WTTT 



Length Length 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



1117 



ORF Name 



NT ID 



NT AA 
AAID Length Length 



5322506 c2 



^5" 



7T 



Probability 
10.034 



Protein name 



Locus Name 



nypotnetical protein PH022 0 



pir:B71245 



Acc# 



B71245 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



14275 



Length Length 



Score 



1479 



Probability 
|4.Se-39 



Protein name 



Description 



Locus Name 



Sp : YAAT BACSU 



Acc# 



P37541 



HYPOTHETICAL 21.2 KB PROTE IN IN XPAC-ABRB I N T L1RGEN I C R E GION 



NT 



AA 



ORF Name 



NT ID 



rar 



AAID Length Length 




T7TT 



Score Probability 



l.ie-15 



Protein name 



Locus Name 



DNA polymerase III gamma suJDunxt 



pir : A70460 



Acc# 



A70460 



Description 



ORF Name 



10.^.&M.5....ai...2sL 



Protein name 

Description 
INO-HIT 



NT 



AA 



it-i-j-p -j-T-v tv^-tt^ x — ^ — ^ n Score Probability 
NT ID AAID Length Length 

w^n — 



Locus Name 



Acc# 



1118 



NT 



AA 



ORF Name 



NTID 



AAID 



iii37$28 tl 1 



WIST 



Length Length 



Score Probability 

w& — 



4 . 5e-ll2 



Protein name 



Description 



Locus Name 



|sp:BGLS_AGRTU 



Acc# 



P27034 



GLUC051£>£ GLUCO^ydrOlaSe) 



NT 



AA 



ORF Name 



NTID 



AAID 



126$&$$2 c3 12 



Length Length 



Score Probability 




S.3e-37 



Protein name 



Locus Name 



sp : MMSR_PSEAE 



Acc# 



P28809 



Description 
MMSAB OPERON REGULATORY PROTEIN 



NT 



AA 



ORF Name 



NTID 



AAID 



19.5A±±6£..±1.A I RE7B? 



Length Length 



Score Probability 



TuTT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




Score Probability 
S¥7 



1.3e-84 



Protein name 



Locus Name 



L-arat>inose transport (permease) araE 



pir:F69587 



Acc# 



F69587 



Description 



1119 



NT 



AA 



ORF Name 



NT ID 



24651537 4:2 14 



AAID Length Length 

wzm — 



1102 



Score Probability 
— 



5 .4e-217 



Protein name 



Description 



Locus Name 



sp: YPHG_ECOLI 



Acc# 



P76585 



HVPOTlfiMIGAL 127.3 KD MOTfllltf IN CSiE-GlVa iNTfiRGEKflC fefiGlON 



NT 



AA 



ORF Name 



NT ID 



34069436 tl d 



4287 



AAID Length Length 

— 



Score Probability 
E5I 



|2.Se-l8 



Protein name 



Locus Name 



beta-galactosidase 



gp:AF055482 



Acc# 



AF055482 



Description 



Tiiermotoga neapolitana galactose utilization operon, complete sequence . 



NT 



AA 



ORF Name 



I5..7.S.5.0.a.7....c2....!S.l I WZ$% 



NTID AAID Length Length 
fTOTT5 



Score Probability 

its — 



3 . be-14 



Protein name 



Locus Name 



RNA polymerase sigma tactor SrgZ-like protein 



|gp:AF137263 



Acc# 



AF137263 



Description 



Bacteroides tnetaiotaomicron 30S rit>osomal protein S16-iiKeprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



16.13.143.7....t2...2.Q I 14289 



AAID Length Length 
55TT 



Score Probability 
TT5 



0.0015 



Protein name 



Locus Name 



BcDNA . GH119 7 3 



gp:Affl4Sg7l 



Acc# 



AF145671 



Description 



Drosophila melanogaster clone GH11973 BcDNA. GH11973 (BcDNA.GH11973 ) mRNA, 
complete cds . 



1120 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



20855303 i2 21 



Protein name 



TZTT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



m.7.52...cl..Aa 



Protein name 



FIT" 



Locus Name 



1.2e-46 



Acc# 



receptor antigen (RagA) 



Description 



gp:Paiii0«72 



AJ130872 



Porphyromonas gingivalxs W50 receptor antigen (rag) locus encodmga major 
immunodominant 55kDa antigen. 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Proba bility 
Length Length 



243.3.5..7.S.1..±3....2£.. 



Protein name 



9514 



T4TT 



77" 



Locus Name 



0.0054 



Acc# 



Description 



sp:NOLE>_kHlL±» 



P23717 



MODULATION PROTEIN N0L1> 



ORF Name 



NTID 



NT AA n „ . . . . . . 

— — Score Probability 
AAID Length Length 



Protein name 



255" 



Locus Name 



Acc# 



Description 



[NO-HIT 



1121 



ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




2655406V_cl_3S 4294 


9516 320 963 




1.2e-08 


1 


Protein name 


Locus Name 


Acc# 




transmembrane sensor 


gp:AF0S169i 


AF051691 




Description 




Tseudomonas aeruginosa stress tactor A ipsrA) , ecf sigma tactorinuij, 
transmembrane sensor (fiuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds . 




ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




4l03377_r3_30 4295 


55l7 


119 36U 








Protein name 






Locus Name 


Acc# 




Description 














MO-HIT 














ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




l3.m5.S.[)...±3....A0. 42 


$51$ 


548 1647 


2S31 


S.9e-^9S 




Protein name 


Locus Name 


Acc# 




neuraminidase precursor 


gp : BNRMANASE 


D28493 




Description 














Bacteroides rragilis nanH gene for neuraminidase, complete cas 






ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




i4s.7.s.iao.^c2^ao. 4297 


19519 


532 


L59S 


139 


6.7e-0a 




Protein name 


Locus Name 


Acc# 




unknown 


gp:U96771 


U96771 





Description 



Prevotella brya nbii putative polygalacturonase, B-l , 4-endoglucanase , and 
mannanase genes, complete cds; and unknowngenes . 



1122 



ORF Name 



16267638 C2 67 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



282 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



215.9.5l6.6.3...±1...3.^ 



3252 



4 . 8e-84 



Protein name 



Locus Name 



115K outer membrane protein precursor : susc 
protein 



pir : J(je>U27 



Acc# 



JC6027 



Description 



ORF Name 



NTID 



9522 



Protein name 



hypothetical protein TMib24 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



55T 



2586 



TTTJI" 



Locus Name 



pir:H7^228 



l.Se-iil 



Acc# 



H72228 



ORF Name 



Protein name 



unknown 



Description 



NTID 



AAID 



— — Score Probability 
Length Length 



125 



|l.5e-06 



Locus Name 



|gp:U%771 



Acc# 



U96771 



Prevoteiia bryantn putative polygalacturonase, B-i, 4-enaogiucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



1123 



ORF Name 



NT ID 



NT AA „ 

— — Score Probability 



2425968:4 £2 27 



AAID Length Length 



\JJT 



8 .le-35 



Protein name 



Locus Name 



sialic-acia o-acetyiesterase 



gp :MMU4U4Ub 



Acc# 



U40408 



Description 

Mus musculus lysosomal sialic acid o-acetylesterase mRNA, completecas . 









NT 


AA 


Score 


Probability 


ORF Name 


NTID 


AAID 


Length Length 




244lS8'77_t3_J3 


4303 


$523 


521 


1566 


107 


1.2e-u7 



Protein name 



Locus Name 



unKnown 



gp:U9677l 



Acc# 



U96771 



Description 



Prevotella bryantu putative polygalacturonase, B-i, 4 -enctogiucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 




Score Probability 



9526 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



SETT 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



KO-HIT 



1124 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



t3 41 



[2W 



T2T 



2.6e-06 



Protein name 



Locus Name 



Acc# 



sp : PA1B_RAT 



035264 



Description 

ACTIVATING fACTOR aCE'TyLSYDROlaSS AL£>HA 2 StiSUNIT) (PAf-AJi ALPHA 2) 



NT 



AA 



ORF Name 



NTID 



AAID 



TT77WT7-cTTT 



Length Length 
— 



Score Probability 
3.0e-S5 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



JC6027 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TJUE 



Score Probability 
|1.2e-84 



Protein name 



Locus Name 



Acc# 



alpna-L- tucosidase , 1 precursor, 
tissue :alpha-L-fucosidase I :alpha-L-fucoside 
fucohyflrolase, 



pir:HWHUFA 



Description 



ORF Name 



NTID 



AAID 



a4Al&42J„.±L...ll I 



SETT 



Protein name 



Description 



NT 



AA 



Length Length 



fZUTT 



Score Probability 
$.2e-26 



Locus Name 



sp:HEXA_PORGl 



Acc# 



P49008 



(BETA-NAHASE) 



1125 



ORF Name 



ibibl583 c2 124 



Protein name 



Description 
[No-HIT 



NT ID 



AAID 



NT AA 
Length Length Probability 



PIT 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
(BK'l'A-ilAHASfi) 



NT AA 

NTID AAID Length Length Probability 



14511 1 



11139 



l.tfe-I15 



Locus Name 



sp:Hi£XA PORGI 



Acc# 



P49008 



ORF Name 



Protein name 



unKnown 



Description 



NTID 



NT AA 

AAID Length Length Probability 



WITT 



15W 



TTT 



l.8e-06 



Locus Name 



gp:U96771 



Acc# 



U96771 



Frevoteiia Joryantu putatxve polygalacturonase, , 4- endoglucanase, and 
mannanase genes, complete cds; and unknowngenes . 



ORF Name 



NT AA 

NTID AAID Length Length Probability 



TJTT 



TTT 



Protein name 

Description 
jJsiO-HIT 



Locus Name 



Acc# 



1126 



ORF Name 



NTID 



NT AA 

AAID Length Length Probability 



■7tfl^32 ri 4 



Protein name 



receptor antigen (RagA) 



Description 



TTTT 



5.9e-S2 



Locus Name 



gp:MI130S72 



Acc# 



AJ130872 



Porphyromonas gmgivaiis W50 receptor antigen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



ORF Name 



NTID 



NT AA 

AAID Length Length Probability 



781932 t2 16 



9537 



TTuT" 



b.le-83 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir: 



Acc# 



JC6027 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length Probability 



1120 I 



t>.^e-73 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir:JC6027 



Acc# 



JC6027 



Description 



ORF Name 



NTID 



AAID 



&b.vA±&...al..±l)A I WJT7 



Protein name 

Description 
pTO-HIT 



NT 



Length Length 



— , , Score 



Locus Name 



Probability 



Acc# 



1127 



NT 



AA 



ORF Name 



NTID 



lmsoo ti 15 



\zrnr 



AAID Length Length 

mm — 



3W 



Score Probability 
TZS 



4.7e-44 



Protein name 



Locus Name 



pro&able nagA protein 



pir :C70845 



Acc# 



C70845 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
— 



TT9T 



Score Probability 
\JW% — 



1.8e-35 



Protein name 



Locus Name 



hypothetical protein b!325 



bir:H64881 



Acc# 



H64881 



Description 



NT 



AA 



ORF Name 



NTID 



4320 



AAID Length Length 




W5T 



Score Probability 
WT5 



2.7e-48 



Protein name 



Locus Name 



hypothetical protein 



pir:A72430 



Acc# 



A72430 



Description 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


lDt5M17...±l...ia | 


4321 


<>543 


70 


213 





Score Probability 



Protein name 

Description 
INO-MT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



Zl&Z6.5.6....r.2...25„. 



WTTF 



AAID Length Length 
— 



Score Probability 
£75 



|3.8e-18 



Protein name 



Locus Name 



polysugar degrading enzyme homolog yKtC 



pir :A69856 



Acc# 



A69856 



Description 



1128 



NT 



AA 



ORF Name 



NTID 



AAID 



2350305 cl G2 



Length Length 
TuTT 



Score 



TIT 



Probability 
|1.5e-06 



Protein name 



Locus Name 



Hypothetical protein PH0217 



pir:G7i244 



Acc# 



G71244 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



|2i40.7.8.27.„±1..5.0.., 



Length Length 



Score Probability 
puTJ 



2.3e-5S 



Protein name 



Description 



Locus Name 



Acc# 



sp:HT£A_EC!OLl 



PROTEASE DO PRECURSOR , 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



2±6A12t±..±2...10. . 



SOT" 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



\2±6£.5.5.&.1..±2...20. I WTZZ 



AAID Length Length 
^HTS 



Score Probability 
2S7 



3.4e-25 



Protein name 



Locus Name 



phosphate transport system regulator PhoU 



pir :G72275 



Acc# 



G72275 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



\25.9.112±2..±l.A I WJ77 



Length Length 
2TD 



Score Probability 



[57 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



1129 



NT 



AA 



ORF Name 



NT ID 



AAID 



26367177 c3 iiO 



Length Length 



Score Probability 
[253 



2.2e-19 



Protein name 



Locus Name 



sensory protein kinase 



pir:T30222 



Acc# 



T30222 



Description 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length * L 



Protein name 



ciostripain-reiateci protein 



Description 



T5T 



'3.3e-i2 



Locus Name 



pir:B723El 



Acc# 



B72351 



NT 



AA 



ORF Name 



NT ID 



Au&M2l2..±3....£l I iimr 



AAID Length Length 
-$552 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



3.3.3.a.7.ias...±i...i£). I istjt 



AAID Length Length 
— 



Score Probability 
5TJ 



1.2e-S3 



Protein name 



Locus Name 



SigA 



bp:CTCf677l8 



Acc# 



U67718 



Description 



Chiorobium tepidum SigA (sigA) gene , complete cds . 



1130 



ORF Name 



13339837'; ±3 43 



Protein name 



Description 



NTID 



AAID 



NT AA 

t~™i-* Score Probability 
Length Length 



TUT 



FT8~ 



1.8e-35 



Locus Name 



Acc# 



sp:ftlSA_BACSU 



RIBOFLAVIN SYNTHASE ALPHA CtlAlN, 



ORF Name 



3386387£ c2 46 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



Length Length 
5TT 



— Score Probability 



T5BT" 



Locus Name 



Acc# 



ORF Name 



Protein name 



Description 



[NO-HIT 



NT 



AA 



NTID 



AAID 



TUT 



Length Length 



Score Probability 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 
4 . 2e-15 



141 



Protein name 
Description 

PUTATIVE NAT)(P)H NITROREDUCTASE YKJI, 



Locus Name 



sp : YDGIJBACSU 



Acc# 



P96707 



1131 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TIT 



Protein name 



phosphate transport ATP binding protein 



Description 



TUT 



Score Probability 




|2.4e-47 



Locus Name 



pir :G70390 



Acc# 



G70390 



ORF Name 



aaa&2£...Gi...3.6L.. 



Protein name 



Description 



NTID 



AAID 



NT AA 

T^t-v, t~™^ Score Probability 
Length Length 



WTTT 



2.5e-35 



Locus Name 



sp:RBNJ»EIH 



Acc# 



P44608 



RIB0NUCLEA3E BN, (RNASE BM) 



ORF Name 



NTID 



\AB.BllBA...al..±12 J WTTZ 



Protein name 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



NTID 



^6.7.i0.6.2...cl...lll I [STT? 



Protein name 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TOT" 



Locus Name 



Acc# 



Description 



1132 



NT 



AA 



ORF Name 



NTID 



78158 Cl 67 



AAID Length Length 
— 



Score Probability 




Protein name 



Description 



Locus Name 



sp:PHOP_BACSU 



Acc# 



P13792 



PHOP 



NT 



AA 



ORF Name 



15103S60 c5 56 



4141 



NTID AAID Length Length 




Score Probability 
75 



u.Olg 



Protein name 



Locus Name 



response regulator 



gp:AF130997 



Acc# 



AF130997 



Description 



Enterococcus taecium strain BM4339 vanD glycopeptide resistancegene 
cluster, complete sequence. 



ORF Name 



NTID 



NT AA 
T — _ — Score Probability 
AAID Length Length 



li53.m„.ca...Ba.. 



[4342 



1158 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



NTID 



NT AA 

_ _ __ _ — ^, _ — ^, Score Probability 
AAID Length Length A ~ 



T72T" 



|2.3e-177 



Protein name 



Locus Name 



hypothetical protein 



pir : jgi020 



Acc# 



JQ102 0 



Description 



1133 



ORF Name 



NT ID 



Protein name 



hypothetical protein 



Description 



NT 



AA 



AAID Length Length 
SSFS — 



Score Probability 
7T3 



5.ie-70 



Locus Name 



pir : JQ1020 



ACC# 



JQ1020 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
T7Z 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




Score Probability 
|6.8e-i4S 



Protein name 



Locus Name 



putative UDP-GlcNAc rundecaprenylphosphate 



gp:AF04&744 



Acc# 



AF048749 



Description 



Bacteroides tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




I25T 



Score Probability 
£TT7 



1.0e-16 



Protein name 



Locus Name 



putative glycosyl transterase 



gp:L£*mil 



Acc# 



AJ007311 



Description 



Legionella pneumophila serogroup 1 iipopolysaccharide biosynthesisgene 
cluster . 



1134 



NT 



AA 



ORF Name 



NT ID 



AAID 



Length Length 
5T~ 



Score Probability 



TUT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



22fisa4aa„.ci„.a4 



AAID Length Length 



2T9 



Score Probability 
|2.7e-i0 



156 



Protein name 



Locus Name 



arylsulrotranst erase 



gp7XFT2S2uT" 



Acc# 



AF126201 



Description 



Pseuctomonas putrcla strain s-313 sultate ester desulturization genelocus, 
complete sequence. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



252 



07031 



Protein name 



Locus Name 



sp : SPRC_XENLA 



Acc# 



P36378 



Description 

(OSTEONECTIN) (ON) (BASEMENT MEMBRANE PROTEIN BM-40) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



218£0±28...±t..±S I 



4351 


5573 


83 252 


64 





Probability 
10.031 



Protein name 



Locus Name 



sp : SPRC__XENLA 



ACC# 



P36378 



Description 

(OSTEONECTIN) — [UN] — (BASEMENT MemBRaNE PROTEIN BM-40) 



1135 



ORF Name 



Protein name 



NT ID 



NT AA 
_ — ^, _ — _ Score Probability 
AAID Length Length i - 



Locus Name 



Acc# 



Description 
INO-HIT 



NT 



AA 



ORF Name 



NTID 



i2fi2i27.fiL2L„.c2L.„4ft„ 



AAID Length Length 
— 



Score Probability 
TT1 



3. 7e-08 



Protein name 



Locus Name 



CapSJ 



gp:SAUS1973 



Acc# 



U81973 



Description 



Staphylococcus aureus capsule gene cluster CapSA through Cap5Pgenes, 
complete cds . 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



2L7.427.Sfi...cl™lS I 14^4 



ST 



Protein name 

Description 
[MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



l3i2Lft£i&aa2...cL™ia.. 



Length Length 



Score Probability 




0.010 



Protein name 



Locus Name 



Acc# 



hypothetical protein, 57.8 kD 



|gp:POL24B436 



Description 



Pseudomonas putida OCT plasmicl alJt genes cluster and flanking DNA, strain 
TF4-1L. 



1136 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
TST3 — 



I57TT 



Score Probability 
6.0e-19 



Protein name 



Locus Name 



probable ixpopolysaccnarrae O-sicte cnam 
biosynthesis protein (O-antigen transpoter) 



foir:t'711b2 



Acc# 



F71152 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score Probability 


| 


4357 


9579 


171 


516 


95 


9.be-Ub 

















Protein name 



Description 



Locus Name 



sp:DBH5_RHILE 



ACC# 



P02348 



DMA-BINDING PROTEIN HftLW 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



15.$JA9&±..±i...:m. 



4358 



T5T 



rnr 



Protein name 



Description 



Locus Name 



sp:HIPB_LI(JOLl 



Acc# 



P23873 



HIPB PROTEIN 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



3.3.0.?.l5.1..±l...ll.. 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



1137 



ORF Name 
3345302 tl 8 



Protein name 



Description 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



T3T 



Locus Name 



Acc# 



[NO-HIT 



ORF Name 



4I0.3.3.SL7....al...3.fL 



Protein name 



NT ID 



AAID 



probable rhamnosyl trans t erase 



Description 



NT 



AA 



Length Length 



Score Probability 
TM 



Locus Name 



pir:H75596 



|2.ie-U 



Acc# 



H75596 



ORF Name 



Protein name 



Description 



NT 



NT ID 



AAID 



Length Length 
EST 



AA 

— Score Probability 



37 



Locus Name 



Acc# 



[NO-HIT 



ORF Name 



Protein name 



unknown 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length *- 





77 


234 




52 





0.023 



Locus Name 



Acc# 



|gp:AF134706 



AF134706 



Description 

Smorhizobium meliloti insertion sequence ISRml4 / completesequence . 



1138 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length ^ 



14158425 c2 47 



TIT 



1 . 6e-05 



Protein name 



Description 



Locus Name 



|gp:AB00Q^ZT 



Acc# 



AB000222 



Stapnylococcus capitis epr gene , complete cas . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



14704715 c3 62 



RUT 



TUTT 



144T 



4 . le-148 



Protein name 



Locus Name 



UT)P-glucose-4-epimerase/aTDP-giucose-4 / 6 



[gp:A]?04a j re5~ 



Acc# 



AF048749 



Description 



Bacteroides rragilis capsular polysaccharide Joiosyntnesis operon, complete 
sequence . 



ORF Name 



5.S.7.0.15.i..±l...l 



Protein name 



NT 



AA 



NTID AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



14557 



AAID 



9589 



NT AA „ ^ i_ -u • i ■ 4_ 
— — , Score Probab ility 
Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



1139 



ORF Name 



NT ID 



NT AA 
— , — , Score 
AAID Length Length 



S725327 c2 bl 



BUT 



rrr 



Probability 
|3.fle-26 



Protein name 



Locus Name 



glycosyl trans t erase 



pir:G7bb96 



Acc# 



G75596 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
15551 — 



Score 



— I [3uT 



Probability 
|1.0e-i35 



Protein name 



Description 



Locus Name 



spTTSTBHSECFIT 



Acc# 



Q45120 



INSER T ION SEQUENCE mi-LIKE PUTATIVE ATP-BINDING PROTEIN 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



TTTTT 



Probability 
|7.4e-171> 



Protein name 



Description 



Locus Name 



sp:TRA2_BAOTR 



Acc# 



Q45119 



T RANSP03ASE EOk INSERT I ON SE QUENCE ELEMENT IS^l-LlKE 



NT 



AA 



ORF Name 



NTID 



WTTT 



AAID Length Length 



— Score Probability 



552" 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



1140 



ORF Name 



Protein name 



NT ID 



TTTT 



NT 



AA 



AAID Length Length 
53~" 



Score Probability 



T35~ 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



23.M.7.MA..±2...4 



Protein name 



Description 



NT 



AA 



NT ID 



TTTT 



AAID Length Length 
TUT 



Score Probability 
|2.7e-ii2 



TTUT 



Locus Name 



sp:TkA2_liA(JPk 



Acc# 
Q45119 



ORF Name 



Protein name 



NT ID 



\lt£15.162.±Z..J. I 



NT AA 

— — Score Probability 
AAID Length Length 



13535" 



KIT 



&7T 



Locus Name 



Acc# 



Description 



IN0-H1T 



ORF Name 



Protein name 



Description 



NTID 



TTTT 



AAID 



353T 



NT AA 

— — Score Pr obability 
Length Length 



TIT 



Locus Name 



Acc# 



MO-HIT 



1141 



NT 



AA 



ORF Name 



NT ID 



AAID 



12285135 c3 311 



Length Length 
T5T 



Score Probability 



I.3e-145 



Protein name 



Description 



Locus Name 



sp:BTOA_HAETN 



Acc# 



P44426 



NT 



AA 



ORF Name 



NT ID 



AAID 



12636437 c3 302 



14377 



Length Length 
TT70 — 



Score Probability 
|4.9e-56 



Protein name 



Locus Name 



immunoreactive 53 kd antigen vurzs 



gp:APl4464l 



Acc# 



AF144641 



Description 



immunoreactive b3 kd antigenPGl23 gene, 



Porphyromonas gingivalis strain WbU 
complete cds . 



ORF Name 



Protein name 



NTID 



AAID 



i2ai7.:m..±2.„aa 



tut 



9600 



— — Score Probability 
Length Length 



T7T 



Locus Name 



Acc# 



Description 



[NO -HIT 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



TTTT 



135" 



10.019 



Locus Name 



Acc# 



sp : PKHA^OoLl — ~"| 



(SC 5.2.1.5) (PPlASEl) (RO'l'AMASii!) 



1142 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probabi lity 
Length Length 



113835937 ±2 100 



1365 



Protein name 



Locus Name 



|2.2e-78 



Acc# 



cLihydxolipoamiae 
dehydrogenase, : 2-oxoglutarate dehydrogenase 
complex rhain E3racetoin dehydrogenase complex 
Description 



pir : 140794 



140794 



ORF Name 



NTID 



NT AA. 

— — Score Probabil ity 
AAID Length Length 



114095405 cl 162 



Protein name 



|438l 



I4TT 



WIT 



Locus Name 



Acc# 



Description 



[NO-MIT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Pro bability 
Length Length 



±47.2.7.3.3.1...cl...iai.. 



Protein name 



ATWT 



TTT 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



\xA112S.ai...Cil..:lll.. 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 





129 


390 




132 





Locus Name 



9.2e-09 



Acc# 



hypothetical protein APE16 73 



Description 



pir :E7^b4H 



E72548 



1143 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



TTJT 



Protein name 



Locus Name 



|1.7e-i20 



Acc# 



receptor antigen (RagA) 



|gp:PSI13Qg72" 



AJ130872 



Description 



Porphyromonas gingivalis W50 receptor antigen (rag) locus encoctmga major 
immunodominant 55kDa antigen. 





ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


16§32§£5_£3_155 




$607 




603 


762 


1.6e-7£ 


Protein name 








Locus 


Name 


Acc# 



hypothetical protein 



pir : JQ1020 



JQ102 0 



Description 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



— — , Score Pro bability 
Length Length 

— 



Locus Name 



Acc# 



Description 



NO-HI* 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



— — Score Pr obability 
Length Length 

— 



71 



Locus Name 



Acc# 



Description 



1144 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
— 



TIT 



Score Probability 
|4.9e-8i 



Protein name 



Locus Name 



Salmonella typhimurium transcriptional 



|gp:^TV^TMFI" 



Acc# 



AF170176 



Description 



Salmonella typhimurium Iragment STMFl . 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Pro bability- 
Length Length 



120750302 ±1 44 



459 



6.3e-7l 



Protein name 



Description 



Locus Name 



sp:0DB2_fiA(JiJU 



Acc# 



P37942 



CHAIN TRANS AC YLAS E ) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
fZUl — 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



\21Zll...c±..±h!J... 



Protein name 



Locus Name 



gp:AB0230b4 



Acc# 



AB023064 



Description 

Listeria monocytogenes DNA tor DnaK operon, complete cds . 



1145 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 




Score Probability 



0.031 



Protein name 



Description 



Locus Name 



sp:^k<J_XENLA 



Acc# 



P36378 



(OSTEONECTIN) — rON) — (BASEMENT MElMBRANfi PROTfil^F BM-4U) 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



23851527 C3 312 



[4uT 



[T2T5~ 



1.7e-l03 



Protein name 



Description 



Locus Name 



|sp:£10P_kAiillW 



Acc# 



P44422 



LIGASU) 



ORF Name 



NT ID 



NT AA 

— — Score Prob ability 
AAID Length Length 



TUT 



1.2e-06 



Protein name 



Locus Name 



unknown 



gp:U96771 



Acc# 



U96771 



Description 



Prevotella bryantix putative polygalacturonase, B-l, 4 -endoglucanase , and 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 




Score Probability 



555 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



1146 



ORF Name 



NT ID 



AAID 



NT AA 

— — , Score Pro bability 
Length Length 



124544015 c3 ^6 



T5T 



7.4e-U 



Protein name 



Locus Name 



prolidase 



gp:AB0146i3 



Acc# 



AB014613 



Description 



Aureobacterium esteraromaticum gene tor prolidase, complete cas. 



ORF Name 



NTID 



AAID 



NT AA 

— — Score P robability 
Length Length 



2464525:4 t3 123 



WW 



|3.2e-45 



Protein name 



Locus Name 



immunoreactive bUKD antigen P<3b3 



bp:APl7S720 



ACC# 



AF175720 



Description 



immunoreactive 50KD antigenPG53 gene, 



Porpnyromonas gingival is strain W5 0 
complete cds . 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



43 55 



$620 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



hypotneticai protein A556L 



Description 



NT 



AA 



Length Length 
T51 



Score Probability 
uTQTS 



Locus Name 



pir:T180S^ 



Acc# 



T18058 



1147 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



9622 



1.5e-54 



Protein name 



Locus Name 



sp:YPOa_MJilTJA 



Acc# 



Q58903 



Description 

HYPOTHETICAL ABC! TRANSPOR TER Al'^-BllttDlNfl PROTEIN MJlbUb 



ORF Name 



255515 cl lbi 



Protein name 



NT ID 



NT AA 

— — , Score Prob ability 
AAID Length Length 



74" 



Locus Name 



Acc# 



Description 



DsfO-MiT 



ORF Name 



Protein name 



NT ID 



NT AA 

— — Score Prob ability 
AAID Length Length 



\Z6.16MA6...±2..&4. I 



1.3e-l8S 



Locus Name 



propionyi-CoA carboxylase 



gp:A£007000 



Acc# 



AB007000 



Description 



Myxococcus xanthus MxppcB gene tor propionyi-CoA carboxylase, complete cds . 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Probability 
Length Length 



4403 



[TOT 



5.6e-48 



Protein name 



Description 



Locus Name 



sp:Blbli_UAt!liJ 



Acc# 
P45248 



2) (DTB SYNTHETASE! 2 ) (DfBS 2] 



1148 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



5713" 



I57TT 



TZTT 



3.2e-i32 



Protein name 



Locus Name 



acetyl-CoA carboxylase uaiotin carboxylase 
subunit) accC 



foirrA^Stil 



Acc# 



A69581 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Pr obability 
Length Length 



TT5T" 



TUT 



5.6e-i5 



Protein name 



Locus Name 



hypothetical protein aq_294 



pir:K70326 



Acc# 



H70326 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



taut 



Length Length 
757 



TUT 



Score Probability 
l.Se-06 



Protein name 



Locus Name 



hypothetical protein APE1466 



pir :B7262b 



Acc# 



B72626 



Description 



ORF Name 



3.21I^3.V...±I...4i.. 



Protein name 



Description 



NTID 



14407 



9629 



NT 



AA 



AAID Length Length 



Score Probability 



TIT 



2.2e-30 



Locus Name 



sp:LPLA_MY<JPW 



Acc# 



P75394 



PROBABLE LIPOATE- PROTEIN LIGASE A, 



1149 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



32116703 c2 2by 



TTS" 



3.4e-09 



Protein name 



Locus Name 



nypotneticai protein APi^Obi 



Description 



bir:G72blO 



Acc# 



G72510 



ORF Name 



Protein name 



flavodoxin 



Description 



NT AA _ _ , , . - . . 
— — Score Probability 
NTID AAID Length Length 



i.ie-33 



Locus Name 



Acc# 



pir :A2867U 



ORF Name 



Protein name 



NTID 



conserved nypotneticai protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 
|2.0e-0$ 



TTT 



Locus Name 



Acc# 



G72385 



ORF Name 



Protein name 



NTID 



|3.3.2lIS.S.27....c3....3.0.0. I I^TT 



NT AA 

— — Score Pro bability 
AAID Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



144 



Locus Name 



methylmalonyl-coa decarboxylase gamma cnain 
PAB1771 



pir:F73l3S 



Description 



5.8e-14 



Acc# 



F75135 



1150 



ORF Name 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



TJUT 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



NTID 



NT AA „ ^ , , . . . . 
— — Score Probability 
AAID Length Length 



2.2e-44 



Locus Name 



sp:BIOC_HAfc!lN 



Acc# 



P45249 



Description 

frUTAftVE BIOTIN SYNTHESIS fr ROTBlti StOC 



ORF Name 



NTID 



NT AA 

— — , Score Probab ility 
AAID Length Length 



Protein name 



membrane protein 



Description 



TTJT 



6.5e-22 



Locus Name 



pir :G645yo 



Acc# 



G64590 



ORF Name 



Protein name 



NTID 



14416 



AAID 



9635 



aspartate aminotransterase 



NT 



AA 



Length Length 




Score Probability 




Locus Name 



pir :D72220 



i.7e-85 



Acc# 



D72220 



Description 



NT 



AA 



ORF Name 



NT ID 



14337882 r3 Ibl 



4417 



AAID Length Length 



2uW 



Score Probability 
|4.4e~103 



1022 



Protein name 



Locus Name 



probable (pyruvate) oxoisovaierate 
dehydrogenase alpha and beta fusion 



|pir:GVib2e 



Acc# 



G71526 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



|4A6.25l3.5....:L1..A.. 



Length Length 
T3T7 



Score 



Probability 
|4.4e-23 



Protein name 



Description 



Locus Name 



sp:YCFW_ECOLl 



Acc# 



P75958 



HYPOTHETICAL 45.3 KB PfcOTUlN I N M^ ' D-^O BB IN'l'Ek^EMIC mitiloM 



ORF Name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



\&&BM:±\L.a±..A±Q. 



T2¥S" 



TTT 



6.ie-35 



Protein name 



Locus Name 



nypotnetical protein APKlbbV 



pir:(^72S7b 



Acc# 



G72575 



Description 



ORF Name 



NTID 



AAID 



|48.0.7.S.8.7....c.3....3.0.i 



Protem name 



L-lactate permease (ictP) nomolog 



NT 



AA 



Length Length 
T5T2 



Score Probability 
1.7e-94 



MX 



Locus Name 



jpir:C70i7b 



Acc# 



C70175 



Description 



NT 



AA 



ORF Name 



NT ID 



14553527 ±1 SI 



AAID Length Length 



Score 



XuT" 



Probability 
1.0e-05 



Protein name 

Description 
THtOL PROl'fiAS / HEMAGGLtJT I N IN PRtJ frRfiCtJKdOK, 



Locus Name 



sp:PMT_l>Ok(il 



Acc# 



P43158 



ORF Name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



I45575S2 c2 261 



Probability 
•7.8e-l6 



Protein name 



Description 



Locus Name 



Acc# 



Q59288 



(CH0NDR01 T IN SULFATE LVA^) (CHONDkOlT IN AC! ELIMINATE) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



— Score Probability 



5±9£±ll...z£Jllb. 



WIT 



ttut 



[2W 



4 . 7e-26 



Protein name 



Locus Name 



Acc# 



sensor protein piis 



bir:S70b2« 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



^4T 



Length Length 
TJUE — 



Score Probability 



£33" 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



1153 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



54802 C2 264 



35TT 



TT7T 



Protein name 



Locus Name 



NADH dehydrogenase, : protein sir08bl:protem 
slr0851 



Description 



pir :S 7482b 



6.4e-63 



Acc# 



S74826 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 
— — n S core 
Length Length 



|5.2.8.46.ii...cl...l7.2 1 WFZZ 



1 [XT7T5 



[35S~ 



Probability 
|6.9e-^4 



Locus Name 



sprYJV^EA^T 



Acc# 
P40896 



HYPOTHETICAL KB PROTEIN IN HXTtt-cJM'i IM ' UkciE KliO kUcilufl 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



4427 



P5¥T 



[TTS5" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



EpsG 



Description 



NT 



AA 



NTID 



AAID 



9650 



Length Length 
TT71 — 



Score Probability 
2.5>e-09 



Locus Name 



Acc# 



gp:A?0364tib 



Plasmid pNZ4 0 du , complete sequence. 



1154 



ORF Name 



NTID 



NT AA 

— — , Score Probab ility 
AAID Length Length 



1172330 ti '1 



4429 



1038 



Protein name 



Locus Name 



1.4e-106 



Acc# 



Ketol-acid reauctoisomerase 



|gp:^Pl6'/^T" 



Description 



Y16743 



Piromyces sp. E2 mRNA tor ketol-acia reauctoisomerase. 



ORF Name 



NTID 



20443775 cl 23 



AAID 



Protein name 



NT AA 

— — Score Pro bability 
Length Length 



1935 



7TT 



Locus Name 



5>.2e-7l 



Acc# 



nypotnetical protein T18E12.6 



Description 



pir :T02byy 



T02699 



ORF Name 



Protein name 



NTID 



AAID 



14431 



NT 



AA 



Length Length 
TTT? — 



Score Probability 



Locus Name 



Acc# 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Pr obability 
Length Length 



|23.4.7.S.5.12..±l...l.. 



Protein name 



Locus Name 



Acc# 



Description 



NT 



AA 



ORF Name 



NTID 



24017527 12 12 



AAID Length Length 
[2T7~ 



7ZT 



Score Probability 
|2.5e-i5 



IT5? 



Protein name 



Locus Name 



palmitoyl-acyi carrier protein tnioesterase I igp : AF03426b 



Acc# 



AF034266 



Description 



Gossypium hirsutum palmitoyi-acyi carrier protein tnioesterase (FatBl) mRNA, 
partial cds. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



124331376 H y 



4434 



7W 



3.5e-245 



Protein name 



Description 



Locus Name 



spiACOM^RAVK 



Acc# 



P49609 



HYDRO -LVA^E ) ( AdoNlTAt! fc! ) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



TFT 



t . 8e-n 



Protein name 



Locus Name 



acetoiactate syntnase 



pir :ii704by 



Acc# 



E70459 



Description 



NT 



AA 



ORF Name 



NTID 



[24^" 



AAID Length Length 

— 



TUT 



Score Probability 
3.Se-S& 



Protein name 



Locus Name 



isocitrate dehyrogenase 



igpiBiiaoijrr 



Acc# 



Y13358 



Description 

Bacillus Israeli isocitrate aenyarogenase gene. 



1156 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



1250942b rl i* 



4437 



|1.7e-62 



Protein name 



Description 



Locus Name 



[gp :AB022gF7" 



Acc# 



AB022867 



£>revoteila ruminicola genes tor polyA polymerase, D-alanmegiycineperraease 
and cellulase, complete cds . 



ORF Name 



NTID 



NT AA 
— — , Score 
AAID Length Length 



H3837557 c3 73 



KIT 



TTT 



Probability 
|S.3e-07 



Protein name 



Description 



Locus Name 



|gp:M2BCW£g~ 



Acc# 



M36913 



2. mays cell wall protein mRNA, 3' end. 



ORF Name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



|lM5.1D.D.2..±2....i6.., 



TZTT 



Probability 
|3.3e-22 



Protein name 



Description 



Locus Name 



sp : YEBA_HAEIJsl 



Acc# 



P44693 



HYPOTHETICAL E^OTElltf HT04 09 



ORF Name 



NTID 



AAID 



NT AA 
— — „ Score 
Length Length 



Probability 



!&3A&t&l.±Z...l.& 



4440 



T54~ 



317 



2 . 2e-28 



Protein name 



Locus Name 



4-metnyl-5 (o-nyclroxyethyl) -tnxazole 
monophosphate biosynthesis protein (thiJ) 
homolog — 



pir:D70l77 



Acc# 



D70177 



Description 



1157 



NT 



AA 



ORF Name 



NT ID 



±1 7 



4441 



AAID Length Length 



SuT" 



Score Probability 
3.8e-33 



352 



Protein name 



Locus Name 



proJoaJDie nucieoside-aipnospnate Kinase, 



|pir:CViii6 



Acc# 



C71116 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



4442 


9664 




310 


933 591 





.2.ie-57 



Protein name 



Locus Name 



gp : PGPUT 



Acc# 



X97228 



Description 
P.gingivalis gpdxJ , put, ana ynt)G-pg genes. 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



4443 



Protein name 



nypotnetical protein 



Description 



77T" 



6.7e-100 



Locus Name 



pir : JQ1U2U 



Acc# 
JQ1020 



ORF Name 



Protein name 



NTID 



4444 



NT 



AA 



AAID Length Length 



Score Probability 



1335 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



35" 



255 



Locus Name 



Acc# 



Description 



NO-HIT 



1158 



ORF Name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 



3312913 t3 'Z'/ 



Protein name 



TIT 



672 



336 



Locus Name 



|2.2e-30 



Acc# 



conserved nypotnetical protein yacM 



Description 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


3.3.3.a^U2....tZ...Xd 


4447 


9669 


267 


S04 


587 


5 . 5e-57 



Protein name 



Locus Name 



Acc# 



triosephosphate isomerase 



Description 



bp:AF04jJri6 



AF043386 



Clostridium acet obutyiicum glyceraidenyae-3-pnospnate aenyarogenase igapj , 
phosphoglycerate kinase (pgk) , and triosephosphate isomerase (tpi) genes, 
complete cds; and 2 , 3-bpg- independent phosphoglyceratemutase (pgm-i) gene, 
partial cds . 



ORF Name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 



Protein name 



9670 



[TOT 



Locus Name 



7.6e-m 



Acc# 



Description 



sp:ftfie<3_Sii!Jsl«3 



Q55681 



ATP - B E P E ND E N T DfrJA HELICASE RECG, 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l&D&±&0.b...±l...ll 


4445 


5671 


8§ 


267 


110 


l.9e-06 



Protein name 



Locus Name 



Acc# 



hypothetical protein PHS004 



Description 



pir :F7124b 



F71245 



1159 



UKr jn a. me 


NT ID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


35205387_±3_26 


4450 


9672 


293 


882 








Protein name 








Locus 


Name 












sp:T0HB 


_NEIG0 


006432 


Description 
















TOON'S PROTEIN | 




ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


4lS3587_ti__24 


| 4451 


9673 


211 


636 




S7<5 


2.0e-66 



Protein name 



Locus Name 



pyridoxal pnosphate syntnetase 



bp:£>(5PUT 



Acc# 



X97228 



Description 

P. gingival is gpdxJ , put, ana ynJoG-pg 



genes . 



NT 



AA 



ORF Name 



NT ID 



AAID 



4452 



Length Length 




Score 



143 



TIT 



Probability 
|1.6e-09 



Protein name 



Description 



Locus Name 



sp:TOLR_HAElM 



Acc# 



P43769 



NT 



AA 



ORF Name 



NT ID 



AAID 



5.u5.7.9.12....a2...b.y... 



4453 



— — , Score Pr obability 
Length Length 



TTTT 



Protein name 



Locus Name 



Acc# 



Description 



1160 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



14454 



TTZT 



Protein name 



Locus Name 



nypotnetical protein 



|gp:P£Ti>4;^b4 



Acc# 



AJ243354 



Description 



Pseudomonas stutzeri hypl and comA genes and putative toiy, ex.bB,tolR ana 
exbD genes. 



ORF Name 



NTID 



10157 c2 2-/9 



Protein name 



carbonic annydrase Homo log ytiB 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



l.le-51 



Locus Name 



pir :F6yyy^ 



Acc# 



F69993 



ORF Name 



Protein name 



NTID 



4456 



AAID 



NT 



AA 



Length Length 

mn — 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
TIE 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



1161 



ORF Name 



12600240 t3 137 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 




Score Probability 



353" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



12fL7.6.D.6.1..±l...i.. 



Protein name 



NTID 



14459 



AAID 



NT 



AA 



Length Length 
T¥75 



Score Probability 



FT7¥- 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



Algl 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 
1.6e-56 



Locus Name 



|gp:PAUB02uT" 



Acc# 



U50202 



Pseudomonas aeruginosa alginate gene cluster Algl (algl) , AlgJ laigJ) ana 
AlgF (algF) genes, complete cds . 



ORF Name 



NTID 



NT AA „ _ 
— — Score Proba bility 
AAID Length Length 



l3.2&5.3.2:£L±2i...lll I WZZT 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



1162 



NT 



AA 



ORF Name 



NT ID 



14541001 t2 W± 



4452 



AAID Length Length 



TTZT 



Score Probability 
3.7e-2S 



TT5 



Protein name 



Locus Name 



thiamin biosynthesis protein nomolog 



pir :H6y^au 



Acc# 



H69260 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



3.7e-07 



Protein name 



Locus Name 



KIAA1275 protein 



IgprABO^^lOl 



Acc# 



AB033101 



Description 



Homo sapiens mRNA tor K1AA12/5 protein, partial cds . 



NT 



AA 



ORF Name 



NTID AAID Length Length 



Score Probability 



4464 



1623 



6.4e-67 



Protein name 



Locus Name 



outer membrane protein 



bpiBNROMWJ 



Acc# 



L77614 



Description 



Bacteroides thetaiotaomicron outer memJorane protein (susD) gene, complete 
cds . 



NT 



AA 



ORF Name 



NTID AAID Length Length 
TJ1 1 [4u"2 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



1163 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



14954712 ti 4y 



S.0e-0a 



Protein name 

Description 
MULTIDRUG RESISTANCE) OPERO ^ REPRESSOR 



Locus Name 



sp:MEXRJ>yii!AE 



Acc# 



P52003 



ORF Name 



NT ID 



NT AA. 

— — Score Probability 
AAID Length Length 



l6l0lSS7 ±1 SO 



1.7e-37 



Protein name 

Description 
MULTIDRUG R E SISTANC E PROT EIN A HOMCLO(^ 



Locus Name 



sp:EMRA_HAEIN 



Acc# 



P44928 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



TTZT 



2.3e-177 



Protein name 



Locus Name 



hypothetical protein 



pir : jgiu^u 



Acc# 



JQ1020 



Description 



ORF Name 



±6.9.0£..±1...1$. 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 



— Score Probability 



^4 



Locus Name 



Acc# 



Description 



NO-HIT 



1164 



ORF Name 



120015643 ci M'l 



Protein name 



NTID 



NT AA 

— — , Score Probab ility 
AAID Length Length jL 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



20:i6^26,..^l,.:112 



4471 



573 



|3.2e-20 



Protein name 



Description 



Locus Name 



sp:YM67_Akc4?'U 



Acc# 



028017 



(EC 1. -.-.-) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
^F^5 — 



Score Probability 
3.3e-2§ 



Protein name 



Locus Name 



alpha-amyiase, precursor : protein cub^u 



bir:S730SV 



Acc# 



S73087 



Description 



ORF Name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



\222B3M±..±±..±\IA.. 



3635 



1203 



TFT" 



Protein name 



Locus Name 



4.0e-10 



Acc# 



thiol:disuitide mtercnange protein nomolog 
yneN 



pir :E69891 



Description 



E69891 



1165 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



22459655 C3 3 6 0 



or 



|4.7e-06 



Protein name 



Locus Name 



transmembrane sensor 



gp:AP0bl6yi 



Acc# 



AF051691 



Description 

Pseudomonas aeruginosa stress tactor A (pstA) , ECF sigma tactor(tiul) , 
transmembrane sensor (f iuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds . 



ORF Name 



NT ID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



I2247S431 ±2 84 



Protein name 



Locus Name 



sp:YfcK0_SA(j£JU 



Acc# 



P54442 



Description 

HYPOTHETICAL 4^.4 KT) PROTEIN I N BLTk-S POIIKJ INTKk^islNicJ kUiiiuN 



ORF Name 



NT ID 



NT AA 

— — Score Probabilxty 
AAID Length Length 



\iz&&nn&::L±i..±$. | w^rrz 



WIT 



[77TT 



Ii.3e-ii3 



Protein name 



Locus Name 



cytosolic pnospnogiycerate Kinase l 



|gp:AB01&410 



Acc# 
AB018410 



Description 

£>opuius nigra tnCytSGKl mktfA tor cytosolic pnospnogiycerate Kmasei, 
complete cds. 





ORF Name 


NT ID 


AAID 


NT AA 
Length Length 


Score 


Probability 


22&£.&±2&...al„lA(). 


4477 




§3 252 


64 


0.031 


Protein name 






Locus 


Name 


Acc# 










sprSPRC 




P36378 




Description 















1166 



NT 



AA 



ORF Name 



NT ID 



.23554563 i'2 «1 



TUT 



AAID Length Length 
5?U 



57M 



TIT 



Score Probability 
5.0e-47 



Protein name 



Locus Name 



endonuclease in 



pir :B7iyiy 



Acc# 



B71919 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



TZ7T 



5B" 



0.0059 



Protein name 



Description 



Locus Name 



sp:A&CD_j>SEAE "[ 



ACC# 



P18275 



ARg l NimJ/ORNITHlNU ANTIPOkTKk 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
— 



TTS2 



TIT 



Score Probability 
|4.2e-46 



Protein name 



Locus Name 



115K outer membrane protein precursor : busu 
protein 



pir:JC6027 



Acc# 



JC6027 



Description 



— — Score P robability 
Length Length 



AA 



ORF Name 



NT ID 



AAID 



\2&0±6525....z2..25& ...I 



TTVT 



TUJT 



I3.4e-ii9 



Protein name 



Locus Name 



|sp:ALF_Tkl^A 



Acc# 
083668 



Description 

^RUCtOSE-BlSPllOriPHAl'fi ALdOlA^, 



1167 



NT 



AA 



ORF Name 



NTID 



24017303 Cl 242 



AAID Length Length 

5704 | |677 | proa — 



Score Probability 
|4.6e-U&8 



Protein name 



Locus Name 



pullulanase 



gp:BTU6706I 



Acc# 



U67061 



Description 



Sacteroides tnetaiotaomicron pullulanase (pull) gene, complete cds. 



NT 



AA 



ORF Name 



NTID 



124041626 is 161 



4483 



AAID Length Length 
T355 



1552" 



Score Probability 
l.Se-06 



141 



Protein name 



Locus Name 



conserved nypotnetical protein MTH83 



foir:F6S2lO 



Acc# 



F69210 



Description 



NT 



AA 



ORF Name 



NTID 



|244I!ia:/.7...±i...li^ I 



AAID Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



14485 



9707 



Length Length 
TFT* 



Score Probability 
|4.0e-21 



Protein name 



Locus Name 



nypotnetical protein PH0272 



pir :A714b2 



Acc# 



A71452 



Description 



ORF Name 



NTID 



NT AA 

— — Score Probabil ity 
AAID Length Length 



TT7W 



S.0e-08 



Protein name 



Locus Name 



conserved nypotnetical protein BB0195 



|pir:C70124 



Acc# 



C70124 



Description 



1168 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



24685717 t3 lib 



WTJJT 



MI 



Protein name 



Locus Name 



antibiotic resistance protein fromolog ywoG 



Description 



pir:B70065 



1.7e-^ 



Acc# 



B70065 



ORF Name 



\216.$.5.1±&..±2J1L 



Protein name 



NT ID 



AAID 



4485 



T7TTT 



NT AA 

— , — , Score Probability 
Length Length 



TI5" 



Locus Name 



2.8e-78 



Acc# 



Description 

-TRNA LIPASE ALPHA CHAIN) (^miktS) 



sp:SVPA__BAO^U 



ORF Name 



Protein name 



NTID 



AAID 



T7TT" 



NT 



AA 



Length Length 
T7 



Score Probability 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



AAID 



TTTT 



NT AA 

— — , Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



NO-HM 



1169 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



25978377 t2 ttb 



T7TT 



ii.8e-i6 



Protein name 
Description 



Locus Name 



sp:V798_METJA 



Acc# 



Q58208 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



44 92 



9714 



508 



T5TT 



WW 



£.4e-5l 



Protein name 



Description 



Locus Name 



gp:AB0l&E>V8 



Acc# 



AB019578 



Microcystis aeruginosa mcyA, mcyB ana mcyC genes, complete cas. 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
[S7T5 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



|2fifl.7.IDL<i:/....ai...il!3L.. 



Protein name 



NTID 



AAID 



4454 



NT AA 

— — , Score Probability 
Length Length 



72" 



TFT 



Locus Name 



Acc# 



Description 



NO-HIT 



1170 



ORF Name 



3125006 c2 



Protein name 



NT ID 



AAID 



T7TT 



NT 



Length Length 
TB3 



AA 

— , Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



Length Length 



AA 

— Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



4497 



TTTT 



7TT 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



Description 



pir: JC6(W7 



4.9e-48 



Acc# 



JC6027 



ORF Name 



Protein name 



NTID 



NT AA 
— — , Score 
AAID Length Length 



|3.1£7.6.&&7...±l...n I 



acyl carrier protein 



Description 



WT 



[TUT 



Locus Name 



pir:S2847b 



Probability 
i.4e-05 



Acc# 



S28475 



1171 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



32620312 12 108 



T7F" 



Protein name 



Locus Name 



|4.8e-i0 



Acc# 



VceB 



gp:AF012101 



Description 



AF012101 



Vibrio cholerae ettlux gene A (vceA; and ettlux gene B (vceB) multidrug 
resistance pump genes, complete cds . 



ORF Name 



NTID 



3377027 cl 213 



Protein name 



NT AA 

— — , Score Pro bability 
AAID Length Length 



9722 



Locus Name 



1.3e-06 



Acc# 



nypotnetical protein 



Description 



pir:^22l6 



F72216 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

Length 


Score 


Probability 


l±l±±052...alJlll. | 


4501 


3723 


175 


528 


285 


5.he-2h 

















Protein name 



Locus Name 



conserved hypothetical protein ag_2l7l 



Description 



pir :D704B6 



ACC# 



D70486 



ORF Name 



3.5.8.2M6.1...c3....il6.. 



Protein name 



NT 



NTID 



AAID 



Length Length 
TW2 



AA 

— Score Probability 



ST 



Locus Name 



Acc# 



Description 



NO-HIT 



1172 



ORF Name 



NT ID 



AAID 



NT AA 

— — , Score Probability- 
Length Length 



36360000 i2 112 



4503 



972b 



WIT 



TST5~ 



TTT 



0.00057 



Protein name 



Locus Name 



Acc# 



unknown 



gp:AF0i3216 



Description 



Myxococcus xanthus Dog l dog) , isocitrate lyase Uci) , Mis tmis),Uto (uto) , 
fumarate hydratase (fhy) , and proteosome major subunit (clpP) genes, complete 
cds; and acyl-CoA oxidase (aco) gene, partial cds . 



ORF Name 



351540 cl 201 



Protein name 



NT ID 



T5U4" 



NT 



AA 



AAID Length Length 




Score Probability 



1ST 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



neopuliulanase 



Description 



NT ID 



NT AA o _ , , . _ . . 
— — Score Probability 
AAID Length Length 



TTTT 



TSZT 



2.0e-l39 



Locus Name 



gp:BTU66tti#7 



Acc# 



U66897 



Bacteroides thetaiotaomicron neopuliulanase (susA) andalpna-glucosiaase 
(susB) genes, complete cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



|410A0.6.2..±l...b.B.., 



4506 



9728 



Length Length 




W5 



Score Probability 
2.0e-20 



Protein name 



Locus Name 



probable riioosomal protein L31 



|pir:T36353 



Acc# 



T36353 



Description 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



4144002 cl 245 



TTZT 



57T 



E5T 



1.0e-25 



Protein name 



Locus Name 



RNA polymerase sigma r actor Sigz-lilce protein 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S nbosomal protein sl6-±ik:eprotein, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds. 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


4l720l2_cl_225 


4506 


$750 


285 


852 


454 


5.0e-44 



Protein name 



Locus Name 



endo-JDeta-galactosidase 



gp:AF085^6 



Acc# 



AF083896 



Description 

Flavobacterium Keratoiyticus endo-beta-galactosidase gene, compietecas. 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


ASLactm^ti^aa 


4503 


5751 


520 


1563 


1554 


5.2e-i68 



Protein name 



Locus Name 



metnylmaionyl-CoA decarboxylase, aipna cnain 



pir :A49094 



Acc# 



A49094 



Description 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


&8.2.Q33J....t.2....XaQ. 


4510 


9752 


146 


441 242 




2 . oe-20 



Protein name 



Locus Name 



glutaconyl-CoA decarboxylase gamma subunit 



gp:AF050576 



Acc# 



AF030576 



Description 



Acidammococcus termentans methylmalonyl-CoA decarboxylase alpnasubunit 
(mmdA) gene, partial cds; and glutaconyl-CoA decarboxylasedelta subunit 
(gcdD) , glutaconyl-CoA decarboxylase gamma subunit (gcdC) , and glutaconyl-CoA 
decarboxylase beta subunit (gcdB) genes, complete cds. 



1174 



ORF Name 



1492206 tl lb 



Protein name 



NT ID 



— — , Score P robability 
AAID Length Length 



J5T 



Locus Name 



Acc# 



Description 



IMO-HIT 



ORF Name 



Protein name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



|4M£18.:L.c2...24& I 



\53% 1 [XFT7 



|0.00i$ 



Locus Name 



outer membrane protein 



Igp : BNROMPA 



Acc# 
L77615 



Description 



Sacteroides thetaiotaomicron outer membrane protein (susE) gene, complete 
cds . 



ORF Name 



NT ID 



Protein name 



iai2Lai&£...ci„ifti I F^tt 



AAID 



NT AA ^ _ , , . , . . 
— — Score Pr obability 
Length Length 



Locus Name 



|2.2e-30 



ACC# 



Description 



sp:MAlM4At^U 



MAF PROTEIN 



ORF Name 



Protein name 



NTID 



14514 



NT AA 

— — , Score Probability 
AAID Length Length 



purr 



Locus Name 



|4.8e-2S 



Acc# 



crossover junction endodeoxyribonuc lease 



bir:B7^60 



Description 



B72360 



1175 



ORF Name 



5890712 ±3 171 



Protein name 



Description 



NT ID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



4515 



TTJT 



Tnr 



T5T~ 



Locus Name 



Acc# 



NO -HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



Length Length 

wn — 



Score Probability 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



a.7..7.2.3.3.D....t^...Xlb... 



Protein name 



Description 



INO-HIT 



NT 



AA 



NTID 



AAID 



T5TT 



Length Length 



Score Probability 



1T5" 



Locus Name 



Acc# 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



T5TF" 



T7W 



Length Length 



Score Probability 



Locus Name 



Acc# 



InO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



14519 



AAID Length Length 
T53~ 



Score Probability 
|3.Se-37 



Locus Name 



oxaloacetate cle carboxylase , beta subunit 



pir :6^72324 



Acc# 



B72324 



Description 



1176 



ORF Name 



NTID 



— — , Score Probability 
AAID Length Length J ~ 



I3I34680 4y 



4520 



5.8e-0£ 



Protein name 



Locus Name 



beta-galactosidase , 



pir :T2y4i4 



Acc# 



T29434 



Description 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


I47iaaaa^ti™i4 


4521 


9743 


83 


252 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



T5T 



Score Probability 
1.3e-06 



Protein name 



Locus Name 



M-protem 



gp:SEU73l6i> 



Acc# 



U73162 



Description 

Streptococcus equi M-protem (seMJ gene, complete cds. 



NT 



AA 



ORF Name 



NTID 



4523 



AAID Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



1177 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



16504818 cl 71 



|4.7e-lb 



Protein name 



Locus Name 



Acc# 



colicin I receptor 



|gp:ECOC!iir 



Description 



E.coli colicin I receptor gene, complete ccis . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



17065076 tl lb 



4^" 



T7TT 



752" 



S.le-22 



Protein name 



Description 



Locus Name 



Acc# 



spiYJJSjiCiOLl 



HYPOTHETICAL 2b. 3 KB PkOTUlN IN klMl-P RFC INTEkGENIC kEciloN 



ORF Name 



NTID 



AAID 



*KTT AA 

— — Score P robability 
Length Length 



2D.7.8.i7.0.B...±l...i 



14526 



^6" 



TuT" 



WT 



0.021 



Protein name 



Locus Name 



Acc# 



glutamyl -tKNA reductase 



gp:AF0800^y 



Description 



Chlorobium vibriotorme glutamyl- tRNA reductase (hemA) gene, complete cas; 
and porphobilinogen deaminase (hemC) gene, partialcds . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TIT 



Score Probability 
i.le-41 



Protein name 



Locus Name 



115K outer membrane protein precursor : susu 
protein 



pir:JC602? 



Acc# 



JC602 7 



Description 



1178 



ORF Name 



24225385 12 ib 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 
TT1 



Score Probability 



7T 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



4529 



hypothetical protein 



Description 



NT AA 

— — Score Pro bability 
Length Length 



287 



0.0042 



Locus Name 



pir :Ti06yy 



Acc# 



T10699 



NT 



AA 



ORF Name 



NTID 



4530 



AAID Length Length 
SSu 



fZTT 



Score Probability 
5.7e-55 



Protein name 



Locus Name 



thymidine kinase 



gp:AF0^720 



Acc# 



AF028720 



Description 



Rhodotnermus sp. 1 I'M SlS* thymidine kinase {ZdK) gene, completecds . 



ORF Name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



3.3.i3.3.0.1D....cl..A3... 



|2.3e-97 



Protein name 



Locus Name 



receptor antigen (Rag A) 



|gp:£>GIl30a72 



Acc# 



AJ130872 



Description 



Porphyromonas gmgivaiis W50 receptor antigen (rag; locus encodinga ma: or 
immunodominant 55kDa antigen. 



1179 



ORF Name 



34178252 cl 64 



Protein name 



NT ID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



IN0-M1T 



ORF Name 



Protein name 



NT ID 



9755 



conserved nypothetical protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 




Locus Name 



5.7e-5B 



Acc# 



D72343 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



[5TTT 



3.ae-14 



Protein name 



Locus Name 



unknown 



gp: 1^6771 



Acc# 



U96771 



Description 



Prevotelia Joryantu putative polygalacturonase, B-l , 4-enaoglucanase, and 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



19A&A12...Z1...12 



AAID Length Length 
— 



Score Probability 



Protein name 



Locus Name 



Acc# 



Description 



KfO-HIT 



1180 



NT 



AA 



ORF Name 



NTID 



14251417 cl b& 



AAID Length Length 
— 



Score Probability 



Protein name 



Locus Name 



probable permease perM homolog iperM) RP6 3 0 I |pir :E7i668 



Description 



i.4e-26 



Acc# 



E71668 



ORF Name 



^7.3.a.7..7.b....tZ...Zl.. 



Protein name 



NTID 



AAID 



9759 



NT 



AA 



Length Length 

or 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



NTID 



AAID 



NT AA „ 

— — Score P robability 
Length Length 



11A.±A1.±2..2± 



2.8e-5B 



Protein name 



Locus Name 



endo-1, 4-Joeta-xylanase, 



pir :T309U9 



ACC# 



T30909 



Description 



ORF Name 



NTID 



AAID 



Protein name 



regulatory protein pcnR- 2 : protein 
slrl489 rprotein slrl489 



Description 



NT 



AA 



Length Length 
— 



Score Probability 



2.0e-20 



Locus Name 



pir :S74456 



ACC# 



S74456 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



ttfO-Ml 



1181 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



TIT 



Score Probability 
TTTJe^n 



Protein name 



Locus Name 



sp:YVBSJttA(Jfc!U 



Acc# 



032244 



Description 

HYPOTHETICAL 22,6 Kb PrOT2i N isf OPtfCA-ENO HWfiRflfliJIg kB^lOEJ 



ORF Name 



$$5240 c2 83 



Protein name 



Description 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



[1ST 



FTTT 



Locus Name 



|sp:YM23_YfiA£j'r 



Acc# 
P53832 



PRECURSOR 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



9765 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



4544 



adenylate cyclase Jiomolog 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



4083 



2.0e-4i 



Locus Name 



foirzTi ' /iiJV 



Acc# 



T17197 



1182 



NT 



AA 



ORF Name 



NT ID 



10737900 t3 43 



AAID Length Length 
TIT 



TTT7 



Score Probability 
S>.3e-0S 



Protein name 



Locus Name 



AnsH pnospnatase 



|gp:SCAHBA<^ 



Acc# 



AF131879 



Description 



Streptomyces collinus ansatrienin AHBA biosynthetic gene clusterregion z , 
complete sequence. 





ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


l6S32fi35J:2Ji8 


4546 


576$ 


15$ 


477 








Protein name 








Locus 


Name 




Acc# 



hypotnetical protein 



pir : JQ1020 



JQ1020 



Description 



ORF Name 



NT ID 



Protein name 



AAID 



nypotnetical protein 



Description 



NT 



AA 



Length Length 




FOT" 



Score Probability 
|2.3e-I77 



rrzr 



Locus Name 



bir:JOi020 



Acc# 



JQ1020 



ORF Name 



NT ID 



iamniii...ti™tt 



4548 



Protein name 



AAID 



9770 



NT AA 

— — Score Pr obability 
Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



1183 



ORF Name 



NT ID 



NT AA 

— , — , Score Probabi lity 
AAID Length Length J ~ 



T7TT 



0.0095 



Protein name 



Locus Name 



putative giucosyl Hydrolase precursor 



|gp:AF047a55" 



Acc# 



AF047839 



Description 



Pseudoalteromonas sp. S9 putative giucosyl Hydrolase precursor andadaptive 
response regulatory protein (ada) genes, complete cds . 



NT 



AA 



ORF Name 



NT ID 



l$SSS?7a ci 87 



AAID Length Length 



Score Probability 
S.6e-l2 



Protein name 



Locus Name 



MsmR 



Acc# 



U49397 



Description 



Streptococcus pyogenes MsmR imsmRj gene, partial cds; LepA UepAj,cpa 
(cpa) , and Nra (nra) genes, complete cds; SsbA (ssbA) gene, partial cds; and 
unknown genes . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
TT^ 



Score Probability 



TO 



ff^5~ 



|4.7e-35 



Protein name 



Locus Name 



hypothetical protein PAB0790 



bir:H7!>0iJB 



Acc# 



H75098 



Description 



ORF Name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



Protein name 



Locus Name 



0.031 



Acc# 



Description 



P36378 



(OSTEONECTIN) (ON) (BASEM E NT MEMBRANE PROTEIN BM-40) 



1184 



ORF Name 



NT ID 



AAID 



NT AA 

— — , Score Pro bability 
Length Length 



Protein name 



Locus Name 



0.031 



Acc# 



Description 



sp:SPRC_XSNLA 



(OSJBOtiBCTtH) (ON) (BASflME!N? MflMfeRAtia P ROTEllN BM-40J 



P36378 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



|2422$677 c2 64 



Protein name 



Locus Name 



Acc# 



Description 



ORF Name 



2S0..7.0.10...±2...1^... 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



TT7T 



Locus Name 



|4.2e-l<5 



Acc# 



hypothetical protein PHlluv 



Description 



pir:D710bl 



D71051 



ORF Name 



NTID 



lB.19£tb.b..±l...lb. 



4556 



Protein name 



NT 



AA 



AAID Length Length 
TH> 



Score Probability 



TT 



Locus Name 



Acc# 



Description 



1185 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



33442 ti 9 



TTTT 



TT7T 



Protein name 



Locus Name 



transcription regulator AraC/Xyis tamiiy 
homo log ydeE 



Description 



toir:ti697V7 



i.5e-16 



Acc# 



G69777 



ORF Name 



NTID 



AAID 



NT AA 

— — n Score Probability 
Length Length 



3.3.6.25.4.0.7....cl...b.i).. 



75T 



i.6e-06 



Protein name 



Locus Name 



transposase 



Acc# 



AF038866 



Description 



transposase (bipHJ andmotnlization 



Bacteroiaes tragxlis transposon Tnbb^o 
protein BmpH (bmpH) genes, complete cds 



NT 



AA 



ORF Name 



NTID 



AAID 



3.5.5i5.5.1SL.7....c3....aa., 



Length Length 
TT5 



TTT 



Score Probability 
|i.2e-05 



TTT 



Protein name 



Locus Name 



transposase 



gp:AF038S66 



Acc# 



AF038866 



Description 



transposase (bipH) andmoDilization 



Bacteroides tragilis transposon Tn5520 
protein BmpH (bmpH) genes, complete cds 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Pro bability 
Length Length 



TTT 



FT7T" 



|4.5e-07 



Protein name 



Locus Name 



hypotneticai protein MTH628 



pirT^T^T 



Acc# 



E69183 



Description 



1186 



ORF Name 



NT ID 



NT AA 

— • — , Score Probability 
AAID Length Length 



35523937 c2 72 



TTTT 



4.7e-34 



Protein name 



Locus Name 



sialic acid-speciric 9-o-acetyiesterase 



gp : MMAS y U A 



ACC# 



X98625 



Description 



M. musculus mRJMA tor sialic acid-specitic 9-o-acetylesterase . 



NT 



AA 



ORF Name 



4095052 cl b3 



NT ID AAID Length Length 

FIT" 



Score Probability 
0.00020 — 



5ff 



Protein name 



Locus Name 



oligopeptide ABC transporter, ATP-binding 
protein 



pir:£)?22&9 



Acc# 



D72289 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , — n Score Probability 
Length Length 



Protein name 



70 



TFT 



Locus Name 



Acc# 



Description 



ORF Name 



|4I10.0.Iu..±3....47... 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



1187 



ORF Name 



|43S4£S2 c2 



Protein name 



Description 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



7T 



ITT 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



4566 



T7W 



Length Length 
ITT 



Score Probability 
3.0e-40 



Locus Name 



sp: YJV8_YEAST 



ACC# 



P40892 



(BC 2.5.1. 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



Length Length 
fTTTT 



TTT 



Score Probability 
3.0e-l7 



TTT 



Locus Name 



hypothetical protein TM0383 



Acc# 



G72383 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TUT 



Score Probability 



73T" 



Protein name 



Description 



Locus Name 



Acc# 



1188 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



7031556 ti ii 



Protein name 



9791 



T7T" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



a&6.£3.7...±3....3.b... 



Protein name 



NTID 



14570 



NT AA 

— — , Score Probability 
AAID Length Length 



9792 



11555 



Locus Name 



4 . 4e-4U 



Acc# 



alpha -glucosidase 



Description 



gp:M'U66^7 



U66897 



Bacteroides thetaiotaomxcron neopulluianase (susAJ andalpha-glucosidase 
(susB) genes, complete cds . 



ORF Name 



Protein name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



118 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



±103AC>.6.1.±1...±1 1 



Protein name 



AAID 



NT 



AA 



Length Length 



Score Probability 



T5T 



Locus Name 



Acc# 



Description 



MO-HIT 



1189 



NT 



AA 



ORF Name 



NT ID 



193757 t2 21 



4573 



AAID Length Length 



9795 



Score Probability 
9.8e-83 



pro" 



Protein name 



Locus Name 



115K outer membrane protein precursor : Susc 
protein 



pir : JC6027 



Acc# 



JC6027 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



57351T 



T7W 



5.9e-163 



Protein name 



Locus Name 



immunoreactive 8 7JcD antigen PG92 



gp:AF175724 



Acc# 



AF175724 



Description 



Porphyromonas gmgivalis strain WbO immunoreactive 87 KD antigenPG92 gene, 
complete cds. 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


22115.1&±.±1JU 


4575 


9797 


188 


567 


215 




l.4e~l/ 



















Protein name 



Locus Name 



RNA polymerase ECF-type sigma t actor sigw 



pir:H69706 



Acc# 



H69706 



Description 



ORF Name 



Protein name 



NT 



AA 



NT ID 



AAID 



9798 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



1190 



ORF Name 



NTID 



24023387 ±2 12 



Protein name 



NT 



AA 



AAID Length Length 



Score Probability 



TTUT 



i.2e-iii 



Locus Name 



Acc# 



putative secreted £>eta-galactosictase 



Description 



gpTSCFHl 



AL133171 



Streptomyces coelicolor cosmid F81. 



ORF Name 



NTID 



242l$562 hi 9 



Protein name 



NT 



AA 



AAID Length Length 



Score Probability 



1.3e-36 



Locus Name 



Acc# 



hypotnetical protein 



Description 



tpir:£7S054 



S76053 



ORF Name 



Protein name 



NTID 



NT AA 

— — Score Probabil ity 
AAID Length Length 



TTuT" 



TTJT 



9.6e-i7<> 



Locus Name 



Acc# 



ABC transporter (ATP -binding protein) homo log 
ykpA 



pir :E6yyfei 



E69861 



Description 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



Length Length 

tzui — 



Score Probability 



5ff 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



25306512 ci 56 



TJUT 



i.2e-i32 



Protein name 



Locus Name 



immunore active heat shocK protein DnaJ 



gp:AF145797 



Acc# 



AF145797 



Description 



Porphyromonas gmgivalis strain W50 immunoreactive neat shockprotein DnaJ 
gene, complete cds . 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



34181503 tl 2 



4582 



9804 



54¥~ 



IFX5~ 



S.le-0& 



Protein name 



Locus Name 



outer membrane protein 



|gp:fiKtftOM&E" 



Acc# 



L77614 



Description 



Bacteroides tnetaiotaomicron outer membrane protein (susDj gene, complete 
cds . 



ORF Name 



NTID 



NT AA 

— — , Score Prob ability 
AAID Length Length 



3.45.5.3.3.7.5....c2....7.& I 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



|3.5.7.3..7.2D.8...±2...14 



AAID Length Length 




mm 



Score Probability 
i.le-70 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



|pir:JC60a7 



Acc# 



JC6027 



Description 



1192 



ORF Name 



NT ID 



NT AA 

— , — , Score Probabi lity 
AAID Length Length 



36360255 t2 26 



i.5e-22 



Protein name 



Locus Name 



sp : PLC_BACCE 



ACC# 



P14262 



Description 

(MOSpaAfiaYLlNOSlTOL-fiP^ClP lg £H6Sfc>H0tlPA5fl C) (Pt-PLC) 



ORF Name 



Protein name 



NTID 



9808 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



lAiaaiai...ci...aj3i.. 



Protein name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



surtace antigen BspA 



Description 



Locus Name 



pxr :T31Uy4 



5.7e-u7 



Acc# 



T31094 



ORF Name 



\1&3M.21.±±.A 



Protein name 



NTID 



AAID 



MIT 



probable transmembrane protein 



Description 



NT AA 

— — , Score Probability 
Length Length 



3.ie-22 



Locus Name 



pir:T34651 



Acc# 



T34651 



1193 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length ^ 



4589 



est 



|2.2e-78 



Protein name 



Locus Name 



immunoreactive 87KD antigen P(iy2 



|gp:AP175'^4 



Acc# 



AF175724 



Description 



Porphyromonas gingivalis strain W50 immunoreactive 87K:D antigenPG92 gene, 
complete cds. 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



6640682 cl 5$ 



$812 



1.6e-27 



Protein name 



Description 



Locus Name 



sp:<3ftfit_PftATU 



Acc# 



P48204 



GkPE PROTEIN 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



14551 



l.Ie-58 



Protein name 



Locus Name 



integrase 



gp:BFU7b371 



ACC# 



U75371 



Description 



feacteroides tragilis transposon Tn4555 TnpA ttnpAJ , integrase imtj , Tnpc 
(tnpC) , excisionase (xis) , mobilization protein (mobA),and beta-lactamase 
(cfxA) genes, complete cds; and unknown genes. 



NT 



AA 



ORF Name 



NT ID 



AAID 



9814 



Length Length 

vm — 



Score Probability 



1T7T 



T7T 



5.3e-34 



Protein name 



Locus Name 



hypothetical protein sll08bb 



lpir:S74g33 



Acc# 



S74833 



Description 



1194 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



1203515 C3 268 



Protein name 



T2T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
W5~ 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



Protein name 



T5W 



Locus Name 



5.9e-10 



Acc# 



nypotnetical protein jnp065i 



Description 



pir :EViyUb 



E71905 



ORF Name 



NTID 



\±11&12±1.±1„±().L I 



Protein name 



AAID 



NT 



AA 



Length Length 
255 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 
— — . Score 
Length Length 



Probability 



Protein name 



F7~ 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



14970628 Cl 165 



i.3e-53 



Protein name 



Locus Name 



Acc# 



K+ transport protein nomolog 



pir:H70430 



H70430 



Description 



ORF Name 



l£MALlB...±2Jlb... 



Protein name 



NT ID 



AAID 



NT AA 

— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



±6All±&0....alJlb±, 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



Length Length 
73 ] pi 



Score Probability 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



±6£&5A6.2„±l..±ttA.. 



Protein name 



NT ID 



AAID 



NT 



AA 



Length Length 



Score Probability 



TUT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



i&a3.2&8.5....ca...;L6.&.. 



Protein name 



NT ID 



AAID 



nypotnetical protein 



Description 



NT 



AA 



Length Length 
TTK> — 



WTT 



Score Probability 
a.3e-17V 



TTIT 



Locus Name 



|pir:J010^0 



Acc# 



JQ1020 



1196 



NT 



AA 



ORF Name 



NTID 



176875 ci l±b 



AAID Length Length 



Score Probability 



585 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



iaaQiQ...ti...za.. 



Length Length 
IT5S 



Score Probability 
i.5e-06 



TIT 



Protein name 



Description 



Locus Name 



Sp : YY02_METJA 



Acc# 



Q60301 



HYPOTHETICAL PROTEIN MJfiC&02 



NT 



AA 



ORF Name 



NTID 



AAID 



ia3ifiia3.„±i™a | 



Length Length 
T473 



Score Probability 



Protein name 



Description 



Locus Name 



sp : GATfiJ&AC^'k 



Acc# 



P45737 



CATALASE, 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TTF7 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



1197 



ORF Name 



20101577 11 50 



Protein name 



hemin permease 



Description 



NT 



AA 



NTID 



AAID Length Length 
— 



1005 



Score Probability 
BT3 



3.0e-4$ 



Locus Name 



pir :S54438 



Acc# 



S54438 



ORF Name 



2ullu9.3.0...±l...IA.. 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



Locus Name 



tryptophan synthase, subunit beta (trpB-l) 
homolog 



pir :G69404 



Description 



3.4e-128 



Acc# 



G69404 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



20.2±2B.21...al„.26.b. I 



T5T 



2.7e-l6 



Protein name 



Locus Name 



RNA polymerase sigma tactor sigz-like protein 



gp:AF137263 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S ribosomal protein si6-iiKeprotem, rucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



2.Q3.2.ai.7.2...t.Z....7.5.., 



AAID Length Length 
3T7 



Score Probability 
10.0042 



TUT 



Protein name 



Locus Name 



branched- chain amino acid ABC transporter, 
ATP-binding protein (braG-4) homolog 



pir:D5$41>3 



Acc# 



D69423 



Description 



1198 



NT 



AA 



ORF Name 



NT ID 



14511 



AAID Length Length 



Score Probability 
1.7e-54 



Protein name 



Description 



Locus Name 



Acc# 



sp : ENr>4_ETOLI 



atfDOttUCLEA££ IV, (fiNDODEiOxYfelfeONtrGLEAgE IV) 



ORF Name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



20517142 i'A 74 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — n Score Probability 
Length Length 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
l.le-06 



TU7 



Protein name 

Description 
HYPOTHETICAL PROTEIN MJECL35 



Locus Name 



sp:YS3S_METJA 



Acc# 



Q60291 



1199 



ORF Name 



2150262 tl 22 



Protein name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



TUT 



3.9e-05 



Locus Name 



bp:BPU53767 



Acc# 



U53767 



Description 

Bacillus pumxlus plasmid pSH1452, Rep gene, complete ccts . 



1200 



ORF Name 



NTID 



NT AA „ _ , , . _ . . 
— , — , Score Probability 
AAID Length Length 



22S60128 c2 211 



4619 



0.031 



Protein name 



Locus Name 



sp:SPRC_XEMLA 



Acc# 



P36378 



Description 

(OSTEONECTIN) (ON) (BASfiMEMT MEMBRANE! i>ft<MEllN BM-40) 



NT 



AA 



ORF Name 



NTID 



AAID 



23475176 il 2b 



Length Length 
T75 



Score Probability 
0.00045 



5? 



Protein name 



Locus Name 



TnpC 



|gp:feW?537l 



Acc# 



U75371 



Description 



Bacteroides tragilis transposon Tn4b55 TnpA (tnpA) , mtegrase lint) , TnpC 
(tnpC) , excisionase (xis) , mobilization protein (mobA),and beta-lactamase 
(cfxA) genes, complete cds; and unknown genes. 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



215.1B.26.2.±2..M. I 



MI 



7TT 



ITT 



ST 



Trmnr 



Protein name 



Locus Name 



hypothetical protein BB0404 



pir:C70150 



Acc# 



C70150 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Proba bility 
AAID Length Length 



4622 



Protein name 



Locus Name 



4 . 9e-39 



Acc# 



putative aipha-glucosidase 



gp:AAC2521£>i 



Description 



AJ252161 



AIicycloiDacillus acidocaldarius maltose/maltodextrme transportgene region 
(malEFGR genes, cdaA gene and glcA gene) . 



1201 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



24020212 c2 221 



Protein name 



Locus Name 



|2.7e-37 



Acc# 



115K outer membrane protein precursor : Susc 
protein 



bir:JC6027 



Description 



JC6027 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length -L 



2i23.7.7.6.2...cl...i:/.6... 



Protein name 



J7T 



Locus Name 



Acc# 



Description 



INO-HTT 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



Protein name 



1.6e-160 



Locus Name 



Acc# 



beta-lactamase, A precursor : cephalospormase 



bir:1401y2 



Description 



ORF Name 



2^3.2L9..7.a2L...CZ...iaZ.. 



Protein name 



NTID 



NT AA 

— — , Score Probabi lity 
AAID Length Length 



T7I~ 



57T 



Locus Name 



0.055 



Acc# 



unknown 



Description 



|gp:AP04^749 



AF048749 



Bacteroid.es tragilis capsular polysaccharide biosynthesis operon, complete 
sequence . 



1202 



NT 



AA 



ORF Name 



NTID 



AAID 



24415522 ±3 107 



Length Length 
WTT 



Score Probability 

0.00030 — 



Protein name 
Description 

HYPOTHETICAL £>R0T£ltf 1110665 



Locus Name 



sp:Y665JHAkIN 



Acc# 



P44033 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



2464S550 ±3 10$ 



TTT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



l$£S&±&i....al...ll2 1 



Length Length 
2Tu 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



25.9a5.x5.2.±2...1x I [4^7 



US4F" 



0 . 014 



Protein name 



Locus Name 



rhoptry protein 



bir:T28676 



Acc# 



T28676 



Description 



1203 



NT 



AA 



ORF Name 



NT ID 



AAID 



126594683 c2 185 



Length Length 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



|2&5a.7&12..±l...Iia I WZTZ 



AAID Length Length 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



10M.111..±2..£6. 



AAID Length Length 
— 



Score Probability 
4.0e-06 



Protein name 



Locus Name 



AblEii 



gp:LLU368^7 



Acc# 



U36837 



Description 



Lactococcus lactis plasmict pNP40, abortive infection locus, AbiEi , AJoiEii , 
RecA(LP) , AbiF genes, complete cds . 



ORF Name 



NTID 



NT AA 

— t — , Score Probability 
AAID Length Length 



3LQ12La2LCta...c2...2LQ& I 14634 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



1204 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



32243757 tl 26 



ST 



IT7TT 



Protein name 



Locus Name 



Acc# 



Description 



(NO-HIT 



ORF Name 



NT ID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



Protein name 



11110 



7.6e-24 



Locus Name 



Acc# 



Description 



sp :XYJ_.B_BACOV 



P49943 



ORF Name 



NT ID 



AAID 



NT AA „ „ , , . _ . ^ 
— — , Score Probabxlity 
Length Length ^~ 



Protein name 



7T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



1S.6£S.$.S.1.±1.A5. I 



Protein name 



TTT 



5.0e-S3 



Locus Name 



Acc# 



hypothetical protein 2 



Description 



pir:I40233 



140233 



NT 



AA 



ORF Name 



NTID 



!iaiaia&i...ci..iai.. 



AAID Length Length 




Score Probability 



Protein name 



Locus Name 



DnaK 



gp:AB015879 



Description 



Porphyromonas gmgivalis dnaK operon genes, complete ccis . 



Acc# 



AB015879 



1205 



NT 



AA 



ORF Name 



NT ID 



3959627 cl 22$ 



AAID Length Length 
531 



T5W 



Score Probability 
|4.8e-26 



Protein name 



Locus Name 



gp:AB015a7SJ 



Acc# 



AB015879 



Description 

Porphyromonas gingivalis dnaK operon genes, complete ccLs , 



NT 



AA 



ORF Name 



NT ID 



14147125 ±1 VI 



14641 



AAID Length Length 
5553 — 



Score Probability 
|2.3e-5l 



Protein name 



Locus Name 



5 ' -nucleotidase 



gp:CLIl3l24:i 



Acc# 



AJ131243 



Description 

Columoa iivia mRNA tor 5 ' -nucleotidase . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



i 4msiii...ci..m.. 





4542 9564 


62 


159 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probab ility 
Length Length 



53" 



2F5~ 



TFT 



3.1e-ll 



Protein name 



Locus Name 



Na+-ATPase ciiam J: protein slrlbuy iprotem 
slrl509 



bir:S754bb 



Acc# 



S75455 



Description 



1206 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



14727280 t2 87 



Protein name 



Locus Name 



8.5e-23 



Acc# 



Description 

HYPOTHETICAL £>R0T£ltf MJ0878 



sp: Y87S METJA 



Q58288 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



14727337 tl 25 



[4^43" 



Protein name 



TFTT" 



Locus Name 



1.4e-l3 



Acc# 



nypotnetical protein PAB1002 



Description 



pir :G?S654 



G75064 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



Protein name 



1ST 



Locus Name 



Acc# 



Description 



W0-H1T 



ORF Name 



5.Z.7AQD.3....C1...L3.S... 



Protein name 



NTID 



4547 



AAID 



NT AA „ ^ , , . - . ^_ 
— — , Score Probability 
Length Length 



Locus Name 



3.0e-B2 



Acc# 



otnA protein 



Description 



pir :S70958 



S70958 



ORF Name 



NTID 



AAID 



NT AA 

— — „ Score Probability 
Length Length 



a.7.aaA&7....ai...i.^a.. 



Protein name 



4548 



W7TT 



73" 



Locus Name 



Acc# 



Description 



NO-HIT 



1207 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



9S4805 ci 158 



TTT 



7UT 



|6.2e-26 



Protein name 



Locus Name 



Acc# 



conserved hypothetical protein aq_1503 



pir :G70430 



G70430 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 
5u 



0.006& 



Protein name 



Locus Name 



CrylA toxin receptor A 



|gp:APi73552 



Acc# 



AF173552 



Description 



Heliothis virescens CrylA toxin receptor A mRNA, complete cds . 



NT 



AA 



ORF Name 



NTID 



±llLS.m...Q±Jlll I 



AAID Length Length 

— 



TTZT 



Score Probability 
1.3e-3i 



Protein name 



Locus Name 



putative putrescme/ spermidine binding 
protein 



|gp:PSEPAHE> 



Acc# 



L49465 



Description 



Pseudomonas tluorescens Hypothetical metaJDOlite transport protein, positive 
transcriptional regulator (phnR) , phosphonoacetatehydrolase (phnA) , 
2-phosphonopropionate transporter (phnB) , putative putrescine/spermidine 
binding protein, and putativemethionine sulfoxide reductase genes, complete 
cds . 



NT 



AA 



ORF Name 



NTID 



\±ZM&2&5..±±...2$ J 



AAID Length Length 
5573 — 



1069 I nzru 



Score Probability 




i.9e-43 



Protein name 



Locus Name 



histidme protein kinase homolog GacS 



gp:AP197i)l^ 



Acc# 



AF197912 



Description 



Azotobacter vmelandii histidme protein Jcmase homolog GacS (gacS) gene, 
complete cds . 



1208 



ORF Name 



NTID 



Protein name 



hypothetical protein APE2061 



Description 



NT 



AA 



AAID Length Length 
I3B 



Score Probability 




i.6e-il 



Locus Name 



bir:G72510 



Acc# 



G72510 



ORF Name 



Protein name 



Description 



PROTEIN TLPA) 



NT 



AA 



NTID 



AAID 



W7F" 



Length Length 



Score Probability 




|4.5e-16 



Locus Name 



|sp:TLPA_BRAJA 



Acc# 



P43221 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



^13" 



2.0e-27 



Locus Name 



hypothetical protein MTH6 71 



pir :D69189 



Acc# 



D69189 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



1^6.3.QD.3.5....C2....Z7.1.. 



Length Length 

tt&z — 



Score Probability 
— 



i.5e-i34 



Protein name 



Locus Name 



probable V- type ATPase, subumt A (atpA-lj 



bir:G71325 



Acc# 



G71325 



Description 



1209 



NT 



AA 



ORF Name 



NT ID 



AAID 



15712666 11 24 



Length Length 



Score Probability 
^T2 



|2.6e-18 



Protein name 



Description 



Locus Name 



sp:YJJP_HAEIN 



Acc# 



P44520 



HYPOTHETICAL PRCtffiltf HlOlOS 



NT 



AA 



ORF Name 



NT ID 



AAID 



15715042 c2 270 



MIT 



Length Length 



Score Probability 
53 



S.le-06 



Protein name 



Locus Name 



hypothetical protein BB0095 



pir:G70lll 



Acc# 



G70111 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
5HBI — 



Score Probability 




5,0e-40 



Protein name 



Locus Name 



2-keto-3-deoxygluconate kinase 



pir:G72422 



Acc# 



G72422 



Description 



NT 



AA 



ORF Name 



NTID 



I6.6.1D.&27...±1...17... 



AAID Length Length 
~5%W2 



1407 



Score Probability 
753 



1.4e-74 



Protein name 



Locus Name 



Na+/H+ antiporter (nhaC-1) homo log 



pir:D70173 



Acc# 



D70179 



Description 



1210 



NT 



AA 



ORF Name 



NTID 



16833455 c2 316 



AAID Length Length 
3553 — 



JUT 



Score Probability 
5^3 



3.4e-57 



Protein name 



Locus Name 



cation ettlux system protein 



gp:AF203881 



Acc# 



AF203881 



Description 



Zymomonas mobiiis strain ZM4 cione 43F4, compiete sequence . 



ORF Name 



15687750 cl 228 



Protein name 



NTID 



AAID 



NT 



AA 



Length Length 
I22B 



Score Probability 



75 



Locus Name 



Acc# 



Description 

etcpext 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



ff55T 



3555" 



55" 



255" 



75" 



Locus Name 



|sp:£>RS6_MAN£!E 



Acc# 



P46507 



26S PROTEASE REGULATORY 3UBUNIT SB {ATPASE MS73) 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



41SI5£i7A™ca.„aa4 1 



98SS 



555" 



^5S5~ 



T5"2T" 



i.5e-15S 



Protein name 



Locus Name 



nypotnetical protein PH1512 



pir:D71027 



Acc# 



D71027 



Description 



1211 



NT 



AA 



ORF Name 



NTID 



22445301 12 87 



AAID Length Length 
— 



Score Probability 
TZ2 



2.0e-07 



Protein name 



Locus Name 



unknown 



gp:AF125164 



Acc# 



AF125164 



Description 



Bacteroides tragilis 638R polysaccharide B (PS B2) brosynthesislocus , 
complete sequence; and unknown genes. 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



122454707 13 161 



11071 



1 . 7e-142 



Protein name 
Description 

(RIBONUCLEOTIDE REDUCTASE) 



Locus Name 



|sp:Rift2JfftEJ»A 



Acc# 



083092 



NT 



AA 



ORF Name 



NTID 



23.5.a5.1S.D....Cl...Z6.6.. 



14567 



AAID Length Length 

— 



TUT 



1212 



Score Probability 
2£3 



i.7e-2i 



Protein name 



Locus Name 



sp:VRK0_BACSU 



Acc# 



P54442 



Description 

HYPOTHETICAL 46.4 KD PROTEIN" IN BLTR-SpOIIIC IttTSRGEMIC REGION 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
TIE 



444 



Score Probability 
II . 2e~102 



1018 



Protein name 



Locus Name 



probable V- type ATPase, subunit B (atpB-1) 



bir:H7l525 



Acc# 



H71325 



Description 



1212 



ORF Name 



NTID 



NT AA 
— , — , Score 
AAID Length Length 



124252311 13 167 



7TT 



Probability 
3.9e-15 



Protein name 



Description 



Locus Name 



Acc# 



|gp:AB016260 



Agrobacterium tumelaciens plasmid pTi-SAKURA, complete sequence . 



ORF Name 



NTID 



AAID 



NT AA 
T - h Score Probability 
Length Length 



24257692 ll $ 



l.Se-63 



Protein name 



Locus Name 



TonB- dependent receptor HmuR 



|gp:£>OT673« 



Acc# 



U87395 



Description 



Porphyromonas gingival is TonB -dependent receptor HmuR (hmuR) gene , complete 
cds . 



NT 



AA 



ORF Name 



NTID 



243.47.15.3....c3....3.3.0... 



14671 



AAID Length Length 




Score Probability 
^3 



5.1e-54 



Protein name 



Locus Name 



V- type ATPase, s uhun it I homo log 



pir:C70ill 



Acc# 



C70111 



Description 



ORF Name 



NTID 



AAID 



2&5AS£.l±...al...l5A I WZTl 



Protein name 



2-keto-3-deoxygluconate kinase 



Description 



NT 



AA 



Length Length 
14T" 



Score Probability 
3.3e-47 



ASA 



Locus Name 



pir:(T72422 



Acc# 



G72422 



1213 



NT 



AA 



ORF Name 



NTID 



AAID 



24805557 ci 238 



Length Length 
TIT" 



Protein name 



conserved Hypothetical protein MTH12 85 



Description 



3STT 



Score Probability 
3135 



Locus Name 



pir :A69038 



5.6e-2i 



Acc# 



A69038 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length A ~ 



as4£iaai2-.±a...ia<L 



7^" 



|4.2e-82 



Protein name 



Locus Name 



3 OS ribosomal protein S16-like protein 



gpTCTTT7^T 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S ribosomal protein S16-likeprotein, lucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



ORF Name 



NTID 



AAID 



NT AA 
Length Length 



— — Score Probability 



2L5aa4aaa„±i„A3. i f^tf 



Protein name 

Description 
KO-HIt 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



2£.2&ZAAl...c>l...ll& I WZTZ 



AAID Length Length 




Score Probability 
TT1 



|i.fie-id 



Protein name 



Locus Name 



conserved hypothetical protein yvbK 



pir :B70030 



Acc# 



B70030 



Description 



1214 



NT 



AA 



ORF Name 



NTID 



26365651 c2 287 



AAID Length Length 




7JT 



Score Probability 
Ttt 



1.0e-22 



Protein name 



Locus Name 



hypothetical protein 



fpir:B75629 



Acc# 



B75629 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
fTTT 



Score Probability 

cm — 



7.6e-25 



Protein name 



Locus Name 



hypothetical protein 



pir:H75628 



Acc# 



H75628 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
Ml 



Score Probability 



FT 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



NTID 



AAID 



NT AA 

j^i-'h Score Probability 
Length Length 



FT" 



T3T 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

-mji — 



Score Probability 
7ul5 



5.8e-69 



Protein name 



Locus Name 



sp:YHOA_BACSU 



Acc# 



P54585 



Description 

HYPOTHETICAL 58.3 KD PROTEIN IN GLPD-CSPB INTERGENIC REGION 



1215 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



TUT 



Score Probability 
TUZ 



0.021 



Protein name 



Locus Name 



probable erythrocyte -binding protein MAEBL 



pir :T09129 



ACC# 



T09129 



Description 



NT 



AA 



ORF Name 



NTID 



, „ „ — ^. — ^. Score Probability 
AAID Length Length JL 

wus — 



Protein name 

Description 
(NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 
T — ^ n r — * Score Probability 
AAID Length Length JL 



3.223.4..7.5.2...XZ...9.1 



Protein name 



hypothetical protein MTH6 70 



Description 



TTT 



Locus Name 



pir :C69189 



ACC# 



C69189 



ORF Name 



±1216&S.b....al...lll., 



Protein name 

Description 
NO-HIT 



NT 



AA 



NTID 



AAID Length Length 




Score Probability 



Locus Name 



Acc# 



ORF Name 



NT 



AA 



NTID 



AAID 



d3AU15.5.2...al...23.Q I l4£3£ 



9908 



Length Length 



Score Probability 

hss — 



l.le-35 



Protein name 



Locus Name 



peptide ciiam release tactor nomolog prtH 



pir :E64748 



ACC# 



E64748 



Description 



1216 



NT 



AA 



ORF Name 



NTID 



33711081 t2 135 



T — _ — Score Probab ility 
AAID Length Length JL 

— 



TTT 



2 . 4e-06 



Protein name 



Locus Name 



hypothetical protein 



|gp:flSU18930 



Acc# 



Y18930 



Description 



Sultolobus soltatarxcus 281 KJd genomic DNA tragment, strain P2 . 



ORF Name 



NTID 



NT AA 
T — ^. — ^ Score Probability 
AAID Length Length JL 



1467 



4404 



SFT 



4 ,4e-125 



Protein name 



Locus Name 



cobalamin biosynthesis protein N 



pir :C69048 



Acc# 



C69048 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



±±0.65££6....c.±J±6S. I WZ%5 



WFTT 



Length Length 
TTT 



Score Probability 
231 



S.8e-26 



Protein name 



Locus Name 



hypothetical protein aq_106 0 



pir:D70391 



Acc# 



D70391 



Description 



ORF Name 



NTID 



3.±Q£±&Q3....a±...2±2 I 14590 



Protein name 



hypothetical protein PHS0 04 



Description 



NT 



AA 



AAID Length Length 

wm — 



FF" 



VET 



Score Probability 
TTu 



l.de-06 



Locus Name 



|pir;F7i245 



Acc# 



F71245 



1217 



ORF Name 



NT ID 



NT AA 
AAID Length Length C ° re 



■£4242265 ci 221 



TUT 



3T7T 



Probability 
l.Se-44 



Protein name 



Locus Name 



spermidine/putrescine ABC transporter, 
permease protein (potC) homolog 



pir:G?0179 



Acc# 



G70179 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 




Score Probability 



ITT 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
— 



Score Probability 
T? 



0.038 



Protein name 



Locus Name 



hypothetical protein DKFZp566D1824 . 1 



pir:T14767 



Acc# 



T14767 



Description 



ORF Name 



Protein name 



NTID 



4694 



NT 



AA 



AAID Length Length 
— 



Score Probability 




Locus Name 



spiYJJPJECOLI 



5 . le-22 



Acc# 



P39402 



Description 

HYPOTHETICAL 30.5 KD PROTEIN IN DNAT-BGLJ INTERGENIC REGION (F277J 



1218 



NT 



AA 



ORF Name 



NT ID AAID Length Length 

— 



Score Probability 

pus — 



|I.4e-I5 



Protein name 



Locus Name 



TonB- dependent receptor HmuR 



gp:PGU87395 



Acc# 



U87395 



Description 



Porphyromonas gingival is TonB - dependent receptor HmuR (hmuR) gene, complete 
cds . 



ORF Name 



NTID 



NT AA 

_ _ _ _ — ^ _ — ^ Score Probability 
AAID Length Length JL 



4069152 Cl 241 



^4" 



l.Se-22 



Protein name 



Locus Name 



2 - dehydro - 3 - deoxypho sphoglucona t e 
aldolase/4 -hydroxy- 2 -oxoglutarate aldolase 



pir:F72452 



Acc# 



F72422 



Description 



NT 



AA 



ORF Name 



&ia3.53.U...C3....3.2L6.. 



NTID AAID Length Length 

— 



T7T 



Score Probability 
Hd2 



Protein name 



Locus Name 



V- type ATPase, subunit E homolog 



par :H70111 



Acc# 



H70111 



Description 



ORF Name 



Protein name 

Description 
NO-HIT 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



1219 



NT 



AA 



ORF Name 



NT ID 



485680 tl 45 



AAID Length Length 
5521 — 



Score Probability 



OF 



T5T 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NT ID 



AAID 



4ftftft2L7.fiL±2„.a2L 



FT7inr 



Length Length 



Score Probability 
9.2e-293 



Protein name 
Description 

(RIBONtWLfiOflDfi REiDtfOTASEl) 



Locus Name 



sp:RIRl_TREPA 



Acc# 



083972 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
3323 



Tl4~ 



JUT 



Score Probability 
^ 



10.012 



Protein name 



Locus Name 



conserved hypothetical protein AF1223 



pir :F69402 



Acc# 



F69402 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



m6.0M.l..±l..±£b. I [¥7U7 



Length Length 
7^— 



Score Probability 



2_nr 



Protein name 
Description 

iroir 



Locus Name 



Acc# 



1220 



ORF Name 



NTID 



AAID 



NT AA 
— . — _ Score 
Length Length ■ 



FT7uT" 



[T¥TT 



Protein name 



Locus Name 



spermidine/ put res cine ABC transporter, 
ATP-binding protein (potA) homolog 



pir :A70180 



Description 



Probability 
B.6e-iOI 



Acc# 



A70180 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length J ~ 



5l2L8.5l22L.7....G1...2L14.. 



[¥7uT" 



sir 



TTT 



2.9e-19 



Protein name 



Locus Name 



Acc# 



probable V- type ATPase, subunit D (atpD-1) 



bir:A7m£ 



A71326 



Description 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length ^ 



saaai&i...ca.„i4£ i muz 



TUUT 



l.le-101 



Protein name 



Locus Name 



Acc# 



rtcB protein 



|pir:D7552l 



D75521 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



SLfifla5ii...ci.-.m i mur 



9925 



Length Length 



11782 



I3TT 



Probability 
|4.8e-60 



Protein name 



Locus Name 



Acc# 



UDPgiucose- -glycogen glucosyitransterase, , 
skeletal muscle : glycogen (starch) 
synthase ; glycogen ( starch) synthase 



pir:A3336y 



Description 



1221 



NT 



AA 



ORF Name 



NT ID 



6912588 c2 278 



AAID Length Length 
— 



Score Probability 
1525 1 |S.3e-40 



Protein name 



Locus Name 



spermidme/putrescine ABC transporter, 
permease protein (potB) homolog 



pir:H70179 



Acc# 



H70179 



Description 



NT 



AA 



ORF Name 



70..7.29.5.1...C1...243. I 



NTID AAID Length Length 

mjv — 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



7.ais&i„.c3L...m I 



AAID Length Length 

mn — 



TIT 



Score Probability 
B53 



Protein name 



Locus Name 



glycine-rich RNA-bmding protein (clone A81J 



Ipir:33i443 



Acc# 



S31443 



Description 



ORF Name 



.7.ai5La.7....Cl...2L2L5t.. 



Protein name 



Description 



NT 



AA 



NTID 



4710 



AAID Length Length 

— 



Score Probability 
S3 



Locus Name 



gp:CPU53466 



Acc# 



U53466 



CycLia pomoneila granulosis virus ORF13L gene, partial eels, ORF15L, ORF15R, 
ORF16L, ORF17L genes, complete cds, ORF17R gene, partialcds. 



1222 



NT 



AA 



ORF Name 



NT ID 



AAID 



5702£2 c2 274 



14711 



Length Length 



Score Probability 
TZ§ 



6 .le-iS 



Protein name 



Locus Name 



nypotnetical protein PH198 0 



foir:D71214 



Acc# 



D71214 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



W7TF 



Length Length 



Score Probability 
7¥S 



|3.7e-74 



Protein name 

Description 
HYPOTHETICAL PROTEIN HI 003 5 



Locus Name 



Isp : YIDE_HAETN 



Acc# 



P44472 



NT 



AA 



ORF Name 



NTID 



AAID 



lift&2a2fl..±L..24..... I WTTE 



Length Length 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
7F~ 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



1223 



ORF Name 



1152950 ti 6 



Protein name 

Description 
TO^HTT 



NT 



AA 



NT ID 



AAID 



Length Length 
F7~™ 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
HO-HIT 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



14717 



hypothetical protein TM0280 



Description 



NT AA 

— , — , Score Probability 
Length Length 



I220S 



|3.4e-76 



Locus Name 



pir:F72395 



Acc# 



F72395 



ORF Name 



NTID 



14718 



AAID 



9940 



Protein name 
Description 

ftAS-LIKE GT£-£ilNDltfG J>ftOTSltf 



NT 



AA 



Length Length 



Score Probability 
0.017 



S3" 



Locus Name 



sp:RVL2_YARLI 



Acc# 



P41925 



1224 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
5331 — 



Score Probability 



WW 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



UM.9Ah.b.2.±2...SJ. 



AAID Length Length 

mzi — 



Score Probability 



I2TTT 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



IAM&7.ttI...cl...m I FT72T 



_ i — j -* — ^. Score Probability 
AAID Length Length SL 

s&n — 



TFT 



5.4e-50 



Protein name 



Locus Name 



FucR 



gp:AF1372S3 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S rilDosomal protein sl6-liJceprote±n, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



ia.7.6.3.7....G2...L3.Q I WTI2 



Length Length 



Score Probability 



l^T 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



1225 



NT 



AA 



ORF Name 



NTID 



AAID 



22253385 t2 44 



Length Length 



Score Probability 

— 



6.0e-2§ 



Protein name 



Locus Name 



nypotneticai protein sir0698 



bir:^7703§ 



Acc# 



S77038 



Description 



ORF Name 



NTID 



NT AA 
— — Score 
AAID Length Length 



14724 



Protein name 



hypothetical protein 



Description 



1ST 



Probability 
5.3e-91 



Locus Name 



pir :H72299 



ACC# 



H72299 



ORF Name 



NTID 



AAID 



NT AA 
T , Score Probability 
Lengtn Lengtn 



|3.le-4l 



Protein name 

Description 
BETA- GALACTOS IDAS E , (LACTASE) 



Locus Name 



sp : 6(jAL_TMSTU 



Acc# 



P26257 



NT 



AA 



ORF Name 



NTID 



Zi6.Z5.9.Z6..„aO,...lZ6. i 



AAID Length Length 
— 



TT7T 



Score Probability 
B?B 



|6.8e-iS 



Protein name 



Locus Name 



unknown 



gp:AP14i532 



Acc# 



AF141932 



Description 



RJiizobium ieguiriinosarum Jdv. trifolii plasmid PRlel62Y10C rspDEFoperon, 
partial sequence. 



1226 



ORF Name 



NTID 



24259438 ci 125 



WTZT 



Protein name 



protein kinase, , cGMP - dependent 



Description 



NT 



AA 



AAID Length Length 
— 



Score Probability 
55 



Locus Name 



pir :B2826S? 



Acc# 



B28269 



NT 



AA 



ORF Name 



NTID 



AAID 



2>^> 3. 3. 5. 3. 0. 1 • • > £. 2L >• .4i 3i • > 



Length Length 



Score Probability 
T2T7 — 



l.le-126 



Protein name 



Description 



Locus Name 



sp:LCPH_HAEIM 



Acc# 



P44446 



ACYL-COA SYNTHETASE) (LAC!^) 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length 



TTIT 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



l±6A03£.2...a!.AZ. I W7JU 



Length Length 



Score Probability 



TT3T 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



.265a£ft6j...±i...i£ i mn 



Length Length 



Score Probability 
WFL 



3.7e-20 



Protein name 



Locus Name 



Hypothetical protein MTH1451 



pir : C69060 



Acc# 



C69060 



Description 



1227 



NT 



AA 



ORF Name 


NTID 


AAID 


Length Length 


2946280I_tI_2 


4732 




9954 




62 189 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



FT73T 



AAID Length Length 

mss — 



12547 



Score Probability 
~JT3 



|8.0e-26 



Protein name 



Locus Name 



putative alpha ~L-arabinoturanos±dase 



gp:ATACuii708 



Acc# 



AC011708 



Description 



Arabidopsis thaliana chromosome III BAC T7M13 genomic sequence, complete 
sequence . 



NT 



AA 



ORF Name 



NTID AAID Length Length 



Score Probability 



Mim0.5,.±1...14 „..J [4734 



1 TUJE 



5.3e-0? 



Protein name 



Locus Name 



sp : PORP_PSEAE 



Acc# 



P05695 



Description 

PORIN P PRECURSOR (CUTER MEMBRANE PROTEIN Di) 



NT 



AA 



ORF Name 



NTID 



pm5~ 



AAID Length Length 
— 



Score Probability 
TT5 



l.le-05 



Protein name 



Locus Name 



Styrene sensor kinase 



gp:PS^TYCATA 



Acc# 



AJ000330 



Description 

Pseudomonas sp. DNA tor styrene catabolxsm genes. 



1228 



ORF Name 



NTID 



NT AA 
» — — _ Score 
AAID Length Length 



34406517 C2 127 



Protein name 



Locus Name 



Probability 

17 .2e-97 
Acc# 



receptor antigen (RagA) 



bp:WJ.T:L30872 



AJ130872 



Description 

Porpnyromonas gingival is W50 receptor antigen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



NTID 



34410751 t3 75 



~, _ — _ ^ T — — ^_ Score Probab ility 
AAID Length Length 

|3.1e-37 



4737 






2533 


41$ 





Protein name 



Locus Name 



unknown 



gp:AF0073Sl 



Acc# 



AF007381 



Description 



Flavobacterium johnsoniae gliding motility protein (gldAJ gene , complete 
cds; and unknown genes. 



NT 



AA 



ORF Name 



NTID 



AAID 



'±2.111.2. 5.3....t3....6.6..... 



FT73F" 



Length Length 



Score Probability 
TU£S — 



i.ie-127 



Protein name 



Locus Name 



hypothetical protein SCF34.07 



pir :T36406 



Acc# 



T36406 



Description 



ORF Name 
|ii7A46.2...c2...12S.., 



Protein name 

Description 
IMO-HIT 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



1229 



NT 



AA 



ORF Name 



NT ID 



14^41^3 ±3 71 



AAID Length Length 
— 



IU55" 



Score Probability 




1 . 2e-68 



Protein name 



Description 



Locus Name 



Acc# 



Sp:RF2_ECOLI 



ftBMlDE C HAirt release! Factor 2 (ft*-2) 



NT 



AA 



ORF Name 



NT ID 



43303l2 cl 9S 



AAID Length Length 
9963 



WIT 



Score Probability 
T5B 



S.0e-l3 



Protein name 



Locus Name 



nypotnetxcal protein 



pir :T33724 



ACC# 



T33724 



Description 



NT 



AA 



ORF Name 



NTID 



14742 



AAID Length Length 




Score Probability 



Protein name 

Description 
PT^TTTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



5.3.45.19.2...c:L..9.9. 



14743 



9965 



Length Length 



1545 



Score Probability 
7U5 



1.7e-69 



Protein name 

Description 
(BETA-NAHASE) 



Locus Name 



Acc# 



P49008 



1230 



NT 



AA 



ORF Name 



NTID 



587787 c2 131 



FT7¥T 



AAID Length Length 

mzz — 



FHJ¥" 



T2T5" 



Score Probability 
FHTu" 1 |3.6e-37 



Protein name 



Locus Name 



unsaturated glucuronyi hydrolase 



|gp:AB01S6i$ 



Acc# 



AB019619 



Description 



Bacillus sp. GL1 genes for ort and unsaturated glucuronylhydrolase, 
complete cds . 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


S33$50_cl_S4 


4745 




d$61 


474 1425 




451 




2 .6e-42 



Protein name 



Locus Name 



adenylate cyclase 



|gp:£)8$625 



Acc# 



D89625 



Description 

Anabaena sp. cyaC gene tor adenylate cyclase, complete cds . 



NT 



AA 



ORF Name 



NTID 



6£±2&ll...al.A&£. I 



AAID Length Length 

mzz — 



T7TT 



TTTT 



Score Probability 
B59 



Protein name 



Locus Name 



probable succmyl -diammopimelate 
desuccinylase 



Ipir:ll70608 



Acc# 



H70608 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— L . T — L1 Score Probability 
Length Length 



ia9.6.aZU3...X2....1Q., 



Protein name 



TTTT 



Locus Name 



Acc# 



Description 
MO-HIT 



1231 



NT 



AA 



ORF Name 



NTID 



AAID 



15755751 t2 15 



Length Length 



Score Probability 
3.2e-07 



Protein name 



Locus Name 



cytochrome b 



|gp:<3PA2433$5 



Acc# 



AJ249395 



Description 



Giobodera pallida mitochondrial COII, ND4 , COIII, ND6 , NDl , ND3 andcytb 



genes . 



NT 



AA 



ORF Name 



NTID 



AAID 



13S3S463 12 11 



Length Length 



Score Probability 



Protein name 

Description 
IMO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



lS.7.0.3.7.S.£L.±l...ll 



Length Length 



Score Probability 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 



74T - 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



1232 



NT 



AA 



ORF Name 



NTID 



AAID 



22848775 cl 23 



Length Length 
T5W 



Score Probability 
32 



0.00035 



Protein name 



Description 



Locus Name 



|sp:15HU4_RHORU 



Acc# 



P15017 



tftOfiABLfi TRA^SCftiPMOKrAL k£GtrLAfO£ itf AftASfi Cf(6) RUGlOtf (tJRl?4) 



NT 



AA 



ORF Name 



NTID 



AAID 



23444088 Cl 24 



Length Length 
S3 - 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID AAID Length Length 

wm> — 



7TT 



Score Probability 
73 



Protein name 



Description 



Locus Name 



gp:MUSIGKSJ 



Acc# 



M13606 



Mouse ig active Kappa- cnam VJ2 mRNA trom HP22.134. 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



1233 



ORF Name 



34570927 tl 2 



m7T 



Protein name 



Description 



Locus Name 



Acc# 



INO-HIT 



NT 



AA 



ORF Name 



NT ID 



1L1&2&.11...G1..A& I W757 



AAID Length Length 
— 



(T5~ 



Score Probability 
75 



0.021 



Protein name 



Locus Name 



hypothetical protein C17F3.3 



pir :T32879 



ACC# 



T32879 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



|il6.2&I&..±2...I4 1 



T7"5F" 



Length Length 



Score Probability 

m 



Protein name 



Locus Name 



conserved hypothetical protein BBI40 



|pir : 



:G70244 



Description 



0.021 



Acc# 



G70244 



ORF Name 



Protein name 



unknown 



Description 



NT 



AA 



NTID 



AAID Length Length 
15951 — 



Score Probability 
10.0055 



Locus Name 



gp:AF033S5$ 



Acc# 



AF033858 



Pediococcus pentosaceus strain ATCC432 00 plasmid pMD136, completeplasmid 
sequence . 



1234 



NT 



AA 



ORF Name 



NTID 



11769375 c2 42 



AAID Length Length 

mm — 



Score Probability 

— 



e^T 



Protein name 



Locus Name 



receptor antigen (RagA) 



gp:PGI130872 



Acc# 



AJ130872 



Description 



Porphyromonas gmgivalis W50 receptor antxgen (rag) locus encodinga major 
immunodominant 55kDa antigen. 



ORF Name 



NTID 



NT AA , , . , . 
T — _ — L1 Score Probability 
AAID Length Length JL 



c3 45 



[5TT" 



4uT" 



2.$e-37 



Protein name 



Locus Name 



|sp:Y4£L ftfMSM 



Acc# 



P55617 



Description 

PUTATIVE INSERTION SEQUEN CE ATP -BINDING PR0TU1N Y4PL 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
S3 - 



Score Probability 



Protein name 

Description 
INO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



4753 



Length Length 
HIT 



Score Probability 



Protein name 

Description 
NO-HIT 



Locus Name 



Acc# 



1235 



ORF Name 



NTID 



NT AA 

_ _ _ _ — _ — Score Probability 
AAID Length Length JL 



34181512 c2 38 





4764 




4986 




207 


524 



Protein name 

Description 
PT^TTTT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



3.6.42&7.S.!...ci...3.4... 



NTID AAID Length Length 

wzi — 



IT 



Score Probability 




0.011 



Protein name 



Description 



Locus Name 



gp:ATAC01i020 



Acc# 



AC011020 



Arabidopsis thaliana chromosome I BAC F12B7 genomic sequence, complete 
sequence . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 

sws — 



Score Probability 
T7 



0.015 



Protein name 



Locus Name 



probable sigK protein 



pir :F70830 



ACC# 



F70830 



Description 



ORF Name 



Protein name 

Description 
iOTT 



NTID 



AAID 



NT 



AA 



Length Length 



Score Probability 



\T7T 



Locus Name 



Acc# 



1236 



NT 



AA 



ORF Name 



NT ID 



c2 41 



AAID Length Length 

ss^u — 



ITT 



Score Probability 
7.4e-40 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



JC6027 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



±±8.B3A5.8....ZZ..:3. I 14769 



Length Length 



[TEuT" 



Score Probability 




1.9e-54 



Protein name 

Description 
BETA-GALACTOSIDASE , (LACTASE) 



Locus Name 



sp:BGAL_THETU 



Acc# 



P26257 



NT 



AA 



ORF Name 



NTID 



AAID 



1D.S.3..7.S.S..7....C3....3.Q., 



14770 



Length Length 



Score Probability 
TuTI — 



|6.5e-102 



Protein name 



Locus Name 



CDP-glucose-4, 6-denyciratase 



pxr :D47070 



Acc# 



D47070 



Description 



ORF Name 



NTID 



AAID 



NT AA 

— , — , Score Probability 
Length Length ■ £ - 



±UA0£.15....a±...l± I WT7T 



W5T 



l.Se-95 



Protein name 



Locus Name 



CDP-tyvelose epimerase 



|gp:YPU25551 



Acc# 



U29691 



Description 



Yersinia pseudotuberculosis group 
IVACDP-4-keto-6-deoxy-D-glucose-3-dehydrase (ddhC) gene, partial 
cds,CDP-paratose synthetase (prt) and CDP-tyvelose epimerase (tyv) genes, 
complete cds, and putative 0 antigen export protein (wzx)gene, partial cds . 



1237 



NT 



AA 



ORF Name 



NTID 



WTJT 



AAID Length Length 
— 



JUT 



Score Probability 

fizs — 



1.3e-18 



Protein name 



Locus Name 



dTDP-glucose 4, 6 -dehydratase 



Description 



pir:H5510S 



Acc# 



H69105 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


2L0L5.D..7.Zld...a3....Za 


4773 




86 


261 



Protein name 

Description 
MO-HIT 



Score Probability 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
INO-HIT 



NT 



AA 



NTID AAID Length Length 

£771 — 



Score Probability 



WW 



FT" 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
INO-HIT 



NT 



AA 



NTID 



4775 



T — _ — ^, Score Pro bability 
AAID Length Length ^ 

3SF7 



Locus Name 



Acc# 



ORF Name 



Protein name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



WW 



Probability 
|4.ie-45 



Locus Name 



glucose -1-phospnate cytxdylyl trans t erase, 



pir : C47070 



Acc# 



C47070 



Description 



1238 



NT 



AA 



ORF Name 



NT ID 



13070515 ci ii 



WT7T 



AAID Length Length 
— 



£53T 



Score Probability 
P73 



3.4e-6S 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pin JC6027 



Acc# 



JC6027 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 
10000 



7S5~ 



Score Probability 
RE25 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



Description 



pir : JC6027 



8.0e-4i 



Acc# 



JC6 027 



NT 



AA 



ORF Name 



lH£15filfiL±:L..l£ I FTm 



NT ID AAID Length Length 

llOOOl 



Score Probability 



[4TT 



l2F0~ 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

^ T — . — _ Score Probability 
AAID Length Length -L 



I1S.^3.2lu3....c1...2lS.3. I 



10002 



3TT 



|2.5e-08 



Protein name 



Locus Name 



gp:D86934 



Acc# 



D86934 



Description 

Staphylococcus aureus genes, mec region, partial and complete cds . 



1239 



NT 



AA 



ORF Name 



NT ID 



12238437 c2 380 



W7TT 



AAID Length Length 
10003 



Score Probability 

rzn — 



1.7e-21 



Protein name 



Description 



Locus Name 



sp:YYAM_BACSU 



Acc# 



P37511 



HYPOTHETICAL 32.$ K£) P^OTEllN IN TETB-£xOA iNTElRGEtflC RfiGlOtf 



NT 



AA 



ORF Name 



NTID 



13865675 Cl 304 



AAID Length Length 
10004 



TFT 



Score Probability 
^4 



|6.2e-97 



Protein name 



Locus Name 



homos er me O - sue cinyltransx erase 



pir:C72324 



Acc# 



C72324 



Description 



ORF Name 



13.B.6.&25.5...£3....2.aa., 



Protein name 



NTID 



AAID 



10005 



NT 



AA 



Length Length 
TTT 



Score Probability 



Locus Name 



Acc# 



Description 
KO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



10006 



NT AA , , , n , 

t ^^-h x^^v, Score Probability 
Length Length 



ITT 



Locus Name 



Acc# 



Description 
KfO-SIT 



1240 



ORF Name 



14S407S2 t2 103 



Protein name 
Description 



NT 



AA 



NT ID 



AAID Length Length 



Score Probability 





4785 


10007 




26i 



Locus Name 



Acc# 



ORF Name 



Protein name 



NT ID 



NT AA , , n , 
T — ^, — ^ Score Probability 
AAID Length Length JL 



10008 



conserved hypothetical protein 



Description 



TTZT 



2 . 3e-lQ 



Locus Name 



pir :E75439 



Acc# 



E75439 



NT 



AA 



ORF Name 



NTID 



AAID 



Protein name 
Description 

NO-HIT 



10005 



Length Length 



Score Probability 



UTS" 



Locus Name 



Acc# 



ORF Name 



Protein name 

Description 
PfiftftfifiOXIS — 



NT 



AA 



NTID 



AAID 



10010 



Length Length 
7^ 



Score Probability 

srra — 



|6.^e-17 



Locus Name 



sp:FER_BUTME 



Acc# 



P14073 



1241 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



16615912 ti 2bl 



4785? 



10011 



67 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID 



T7W 



10012 



Length Length 
— 



Score Probability 
|2.3e-iV7 



TTZT 



Protein name 



Locus Name 



hypothetical protein 



pir : Jyi020 



Acc# 



JQ1020 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



110013 



2604 



1.5e-3i 



Protein name 



Locus Name 



hypothetical protein fiomio.su 



pir :T04 / f'A 



Acc# 



T04772 



Description 



NT 



AA 



ORF Name 



NTID 



14792 



AAID Length Length 
10014 I | 



Score Probability 
175 



0.012 



Protein name 



Locus Name 



tap:T?CtJ6472y 



Acc# 
U64729 



Description 

Toxocara cams TcH SLdT.46 0 iilrJNA, complete cas, 



1242 



NT 



AA 



ORF Name 



119770066 &2 Jbi 



4793 



NT ID AAID Length Length 

110015 



1170 



Score Probability 
7.1e-64 



Protein name 



Locus Name 



potassium- dependent ATPase susumt u 1 



|gp:AP212466 



Acc# 



AF213466 



Description 



Anabaena sp. L-31 kdp operon, complete sequence. 



ORF Name 



I20ll303;i tl 10 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



1OO16 



Length Length 
575 



Score 



T5T 



Probability 
14 . le-20 



Locus Name 



|sp : £>NUc!_SAL [ rY 



Acc# 



P24520 



ORF Name 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



4795 



10017 



Length Length 



Score Probability 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



Protein name 

Description 
INO-HIT 



NT 



AA 



NT ID 



AAID 



1001S 



Length Length 
Z7~ 



Score Probability 



Locus Name 



Acc# 



1243 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



22306532 £2 123 



1001S 



|6.0e-87 



Protein name 



Locus Name 



hypothetical protein 



bir:S76076 



Acc# 



S76076 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
T?l 



10020 



3T" 



Score Probability 

0.051 



Protein name 



Description 



Locus Name 



sp : SPRC_XENLA 



ACC# 



P36378 



(OS TE ON E C TI N) (ON) MEMBRAMl! P ROTEIN BM-40J 



NT 



AA 



ORF Name 



NTID 



14 7 £9 



AAID Length Length 
TZ%% 



10021 



Score Probability 
|4.0e-8S 



Protein name 



Locus Name 



probable pnosphonopyruvate decarboxylase, I I bir :D69154 



Acc# 



D69154 



Description 



NT 



AA 



ORF Name 



NTID 



23.&.2b.:/.ll...C3....4.a&.. 



AAID Length Length 



110022 



Score Probability 
|6.4e-221 



Protein name 



Locus Name 



potassium- transporting ATPase, B subunit 



|pir:A75627 



Acc# 



A75627 



Description 



1244 



NT 



AA 



ORF Name 



NTID 



23632875 c3 403 



AAID Length Length 
10023 



T72T 



Score Probability 
|6.5e-125 



Protein name 



Locus Name 



potassium- translocating ATPase A chain 



|gp:AAC243194 



ACC# 



AJ243194 



Description 



AlicycloJDacillus acidocalctarius KctpA gene. 



NT 



AA 



ORF Name 



NTID 



AAID 



23560013 c3 4ltf 



10024 



Length Length 
554" 



Score Probability 
2.4e-ll5 



5^ 



Protein name 



Locus Name 



putative secreted protein 



[gp 



:SCF41 



Acc# 



AL117387 



Description 

Streptomyces coelicolor cosmid F41. 



NT 



AA 



ORF Name 



NTID 



AAID 



'lA0A116±...a±...2$.l I FFuT 



10025 



— , — , Score Probability 
Length Length 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



2.4.3.2LZU63....CZ..3.fe.Z.. 



10026 



Length Length 
T3T~ 



Score Probability 



TTTuT" 



Protein name 



Description 
[NO-HIT 



Locus Name 



Acc# 



1245 



ORF Name 



NT ID 



NT AA 

— , — , Score Probability 
AAID Length Length 



24322712 cl 235 



10027 



5.3e-132 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



Description 



pir : JC6027 



Acc# 



JC602 7 



ORF Name 



NT ID 



Protein name 



lAllA6£2.±1...2&h I W&JZ 



10028 



NT 



AA 



AAID Length Length 




Score Probability 



?TZT 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



2±6AQ.$±S...±2..±t).L. 



Protein name 



NT 



AA 



NTID AAID Length Length 

10025 



hypothetical protein jhpl211 



Description 



"TUT 



Score Probability 
|3.0e-26 



Locus Name 



Acc# 



C71832 



ORF Name 



NTID 



AAID 



2±6A2%±&.±2...±%1 I 



10030 



Protein name 



Description 



NT AA 

— , — , Score Probability 
Length Length 



118 



3B" 



Locus Name 



gp : SSKiiMUdAl 



3.6e-05 



Acc# 



Y13052 



S.sciurl mecAl gene, strain K3 (MM2) 



1246 



ORF Name 



245046^1 cl 248 



Protein name 



NT ID 



AAID 



10031 



NT 



AA 



Length Length 



Score Probability 



775" 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



NT ID 



NT AA 

— - , — , Score Probability 
AAID Length Length 



14810 



10032 



F7T" 



1.2e-58 



Locus Name 



aspartate Kinase, / homoserme dehydrogenase, 
T16H5 . 70 rprotein T16H5 . 70 : protein T16H5.70 



pir :TU4 7b2 



Acc# 



T04752 



Description 



NT 



AA 



ORF Name 



2±a££7.A£...cl...m I wzn 



NT ID AAID Length Length 

10033 



Score Probability 
TTi 



i.5e-39 



Protein name 



Locus Name 



VicK protein 



|gp:EPA0120B0 



Acc# 



AJ012050 



Description 



Enterococcus taecalis vie operon and flanking genes. 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length A - 



10034 



7W 



\TTTT 



Protein name 



Description 



Locus Name 



Acc# 



KTO-HIT 



1247 



ORF Name 



NTID 



NT AA 

— , — , Score Probability 
AAID Length Length 



2557712 cl 333 



Protein name 



hypothetical protein sll0687 



Description 



l.Se-12 



Locus Name 



pir:S7441b 



Acc# 



S74416 



NT 



AA 



ORF Name 



NTID 



AAID 



lS&153A2...al...l&5. I W$T% 



10036 



Length Length 



TuTT" 



Score Probability 
7.6e-06 



ITT 



Protein name 



Description 



Locus Name 



ACC# 



P23485 



FECR PROTEIN 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



14815 



110037 



TUT 



1206 



Protein name 



Description 



Locus Name 



sp :AAT_BACST 



ACC# 
Q59228 



ASPARTATE AMlMOTkANiiP E R&SE, (T RANSAMINASE A) (ASPA 1 ! 1 ) 



ORF Name 



NTID 



NT AA 

— „ — , Score Probability 
AAID Length Length 



1003S 



TSS4 - 



7W 



|2.§e-7S 



Protein name 
Description 

PROBABLE ASPARTOKINASE, (ASPARTATE KINASE) 



Locus Name 



sp:AK_METJA 



Acc# 
Q57991 



1248 



ORF Name 



3001402 fi 2^2 



Protein name 



NTID 



NT AA ^ ^ _ , . _ . ^ 
— , — , Score Probability 
AAID Length Length 



10039 



TTS" 



1.5>e-06 



Locus Name 



Acc# 



U40158 



Description 



Staphylococcus carnosus response regulator -liJte protein (ortx)gene, partial 
cds . 



ORF Name 



NTID 



NT AA 
— — Score 
AAID Length Length 



30££S2$S cl 294 



10040 



l^T 



Probability 
2.$e-2§ 



Protein name 



Locus Name 



RNA polymerase sigma tactor SigZ-liJte protein 



Acc# 



AF137263 



Description 



Bacteroides thetaiotaomicron 30S rifcosomal protein S16-HKeprotem, tucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



1Q.6.6±6&1.±1..±55. I 



10041 



fTTT 



8.2e-3b 



Protein name 



Locus Name 



putative aspartate Kinase 



gp:ATAC0107yV 



Acc# 



AC010797 



Description 



Arabidopsis tnaliana chromosome III BAC F28J7 genomic sequence, complete 
sequence. 



ORF Name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



10042 



75T 



S.0e-33 



Protein name 



Locus Name 



Acc# 



sp:ATKC_MYCrU 



P96369 



Description 



C CHAIN) 



1249 



ORF Name 



NT ID 



NT AA 
T — _ _ — ^. Score Probability 
AAID Length Length 



Ji^OlI t3 222 



10043 



l.ie-25 



Protein name 



Locus Name 



SttP 



gp:AF126201 



Acc# 



AF126201 



Description 



Pseudomonas puticla strain S-313 sultate ester aesulturization genelocus, 
complete sequence. 



ORF Name 



NT AA 

™,™ _ _ _ _ . T — ^. — ^. Score Probability 
NTID AAID Length Length 1 ~ 



J132040 ti 120 





10044 


3SS 


1158 


126 



6.$e-12 



Protein name 

Description 
(L-ASNASE I) 



Locus Name 



sprASfiiJBCttLl 



Acc# 



P18840 



NT 



AA 



ORF Name 



NTID 



AAID 



14523 



10045 



Length Length 



Score Probability 
571 



l.le-97 



Protein name 



Locus Name 



NADH oxidase (noxA- 3 ) homo log 



|pir ; 



:H69299 



Acc# 



H69299 



Description 



ORF Name 



NTID 



NT AA , . , 
, T — _ — ^. Score Probability 
AAID Length Length JL 



3.3A&&.5Q.2L...cA..A0.2l I I4S24 



1004£ 



Protein name 



transmembrane sensor 



Description 



114" 



0.0006S 



Locus Name 



gp:AF05169i 



Acc# 



AF051691 



Pseudomonas aeruginosa stress tactor A (pstAj , ECF sigma tactor (tiul) , 

transmembrane sensor (f iuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds . 



1250 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



35245635 c3 410 



10047 



5TT" 



T5TS" 



2.4e-09 



Protein name 



Locus Name 



unknown 



lgp:U5£77i 



Acc# 



U96771 



Description 



Prevotella Pryantu putative polygalacturonase, B-l, 4- endogiucanase , ana 
mannanase genes, complete cds ; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



5^375662 ci 323 



AAID Length Length 
1355 — 



10048 



Score Probability 
2.§e-$4 



Protein name 

Description 
THREOWlJslE SYNTHASE, 



Locus Name 



|sp:TffllC_HAEI*l 



Acc# 



P44503 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



10045 



1242 



|2.ie-126 



Protein name 



Locus Name 



sp:RADA_fiA0£JU 



Acc# 



P37572 



Description 

tMk RSPAI& EROTStiSl ftADA HOMoLOG (OaJA RE PAIR £R0T£1N SMS HOMoLuU) 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


3.3.3.a3.a.7....ci...^3.7. 


452$ 


10050 


305 




551 


3.6e-53 















Protein name 



Locus Name 



putative 3 0.6 kDa protein 



gp:AF037440 



Acc# 



AF037440 



Description 



Edwards lei la ictaluri D-3 -pfrospiiogly cerate denydrogenase (serA)gene, 
partial cds; ribose- 5 -phosphate isomerase (rpiA) , inhibitorof chromosome 
initiation (iciA) , putative 26 kDa protein (yggE) , putative 30.6 kDa protein 
(yggB) , and fructose 1 , 6-bisphosphatealdolase (fda) genes, complete cds; and 
phosphoglycerate kinase (perk) gene, partial cds. 



1251 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



3542202 c2 rid 



10051 



TFTTT 



2.3e-38 



Protein name 



Description 



Locus Name 



spiARSFHTIMMJ 



Acc# 



P54793 



ARYL S trLFATAS fi ? PRECURSOR, (ASF) 



ORF Name 



NTID 



NT AA 

— — , Score Probability 
AAID Length Length 



4076255 cl 



10052 



F£34" 



T¥F5~ 



4^" 



2.4e-47 



Protein name 



Locus Name 



tripeptidyl ammopeptidase 



gp : STMTPAP 



ACC# 



L46588 



Description 



Streptomyces iividans tripeptidyl ammopeptidase gene, compietecas . 



ORF Name 



NTID 



NT AA „ ^ , , . _ . . 
— — , Score Probability 
AAID Length Length 



|410.M10^c3^3.S.D. I 



110053 



1557 1 



Protein name 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NT 



AA 



NTID 



A4i4aafiL„.ai™2aa i 



AAID Length Length 
10054 



Score Probability 
§.3e-^6 



Protein name 



Locus Name 



response regulatory protein (rrp-2) homo log 



bir:B70i9b 



Acc# 



B70195 



Description 



1252 



ORF Name 



NTID 



AMD 



NT AA 

— , — , Score Probability 
Length Length 



4594037 c2 ^ 



T7W 



TTTT 



2.5e-123 



Protein name 



Locus Name 



GTP cyclohydrolase II, / 3 , 
4-dihydroxy-2-butanone 4 -phosphate synthase, 
ribA:ribA protein _ 



foir:C7(mi 



Acc# 



C70331 



Description 



NT 



AA 



ORF Name 



NTID 



4SS1507 c3 4^9 



AAID Length Length 



[10056 



Score Probability 
|3.2e-53 



WIT 



Protein name 



Locus Name 



115K outer membrane protein precursor : susc 
protein 



Description 



pir : JC6027 



Acc# 



JC6027 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Prob ability 
Length Length 



10057 



FTT" 



9.ie-135 



Protein name 
Description 

PUTATIVE PROTEASE YDCP PRECURSOR, 



Locus Name 



Acc# 



sp:YDCP_E0OLl 



NT 



AA 



ORF Name 



NTID 



AAID 



I4§3£ 



1005$ 



Length Length 
TUT 



'2124 



Score Probability 
3 . 9e-08 



Protein name 



Locus Name 



heme receptor 



gpTVTEHUTS" 



Acc# 



L27149 



Description 

Vibrio cholerae heme receptor (hut A) gene, complete cas . 



1253 



NT 



AA 



ORF Name 



NT ID 



AAID 



5272212 c2 



Length Length 
7S2 



TUT 



Score Probability 
HB 



0.0013 



Protein name 



Locus Name 



hypotnetical protein Rv0587 



Description 



pir :F70907 



Acc# 



F70907 



ORF Name 



5.2..7.5AZ5....tZ...0.iA.. 



Protein name 



Description 



NT 



AA 



NT ID 



AAID 



10060 



Length Length 



Score Probability 



Locus Name 



Acc# 



NO-HIT 









NT 


AA 


ORF Name 


NT ID 


AAID 


Length 


Length 


S2aaaii.±i...2Ai 


483<» 


10061 


124 


375 



Protein name 



Description 



Score Probability 



Locus Name 



Acc# 



[NO-HIT 



ORF Name 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



10062 



Length Length 
TTT 



Score Probability 



Locus Name 



Acc# 



NO -HIT 



ORF Name 



Protein name 



Description 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



14841 



10063 



P3" 



Locus Name 



Acc# 



MO-HIT 



1254 



NT 



AA 



ORF Name 



NT ID 



AAID 



10054 



Length Length 
313" 



Score Probability 
35 



0.010 



Protein name 



Locus Name 



outer membrane protein 21, Omp2l 



bpiCAAJlSitt 



Acc# 



AJ001918 



Description 

Comamonas acidovorans omp21 gene. 



NT 



AA 



ORF Name 



NT ID 



AAID 



5411152 c3 420 



10055 



Length Length 
77~ 



Score Probability 



2TT 



Protein name 



Description 



Locus Name 



Acc# 



ORF Name 



NTID 



AAID 



NT AA „ _ , , . . . . 
— — Score Probability 
Length Length 



TttT 



10055 



ITST 



7TT 



10.033 



Protein name 



Locus Name 



hypothetical protein APE1598 



bir:A72b3y 



Acc# 



A72539 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



10057 



Length Length 
T73 



Score Probability 
0.042 



Protein name 



Locus Name 



nypotnetical protein ORF87 



pir :T3Q436 



Acc# 



T30436 



Description 



1255 



ORF Name 



NT ID 



AAID 



NT AA 
— , — , Score 
Length Length 



10068 



I3TT 



Probability 
|1.0e-94 



Protein name 



Locus Name 



WbnE 



Acc# 



AF172324 



Description 



Escherichia coli GalF (galF) gene, partial cds; O-antigen repeatumt 
transporter Wzx (wzx) , WbnA (wbnA) , O-antigen polymerase Wzy(wzy) , WbnB 
(wbnB) , WbnC (wbnC) , WbnD (wbnD) , WbnE (wbnE) , UDP-Glc-4-epimerase GalE 
(galE) , 6-phosphogluconate dehydrogenaseGnd (gnd) , UDP-Glc- 6 -dehydrogenase 
Ugd (ugd) , and WbnF (wbnF) genes, complete cds; and chain length determinant 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


2Ifi.2flJ5M„jdL64 


4847 


10069 


474 




142b 


1297 




3 .2e-132 





Protein name 



Locus Name 



3-isopropylmalate denydratase, large cfiain 



pir :T2y083 



Acc# 



T29083 



Description 



ORF Name 



NT 



AA 



NT ID 



AAID Length Length 



Score Probability 



Protein name 



10070 



£u"6~ 



Locus Name 



9.7e-44 



Acc# 



Description 



sp:LEUD_HAElW 



P44438 



(ISOPROPYLMALATE I 50M E RA5E) (ALPHA- I PM ISOMERASE) 



ORF Name 



NTID 



AAID 



NT AA 

— — , Score Probability 
Length Length 



Protein name 



10071 



Tu~7¥" 



Locus Name 



2.4e-189 



Acc# 



Description 



sp:LKU3_BACFR 



P54354 



(IMDH) (3-IPM-I5U) 



1256 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score 



24270450 ci 67 



4850 



10072 



Probability 
I2.0e-14 



Protein name 



Locus Name 



unknown 



gp:AF036677 



Acc# 



AF036677 



Description 



Salmonella typhimurium putative operon regulated Jay PmrAB, necessary tor 
4-aminoarabinose lipid A modification and polymyxinresistance, PmrG (pmrG) 
gene, partial cds; PmrF (pmrF) gene and 6orfs, complete cds; and PmrD (pmrD) 
gene, partial cds. 



NT 



AA 



ORF Name 



NTID 



AAID 



2433S131 11 25 



10075 



Length Length 



T5T 



Score Probability 
S.le-06 



Protein name 



Locus Name 



hypothetical protein PH02iy 



|pir:A7124L, 



Acc# 



A71245 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



2&11116.b....z±...6A.. 



14852 



110074 



TUJT 



3096 



2.6e-86 



Protein name 



Locus Name 



115K outer memJorane protein precursor : SusC 
protein 



pir : JC6027 



Acc# 



JC6 02 7 



Description 



ORF Name 



2&210±±2..A2.. 



Protein name 



NTID 



AAID 



110075 



NT 



AA 



Length Length 
TS1 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



1257 



NT 



AA 



ORF Name 



NT ID 



AAID 



13408140b c2 71 



10076 



Length Length 




TO 



Score Probability 
1.9e-06 



Protein name 



Locus Name 



nypotnetical protein PHS004 



foir:F7i24S 



Acc# 



F71245 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



— Score Probability 



\i±±ojaxl.q2J1i ...j mss 



10077 



1 [TTOT 



[T2uT" 



li.le-122 



Protein name 



Description 



Locus Name 



sp:LEUl_HAli!lJsJ 



Acc# 



P43861 



5 YNTHA&k ) (AL1>HA-1PM SYMTHJaiTAtSK) 



NT 



AA 



ORF Name 



NT ID 



4856 



AAID Length Length 



1007^ 



Score Probability 
2.3e-65 



Protein name 



Locus Name 



2-isopropyimaiate syntnase (leuA-ij nomoiog | [pir :E6936^ 



Acc# 



E69369 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



2TOT 



10075 



Length Length 
£S3 



Score Probability 
0.034 



71 



Protein name 



Locus Name 



nypotnetical protein pho^u 



tpir:B7124^ 



Acc# 



B71245 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



4858 



Length Length 



Score 



Probability 
|2.0e-lfi 



Protein name 



Locus Name 



lipid A disaccharide synthase 



|pir:B71>0i4 



Acc# 



B72014 



Description 



1258 



ORF Name 



NT ID 



AAID 



NT AA 

— — Score Probabil ity 
Length Length 



5901877 &2 7b 



looai 



Protein name 



Locus Name 



dolicnol-pnosphate mannosyl transferase 



Description 



pxr:<4704fei 



Acc# 



G70463 



ORF Name 



Protein name 



NTID 



4860 



AAID 



10082 



NT 



AA 



Length Length 



— Score Probability 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



proline- ricn protein precursor 



Description 



NT AA 

— — Score Probability 

Length Length 

T5T~ 



T7TT 



i.7e-12 



Locus Name 



FpTr7S2T7TT 



Acc# 



S23737 



ORF Name 



lQ2L£143.6....tl...8...... 



Protein name 



NTID 



AAID 



10084 



NT 



AA 



Length Length 
142 I 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



100S5 



araJDinogalactan-liJce protein 



Description 



NT 



AA 



Length Length 
"BTT5 



Score Probability 
1.4e-05 



TTT 



Locus Name 



pir :S52994 



Acc# 



S52994 



1259 



ORF Name 



NTID 



Protein name 



NT AA 
— , — , Score 
AAID Length Length 



10086 



ttypotnetical protein kv3 864 



Description 



Locus Name 



pir :E706bb 



Probability 
|4.4e-07 



Acc# 
E70656 



NT 



AA 



ORF Name 



NTID 



AAID 



10087 



Length Length 
^7" 



Score Probability 
3.2e-40 



Protein name 



Locus Name 



receptor antigen tRagA] 



Igp:^i±i0872 



Acc# 



AJ130872 



Description 



Porphyromonas gingivaiis WSO receptor antigen tragj locus encoamga major 
immunodominant 55kDa antigen. 



ORF Name 



Protein name 



NTID 



ii3.o.ao.aio...±i...iib. I wszz 



N'T AA 

— — Score Probability 
AAID Length Length 



Locus Name 



Acc# 



Description 



IMC-MIT 



ORF Name 



Protein name 



NTID 



AAID 



10089 



— — Score Probability 
Length Length 



TTT 



Locus Name 



Acc# 



Description 



1260 



ORF Name 



NTID 



NT AA 

— — , Score Proba bility 
AAID Length Length 



13956536 ci 



110090 



KIT 



TZW 



Protein name 



Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



10091 



Length Length 

Tm — 



TIT 



Score Probability 
14 . 5e-22 



T7TT 



Protein name 



Description 



Locus Name 



Acc# 



sprTIKMJiUOLl 



MA MlMASfl TftAC, (REPLICATI ON PR1MAS4) 



NT 



AA 



ORF Name 



\±A1212B±..±2...12 1 FF7U 



NTID AAID Length Length 

I100S2 



Score 



[ITT 



1ST 



Probability 
5.8e-05 



Protein name 



Description 



Locus Name 



sp:Ml3_AkAl l U 



Acc# 



P30185 



DEHYDR1N kAUlB 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



14871 



10093 



TIT 



TUT 



Probability 
0.036 



Protein name 



Locus Name 



exodeoxyribonuclease V, gamma chain (recCJ 
homo log 



pir :A70179 



Acc# 



A70179 



Description 



1261 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



I202S0 t3 147 



110034 



T77TT 



F5T 



0.041 



Protein name 



Locus Name 



unknown 



gp:U96771 



Acc# 



U96771 



Description 



Prevoteila bryantii putatxve polygalacturonase, B-i, 4-endoglucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



ORF Name 



NTID 



20506501 C2 202 



4873 



10035 



Protein name 



nypotnetical protein bl488 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



Locus Name 



pir :C64902 



l.le-lS 



Acc# 



C64902 



ORF Name 



Protein name 



NTID 



10096 



NT 



AA 



AAID Length Length 
T33 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



|2119.uI3.„.ci...i47.. 



Protein name 



NTID 



AAID 



NT AA 

— — Score Probability 
Length Length 



10097 



400 



TZUT 



115 



,i.5e-oa 



Locus Name 



hypothetical protein SC6G4.36C acbU4.Jbc 



pir:T3bb87 



Acc# 



T35587 



Description 



1262 



NT 



AA 



ORF Name 



NT ID 



121507^ tl 11 



AAID Length Length 

Tm — 



Score 



10058 



7TT 



559 



Probability 
|2.3e-&5 



Protein name 



Description 



Locus Name 



|gp:BFU63P55" 



Acc# 



U63096 



Bacteroides tragilis IDctA) gene, complete cas . 



ORF Name 



NTID 



NT AA 
— — , Score 
AAID Length Length 



|2l5?§375 cl 197 



[4T7T 



10055 



77F" 



1ST 



Probability 
3.2e-l2 



Protein name 

Description 
Ateime Herpesvirus i complete genome. 



Locus Name 



|gp:AP0^^424 



ACC# 



AF083424 



ORF Name 



NTID 



NT AA 
— — , Score 
AAID Length Length 



I225.iaia.l..±2...b.l 



10100 



[T7T 



Probability 
5.2e-06 



Protein name 



Locus Name 



nypotnetical protein T15B7.3 



bir:T^2i>0 



Acc# 



T32250 



Description 



NT 



AA 



ORF Name 



NTID 



12$A6.5.0£...alJ±&Z I m75 



AAID Length Length 



10101 



— ^ Score Probability 
5.4e-25 



3T5" 



Protein name 



Locus Name 



Acc# 



mobilization protein c 



gp:AFll&^ 



AF118243 



Description 

Bacteroides tragilis mobilization protein (J tmor>c; gene, compietecas. 



1263 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



23485336 ci 14b 



10102 | [77 | pTT 



0.0098 



Protein name 



Locus Name 



&07E5.1 protein (clone Kuvabj 



pir:343604 



Acc# 



S43604 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



10103 



TOT" 



TOT" 



UTT 



i.5e-06 



Protein name 



Locus Name 



hypothetical protein PH0217 



pir :G71244 



Acc# 



G71244 



Description 



ORF Name 



Protein name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



10104 



Locus Name 



1.4e-10 



Acc# 



Description 



sp;?ESAj3A(JfciU 



HYPO T H E TICAL 21.1 KB PRO T EIN I N COTb -KDUD INTERGENIC Ri^loN 



P50838 



ORF Name 



Protein name 



NTID 



14883 



110105 



NT 



AA 



AAID Length Length 

— 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



2A&B.0.1B;i.±±...lb... 



Protein name 



AAID 



10105 



KTT AA 

— — Score Probability 
Length Length 



T5T 



Locus Name 



Acc# 



Description 



NO-HIT 



1264 



ORF Name 



Protein name 



NTID 



14885 



AAID 



10107 



— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



2i5.0A0.6.2..±l...LL 



Protein name 



NTID 



AAID 



10108 



NT 



AA 



Length Length 
ZTB 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



14887 



AAID 



10109 



NT 



AA 



Length Length 



Score Probability 



576 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



10110 



NT 



AA 



Length Length 

— 



Score Probability 



Locus Name 



Acc# 



Description 



NO-HlT 



ORF Name 



Protein name 



NTID 



AAID 



10111 



— — Score Probability 
Length Length 



7TJT" 



Locus Name 



Acc# 



Description 



NO-HIT 



1265 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



Protein name 



4890 



10112 



7T 



TIT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



2Sft7.aai2„±I...la. 



Protein name 



NTID 



WZTT 



AAID 



10113 



"NTT AA 

— — Score Probability 
Length Length 



TUT 



330 



F5" 



Locus Name 



0.0044 



Acc# 



antigen 5401 



Description 



pir :A60b43 



A60643 



ORF Name 



Protein name 



NTID 



10114 



NT 



AA 



AAID Length Length 



Score Probability 



837 



TTT 



Locus Name 



2.0e-05 



Acc# 



chromosome partitioning ATPase S03 



Description 



bir:D7bb70 



D75570 



ORF Name 



NTID 



Protein name 



AAID 



10115 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



NTID 



4894 



AAID 



Il0ll6 



NT AA 

— — Score Probability 
Length Length 



flsT 



Locus Name 



Acc# 



Description 



MO-HIT 



1266 



ORF Name 



NT ID 



NT AA „ „ , , . . , 
— — Score Probability 
AAID Length Length 



25354152 c2 J 



14 8 95 



10117 



75" 



0.027 



Protein name 



Locus Name 



Acc# 



gp:S631^ 



S83195 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID 



|3l363l§ Cl 189 



llOllS 



Length Length 
1235" 



Score Probability 
ITGS 



|2.3e-0& 



Protein name 



Locus Name 



sperm mitochondrial capsule selenoprotein I tpir :A3 viyy 



Acc# 



A37199 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



1±A21^±..±±..±1 



4897 



10115 



T5T 



7.5e-06 



Protein name 



Locus Name 



major ampuiiate tiJDrom protein 



pir :A36068 



Acc# 



A36068 



Description 



NT 



AA 



ORF Name 



llAllOAl^al^lll ] |4&9a | [ 



NTID AAID Length Length 

110120 



HUT" 



[TUT" 



Score Probability 
[TT5 



|3.8e-0£> 



Protein name 



Locus Name 



KIAA0775 protein 



|gp:AB01*my 



Acc# 
AB018318 



Description 

ttomo sapiens mMA tor KlAA077b protein, complete cas 



1267 



NT 



ORF Name 



NT ID 



AAID Length Length 



AA 

— Score Probability 



|31836b61 cl Ibb 



10121 



1791 



ITT 



|3.1e-27 



Protein name 



Description 



Locus Name 



jgp : CBCjIDPAH 



ACC# 



Y10436 



C. burnetii put. genes ior encoding glucose mnimteti. divisionprotem A ana 



B. 



ORF Name 



333Sab68 cl 14b 



Protein name 



Description 



NT 



AA 



NTID AAID Length Length 

2T7 



Score Probability 



TUTZT 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



lll$.llX$...±'2..Ml 



Protein name 



Description 



NT 



AA 



NTID 



AAID 



10125 



Length Length 
TT74 



Score Probability 



Locus Name 



Acc# 



M0-U1T 



ORF Name 



Protein name 



troponin T 



Description 



NT 



AA 



NTID 



AAID 



4902 



10124 



Length Length 



24?T 



— ^ Score Probability 
0.00056 — 



ttt 



Locus Name 



pir :SU27Ub 



Acc# 



S02708 



1268 



ORF Name 



34181583 ti lib 



Protein name 



NTID 



4903 



AAID 



NT 



AA 



Length Length 




Score Probability 



T5T 



Locus Name 



Acc# 



Description 



IN0-H1T 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



!1440..7.3.:Ab...±1...2.U. 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



Protein name 



10127 



TUT~ 



'JUT 



Locus Name 



Acc# 



Description 



1NO-HIT 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



Protein name 



4906 



110128 



756 



Locus Name 



Acc# 



Description 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



M49.5.9Ab...±i..J.l.. 



Protein name 



14907 



10129 



TTTT 



Locus Name 



Acc# 



Description 



NO-HIT 



1269 



UKr iMame 




NT ID 


NT 

AAID Length 


AA 

— , Score 
Length 


Probability 




34S5203i4_t3_i07 


4908 


10130 | [407 | |1224 162 




3 . ue-ub 




Protein name 










Locus Name 




Acc# 




hypothetical 119. bK protein 


(uvrA region) :ORF 


pir : Jy040b 




JQ0405 




1 protein 


















Description 


















ORF Name 




NT ID 


NT 

AAID Length 


AA 

— , Score 
Length 


Probability 




3.5.2L3.^b.d^CL^l.:/.y. 


4y0y 


10131 134 40b 100 




6 . be-Ub 




Protein name 










Locus Name 




Acc# 




latent nuclear antigen 




gp:AL'0^bOl 




AF083501 




Description 


















THacaca muiatta rhadmovirus 17S77 Rl, dihydrololate reductase, complement: 
binding protein, ssDNA binding protein, transportprotein, glycoprotein B, 
DNA polymerase, R2, thymidylate synthase ,R3, Bel -2 homolog, capsid protein, 
tegument protein, thymidinekinase , glycoprotein H, major capsid protein, 
capsid protein, kinase, alkaline exonuclease, crlvcoprotein M, 




ORF Name 




NTID 


NT 

AAID Length 


AA 

— , Score 
Length 


Pi 


-obability 




as2js.ujia...jc3...ii*3 




4910 


10132 476 1431 89 




0.0035 




















Protein name 










Locus Name 




Acc# 




beta-D-galactosidase 




gp:BkPLAC201 


M63097 




Description 


















Brugia maiayi peta-D-gaiactosidase (laczj mRJNA, 


partial ccts . 








ORF Name 




NTID 


NT 

AAID Length 


AA 

— , Score 
Length 


Probability 




Z&MlM^cX^M 


4911 


10133 286 861 179 




y . ie-i^ 




Protein name 










Locus Name 




Acc# 




MocB (Tn43yy) 


pir :B48487 




B48487 





Description 



1270 



ORF Name 



NT ID 



NT AA 

— — Score Probability 
AAID Length Length 



25150277 ci 164 



10134 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID 



|3£442^...c3....27.D. 



10135 



Length Length 




Score Probability 
0.014 



Protein name 



Locus Name 



envelope protein 



gpTTTTVETTVim" 



Acc# 



M61052 



Description 



Human T-ceii leuke mia virus I (HTLVl) envelope lenvj gene, b' ena. 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



16£a$a&i.±±.xi± I mrz 



10135 



^4" 



i.7e-3l 



Protein name 



Locus Name 



nypotnetical protein slrll3b 



|pir:S7743y 



Acc# 



S77439 



Description 



NT 



AA 



ORF Name 



NTID 



3.5.S2fl3J..7....cl...ly.b.. 



AAID Length Length 



Score Probability 



TUTT7 



TTF 



Protein name 



Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



I0I38 



Length Length 
3TT7 



TFB" 



Score Probability 
6.1e-19 



228 



Protein name 



Locus Name 



DNA repair protein Raae 



pir:C7043$ 



Acc# 



C70439 



Description 



1271 



ORF Name 



402713b ti 110 



Protein name 



NTID 



"NTT AA 

— — Score Pr obability 
AAID Length Length 



TTT 



TUTT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



transposase 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



14918 



10140 



TTZT 



1.2e-86 



Locus Name 



gp:AF0^6fe 



Acc# 



AF038866 



feacteroides iira gilis transposon Tn552U transposase IJsipH) anamofrilization 
protein BmpH (bmpH) genes, complete cds . 



ORF Name 



NTID 



14515 



10141 



Protein name 



nypotneticai protein 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TUT 



6.7e-05 



Locus Name 



pir :B40bUb 



Acc# 



B40505 



ORF Name 



Protein name 



NTID 



— — Score P robability 
AAID Length Length 



ftiaafli..±A...m I 



110142 



putative resolvase 



Description 



KIT 



[5T 



0.0034 



Locus Name 



Acc# 



gp : DAS OK 



Desulturolobus ambivalens tnpA, tnpB, rtbD and sor genes ana ORF2 , OKfc'J / 
ORF4 and ORF5 . 



1272 



ORF Name 



12944067 t'l 4 



Protein name 



NTID 



AAID 



110143 



NT 



AA 



Length Length 
TUT75 — 



Score Probability 



TIT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



4922 



AAID 



10144 



NT 



AA 



Length Length 
9375 



Score Probability 



JUT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



\±216M.l...z±.± 



Protein name 



NTID 



AAID 



10145 



NT 



AA 



Length Length 
522 



Score Probability 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



Description 



NTID 



AAID 



— — Score Probability 
Length Length 



4924 



110146 



[TT75~ 



H2UB" 



8.6e-i23 



Locus Name 



IspiKBLJiC'oLl 



Acc# 



P07912 



(GLYCINE AC ET YLTftAK SFERASU) 



1273 



ORF Name 



25783462 tl V 



Protein name 



NT ID 



10147 



NT 



AA 



AAID Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



10148 



NT AA 

— — Score Probability 
Length Length 



TUTT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



4927 



10149 



nypotnetical protein PHOV/tf 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



T3T 



TT 



Locus Name 



pir :D71I2b 



0.018 



ACC# 



D71126 



ORF Name 



Protein name 



NTID 



110150 



NT 



AA 



AAID Length Length 



Score Probability 



JUT 



TUT 



Locus Name 



gp:D42067 



0.0078 



Acc# 



D42067 



Description 

£orphyromonas gmgivalis DNA tor Fimfcrilin, ORFI-4, complete cas . 



1274 



ORF Name 



NT ID 



NT AA 

— Score Probability 



AAID Length Length 



167688V tl h 



10151 



HIT 



T3TT 



Protexn name 



Description 



Locus Name 



gprVCimiiOfc 



Acc# 



AJ231106 



Vibrio cholerae z47t gene. 



ORF Name 



NT ID 



AAID 



10152 



Protein name 



hypothetical protein 



Description 



NT 



AA 



Length Length 




Score Probability 
3.5e-55 



Locus Name 



plrTT^MTT 



Acc# 



T29433 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score 



10153 



T77TT 



TUT 



Probability 
3 .4e-ov 



Protein name 



Locus Name 



unknown 



gp:U96771 



Acc# 



U96771 



Description 



Prevotella bryant ii putative polygalacturonase , h-i , 4-enctogiucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


±&te&ia^.aX..±Q.h 


4$32 


10154 


1060 


3lS3 


565 


l.Se-57 



Protein name 



Locus Name 



beta-N-Acetylglucosammidase 



gp:AB00fcJ771 



Acc# 



AB008771 



Description 

Streptomyces thermoviola ceus nagA gene torfceta-N-Acetyiglucosaminiaase, 
complete cds . 



1275 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



114657927 ti 7o 



110155 



TIT 



li.Se-54 



Protein name 



Locus Name 



sp:Y7S6_METJA 



Acc# 
Q58206 



Description 

HYPOTHETICAL ABO TRANSPORTER Al^P- BINDIN G PROTEIN MJ073b 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



17010202 12 40 



10156 



Protein name 



Locus Name 



conserved hypotnetical protein MTH6 y b 



[pir:F6919r 



Description 



1.5e-l7 



Acc# 



F69192 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



^T5~ 



10157 



i.5e-i7 



Protein name 



Locus Name 



&NA polymerase sigma tactor sigz-nke protein I igp ; AFij'/^bi 



Acc# 



AF137263 



Description 

Bacteroides theta iotaomicron 30^ nsosomai protein S16 -iijceprotem, rucose 
gene cluster, and RNA polymerase sigma f actorSigZ-like protein (sigZ) genes, 
complete cds . 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 





. 4936 




1007 


3024 




566 


- 5.5>e-58 


Protein name 








Locus 


Name 


Acc# 



beta-N-Acetylglucosammidase 



bpiABOOdVVl 



AB008771 



Description 

Streptomyces thermoviolaceus nagA gene rorfceta-N-Acetyiglucosaminiciase , 
complete cds . 



1276 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



212 75 7 cl yu 



W5TT 



110159 



TZUT 



8.9e-24 



Protein name 



Locus Name 



Acc# 



sp:MUT^__TH£!AO I Q56215 



Description 

f)MA MISMATCH feKPAlR E kOTElfrJ MU'l'S 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



— Score Probability 



2168150^ ri 74 



101S0 



S3T 



|3.2e-l4 



Protein name 



Locus Name 



transmembrane sensor 



|gp:AP0bl69l 



Acc# 



AF051691 



Description 



Pseudomonas aeruginosa stres s tactor A (psrA) , ECF sigma tactor iiiuu ; 
transmembrane sensor (fiuR) , and hydroxamate-typef errisiderophore receptor 
(fiuA) genes, complete cds , 



ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


21ilW.b-i..±±Jll). 4939 


10161 


1085 32b8 


583 


1.8e-93 


Protein name 








Locus 


Name 


Acc# 


receptor antigen iRagA) 


gp:PGIl30872 


j AJ130872 


Description 
















Porpnyromonas gmgivaiis Wbu 
immunodominant 55kDa antigen. 


receptor 


antigen 


(rag) locus en 


codinga major 




ORF Name NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


\2±b3.115A...al..±SA 494 0 




343 1032 


512 


4.9e-4$ 


Protein name 








Locus 


Name 


Acc# 


glucose Kinase 




gp : BMGLDCKiJN 


AJ000005 


Description 















Bacillus megatenum giK gene 



1277 



ORF Name 



NTID 



ioi6i 



Protein name 



hypothetical protein siri^uv 



Description 



— — Score Probability 



AAID Length Length 
TTT5T 



156 



Locus Name 



tair:SV7t>41 



3.4e-16 



Acc# 



S77541 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



14942 



10164 



6.2e-6b 



Protein name 



Locus Name 



immunoreactive 51kD antigen PUb^ 



gp:AP17b71y 



Acc# 



AF175719 



Description 

Porphyromonas gmgivalis strain Wbu 
complete cds. 



immunoreactive 51KD antigenp^b^ gene, 









NT 


AA 


Score 


Probability 


ORF Name 


NTID 


AAID 


Length 


Length 








l^M^l^tl^ll 


4943 


10165 


119 


360 


299 




i.Se-26 


Protein name 








Locus 


Name 




Acc# 



Description 



I sprRLl^'l'kTk 



034031 



50a kll*O&j0MAL kkoTE lN Ll9 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


i&2ftiL±l..i& 


4944 


|l0l66 


117 


351 




0.027 


Protein name 








Locus 


Name 


ACC# 



hypothetical protein BB0794 



pir:A7019y 



Description 



1278 



NT 



AA 



ORF Name 



NTID 



26750178 Cl 104 



AAID Length Length 
10167 



Score Probability 
TW9 



5.1e-15 



Protein name 



Locus Name 



UDP- sugar Hydrolase 



pir :A72201 



ACC# 



A72201 



Description 



ORF Name 



NTID 



NT AA 
T — _ _ — _ Score Probability 
AAID Length Length JL 



|3IE 



>.3.3.22....X.2....5l1 1 



1016S 



[775" 



2T7F" 



8.ie-lli 



Protein name 



Locus Name 



melibiase 



gp : TEMELA 



Acc# 



Y08557 



Description 



T . ethanolicus melA and lacA genes. 



ORF Name 



NTID 



NT AA 
T — _ — ^ Score Probability 
AAID Length Length JL 



3.5.27.&Z53....C2....:L2.SL. 



10165 



TU7T 



|2.3e-73 



Protein name 



Locus Name 



115K outer membrane protein precursor :SusC 
protein 



pir : JC6027 



Acc# 



JC6027 



Description 



ORF Name 



£I15.$.2.7....al...S.l.. 



Protein name 



NT 



AA 



NTID 



AAID 



14948 



110170 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 
NO-HIT 



1279 



ORF Name 



NTID 



14305138 ti 22 



10171 



^ — . , Score Probability 

$.3e-14 



AAID Length Length 
TZTT 



Protein name 



Locus Name 



alpna-xylosiaase 



pir:A7^394 



Acc# 



A72394 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
T3TT 



10172 



— ^ Score Probability 
|4.2e-27 



TUB" 



Protein name 



Locus Name 



conserved nypothetical protein yknzl 



pir rEbydbb 



Acc# 



E69858 



Description 



NT 



ORF Name 



NTID 



AAID Length Length 



AA 

— Score Probability 



M.8.&5A2..±l...iy... 



l6l73 



3.0e-l8 



Protein name 



Locus Name 



hypotneticai protein 



pir :£7£>94b 



Acc# 



S76946 



Description 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 

Length Length 



10174 



TFT" 



Probability 
l.Se-09 



Protein name 



Locus Name 



putative alpna-giucosiaase 



gp:AA(J2b2lfol 



Acc# 



AJ252161 



Description 



Alicyclobacillus acidocaidarius malto se/maitodextnne cransporcgene region 
(malEFGR genes, cdaA gene and glcA gene) . 



1280 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



|5333Mi ti 7b 



11017b 



[TFT 



I3.ie-10 



Protein name 



Locus Name 



putative alpna-glucosiaase 



|gp:AA02b2lbl 



Acc# 



AJ252161 



Description 



Aiicyciobaciiius acidocaida rius maitose/maitodextrine transportgene region 
(malEFGR genes, cdaA gene and glcA gene) . 



ORF Name 



NT ID 



AAID 



— — Score Probability 
Length Length 



£22127 cl 10b 



10176 



Protein name 



Description 



Locus Name 



sp:5^T£)_DiSOM 



Acc# 



P29240 



5 ' -HUCLKOTIDAaK PRECURSOR, ( E CTO - N U (JLEut i das a ; 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



|lD.3A5.m...c2...Z 



10177 



TTT" 



3^" 



2£T 



i.2e-2i 



Protein name 



Locus Name 



115K outer membrane protein precursor : susO 
protein 



pir : 



Acc# 



JC6027 



Description 



ORF Name 



Protein name 



NT 



NT ID 



AAID 



TUTTW 



Length Length 
JT7 



AA 

— Score Probability 



ITS 



Locus Name 



Acc# 



Description 



NO-HIT 



1281 



ORF Name 



15589717 c2 28 



Protein name 



NT ID 



NT AA 

— — , Score Probability 
AAID Length Length 



10179 



167 



Locus Name 



2.2e-09 



Acc# 



Description 



gp:IYLMSA<a 



M.mazei surtace antigen genes orr4y2, ort375 and orr.783, 



X84710 



ORF Name 



24024182 r3 13 



Protein name 



NT ID 



UOlSO 



NT 



AAID Length Length 
535 



AA 

— Score Probability 



ITT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT ID 



AAID 



iOlSl 



— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



MO- HIT 



ORF Name 



Protein name 



NTID 



43FTT 



AAID 



101S2 



NT 



Length Length 
T557 



AA 

— Score Probability 



5TT 



Locus Name 



Acc# 



Description 



NO -HIT 



1282 



NT 



AA 



ORF Name 



NTID 



26370887 4b 



AAID Length Length 
52u 



Score Probability 



10183 



TTT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



L962 



10184 



Length Length 
321 



Score Probability 
0.0075 



Protein name 



Locus Name 



hypothetical protein u&uziz 



bxr:D70l2fo 



Acc# 



D70126 



Description 



ORF Name 



NTID 



AAID 



3.U&lS.6A..±i....l2... 



14 96 3 



110185 



— — Score Probability 
Length Length 





TOT 



0.024 



Protein name 



Locus Name 



proJoaJDle ciaitmase 



pir :T42071 



Acc# 



T42071 



Description 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



4964 



.10186 



£4~ 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



10187 



Length Length 




^4" 



Score Probability 
1. 6e-44 



[TTT 



Protein name 



Locus Name 



otnA protein 



pir:S709S8 



Acc# 



S70958 



Description 



1283 









NT 


AA 


ORF Name 


NT ID 


AAID 


Length 


Length 


S495337_rl_l 


4966 


ioiaa 


132 


399 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



iiio.o.&a.bu.±i...i 



10189 



Length Length 
551 



Score Probability 
|2.4e-0b 



TIT 



Protein name 



Locus Name 



nypotnetical protein slrisis 



pir:U7b4fo4 



Acc# 



S75464 



Description 



NT 



AA 



ORF Name 



NTID 



12.19.S.3.B....a2....y.b.., 



I496S 



AAID Length Length 
T5T7 — 



10190 



Score Probability 
7 .9e-17 



Protein name 



Locus Name 



unknown 



gp:TJ96V7i 



Acc# 



U96771 



Description 



£>revotella bryantii putative polygalacturonase, b-1 , 4-enaogiucanase, ana 
mannanase genes, complete cds; and unknowngenes . 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



±163:ilb±..cl...m 



4969 



10191 



1719 



9.7e-60 



Protein name 



Locus Name 



carboxyl- terminal proteinase 



pir :F7U^by 



Acc# 



F70369 



Description 



1284 



ORF Name 



NT ID 



AAID 



— — Score Probability 
Length Length 



110192 



ITT 



[2TT 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



TT7T 



Length Length 
1ST" 



Score Probability 



450 



Protein name 



Description 



Locus Name 



Acc# 



[MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



10154 



T7T 



2219 



Probability 
uTu 



Protein name 



Locus Name 



Deta-glucosxaase 



gp:AF0066SB 



Acc# 



AF006658 



Description 



Bacteroides tragilis beba-giucosidase gene, complete cas . 



ORF Name 



NTID 



16195 



NT Score Probability 

7.9e-0^ 



AAID Length Length 
2W7 



104 



Protein name 



Locus Name 



unknown 



|gp:APll>4J4y 



Acc# 



AF124349 



Description 

Zymomonas mobilxs ZM4 tosmid clone 41A4, complete sequence. 



1285 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



2638B883 c3 lul 



10196 



3 . 5e-39 



Protein name 



Locus Name 



Acc# 



|sp:YZ37_^YNY3 | Q55480 



Description 
itV&OTfHfiTlCAL SUCJAR KltfAkh! SLRObiV 



ORF Name 


NTID 


AAID 


NT AA 
Length Length 


Score 


Probability 


3324206;>__c3_100 


4375 


10197 


824 2475 


802 


8 .8e-82 



Protein name 



hypothetical protein tmo^bu 



Description 



Locus Name 



Acc# 



F72395 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



ll&6Ai:i£...a±Jl& 



10198 



1.3e-92 



Protein name 



Locus Name 



receptor antigen (RagA} 



lgp:PG11308V2 



Acc# 



AJ130872 



Description 



Porphyromonas gm givalis W50 receptor antigen irag) locus encoamga major 
immunodominant 55kDa antigen. 



NT 



AA 



ORF Name 



NTID 



I^il3..1:/....cz...y.:/. 



AAID Length Length 
Ii0l99 



TTTTT 



Score Probability 
ITB 



5.4e-06 



Protein name 



Locus Name 



lsp:XYNb_±>kUkU 



Acc# 
P48791 



Description 

i,4-BETA-XYLuaibAtjlil) (EXO-bUTA - (1,4) -X¥LANArilj) 



1286 



NT 



AA 



ORF Name 



NT ID 



|436ifa7tt c2 8b 



AAID Length Length 
— 



Score Probability 



10200 



PTT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID 



10201 



Length Length 



Score Probability 



TTTT" 



Protein name 



Description 



Locus Name 



Acc# 



[NO-KIT 



NT 



AA 



ORF Name 



NTID 



AAID 



|lM.a.7.UBLLL..XJL...i... 



— — Score Probability 
Length Length 





TTT 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
110205 



Score Probability 
fTZZ 



|6.0e-lb 



Protein name 



Locus Name 



sp : YJNHE_ECOLl 



ACC# 
P77522 



Description 

HYPOTHETICAL 56.^ KB PROTEIN IN LPk-A£ 0D lNTEk^ENlcJ kiilUluN 



1287 



ORF Name 



NTID 



NT AA 

— — , Score Pro bability 
AAID Length Length 



25815&41 cJ 4 



10204 



77 



TIT 



TTT 



2 . le-18 



Protein name 



Locus Name 



probaJDie oxiaoreductase 



pir :T34yyj 



Acc# 



T34993 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



1020b 



Length Length 
353 



Score 



TFT" 



ITT" 



Probability 
2.2e-28 



Protein name 



Locus Name 



4 -methyl -5 (JD-nydroxyetnyi) -tmazoie 
monophosphate biosynthesis protein (thiJ) 
homolog 



bir:D70lV7 



Acc# 



D70177 



Description 



NT 



AA 



ORF Name 



NTID 



2I5a5^i...c2L...2L„. 



4984 



AAID Length Length 
WU2 



10206 



Score Probability 
1.3e-14 



Protein name 



Locus Name 



I15K outer membrane protein precursor : Susc 
protein 



pir;JC6027 



Acc# 



JC6027 



Description 



NT 



ORF Name 



NTID 



AAID 



10207 



Length Length 



AA 

— Score Probability 



70 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



1288 



ORF Name 



13707338 tl 1 



Protein name 



omp85 analog 



Description 



NT ID 



— — Score Pro bability 
AAID Length Length 



TUJUW 



Locus Name 



pir :D72Uy4 



0.0014 



Acc# 



D72094 



ORF Name 



170.7.0.iO.O....ci....lb... 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



WO- HIT 



ORF Name 



Protein name 



NT ID 



AAID 



10210 



NT 



AA 



Length Length 
553 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NT ID 



— — Score Probability 
AAID Length Length 



\2S2&Lo.b:.L±i^&. I 



110211 



I5B" 



10.00010 



Locus Name 



Acc# 



spiYJDBJ^OLl 



Description 

riVEOlflfiTttgAL 61.7 KB PftOf^lN IM BAj^-AM Y l^T^ENIC RKCjIUIM 



1289 



NT 



AA 



ORF Name 



NT ID 



AAID 



32031466 ri i 



4990 



TUZTT 



Length Length 



133 



Score Probability 
0.00026 



tott 



Protein name 



Description 



Locus Name 



Acc# 



P75785 



HYPOTHETICAL kd prOTe IeJ irt OmpX-mOBb intergenic region 



NT 



AA 



ORF Name 



NTID 



10535006 ti 1 



4991 



AAID Length Length 
324 



Score Probability 



10213 



TUT 



Protein name 



Description 



Locus Name 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



10214 



Length Length 
1W 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



10215 



I2T0" 



2.2e-53 



Protein name 



Locus Name 



Acc# 



mobilization protein A 



lgp:APli^241 



AF118241 



Description 

Bacteroides tragi lis mobilization protein A imor>Aj gene ; compietecas. 



1290 



ORF Name 



12541^ c2 3fo 



NTID AAID Length Length 

TT7 



— — Score Probability 



10216 



TM" 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 









NT 


AA 


Score 


Probability 




ORF Name 


NTID 


AAID 


Length 


Length 






IbSA&l&l^al^A 


4995 


10217 | 


276 


831 


51 


0.047 





Protein name 



Locus Name 



polymorphic outer memorane protein g tamiiy |gp:ABU33794 
Description 



Acc# 



AB033794 



Chlamydophila pneumoniae p mp_3 . 1 gene tor polymorpmc outermembrane protein 
G family, complete cds . 



NT 



AA 



ORF Name 



NTID AAID Length Length 

TBS 



Score Probability 



10218 



Protein name 



Description 



Locus Name 



Acc# 



[NO-HIT 



NT 



ORF Name 



[4997 



NTID AAID Length Length 

10219 



AA 

— Score Probability 



1488 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



1291 



ORF Name 



|2464iti77 ci J8 



Protein name 



NT ID 



AAID 



10220 



KIT A A 

— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



4999 



AAID 



110221 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



10222 



hypothetical protein Y26D4A.9 



Description 



NT 



AA 



Length Length 

run — 



T5W 



Score Probability 
032 



Locus Name 



pir :T26!>69 



Acc# 



T26569 



ORF Name 



5.16.i£).A2...cl...zB... 



Protein name 



NTID 



10223 



— — Score Probability 



AAID Length Length 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



Description 



NTID 



AAID 



10224 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



IN0-H1T 



1292 



NT 



AA 



ORF Name 



NT ID 



2636875V c2 2y 



[5UUT 



AAID Length Length 
T25~ 



Score Probability 



10225 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



27.3.6.6.5.:/....c2...i.i.. 



Z3 [SuW 



10226 



7T 



I2TF" 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



\±9A0.2Z.0....U...±1 



5005 



10227 



708 



TFZT 



3.3e-iVtt 



Protein name 



Locus Name 



conserved nypotnetical protein ydci 



bir:(ib9V7i 



Acc# 



G69773 



Description 



NT 



AA 



ORF Name 



NTID 



AAID 



6.0.5A0.ai...c2...3.6. I 



1022S 



Length Length 
TZA 



tut 



Score Probability 
0.0099 



75 



Protein name 



Locus Name 



E3 class 2 protein 



pir :B4bJU» 



Acc# 



B46308 



Description 



NT 



ORF Name 



NTID 



AAID 



1022$ 



Length Length 
TTZ 



AA 

— Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



RTO-HIT 



1293 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



14882768 tJ 23 



10230 



Protein name 



conserved nypotnetical protein yisg 



Description 



JUT 



587 



Locus Name 



pir:H698i7 



|5.5e-57 



Acc# 



H69837 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


±5A&2ti6±.±±..:2.b. 


500^ 


10231 


I 2 5 7 1 


774 


213 


2.4e-17 



Protein name 



Description 



Locus Name 



IgprSRI^ib 



Acc# 



U59236 



Synechococcus 1X^7942 ribos omal protein si or 3US rioosome irpsij ,OKF2/i, 
ORF231, ORF341, ca rboxyl trans f erase alpha subunit (accA) , ORF245 , ORF227, and 
GTP cyclohydrolase I (folE) genes, completecds, and ORF205 gene, partial 
cds . 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



SuTu" 



110232 



Protein name 



Locus Name 



2.4e-17 



Acc# 



sp:VhJBR_hl(JULl 



Description 

HYPOTHETICAL 20.3 KB L>kOTUlN I N L>RtJ-i^H A IKi'EkcJUNIC kklciloN 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



ZmD.D.D.b.S^.^b.^ 



110233 



\7UT 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



1294 



ORF Name 



Protein name 



NT ID 



110234 



NT 



AA 



AAID Length Length 
¥F5 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



2.a47i£27...±l...l.... 



Protein name 



NTID 



AAID 



— — Score Probability 
Length Length 



10235 



TOT 



i.Se-14 



Locus Name 



Acc# 



two component sensor 



|gp:AF0303b2 



AF030352 



Description 

f>seudomonas aeruginosa two component sensor (lemAj gene, partiaicds . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Pr 


obability 


|aiaflLisii«.ta^2L3. 


5014 


1023£ 


463 


1392 


414 


l.2e-38 



Protein name 



Locus Name 



conserved nypotnetical protein 



|pir:G72220 



Acc# 



G72220 



Description 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



5015 



1752 



5TTT 



5.3e-W 



Locus Name 



2 ' , 3 1 -cyciic-nucleoticle 2 ' -pnospnoaiesterase, 
precursor 



BTrTT^453T" 



Acc# 



H64532 



Description 



1295 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



Protein name 



purr 



10238 



TIT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



\±9£A&±b...±±J.L 



Protein name 



TUZTT 



Locus Name 



7.le-l8 



Acc# 



Description 



spiCIRAJilCOLi 



P17315 



COLIOlfrJ I ftSCfclPTok P RECURSOR 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



&&DA:L.cl...3.Z 



Protein name 



10240 



Locus Name 



Acc# 



Description 



WO-H1T 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



13.0.0.S1$.5l...c1...Z2... 



Protein name 



10241 



T552" 



Locus Name 



Acc# 



Description 



NO-HIT 



1296 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



25555015 c2 '11 



110242 



2520 



7.4e-74 



Protein name 



Locus Name 



Acc# 



115K outer membrane protein precursor : susc 


pir:JC6027 


1 l TP602 7 


protein 














Description 














ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


±±Q&215....z±Jl± 


5021 10243 


62 : 










Protein name 






LOCUS 


Name 


Acc# 


Description 














NO-HIT | 


ORF Name 


NTID AAID 


NT 
Length 


AA 

Length 




Score 


Probability 


IWXIUx&JlIJl 


5022 10244 


151 


576 




475 


4 . le-4b 


Protein name 


Locus 


Name 


Acc# 


hypotnetical protein 


jhp0042 




pir:H71981 


H71981 


Description 


ORF Name 


NTID AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


li&2112.™tl^X 


5023 10245 


|202 


609 




127 


1.3e-0S 


Protein name 




Locus 


Name 


Acc# 



TonB- dependent receptor HmuR 



Description 



Porphyromonas gxngivalis To nB - dependent receptor httiuk tnmuRj gene , complete 
cds . 



1297 



ORF Name 



11058140 ±1 2 



Protein name 



NTID 



AAID 



10246 



— — Score Probability 
Length Length 



3288 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



10247 



— — Score Probability 
Length Length 

T7T5 



35 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



Description 



NTID 



AAID 



— — Score Probability 
Length Length 



10248 



330 



9.4e-30 



Locus Name 



sp:MSCL_EkW<JA 



Acc# 



068284 



LARGE - CONDUC T ANCE MECHAN0&11NS IT1VK CHANNEL 



NT 



AA 



ORF Name 



NTID 



5027 



AAID Length Length 
7S2 



Score Probability 



1024^ 



53" 



Protein name 



Locus Name 



Acc# 



Description 



I N0-H1T 



1298 



NT 



AA 



ORF Name 



NT ID 



13017676 ci 8b 



AAID Length Length 



10250 



531 



Score Probability 
i.2e-07 



ITT 



Protein name 



Locus Name 



Acc# 



trsi protein ttraij 



gp:AE00l272 



AE001272 



Description 

Lactococcus iacti s DfrC3l47 piasmid pMRCUi, complete plasmiasequence. 



ORF Name 


NT ID 


AAID 


NT 

Length 


AA 

— , Score 
Length 


Probability 


I37$l250_c2_l06 


5029 


10251 1 


67 | 204 202 


3.5e-16 


Protein name 




Locus Name 


Acc# 


nypotneticai protein l 


pir: 140237 


140237 


Description 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 


UffiiliLtIM 


5030 


10252 


721 2166 121 


0.0007b 


Protein name 








Locus Name 


Acc# 










gp:T7CG 




Description 












Genome ot Jsacteriopnage T /. | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 

— Score 
Length 


Probability 


(l^.i^J.^al^.?^ 


5031 


10253 


140 


423 




Protein name 








Locus Name 


Acc# 


Description 













MO-HIT 



1299 



NT 



AA 



ORF Name 



NT ID 



5032 



AAID Length Length 
110254 



Score 



Probability 
0.0005^ 



Protein name 



Locus Name 



ras interacting protein R1PA 



|gp:At'lby241 



Acc# 



AF159241 



Description 



Dictyosteiium discoideum r as interacting protein KiPA (npA) mRNA, complete 
cds . 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



23^6550 12 2b 



7.3e-l0 



Protein name 



Locus Name 



tetracycline resistance element mobilization 
regulatory protein rteC 



bir:A36927 



Acc# 



A36927 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



10256 



5.2e-06 



Protein name 



Locus Name 



clostripam-related protein 



IpircB^bl 



Acc# 



B72351 



Description 



ORF Name 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



10257 



ST 



Locus Name 



Acc# 



Description 



NO-HIT 



1300 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



24641637 c2 lo3 



5036 



10258 



TTT 



3.5e-46 



Protein name 



Description 



Locus Name 



Acc# 



spiAQWJiiCoLl 



AOtfAfrOftlttf 2 (fiACjI'lil RlAL SfoDliLlKf-LiiflB INTRINSIC PROTEIN) 



NT 



AA 



ORF Name 



NTID 



AAID 



24645261 c3 132 



Length Length 
TTT7 



T7F" 



Score Probability 
1.2e-36 



755 



Protein name 



Description 



Locus Name 



|sp:¥HCtij5e6Ll 



Acc# 



P45423 



HYPOTHETICAL 4i.3 KB PROTEIN IN INTklkl^lNIO k^loN «J3;b) 



□ 



ORF Name 



Protein name 



aS.3.7.5.6.2...ai...6.a i F^s 



NT 



AA 



NTID AAID Length Length 

110260 



Score Probability 



Locus Name 



Acc# 



Description 



{NO-HIT 



ORF Name 



|26.5.M£).R7...±2...2.b.. 



Protein name 



NTID 



AAID 



10261 



NT 



AA 



— — Score Probability 
Length Length 

TT1 



TJT 



Locus Name 



Acc# 



Description 



[NO-HIT 



1301 



ORF Name NT ID 


7\ 7\ TT"\ 

AALu 






NT 
Length 


AA 
Length 


Score 


Probability 


32317217_ti_4 5040 


10262 






143 | 432 






Protein name 










Locus Name 


Acc# 


Description 
















NO-HIT 














1 


ORF Name in i ±u 


& ATT") 






NT 
Length 


AA 
Length 


Score 


Probability 


limilb^alJli), 5041 


10263 






235 708 


170 


l.Se-13 


Protein name 










Locus Name 


Acc# 


immunoreactive 42KD antigen 


PG33 








gp:API71>7ib 


AF175715 


Description 




Porpiayromonas gingival ±s strain Wbu 
complete cds . 


immunoreactive 42KD antigenic 3 gene, 




ORF Name NT ID 


AAID 






NT 
Length 


AA 
Length 


Score 


Probability 


12^3AD.:L.al...24 5042 


10264 






91 | 276 


76 


0.023 


Protein name 










Locus Name 


Acc# 


elongation lactor Ts 


gp:AFlybyb2 


AF195952 


Description 




Ehaeodactylum tricornutum ribuiose-i , 5 -bispnospnacecarJDoxyiase/ oxygenase 
large subunit (rbcL) , ribulose-1 , 5-bisphosphate carboxylase /oxygenase small 
subunit (rbcS) , and elongation factor Ts (EF-Ts) genes, complete 
cds;chloroplast genes for chloroplast products. 




ORF Name NT ID 


AAID 






NT 
Length 


AA 
Length 


Score 


Probability 


maniLcm bu4i ' 


10265 






87 264 







Protein name Locus Name Acc# 



Description 
NO-HIT 



1302 



ORF Name 



Protein name 



NT ID 



NT 



AA 



AAID Length Length 



Score Probability 



432 



Locus Name 



Acc# 



Description 



NO -HIT 



ORF Name 



Protein name 



NTID 



FOTT 



110267 



DNA- binding protein, hu 



Description 



— — Score Probability 



AAID Length Length 



"ITT 



Locus Name 



pir:H72^ye 



Lie- 12 



Acc# 



H72396 



ORF Name 



Protein name 



NTID 



5046 



AAID 



10268 



NT 



AA 



Length Length 



Score Probability 



FT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



10269 



DNA topoisomerase ill topB 



Description 



— — Score Probability 
Length Length 



I2.0e-9I 



Locus Name 



pir:H6y724 



Acc# 



H69724 



1303 



ORF Name 



NTID 



5250000 c2 lori 



10270 



NT — Score Probability 

14 . Oe-10 



AAID Length Length 



5TT 



Protein name 



Locus Name 



high molecular weignc giutenm summit 



Acc# 



U39229 



Description 

Aegilops tauschii high molecular weignt giutenm suDunit igiu-1-2 ) gene , 
complete cds. 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


58dSl67_tI_5 


504$ 


10271 | 


165 


4$S 










Protein name 








Locus 


Name 




Acc# 


Description 


















NO-HIT | 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 

Length 


Score 


Probability 


fiSiSia^t^ia 


5050 


|l0272 




210 










Protein name 








Locus 


Name 




Acc# 


Description 


















NO -HIT 1 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Pr 


obability 


LiafiftMi..±i...3L 


5051 


10275 


163 


492 




183 




3.6e-l4 


Protein name 








Locus 


Name 




Acc# 



unknown 



Description 



gp:AF04*J74y 



Bacberoides rragilxs capsu lar poiysaccnaride biosynthesis ope r on, complete 
sequence . 



1304 



ORF Name 



NTID 



KPT 1 AA 

— — Score Pr obabilxty 
AAID Length Length 



10274 



T7¥" 



T7T 



3.2e-ii 



Protein name 



Locus Name 



unknown 



gp:AP04^749 



Acc# 



AF048749 



Description 



Sacteroides rragr iis capsular polysaccharide biosyntnesis ope r on, complete 
sequence . 



ORF Name 



134266*86 cl 4 



Protein name 



NT 



AA 



NTID AAID Length Length 





Score Probability 



10275 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



im$.3.2...±l...l 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



[3W 



TIT 



Locus Name 



1I5K outer membrane protein precursor : susc 
protein 



Description 



9 . 5e-50 



Acc# 
JC6027 



ORF Name 



Protein name 



Description 



NTID 



AAID 



10277 



NT 



AA 



Length Length 




Score Probability 
i.Oe-OB 



Locus Name 



sp :tfOLB_kAklN 



Acc# 



P43748 



DMA POLYMERASE 111 , DELTA ' 5UBUNIT, 



1305 



ORF Name 



i£44^6 ti 22 



Protein name 



NTID 



5056 



10278 



NT 



AA 



AAID Length Length 

Tn — 



Score Probability 



IT 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



15057 



TUTTT 



TZT 



Locus Name 



carJooxyl- terminal proteinase 
ctpB: hypothetical protein slr0257 rhypothetical 
p-mf^-in s1r0257 _ 



Description 



bir:S74t»79 



0 .00036 



ACC# 



S74579 



ORF Name 



\X&l$Ab...a'±...± , 2... 



Protein name 



NTID 



5058 



10250 



NT 



AA 



AAID Length Length 
5TS 



— Score Probability 



T7T 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID 



51753" 



10281 



Length Length 
355 



Score Probability 



Locus Name 



Acc# 



Description 
NO-HIT 



1306 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



24648y4l C3 41 



10252 



8.7e-7b 



Protein name 



Locus Name 



putative Tonl3- dependent outer membrane 
receptor 



gp:AF04874y 



Acc# 



AF048749 



Description 



Sacberoides rrag ilis capsular poiysaccnaride mosyntnesis operon, complete 
sequence . 



ORF Name 



i25785l<>l c3 34 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



TUZZT 



TFT" 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



TOST - 



AAID 



10284 



NT 



AA 



Length Length 
1^5 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



5063 



10285 



hypothetical protein PFB0b4Uw 



Description 



— — Score Probability 
Length Length 



2796 



0.00047 



Locus Name 



pir:D71€>12 



ACC# 



D71612 



1307 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



13145252 cl 24 



10284 



837 



|1.3e-06 



Protein name 



Description 



Locus Name 



spiAPRFJ^Al! 



Acc# 



Q03027 



ALKAL1NS ££0fEA£iii SeCrEtioN prOteIsT aprf 



ORF Name 



|35l$4686 13 2S 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



110267 



E4T 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



putative HiU>2 0 



Description 



NT 



AA 



NTID 



AAID 



10288 



Length Length 

mi — 



Score Probability 

0.0036 



Locus Name 



lgp:AF07287h 



Acc# 



AF072875 



Mycobacterium smegmatis putative HSP20 msp) gene, complete cas. 



ORF Name 



NTID 



— — Score P robability 
AAID Length Length 



15067 



110285 



RT75" 



11428 



[2.2e-05 



Protein name 



Locus Name 



ORF MSV251 leucine rich repeat gene ramiiy I igp : AFU6 J«bb 



Acc# 



AF063866 



Description 

Melanoplus sanguimpes entomopoxvirus, complete genome. 



1308 



NT 



AA 



ORF Name 



NTID 



10429b67 ±3 3 



AAID Length Length 
77 



Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



[MO-HIT 



NT 



AA 



ORF Name 



NTID 



4iAl£3.5L„±i...4 



AAID Length Length 



10291 



353" 



— ^ S core Probability 
1.3e-!>4 



FIT" 



Protein name 



Locus Name 



115K outer membrane protein precursor : sus<J 
protein 



|pir:J(Jb02V 



Acc# 



JC6027 



Description 



NT 



AA 



ORF Name 



NTID 



±03.&b±b.b....£<L..AA.. 



5TT7TT 



AAID Length Length 

im — 



Score Probability 



10252 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



ORF Name 



NTID 



m — Score Probability 



AAID Length Length 



5071 



110253 



393 



THTT 



14. 8e-20 



Protein name 



Locus Name 



conservea nypotnetical protein 



|pir:H72^7^ 



Acc# 



H72273 



Description 



ORF Name 



Protein name 



NTID 



AAID 



i£^7.££S.S...±:l..i 



FTT7T 



10254 



conservea nypotnetical protein yngK 



Description 



— — Score Probability 
Length Length 



W5T 



1475 



Locus Name 



[pir:H69893 



3 . le-63 



Acc# 



H69893 



1309 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



1215219:47 cl 7u 



BUTT 



1029b 



|2.9e-19 



Protein name 



Description 



Locus Name 



Acc# 
P26984 



FRuOTO^lNASK, 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



240234b7 ts b2 



10296 



T7T 



fi.2e-3l 



Protein name 



Description 



Locus Name 



sprLEMAj^W^ 



Acc# 



P48027 



SENSOR PROTEIN LfelMA, 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score Probability 



242L9.&26.2L...C.1...7A.. 



10297 



i.fle-74 



Protein name 



Locus Name 



sp : PROW _ECoLl 



Acc# 



P14176 



Description 

GLYCINE! IjETAlNhl j L- PROLINE TkAN&PORT SYS TEM PhlRMijAsa PRUTmiN fkuW 









NT 


AA 


Score 


Probability 


ORF Name 


NTID 


AAID 


Length 


Length 








|2Mm&R...a2...10.1 


5076 


|10293 


285 


855 


453 


8.7e-4^ 





Protein name 



Locus Name 



Acc# 



glycine-betame binding permease protein 



gp:A*'l39b7b 



AF139575 



Description 

Lactococcus lactis BusAA (bus AA) and glycme-betame Dinaingpermease 
protein (busAB) genes, complete cds . 



1310 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 


24644706_cl_7i 


5077 


10259 


909 2/30 




2 . 7e-55 


Protein name 


Locus 


Name 


Acc# 


hybrid mstiaine 


Kinase 








gp:AP0297O4 


AF029704 


Description 


















Dictyostelium discoideum nyJD 


rid hist 


idine Kinase (dhKD; 


mKNA, complete cas . j 


ORF Name 


NT ID 


AAID 




NT 
Length 


AA 
Length 


Score 


Probability 


24648432J:l_iO 


5078 


|l0300 




214 64b 








Protein name 










Locus 


Name 


Acc# 


Description 




































ORF Name 


NTID 


AAID 




NT 
Length 


AA 
Length 


Score 


Probability 




2A6£A&±l...al..±±± 


5075? 




i 


28l t 


346 




1490 


l.le-lb2 




Protein name 










Locus 


Name 


Acc# 


tructanase 


pir :A3691b 


A36915 


Description 


















ORF Name 


NTID 


AAID 




NT 
Length 


AA 
Length 


Score 


Probability 


i£aam«x...ci.ii4 


5080 


10302 




390 


1173 




268 


1.3e-2i 


Protein name 










Locus 


Name 


Acc# 



Description 



|sp:GLU£>_6&UAB 



Q44623 



GLUCOS E /GALACTO^ TRANi^UkTEk 



1311 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



12817217 ci 72 



110303 



[2Tu~ 



14. ye- 17 



Protein name 



Locus Name 



Acc# 



P75707 



Description 

HYPOTHETICAL 14.4 iCD frftOT Elfl lH j 1'ESb-HHA intergenic rkgiujn 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 

— , Score 
Length 


Probability 


30084532_c2_l0o 


50&2 


10304 | 


412 


1235 |1028 


1. 0e-l63 



u 



Protein name 



Locus Name 



ATPase nomoiog gjdua 



bp:A?0i98Jb 



Acc# 



AF039835 



Description 

Listeria monocytogenes A'i^ase homolog GbuA (gbuAj , putative gxycineioetaine 
membrane transport protein GbuB (gbuB) , and putativeglycine betaine binding 
protein GbuC (gbuC) genes, complete cds . 









NT 


AA 


Score 


Probability 


ORF Name 


NT ID 


AAID 


Length 


Length 








aafiaiiiiajti^ii 


50&3 


10305 


102 


305 


125 




S.Oe-0^ 


Protein name 








Locus 


Name 




Acc# 



hypothetical protein APK2U61 



pir:G7^l0 



G72510 



Description 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 




Score 


Probability 




\&2&ix±'L.±lJ±± 


5084 


10306 


152 


579 




132 


9. oe-oy 




Protein name 








Locus 


Name 


Acc# 



conserved Hypothetical protein 



pir:G7bbbb 



G75555 



Description 



1312 



ORF Name 


JN 1 ID 




NT 


AA 
Liencrth 

J-J^^ JLXV*] ^li 


Score 


Probability 


4712825_ci_88 


5085 


10307 




1197 






Protein name 








Locus 


Name 


Acc# 


Description 














MO -HIT | 


ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


54.7.:/.S.2..±i...y. 


|5086 


|10308 


1193 | 


3582 


3752 


0.0 


Protein name 








Locus 


Name 


Acc# 



pyruvate terrectoxm oxidoreauctase 



|gp:CJ^A1772 7 



Y17727 



Description 

Clostridium pas teurianum genes encoaing putative pyruvateterredoxm 
oxidoreductase (800S bp) . 



ORF Name 



NTID 



— — Score Probability 



AAID Length Length 



2.0.3.l4.6.1...cZ...b.., 



10309 



2Sl~ 



846 



1. 8e-OB 



Protein name 



Locus Name 



hypotnetical protein aq_l4/v 



bir:D70428 



Acc# 



D70428 



Description 



ORF Name 



Protein name 



NTID 



3.M3.1&&il.±2...i 



10310 



NT 



AAID Length Length 



AA 

— Score Probability 



Locus Name 



Acc# 



Description 



1313 



ORF Name 



|2156i4fefe7 11 4 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



110311 



|4.6e-211 



Locus Name 



Acc# 



Description 



|sp:ILVl)_HAElN 



P44851 



frltiYDROXY-ACilD D^Hrt &ATASll, (DAD) 



ORF Name 



NTID 



237l27ab ±2 la 



5uW 



TUJTT 



Protein name 



NT 



AA 



AAID Length Length 



Score Probability 



T7Tu~ 



l.Oe-1^6 



Locus Name 



Acc# 



acetolactate synthase, large suJounit 



Description 



pir:fi72362 



B72362 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



Protein name 



\l±$M.b.±&..±2Jl I RT^T 



10313 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



10314 



0. 0053 



Locus Name 



Acc# 



capsict portal protein 



Description 



gp:6lU3^1>22 



Bactenopnage 186, complete sequence. 



1314 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



4822001 ti ±2 



10315 



tut 



|2.4e-20 



Protein name 



Description 



Locus Name 



gp:AF0B34l>4 



Acc# 



AF083424 



Ateline herpesvirus 3 complete genome. 



ORF Name 



Protein name 



NT ID 



— — Score Probability 
AAID Length Length 



10316 



Locus Name 



Acc# 



Description 



INO-Hl'l 1 



ORF Name 



23.5Ll£M2...ai....b... 



Protein name 



NT ID 



AAID 



10317 



NT 



AA 



Length Length 
5T5 



Score Probability 



JUT 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



NT 



AA 



NT ID 



AAID Length Length 



Score Probability 



10318 



83 



i.4e-oy 



Locus Name 



sp:Y0b2_BUKBU 



ACC# 



051081 



Description 

HYPOTHETICAL TkMA/kRHA METHVL TRAN^FKkA^E 



1315 



ORF Name 



I24422S7 tl 1 



Protein name 



NTID 



AAID 



110319 



"NTT Z\A 

— — Score Probability 
Length Length 



Locus Name 



Acc# 



Description 
MO-HIT 



ORF Name 



Protein name 



Description 



NTID 



AAID 



— — Score Probability 
Length Length 



110320 





140 | 


423 




257 





|5.Ie-22 



Locus Name 



sp:VM64_Ak(JPU 



Acc# 



028020 



ORF Name 



Protein name 



NTID 



5099 



AAID 



10321 



NT 



AA 



Length Length 
ST5 



Score Probability 



TOT 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



ki^aii..±A...a 



15100 



AAID Length Length 
110322 



I33TT 



Score Probability 
|9.5e-3b 



[T77 



Locus Name 



sp:YXEHJsA<^U 



Acc# 



P54947 



Description 

HYPOTHETICAL J 0.2 KB PkOTEiN IN 1DH-DE 0R iJsl'1'EkCjENiC kiiicjiuN 



1316 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



6444402 i2 b 



10323 



1488 



: 7. Oe-121 



Protein name 



Locus Name 



cystemyi-tKNA syntnetase 



pir:A7biferi 



ACC# 



A75368 



Description 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Pr 


obability 


12125.^^2^4. 


| 5102 


|10324 


214 


645 


220 




4.3e-i& 


Protein name 








LOCUS 


Name 




Acc# 



2,3,4, 5- tetrahydropyriaine-2-carJooxyiate 
N- succinyltransf erase-related protein 



Description 



bir:H7224b 



H72245 



ORF Name 
l2MSL.7.6.2..±2...b,. 



Protein name 



NTID 



AAID 



10325 



NT 



AA 



Length Length 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



|2ai3.M2...t2...i... 



Protein name 



NTID 



AAID 



10326 



NT 



AA 



Length Length 

Tn — 



Score Probability 



73 



Locus Name 



Acc# 



Description 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



10327 



T35T 



JUT 



7.3e-35 



Protein name 



Locus Name 



TonB- dependent receptor HmuK 



gp:PGU87iDb 



Acc# 



U87395 



Description 

Porphyromonas gmgivalis T onfe- dependent receptor HmuR inmuKj gene y compie 
cds . 



te" 



ORF Name 



Protein name 



NT ID 



— — Score Probability 
AAID Length Length 



1032S 



7T 



Locus Name 



Acc# 



Description 



[NO-HIT 









NT 


AA 


Score 


Probability 


ORF Name 


NTID 


AAID 


Length 


Length 








±&6£A0x:L.aA..A 


5107 


1032S 


210 


630 


375 




S.le-34 



Protein name 



Locus Name 



receptor antigen (RagAj 



gp:PGI130&72 



Acc# 



AJ130872 



Description 



Porphyromonas gmgivalis W5 0 recepbor antigen (rag J locus encociinga major 
immunodominant 55kDa antigen. 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



\io±6A(>.6.b^t±jL I [srtrs i prnr 



68 



I5T 



10.0055 



Protein name 



Locus Name 



50JcDa lectin 



|gp:BMOb0Kl)AL 



Acc# 



D14168 



Description 

Silk worm mMA tor 50KDa lectin, complete cas. 



1318 



ORF Name 



NTID 



NT AA 

— — , Score Probabil ity 
AAID Length Length 



2460837 13 2 



Protein name 



10331 



Locus Name 



5.5e-54 



Acc# 



adenylate cyclase nomolog 



Description 



pir:T17i^7 



T17197 



ORF Name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



Protein name 



Ili9.23£6.3....cl...ltt I PTTU 



110332 



Locus Name 



Acc# 



Description 



[NO -HIT 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



Score Probability 



±S3A2±XL±±..:<L.. 



Protein name 



10333 



TO" 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



amaai7....ci...ii.. 



Protein name 



10334 



T3T 



582 



JUT 



5.7e-l>V 



Locus Name 



Acc# 



conserved Hypothetical protein 



Description 



bir:G7^380 



G72380 



1319 



NT 



AA 



ORF Name 



33772907 C2 lb 



NTID AAID Length Length 



HIT 



Score Probability 
fTI 



10. 025 



Protein name 



Locus Name 



HdcB 



gp:00U^6b 



Acc# 



U58865 



Description 



Oenococcus oem histidine decarboxylase thacA) gene, complete cas;ana hclcb 
(hdcB) gene, partial cds . 



ORF Name 



— — Score Probability 
NTID AAID Length Length 



3526703? c2 16 



5114 



KIT 



354 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



ORF Name 



NTID 



AAID 



10337 



Length Length 



AA 

— Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



2u5.u0.6.5.:/...±3...A 



^Ti£~ 



AAID Length Length 
T543 



1033^ 



Score Probability 
|4.6e-10ti 



Protein name 



Locus Name 



inorganic pyrophosphatase 



gp:D88820 



ACC# 



D88820 



Description 

Acetabularia mediterranea mRNA lor xnorganic pyrophospnatase, complete cas. 



1320 



ORF Name 



NT ID 



AAID 



— — Score Probability 
Length Length 



10544500 c'A y 



5TTT 



TUTJT 



1255" 



5uTT 



|6.4e-3i 



Protein name 



Locus Name 



|gp:PIGUFMk 



Acc# 



M30284 



Description 

Pig uteroterrin mRNA, complete ccis . 



ORF Name 



NTID 



§343^3 tl 1 



5118 



10340 



Protein name 



hypotnetical protein 



Description 



NT — Score Probability 



AAID Length Length 



TUT 



492 



6.5e-3B 



Locus Name 



[pir:S76672 



Acc# 



S76672 



ORF Name 



Protein name 



NTID 



10241 



NT 



AA 



AAID Length Length 
TUU2 



Score Probability 



Locus Name 



Acc# 



Description 



pSf6-MT 



ORF Name 



Protein name 



NT 



AA 



|23.:/.0Ab.:/^...c^l0...... I 



NTID AAID Length Length 



Score Probability 



|5.5e-lii4 



Locus Name 



heat snocK protein 6 0 



|gp:SP06bl6 



Acc# 



AJ006516 



Description 

Bacteroides torsythus groKL gene, strain atcjc 43037. 



1321 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



~5T 



12 . Oe-36 



Protein name 



Description 



Locus Name 



|sp:CHI0J*Ok<Jl 



Acc# 
P42376 



10 KD dHAPE&OtilN (PkO?SlN (JPNIU ) (PRC^lN (ikUKS) 



ORF Name 



10334667 ±2 16 



Protein name 



NTID 



10344 



NT 



AA 



AAID Length Length 
T72 



Score Probability 



Locus Name 



Acc# 



Description 



tMO-MM 



ORF Name 



144S£0.10...±1...&.. 



Protein name 



NTID 



AAID 



10345 



NT 



AA 



Length Length 
TIT 



Score Probability 



^4" 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



AAID 



|l0346 



— — Score Probability 
Length Length 

TS3 



Locus Name 



Acc# 



Description 



MO-HIT 



1322 



ORF Name 



Protein name 



NT ID 



AMD 



10347 



NT 



AA 



Length Length 
553 



Score Probability 



TIT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



13.B.140.b.:A..±l...:L 



Protein name 



NT ID 



AAID 



10248 



NT AA 
— , — , Score 
Length Length 



Locus Name 



Probability 



Acc# 



Description 



NO-HIT 



ORF Name 



\22A1^5.\L±'L.:U.. 



Protein name 



NT ID 



AAID 



10249 



NT 



AA 



Length Length 




Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NT ID 



— — Score Probability 
AAID Length Length 



\115SAi)±:L±L±& I I5T35 



110350 



75 



10.03& 



Locus Name 



|sp:UCRH_ViilA^T 



Acc# 
P00127 



Description 

(M IT OCHONDRIAL HINGE PROTEI N) (COMPLEX III polypeptide vu 



1323 



ORF Name 



— — Score Pr obability 
NTID AAID Length Length 



24024067 ti b 



Protein name 



10351 



224 



Locus Name 



|2.5e-2a 



Acc# 



Description 



gp:BFU6:40<*fe 



U63096 



Bacteroides tragilis lactA) gene, complete cas. 



ORF Name 



NTID 



NT AA 

— — Score Pr obability 
AAID Length Length 



I243&SS17 b0 



10352 



Protein name 



T5T 



304 



5.4e-27 



Locus Name 



Acc# 



conserved nypotnetical protein 



Description 



pir:G72^80 



G72380 



ORF Name 



NTID 



AAID 



— — Score Pro bability 
Length Length 



\2&6A&4.1L.±'L..± 



Protein name 



pur 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



!25.3.7.ai...cl...6.4.. 



Protein name 



10354 



7T 



Locus Name 



Acc# 



Description 



NO-HIT 



1324 



ORF Name 



NTID 



"NTT AA 

— — Score Probability 
AAID Length Length 



|2544iS67b rl 4 



Protein name 



I5T3T 



110355 



WZT 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



Protein name 



5124 



10356 



T7F 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



25£Xil±2..±±..A\L 



Protein name 



NTID 



5TT5- 



AAID 



10357 



NT 



AA 



Length Length 
T3TT5 — 



Score Probability 



Locus Name 



Acc# 



Description 



no-hit 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



Protein name 



5136 



I1035& 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



Protein name 



NTID 



KFT AA 

— — Score P robability 
AAID Length Length 



\26.11&.5.6A..±1..A1 I 



110359 



10 .045 



Locus Name 



H+- transporting ATP syntnase, protein b 



Jpir:Tlll21 



Acc# 
T11121 



Description 



1325 



ORF Name 



NTID 



AAID 



26854156 tl Z 



Protein name 



hypothetical protein H02Fuy.J 



Description 



NT AA 

— — Score Probab ility 
Length Length 



TO" 



7.1e-06 



Locus Name 



EirTTmFT 



Acc# 



T33369 



ORF Name 



Protein name 



NTID 



AAID 



110361 



NT 



AA 



Length Length 

wn — 



Score Probability 



Locus Name 



Acc# 



Description 



MO-HIT 



ORF Name 



NTID 



AAID 



120.6MA'L±l..Ab. I \5T%U 



10352 



Protein name 



ATP syntnase, subunit F 



Description 



NT 



AA 



Length Length 
515 



Score Probability 
T3T0T3 



78 



Locus Name 



pir:HS5227 



Acc# 



H69227 



ORF Name 



Protein name 



Description 



NTID 



imteiu 



AAID 



16363 



NT AA 

— — Score Probability 
Length Length 



TTT 



TUT 



5.4e-05 



Locus Name 



sp:Y£l6_eLOPii! 



Acc# 



P18017 



HYPOTHETICAL 15.7 KD PROTEIN (ORt'6) 



1326 



ORF Name 



NTID 



NT AA 

— — Score Probability 
AAID Length Length 



35" 



10.04 y 



Protein name 



Locus Name 



ORF MSV223 hypothetical protein 



|gp:AF06m6 



Acc# 



AF063866 



Description 



Melanoplus sangumipes entomopoxvirus, complete genome. 



ORF Name 



1831463 cl bb 



Protein name 



NTID 



NT 



AA 



AAID Length Length 
^4 



Score Probability 



fZTT 



Locus Name 



Acc# 



Description 



IN6-H1T 



ORF Name 



Protein name 



integrase intwi 



Description 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



10366 



9.2e-0& 



Locus Name 



bp:fitTOSl9lV 



Acc# 



U51917 



Bacteroides unitormis inse rtion element NBUl tragment, integraseintNl gene, 
complete cds . 



NT 



AA 



ORF Name 



NTID 



AAID 



10367 



Length Length 




|7T 



Score Probability 

utttts 



52 



Protein name 



Description 



Locus Name 



sp:<33±>_y(JMMA 



Acc# 



P20287 



LARVAL ^UR^ACE ANTX flfiM) J 



1327 



ORF Name 



NTID 



Protein name 



— — Score P robability 
AAID Length Length 



10368 



neuroendocrine protein 7B2 



Description 



Locus Name 



0.0027 



Acc# 



S03938 



ORF Name 



Protein name 



NTID 



[STTT 



110369 



NT 



AA 

— Score Probability 
AAID Length Length 



Locus Name 



Acc# 



Description 



IN0-H1T 



ORF Name 



Protein name 



NTID 



AAID 



10370 



NT 



AA 



Length Length 




Score Probability 



72 



Locus Name 



Acc# 



Description 



[MO-HIT 



ORF Name 



Protein name 



NTID 



— — Score Pr obability 
AAID Length Length 



|5.2S£8.a£L.c2L...:Z I |5I?5 



110371 



i.ie-23 



Locus Name 



methyl transferase 



gp : STRMTK 



Acc# 
L29323 



Description 

Streptococcus pne umoniae methyl transferase gene cluster, completesequence . 



1328 



ORF Name 



|28i27aJ3 C2 b 



Protein name 



Description 



GaLACtOSIDASE) 



NT 



AA 

— Score Probability 
NTID AAID Length Length 



TUTTT 



286 



3.8e-i>4 



Locus Name 



sp:B<JAL_HUMAN 



Acc# 



P16278 



ORF Name 



5867213 cl 4 



Protein name 



NTID 



AAID 



TUTTT 



NT 



AA 



Length Length 



Score Probability 



2TT 



Locus Name 



Acc# 



Description 



ORF Name 



Protein name 



NTID 



AAID 



10574 



NT 



AA 



Length Length 
TUm — 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NT 



AA 



NTID 



AAID Length Length 



Score Probability 



5153 



10575 



1.3e-34 



Locus Name 



Joeta-glucosrdase 



Acc# 



U92808 



Description 

Ruminococcus albu s beta-giucosiaase tgiuA; mKJMA, complete cas . 



1329 



NT 



AA 



ORF Name 



NT ID 



24644b7!> cl 33 



AAID Length Length 
WT5 



TUT7S 



Score Probability 
|2.6e-id 



227 



Protein name 



Locus Name 



| S p:YlBlQlCUinr" 



Acc# 



P37690 



Description 

Hyt>OTflfifi(JAL 46.6 Kb PfeOT^IN IJj StiCB-TDH IN TERtiKfllCJ RKCilUN 



ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


4476427_cl_34 


| 5155 


1057? 


1050 | 


3155 


334 


3.3e-26 



Protein name 



Locus Name 



Acc# 



jdzip nistiaine Kinase 



|gp:J>PUViy245 



Y18245 



Description 

Pseudomonas putida tocft, todX , todcil, lodCA, fcodb, tooA, toOD,toaE, 
todl, todH, todS, todT genes. 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



I5.2.7M.I1..C2....40. I 



HUT7B" 



|2.3e-I9 



Protein name 



Locus Name 



response regulator DrrA 



(pir:DV2228 



ACC# 
D72228 



Description 



ORF Name 



iiia^LtLi 



Protein name 



Description 



NT ID 



1037$ 



NT 



AA 



AAID Length Length 



Score 



Locus Name 



sp:BtiLa_AcJkTU 



Probability 
1.2e-5l 



Acc# 



P27034 



1330 



NT 



AA 



ORF Name 



NT ID 



10251900 ci 94 



AAID Length Length 
551 



Score Probability 



103&0 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



ia6.6.Z.7.hB....tl...ZZ.. 



10381 



TFT 



1.7e-liS 



Protein name 



Description 



Locus Name 



gp:ATAC006^02 



Acc# 



AC006202 



Arabidopsis thaliana chromosome II bac T3B23 genomic sequence, complete 
sequence . 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



5F5 



Score Probability 

0.0022 



Protein name 



Locus Name 



nypotnetical protein C0404U 



pir :S75406 



Acc# 



S75406 



Description 



ORF Name 



±2±9A±5A±2..A±.. 



Protein name 



NT ID 



SIFT" 



NT AA 

— — Score Probability 
AAID Length Length 



10583 



Tuir 



3.5e-10 



Locus Name 



Acc# 



P71786 



Description 
HYPO T HET I CAL 2 7 .1 KB PROTEIN CY277 .2^C 



1331 



NT 



AA 



ORF Name 



1126876^ ti lb 



NTID AAID Length Length 

110384 



WIT 



Score Probability 

fum — 



|2.2e-133 



Protein name 



Locus Name 



|sp:UXUA_HAUllNl 



Acc# 



P44488 



Description 

MAtWOHATli! DfitiYDRATASfi , (0-MANNU NA?E! HYDROLASE J 



ORF Name 



NTID 



NT AA 

— — Score Pro bability 
AAID Length Length 



H45fi8ifcJ7 ci 163 



10355 



£3T 



Protein name 



Description 



Locus Name 



Acc# 



INC -Ml l l 



NT 



AA 



ORF Name 



NTID 



AAID 



10386 



Length Length 




Score Probability 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



10357 



Length Length 
7^ 



Score Probability 
6.2e~28 



Protein name 



Locus Name 



dihydrodipicolinate reductase 



~j ETrT 



A72246 



Acc# 



A72246 



Description 



NT 



AA 



ORF Name 



|2Liaft3.m»±1..^4 1 l^TSS 



NTID AAID Length Length 
110355 



Score Probability 
3.5e-lb 



ET5 



Protein name 



Locus Name 



Acc# 



nypotnetical protein v 



pir :S2U'/yy 



Description 



1332 



ORF Name 



1222527b c2 120 



Protein name 



NTID 



AAID 



10385 



NT 



AA 



Length Length 




Score Probability 



"5T 



Locus Name 



Acc# 



Description 



NO-HIT 



ORF Name 



223.&Ib.b.I...rJ....!^.., 



Protein name 



NTID 



AAID 



10350 



NT 



AA 



Length Length 
— 



Score Probability 



Locus Name 



Acc# 



Description 



[NO-HIT 



ORF Name 



Protein name 



NTID 



10351 



hypotnetical protein mexF 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



TTTT 



Locus Name 



pir :T3083U 



3.8e-113 



Acc# 



T30830 



ORF Name 



Protein name 



oxictoreductase 



Description 



NTID 



— — Score Pr obability 
AAID Length Length 



10392 



TTEe^S" 



Locus Name 



gp : NOSHI^MA 



Acc# 



L37087 



tiostoc sp. AfCC 25133 oxid oreductase (hrmU) ana HrmA (nrmAj genes , complete 
cds . 



1333 



ORF Name 



NT ID 



— — Score Probability 



\242'2H6ti2 11 !> 



[5T7T" 



AAID Length Length 



10393 



753" 



5.0e-47 



Protein name 



Description 



Locus Name 



sp:VHlU_Ji!CQLl | 



Acc# 



P37636 



PRECURSOR 



NT 



AA 



ORF Name 



I2450S680 tA 40 



NTID AAID Length Length 

Ilu3$4 



[TTT 



Score Probability 
|7.8e-lb 



Protein name 



Locus Name 



prokaryotic type i sxgnai peptidase axpj?- 



bpiAtObblby 



Acc# 



AF065159 



Description 

Bradyrhizobium japonicum putative aryisultatase (arsAj , putative soiur>ie 
lytic transglycosylase precursor (sltA) , dihydrodipicolinate synthase (dapA) , 
MscL (mscL) , SmpB (smpB) , BcpB (bcpB) , RnpO (rnpO) , RelA/SpoT homolog (relA) , 
PdxJ (pdxJ) , andacyl carrier protein synthase AcpS (acpS) genes, complete 
cds; prokaryotic type I signal peptidase SipF (sipF) gene, sipF-sipSallele , 



NT 



AA 



ORF Name 



NTID 



AAID 



5T7T 



Length Length 
— 



Score 



Probability 
4 .4e-48 



Protein name 



Locus Name 



OprM 



|gp:Al301138I 



Acc# 



AB011381 



Description 



Pseudomonas aeruginosa gene tor oprlYL, complete cas . 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Pr 


obability 


2&^±AXLJl±J*U. 


5174 


10396 


312 


539 


247 




B.9e-2i 



Protein name 



Locus Name 



conserved Hypothetical protein 



|pir:H724iV 



Acc# 



H72417 



Description 



1334 



ORF Name 


NT ID 


AAID 


NT 
Length 




AA 
Length 


Score 


Probability 


25820437_t3_b6 


5175 


10297 


564 


1695 


827 






Protein name 




Locus Name 




Acc# 


hypothetical protein 


mexF 








pir :T30830 




T30830 


Description 




















ORF Name 


NTID 


AAID 


NT 
Length 




AA 
Length 


Score 


Probability 


|25.3.6.0..7.I7....cl...iO^. 


517& 


10358 


208 


627 


321 






Protein name 












Locus Name 




Acc# 


phosphoglycolate phosphatase 


(gph) homolog 






pir:C701B4 


1 


C70184 


Description 




















ORF Name 


NTID 


AAID 


NT 
Length 




AA 
Length 


Score 


Pr 


obability 


I6£xl±l^tl^l 


5177 




171 


516 


354 




2.7e-3^ 


Protein name 












Locus Name 




Acc# 


poiysialic acid capsule expression protein 




pir :B7U434 




B70434 


Description 




















ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


ISAl&ltt^tlJll 


5179 


10400 


1030 . 


*093 


1827 




z . ze- 100 


Protein name 




Locus Name 




Acc# 


beta-galactosictase 




gp:A^0Sb4B2 




AF055482 


Description 




















Thermotoga neapoiitana galactose utilization 


operon, 


compietesequence . | 


ORF Name 


NTID 


AAID 


NT 
Length 




AA 

— , Score 
Length 


Probability 


3.aaiZ5.1^cl^y.^. 


517$ 


10401 


450 


13b3 








Protein name 












Locus Name 




Acc# 


Description 





















IMO-HIT 



1335 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
E35 



Score Probability 



IT 



Protein name 



Description 



Locus Name 



Acc# 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID 



Length Length 
H7T 



Score 



Probability 
|§.7e-43 



Protein name 



Description 



Locus Name 



Acc# 



sp:LPXA_ECOLl 



(EC 2 .3 .1.125) (UDP-N-Am'YL^LUCOaAMl^ A CVLl'kANS^kA^J 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



10404 



Probability 
1.3e-48 



Protein name 



Description 



Locus Name 



Acc# 



Q04805 



HYPOTHETICAL l>kO(Jl«IS Smti kkuTEAiW , (ukFW 



NT 



AA 



ORF Name 



NTID 



\±±&A1±2..±1..±& 



AAID Length Length 
1497 



Score 



10405 



498 



T7T 



Probability 
i.5e-2i 



Protein name 



Locus Name 



sp:Ltiy_fc»ALTV 



Acc# 



P23697 



Description 

SIQtftL PiilPflSASfi 1, (S£A&a 1) (L2AbKR Pij&l'lDAsri l) 



1336 



ORF Name 


NT ID 


AA1JJ 


NT 
.ueng un 


AA 
Length 




Score 


Probability 


549091_cJJ_74 




|10406 


112 


339 








Protein name 








Locus 


Name 


Acc# 


Description 
















NO-HIT | 


ORF Name 


NT ID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 


&AlZLl\x...te..A* 




10407 


371 


1116 






4.8e-26 


Protein name 








Locus 


Name 


Acc# 



protem-tyrosine pnospnatase 



gp:AB028630 



AB028630 



Description 



Clostrxdrum perirm gens hyp27, bacH, ptp, cpa genes rorhypotneticai 
protein, bacterial hemoglobin, protein- tyros inephosphatase, 2', 3'-cuclic 
nucleotide 2 1 -phosphodiesterase, partial and complete cds . 





ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 




iu fi&m&^ci^ 1 


5186 


10408 


214 


645 










Protein name 








Locus 


Name 


Acc# 






Description 


















MO-HIT 
















P 


ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Probability 






amo^t^n 


5187 


10409 


485 


1455 


880 


4.3e-88 





Protein name 



Locus Name 



ATP-dependent RNA heiicase nomolog yaoK 



pir :D69772 



Acc# 



D69772 



Description 



1337 









NT 


AA 


ORF Name 


NTID 


AAID 


Length 


Length 


12402iab_c3jL2 


| 51S8 


10410 


144 


432 



Protein name 



Locus Name 



Acc# 



Description 



NO-HIT 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 



Score 



lSfi.2ttm„.cl..A 



110411 



TTZT 



Probability 
|4.5e-32 



Protein name 



Locus Name 



histiame Kinase 



gp:AF114442 



Acc# 



AF114442 



Description 

Wostoc punctirorme histidme kinase tnepKj gene, complete cas. 







NT 


AA 


Score 


Probability 


ORF Name 


NTID AAID 


Length 


Length 








21^ASA...al..X^ 


5l$0 |l04l2 


124 


375 


302 




2.6e-26 


Protein name 






Locus 


Name 




Acc# 



2 , 3-bisphosphoglycerate-inaepenaent 



gp:Af 12 0091 



AF120091 



Description 



Bacillus stearotnermophilus 
2 , 3-bisphosphoglycerate-independentphosphoglycerate mutase (pgm) gene, 

complete cds . 



ORF Name 



NTID 



AAID 



— — Score Probability 
Length Length 



10413 



|3.0e-lV 



Protein name 



Description 



Locus Name 



|sp:YP20J*A(JLl 



Acc# 



P05332 



HYPOTHETICAL £20 ^ ROtfllN 



1338 



ORF Name 



NT ID 



— — Score Probability 
AAID Length Length 



111777161 cJ fob 



Protein name 



110414 



Locus Name 



Acc# 



Description 



(NO-HIT 



ORF Name 



Protein name 



NT ID 



AAID 



i4£4ft5i£..±I.-i4 



1041b 



— — Score Probability 
Length Length 



TTF" 



1.5e-24 



Locus Name 



Acc# 



hypothetical protein 



Description 



pir :D69060 



D69060 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



16.tt.L±tXb±clJ*l | 

Protein name 



10416 



2.7e-l6 



Locus Name 



probable nyarolase 



rirTTT7TJ2~ 



Acc# 



T37132 



Description 



ORF Name 



NTID 



AAID 



ixrarr 



Protein name 



hypotnetical protein 



Description 



— — Score Probability 
Length Length 



ITIT 



5TT 



TUZT 



3.5e-103 



Locus Name 



pir : jgiu^u 



Acc# 
JQ1020 



ORF Name 



NTID 



NT 



AA 



AAID Length Length 



— Score Probability 



Protein name 



110418 



Locus Name 



Acc# 



Description 



NO-HIT 



1339 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



— Score Probability 



19728433 cl 4y 



10415 



WIT 



T2W 



4.2e-l& 



Protein name 



Locus Name 



DNA damage- inducible protein. PAbi^a 



pir :C7b0b3 



Acc# 



C75053 



Description 



ORF Name 



NTID 



AAID 



^ ^ Score 
Length Length 



10420 



FT 



252 



Probability 
0.031 



Protein name 



Description 



Locus Name 



sp:SP£C_XENLA 



ACC# 



P36378 



(OS TE ONECTIN) (ON) (BA&EMKNT MEMijkANE j^ROTfellM faM-4u) 



NT 



AA 



ORF Name 



NTID 



AAID Length Length 
I35S 



110421 



Score Probability 
OTTJ^ 



75 



Protein name 



Locus Name 



gp:ACu0bbbb 



Acc# 



AC005565 



Description 



Homo sapiens chromosome lb, cosmid clone 444B9 (LAND , compietesequence . 



ORF Name 



NTID 



— — Score Probability 
AAID Length Length 



\2±&±0£&.l.cl1..&± I iraro 



10422 1 [2^1 I 



Protein name 



Description 



Locus Name 



Acc# 



MO-HIT 



1340 



ORF Name 



12441590^ ±2 18 



Protein name 



NT ID 



— — Score Probability 
AAID Length Length 



10423 



Locus Name 



Acc# 



Description 



IN0-M1T 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



5202 



10424 



TTT 



TZTF 



1.4e-i25 



Protein name 



Locus Name 



7 -alpha-hydroxysteroid denyarogenase 



|gp:A^173 8TT 



Acc# 



AF173833 



Description 



Sacteroides iragiiis 7 -alp ha -hydroxy steroid denyarogenase mdhAjgene, 
complete cds . 



ORF Name 



NTID 



10425 



Protein name 



hypothetical protein cobiuw 



Description 



NT 



AA 



AAID Length Length 



Score Probability 



IT 



0.021 



Locus Name 



pir :Tl«4b0 



Acc# 



T18460 



ORF Name 



Protein name 



NTID 



10426 



NT 



AAID Length Length 



AA 

— Score Probability 



Locus Name 



Acc# 



Description 



MO-Hlf 



1341 



NT 



AA 



ORF Name 



NTID 



|2630534<> ci 44 



AAID Length Length 
373 



10427 



Score Probability 
3.5e-10 



Protein name 



Locus Name 



sp : FEOB_Mt!TJA 



Acc# 



Q57986 



Description 

gBftftOtte 1R6M TRANSPORT PROl'i^N B tiOMOhOCj 



ORF Name 


NTID 


AAID 


NT 
Length 


AA 
Length 


Score 


Pr 


obability 


2S5S53l7_c2_5S 


5206 


1042S 


1S6 


471 


302 


S.7e-27 



Protein name 



Locus Name 



hypothetical protein ^Cl^OA.ly 



pir :T3bvyy 



Acc# 



T36799 



Description 



ORF Name 



Protein name 



NTID 



— — Score Probability 
AAID Length Length 



10425 



T5T 



6.9e-ib 



Locus Name 



transcription regulator Arau/xyis tamny 
homo log ydeE 



tpir:a69777 



Acc# 



G69777 



Description 



NT 



AA 



ORF Name 



NTID 



mB3.1^L^..5A I RTTS 



AAID Length Length 



10430 



Score Probability 
|2.4e-24 



[273 



Protein name 



Locus Name 



Acc# 



sp:YUXK_HAcJt!U 



Description 

HWOTtiril'lCAL lb. 7 KB PROl^lN IN PBPD-C OmA IMl'KRflBtiiC! Rii^ION [ORfJ) 



1342 



ORF Name 



NT ID 



NT AA 
— — , Score 
AAID Length Length 



)60AMi>'A ±3 3b 



5209 



110431 



F7T 



Probability 
1 . 7e-bb 



Protein name 



Description 



Locus Name 



sp:UNG_HUMAW 



Acc# 



P13051 



U^AClL-bNA GhMQSYLA&i PRhiUURS fta, (UDtJ) 



ORF Name 



NT ID 



AAID 



NT AA 
— — , Score 
Length Length 



6653437 c3 71 



TMTT 



TUZT 



Probability 



Protein name 



Locus Name 



1I5K outer membrane protein precursor 
protein 



;SusC 



pir:J06027 



Acc# 



JC6027 



Description 



ORF Name 



NTID 



AAID 



NT AA 
— — , Score 
Length Length 



10433 



ITS" 



Probability 
7.8e-42 



Protein name 



Locus Name 



nypotnetical protein 



pir : Jgiu^u 



Acc# 
JQ1020 



Description 



ORF Name 



Protein name 



Description 



NTID 



S.g.3..7.B.3.:L.c2...fiU 



5212 



AAID 



10434 



NT 

Length Length 



AA 

— , Score 



7¥T~ 



Probability 
5.5e-25 



Locus Name 



sp:YTPt!_kAklW 



Acc# 



P45312 



HY&Ol'Mlj'l'ieAL PRO'l^l Kf fitl677 



1343 



NT 



AA 



ORF Name 



NTID 



24417250 ±3 7 



AAID Length Length 
110435 



Score 



FT" 



IT 



Probability 
I 10.029 



Protein name 



Locus Name 



OmpK3 7 porin 



gp:KPN011502 



Acc# 



AJ011502 



Description 



Klebsiella pneumoniae { strain SD8) ompK3 7 gene. 



ORF Name 



NTID 



• NT AA. 
„ , „ _ — L1 _ — ^ Score Probability 
AAID Length Length ^ 



6725l§2 t3 5 



10436 



ST5~ 



2448 



3.1e-12 



Protein name 



Locus Name 



Acc# 



colicm I receptor 



gp:ECOCIR 



Description 



E.coli colicm I receptor gene, complete cds. 



ORF Name 



NTID 



AAID 



NT AA 
— — Score 
Length Length 



2.O.O.6.40.O...±2...5... 



5215 



10437 



7TT 



225 



Protein name 

Description 
ET^ITTT 



Locus Name 



Probability 



Acc# 



NT 



AA 



ORF Name 



NTID 



AAID 



3.2.X83A2...XX..X 



10438 



Length Length 



Score Probability 



Protein name 

Description 
MO-HIT 



Locus Name 



Acc# 



1344 



NT 



AA 



ORF Name 



NT ID 



AAID Length Length 



Score Probability 



755" 



Protein name 

Description 
IKTO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 
— — Score 
AAID Length Length 



4i.7.aaa2L..±a...a i fsts 



10440 



Protein name 

Description 
INO-HIT 



Locus Name 



Probability 



Acc# 



NT 



AA 



ORF Name 



NTID 



l,5.6.2525.2....tl...l | pTET5 



AAID Length Length 
"1ST 



Score Probability 



110441 



Protein name 

Description 
[NO-HIT 



Locus Name 



Acc# 



ORF Name 



NTID 



NT AA 

„^ TT , T — _ — ^, Score Probability 
AAID Length Length J - 



i7..7iS12...tl..i 



10442 



i.4e-5i 



Protein name 



Locus Name 



115K outer membrane protein precursor : SusC 
protein 



pir :JC6 02 7 



Acc# 



JC6027 



Description 



ORF Name 



NTID 



AAID 



3.5.3.5.15.B.3....dl.„3. I I527T 



10443 



Protein name 
Description 

prcmrr 



NT 



AA 



_ — Ll — _ Score Probability 
Length Length 

7T" 



Locus Name 



Acc# 



1345 



— — Score Probability 



AA 



ORF Name 



NT ID 



AAID Length Length 



10444 



T5T 



5.9e-0V 



Protein name 

conserved hypothetical protein yKnk; 



Locus Name 
pir :E6y^btf 



Acc# 



E69858 



Description 



1346 



CLAIMS 



1 . An isolated nucleic acid encoding an B. fragilis polypeptide of SEQ ID NOS: 
5223 - 10444. 

2. A recombinant expression vector comprising the nucleic acid of Claim 1 operably 
linked to a transcription regulatory element. 

3 . A cell comprising a recombinant expression vector of Claim 2. 

4. A method for producing an B, fragilis polypeptide comprising culturing a cell of 
Claim 3 under conditions that permit expression of the polypeptide. 

5. An isolated nucleic acid selected from the group consisting of: 

(a) SEQ ID NOS: 1 -5222; 

(b) a complement of SEQ ID NOS: 1- 5222; or 

(c) an RNA of (a) or (b), wherein U is substituted for T. 

6. A recombinant expression vector comprising the nucleic acid of Claim 5 operably 
linked to a transcription regulatory element. 

7. A cell comprising a recombinant expression vector of Claim 6. 

8. A method for producing an B. fragilis polypeptide comprising culturing a cell of 
Claim 7 under conditions that permit expression of the polypeptide. 

9. A probe comprising a nucleotide sequence consisting of at least eight contiguous 
nucleotides of a nucleotide sequence selected from the group consisting of: 

(a) SEQ ID NOS: 1-5222; 

-1347- 
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(b) a complement of SEQ ID NOS: 1- 5222; or 

(c) an RNA of (a) or (b), wherein U is substituted for T. 

10. An isolated nucleic acid comprising a nucleotide sequence of at least eight 
5 nucleotides in length, wherein the sequence is hybridizable to a nucleic acid 

having a nucleotide sequence selected from the group consisting of: 

(a) SEQ ID NOS: 1 -5222; 

(b) a complement of SEQ ID NOS: 1 -5222; or 

(c) an RNA of (a) or (b), wherein U is substituted for T. 

10 

11. A vaccine composition for prevention or treatment of an B, fragilis infection 
comprising a nucleic acid of Claim 5 and a pharmaceutically acceptable carrier. 

12. A vaccine composition of Claim 1 1, further comprising an adjuvant. 

15 

13. A vaccine composition of Claim 11, further comprising one or more additional 
ingredients. 

14. A method of treating a subject for B. fragilis infection comprising administering 
20 to a subject a vaccine composition of Claim 11, such that treatment of B. fragilis 

infection occurs, 

15. A method of Claim 14, wherein the treatment is a prophylactic treatment. 

25 16. A method of Claim 1 4, wherein the treatment is a therapeutic treatment. 

17. A recombinant or substantially pure preparation of an B. fragilis polypeptide or a 
fragment thereof, wherein said B. fragilis polypeptide is SEQ ID NOS: 5223 - 
10444. 

-1348- 



A vaccine composition for prevention or treatment of an B. fragilis infection 
comprising an B. fragilis polypeptide of Claim 17 and a pharmaceutical^ 
acceptable carrier. 

A vaccine composition of Claim 18, further comprising an adjuvant. 

A vaccine composition of Claim 18, further comprising one or more additional 
ingredients. 

A method of treating a subject for B. fragilis infection comprising administering 
to a subject a vaccine composition of Claim 18, such that treatment of B. fragilis 
infection occurs. 

A method of Claim 21, wherein the treatment is a prophylactic treatment. 

A method of Claim 21 , wherein the treatment is a therapeutic treatment. 

A method for detecting the presence or absence of a Bacteroides nucleic acid in a 
sample comprising: 

(a) contacting a sample with the nucleic acid of Claim 5 under conditions in 
which a hybrid can form between a probe comprising a nucleotide 
sequence consisting of at least eight contiguous nucleotides of a 
nucleotide sequence selected from the group consisting of SEQ ID NOS: 
1-5222 or a complement of SEQ ID NOS: 1-5222 and a Bacteroides 
nucleic acid in the sample; and 

(b) detecting the hybrid formed in step (a), wherein detection of a hybrid 
indicates the presence or absence of a Bacteroides nucleic acid in the 
sample. 



-1349- 



A computer readable medium having recorded thereon a nucleotide sequence 
selected from the group consisting of: 

(a) SEQIDNOS: 1-5222; 

(b) a complement of SEQ ID NOS: 1- 5222; 

(c) an RNA of (a) or (b), wherein U is substituted for T; or 

(d) a fragment of (a), (b) or (c). 

A computer based system for identifying fragments of the Bacteroides genome 
of comprising; 

a) a data storage means comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NOS: 1-5222, a complement of SEQ ID 
NOS: 1-5222, or a fragment thereof, 

b) a search means for comparing a target sequence to the nucleotide 
sequences of the data storage means of step (a) to identify homologous 
sequences, and; 

c) a retrieval means for obtaining said homologous sequences(s) of step (b). 

A method of identifying nucleic acid fragments of a Bacteroides genome 
comprising comparing a database comprising a nucleotide sequence selected from 
the group consisting of SEQ ID NOS: 1-5222; a complement of SEQ ID NOS: 1- 
5222; or a fragment thereof with a target sequence to obtain a nucleic acid 
molecule comprised of a complementary nucleotide sequence to said target 
sequence, wherein said target sequence is not randomly selected. 

A method for identifying an expression modulating fragment of the Bacteroides 

genome comprising comparing a database comprising a nucleotide sequence 

selected from the group consisting of SEQ ID NOS: 1- 5222; a complement of 

SEQ ID NOS: 1-5222; or fragment thereof with a target sequence to obtain a 

nucleic acid molecule comprised of a complementary nucleotide sequence to said 
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target sequence, wherein said target sequence comprises sequences known to 
regulate gene expression. 
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ABSTRACT OF THE DISCLOSURE 

The invention provides isolated polypeptide and nucleic acid sequences derived 
from Bacteroides fragilis that are useful in diagnosis and therapy of pathological 
conditions; antibodies against the polypeptides; and methods for the production of the 
polypeptides. The invention also provides methods for the detection, prevention and 
treatment of pathological conditions resulting from bacterial infection. 
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SEQUENCE LISTING 



<110> Gary L. Breton 

<12 0> NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO BACTEROIDES FRAGILIS 
FOR DIAGNOSTICS AND THERAPEUTICS 

<130> 2709.1001-001 

<160> 10444 



<210> 1 
<211> 420 
<212> DNA 
<213> B.fragilis 



<400> 1 

aacaaggaac 

cacttcctgc 

gaagaccata 

gtaacgatgc 

tcatatcccg 

gggctggtat 

acaggcaccg 



ggttcaggac 
tcgaccaagg 
ccctgggaaa 
tgacggacca 
gacagcaaat 
atccgctcag 
aatttacgat 



catcttccac 
caaaaaaaca 
catcagtctg 
cggaatgttt 
ctccatcttc 
tgacttcagt 
ccatgcggag 



caggagacga 
atcatcctcg 
ctgatagact 
attccggcat 
aacttcaacg 
aactggtggc 
ggagactact 



acgatcagac 
tgggtgcaac 
atatgaaagc 
cggggcggaa 
ccaccggact 
agggtacgct 
tggtttatct 



caaagcggta 
cggtaaacgg 
aggagcgcag 
ctgtttcaag 
gagggccgat 
gaacgaagcg 
gaattactga 



60 

120 

180 

240 

300 

360 

420 



<210> 2 
<211> 1836 
<212> DNA 
<213> B.fragilis 



<400> 2 

agccaccgtt cgatacttcc agccccgtgg ggaagacaga cgacaggttt tgagacggca 6 0 

gccgtccaga aatcggtaag cgtactcccc acacaaactt actacacgtt tacttgcggt 12 0 

ccggtggaac tggacctggt gtttaccgca cctttgatga tggacgacct cgatttgttg 180 

tctactcccg ttaattatat ttcttaccgc gttcgttcgc tggacaaaaa gcaacatgat 240 

gtgcagatgt atgtggagac caccccgcag cttgccatca atgaactgac gcaacctacc 3 00 

cgttcgaaag tgatccgccg taacggtatc aattatgtac aggcagggac tatcgaccag 3 60 

cctatcctcg cacgaaaagg agacggtatc tgtattgatt ggggatatgc ttatctggca 42 0 

ggaaatatag gtgccaatac agctgtcagc ctgggtaact actatggtat gaagaacgag 480 

tttgctacca agggttcttt gttgcctaca caagccgagt gcgtgacccg tcgtgccgac 540 

cagatgccgg ctatggccta tactgacgat ctgggtgaag taggtaccga tggcaaatcc 600 

ggcttcctga tgttgggtta cgatgatatt tatgctatcg aatacttcta tcaacctcgt 660 

atggcctact ggaagcatga tggtaaggta agcatcttcg atgcctttga gcgtgccaaa 72 0 

gcaaactatg cgtctgtcat ggaacgttgc cgtgcttacg acgaaatgat tctgaacgat 780 

gcagaaaaag caggtggcaa agaatactct gaactgtgtg cattggctta ccgtcaggtg 840 

attgccgccc ataagctgtt caaggatgcg gatggtaact tactcttctt ctctaaagag 900 

aacaatagta acggttgtat caatactgtc gacctgactt atccgtctgc tccgctcttc 960 

ctggcttata accccgaatt gcagaaaggc atgatgacca gtatctttga atatagtgcc 102 0 

agcggacgtt ggaacaagcc tttcccggct cacgacctgg gaacttatcc tattgctaac 1080 

ggacaggtat acggtggtga catgccgatt gaagaaggcg gaaatatggt agtcctggct 1140 

gctgctattg ccaaggtaga aggtaacgcc gactatgcta agaagtattg ggatttactg 12 00 

accatttgga ctgattatct ggcggaatac ggacaagatc ccgagaacca actctgtact 1260 

gatgactttg ccggacactg ggcacataac gccaaccttt cggtaaaagc gatcatgggt 132 0 

gtagctgctt acagtgaaat ggcccgtatg ctcggtatgg atgatgtagc cgaccgatat 13 80 

gctgccaaag ccaaagcaat ggctaccaaa tgggaacaaa tggctcgtga gggtgatcat 1440 

tatcgtctgg cattcgaccg tgagaatacc tggagccaga agtacaatat ggtttgggac 1500 

aagatgtgga atctgaacct tttccccaat aatgtgattg agaaagaaat ttcttattat 1560 

cagaccaaac tgcaaaaccc ttatggactt ccgttggatt cccgcaagga atatactaaa 162 0 
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tccgactgga ttatgtggac tgctgccatg tcttctgata aggctacttt cgagaaattt 1680 

atttctccgg tatataagta tgctaatgaa accgtatcac gtgttccgct gagcgactgg 1740 

catcataccg atagcggtaa gtttgtcggt ttcaaggcac gttccgtgat cggtggttat 1800 

tggatgaagg tattaatgga taaaatgcag aagtaa 183 6 

<210> 3 
<211> 750 
<212> DNA 
<213> B.fragilis 

<400> 3 

tgtggtggat tatttcgctg gcacgtgctt acgaaatcac gaatgacccg aaatacctgg 60 

cacatgcctc ttcgggattc tacccatgtc tggaaagagt cgtatgataa agaaaggggg 12 0 

ggcctgtggt ggaacttcaa gcacgatgga aagatggctt gcatcaacta tccgactacg 180 

gtgggcgcca tgactcttta taatgtgacc aaagatcccg attatctgga aaaggcaaaa 240 

agtgtatatg catggtcgag ggatgttttt ttcgacaagg agaaaggccg catagcagac 3 00 

aatatgcact atcatttcca aagacagaac ggtatggaca tagactggac aacccaactt 3 60 

tataatcagg ctacatttat cggttcggcc gtgatgctgt acaaagcaac cggcgagaaa 42 0 

gcttatctgg acgatgccgt tctggctgcc gactacgtcc gcaacgagat gtgtgatgcc 480 

gatggattgc ttccgttcaa aaatggcgtg gaacagggaa tttatgctgc catctttgca 540 

cagtacatca ttcgcctgat agaagatggc aatcagcccc aatatatgga ctggcttcgt 600 

cacaacatag acgtggcgtg gaacaaccgg gatgtaaacc gtaatgtgac attcaaggat 660 

gcaaccaaac cctgcccgac aggtgtgatg gaaagctatg atgccagcgg atgtcccgca 72 0 

ctgatgcaag tgatttctcc attcaaataa 750 

<210> 4 
<211> 1446 
<212> DNA 
<213> B.fragilis 

<400> 4 

ctgaaattgt ccgtccttat tcttgccacg catcagctta tgctctttat cgtacaggtt 60 

cttgtagttc atggcacgct gtgcataaac ggctatctcc tcttcgggct tgttcaaggc 120 

tttacccaac tgatagatgc accagtcatc ataagcatat tccagcgtac gggcggcatt 180 

ttcattgatg cctacattat aaggtacgta gcccagttga ttgtaatatt cataaccgag 240 

acgtccggta gacgaaacct gcggatgaac agcatttgca ccgtgtttca cagcttccca 300 

aagagtttct atatcgtaac ctttcaatcc tttcagatag gcatcagcca ctaccgaagc 3 60 

ggaattgtta cctaccatac agcccctgtg cccgggactt gcccattcgg gaaggaatcc 42 0 

gctttccttg taagtatttg ccagtccctc ctgcatcttt tcgttcatcg aaggatacat 480 

caggttgagg aacgggaaca ggcagcggaa tgtatcccag aaaccggtat cggtaaacat 540 

atatcccggc agcactttac cgttgtaggg actgtaatgt accggtttcc ctttggcatc 60 0 

cagttcgtag aagcttctcg ggaaaagtac cgaacgatag aggcaagagt agaatgtacg 660 

gagatgatcg gtattatcgt cttccacctc aatacgtccc agaaccttgt tccattcctg 72 0 

gcgtcctttg gctgcaacag cttccagatt gtctttaccc aactctttca agttctgttc 780 

tgcctgctcg gggctgataa aagaagaagc cactcgtacg ttgaccgtct ctccacgacg 840 

tgtagagaac ccgatgatac cacctgcatg tttatctttc gattccagct cacccggacg 90 0 

gatgttgccg ttggtaactg ctgcggtaaa agtgaacggc ttatcgaaca ccagtacaaa 960 

ataattctta aagttctccg gcactcctcc actgttcttg gttgtgtagc cgatgatctt 1020 

gttctcttcc ggaatcactt tcacatacga accgttgtcg aaagcatcta ccactacata 1080 

agaatcctta ctctcaggaa aagtgaaacg aaacatcgcc gcacggctgg tcggagcaat 1140 

ttcggtcgtg acatcatgat cggccagata tactttataa taatacggtt tggcaacctc 12 0 0 

agccttatgc gagaaccagc tcgcacgctg atcctgatcg aacacgacct ttcccgtaac 1260 

aggcataatg gcaaactgcc cgtagtcatt aatccacggg ctgggctggt gagtctgctt 1320 

aaatcccctg attttatcgg catcataggt ataagcccat ccatcaccca tctttccggt 13 80 

ttgtgccacc cagaagttca ttccccaagg catggcaata gccggatatg tatttccggt 1440 

agataa 1446 



<210> 5 
<211> 2367 
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<212> DNA 

<213> B.fragilis 

<400> 5 

tcatttttcc cattgatact gaagattaaa aaaactgtgt tagtccgtgt aatctgtggt 60 

gaatttaaaa ctattaatat catgaagaaa ttagcgctat tacttgttgg cgtattgggt 12 0 

actgctttct gcacttttgc aaagagtacg acagaaccgg tggattatgt aagcccactg 180 

gtcggtaccc agtcaaagca tgctttatct accggaaata catatccggc tattgccatg 240 

ccttggggaa tgaacttctg ggtggcacaa accggaaaga tgggtgatgg atgggcttat 3 00 

acctatgatg ccgataaaat caggggattt aagcagactc accagcccag cccgtggatt 3 60 

aatgactacg ggcagtttgc cattatgcct gttacgggaa aggtcgtgtt cgatcaggat 42 0 

cagcgtgcga gctggttctc gcataaggct gaggttgcca aaccgtatta ttataaagta 480 

tatctggccg atcatgatgt cacgaccgaa attgctccga ccagccgtgc ggcgatgttt 540 

cgtttcactt ttcctgagag taaggattct tatgtagtgg tagatgcttt cgacaacggt 600 

tcgtatgtga aagtgattcc ggaagagaac aagatcatcg gctacacaac caagaacagt 660 

ggaggagtgc cggagaactt taagaattat tttgtactgg tgttcgataa gccgttcact 72 0 

tttaccgcag cagttaccaa cggcaacatc cgtccgggtg agctggaatc gaaagataaa 780 

catgcaggtg gtatcatcgg gttctctaca cgtcgtggag agacggtcaa cgtacgagtg 840 

gcttcttctt ttatcagccc cgagcaggca gaacagaact tgaaagagtt gggtaaagac ' 900 

aatctggaag ctgttgcagc caaaggacgc caggaatgga acaaggttct gggacgtatt 9 60 

gaggtggaag acgataatac cgatcatctc cgtacattct actcttgcct ctatcgttcg 102 0 

gtacttttcc cgagaagctt ctacgaactg gatgccaaag ggaaaccggt acattacagt 1080 

ccctacaacg gtaaagtgct gccgggatat atgtttaccg ataccggttt ctgggataca 1140 

ttccgctgcc tgttcccgtt cctcaacctg atgtatcctt cgatgaacga aaagatgcag 1200 

gagggactgg caaatactta caaggaaagc ggattccttc ccgaatgggc aagtcccggg 12 60 

cacaggggct gtatggtagg taacaattcc gcttcggtag tggctgatgc ctatctgaaa 132 0 

ggattgaaag gttacgatat agaaactctt tgggaagctg tgaaacacgg tgcaaatgct 1380 

gttcatccgc aggtttcgtc taccggacgt ctcggttatg aatattacaa tcaactgggc 1440 

tacgtacctt ataatgtagg catcaatgaa aatgccgccc gtacgctgga atatgcttat 1500 

gatgactggt gcatctatca gttgggtaaa gccttgaaca agcccgaaga ggagatagcc 1560 

gtttatgcac agcgtgccat gaactacaag aacctgtacg ataaagagca taagctgatg 162 0 

cgtggcaaga ataaggacgg acaatttcag tcaccgttca atccgctgaa gtggggcgat 1680 

gccttcaccg aaggaaacag ttggcactat acctggtctg tattccatga tcctcaggga 1740 

ctgatcgacc tgatgggcgg acagcaaggg ttcaatcaga tgatggattc tgtctttatc 180 0 

ctgcctcctg tatttgatga cagctattac ggcggtgtga ttcacgaaat ccgtgaaatg 1860 

cagattatga atatgggaca gtatgcacac ggtaaccaac ccatccagca catgctatat 192 0 

ctgtacaatt actcgggaca accgtggaaa gcacagcatt ggattcgtga agtgatggat 1980 

aaactctata cacccaatgc cgacggttat tgcggtgacg aagataacgg acagacttcg 2040 

gcatggtatg tattttctgc tatgggattc tatcccgttt gccccggaac ggatcagtac 2100 

gtgatgggta cgccgtactt caaacagatg aagctgcatc tggagaatgg caagaccgtg 2160 

cagatcagcg caccgggcaa tagcgatgaa aaccgttaca ttgcgtcaat gaccgtaaac 222 0 

ggtaaaacat tgactcgcaa ctacctgaca cataaagaac tgatgaacgg agcgaagatt 22 80 

acgatgaaaa tgtcgtctac tccgaacaaa cagcgtggag tacgcgagtc ggatttcccg 23 40 

tattcgttct ctaaagaggt acgttga 2367 

<210> 6 
<211> 2514 
<212> DNA 
<213> B.fragilis 

<400> 6 

atggtaaaaa ctataaaaaa agaatctgaa gttatgaagc ttaaactatc gactctgttc 60 

ttgggtgcag ctgccatgct gagcagttgt ggggcatcgc aggacgtcaa gagtgaaaaa 12 0 

agtgagatgc gtgcaccggc ctatccgttg gtgatgattg acccttacac cagtgcctgg 180 

tcgtttacgg ataatctgta tgacggaccg gtgaaacact ggaccggtaa ggacttcccg 240 

ttcttgggtg ttgccaaggt agacggacag atttaccgtt tcatgggaac ggaagaactt 3 00 

gagctgcttc cgctggttaa gacctcggaa caaggcagat ggacagctaa gtatacaaca 3 60 

aagaaaccgg ctgacggctg gcagaatgcc gactttaatg atgcggcatg gaaagaagga 42 0 

gaaggtgctt tcggtactat ggagaatgaa agtacagcca agacccagtg gggagaagag 480 
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tatatctgga tacgccgtaa agcggatatt aaagacaacc tgcaaggtaa aaatgtatat 540 

ctggaatatt ctcacgacga tgacgccatc atctatgtga atggcgtgaa ggtggtggat 600 

accggtaact cggctaaaaa acatatgctt gccaaactgc cggaagaggc tgtggccgca 660 

ctgaaacagg gagagaacct gattgcaatt tactgtaaca accgtgttgc caacggtctg 72 0 

atcgattgcg gtctgttggt agagaaagac aatacacaga actttactca gacagcagta 780 

cagaaatcgg tagacgtgca ggctatgcag accaactacg aatttacttg cggaccggtg 840 

gacttgaaac ttgcgttcac ttcacctctc tttatggata atctcgattt gatgactcgt 900 

ccggtgagct atcttaccta tgaagtggct tcgaatgacg gaaataaaca taacgtagaa 9 60 

ctttatttcg aagcaggacc gcagtgggca ctcgaccagc ctcatcagga agctgtagcc 102 0 

gaaagcttta cagaaggtaa tttgctatac ctcaagacgg gaagccgcaa ccaggaaata 1080 

ttgggtaaaa agggagatga tgtccgcatt gactggggat acttctacat ggctgccgat 1140 

aaggagaaca gttcatgcgc taccggagag ggaaagaccc taagaaagag tttcatcgac 1200 

ggaaaattga catcatccaa gaccgatgga agtgacaagc tggcactggt tcgctcactg 12 60 

ggtgaaacga agaaagcgga aggacacttg ctgctgggtt acgatgactt gtactctatt 132 0 

cagtatttcg gtgaaaatct tcgtccgtac tggaaccgca atggaaacga aaccattcag 13 80 

tcgcagtttg cgaaagctga taaggaatat gatgcagtga tggataaatg tgctgcattc 1440 

gatgctaacc tgatgaaaga agctactgaa gtaggcggac gtaagtatgc cgaactctgt 1500 

gcattggctt atcgccaggc aatcgctgcc cacaaattgg tggaagcccc caacaaagac 1560 

ctgctgttcc tctcaaaaga gaacttcagc aacggttcga tcggtacggt ggatatcact 162 0 

tatccttctg ctcctttgtt cctggtatac aatccggaat tggcaaaagg tctgatgaac 1680 

cacatcttct attatagcga aagcggaaaa tggaataagc cgttcgctgc acatgatgta 1740 

ggtacttatc cgttggctaa cggccagaca tacggtggag atatgccgat cgaggaatcg 1800 

ggaaatatgc tgatcttgag tgcagccatc gccattgtcg aaggaaatgc cgactatgcg 1860 

cagaaacact gggatgtatt gacaacctgg accgattatc tggctcaata tgggctcgat 192 0 

ccggaaaatc aactttgtac agacgacttt gccggacact ttgcacacaa cgccaacctg 1980 

tctatcaaag ccatcctggg tgtagcgtct tatggctatc tggccgataa gttaggcaag 2 040 

aaagaagtgg ctgagaaata tacacagaaa gccaaagaaa tggctgccga atgggtgaag 210 0 

atggcagacg acggcgatca ctaccgcctg acttttgaca agcccggaac atggagccag 2160 

aaatacaatc tggtatggga taaactgatg aatctgcaga tattccctga aacagttgca 222 0 

cagaaagaga tagcttacta tcttggcaaa cagaatcaat atggattgcc gctggataac 22 80 

cgtgaaactt ataccaagac cgactggatt atgtggactg ctacactggc accggacaaa 23 40 

gctacattcg agaagtttat cgatccggtt tatctgttca tgaacgagac gaccgatcgc 2400 

gtgccgatgt ccgactgggt atttaccgat cgtccgaacc agagaggttt ccaagctcgt 2460 

tcggtagtag gcggatacta tatcaagatg cttgagaaga agttgaaaaa ataa 2 514 

<210> 7 
<211> 1221 
<212> DNA 
<213> B.fragilis 

<400> 7 

tttgtccttt tgttcatctg tctaaaaaaa ctcagtggga ctccgtgtcg ctctgtggtg 60 

aataaaactc aaaacagttt aataatgaga aaattagcta tgtgggcact gggtgccctc 12 0 

tttgtagccg gttgtgcaga gacagaaaag gctactacgg attccggttt ggtaaagagc 180 

aatttccaga ctgaggtggg cggaaagaaa accgatttgt atgtactccg taatcagaac 240 

aacatggagg tttgcgtcac taattttgga ggacgtattg tttcggtaat ggttcccgat 3 00 

aaagaagggg tgatgcgtga tgtagtgttg ggcttcgact ctattcagga ttacatcagc 3 60 

aagccttcgg acttcggtgc cagcatcggt cgttatgcca atcgcatcaa tcagggaaaa 42 0 

tttactttgg atggagttga ataccagttg ccgcgcaata actacggaca ttgcctgcac 480 

ggtggtccga aaggattcca atatcaggta tacgatgcca agcaggtggg accgcaggaa 540 

cttgagttga cttatctttc aaaagacggc gaggaaggtt tccccggtaa tatcacctgt 600 

aaggttatta tgaagctgac agatgataat gccatcgata tcaagtatga ggcagaaacg 660 

gataaaccga ccattgtcaa tatgaccaac cactcttatt tcaatctgga cggagatgca 72 0 

ggcagcaatg ccgatcatct gctgactatc gacgccgatg cttatactcc cgtggacagt 780 

acctttatga ccagtggcga gattgtaaca gtggaaggta ctccgatgga cttccgcaca 840 

ccgactccgg ttggaaaacg cattaatgat ttcgatttcg tgcagttgaa gaacggtaat 900 

ggttacgacc ataactgggt gttgaatgcc aaaggcgata ttacccgtaa ggccgctact 960 

cttgaatcac ccaaaaccgg tatcgtactc gatgtataca ctgacgaacc cggtattcag 102 0 

gtatatgcag gaaacttcct tgacggttcg ctgaccggaa agaaaggtat tacttacaat 108 0 
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caacgtgctt ctgtctgtct ggagactcaa aagtatcccg atactccgaa caaacctgaa 1140 

tggccttcgg ctgtattgcg tccgggtgaa acctacaata gtcattgtat cttcaaattc 1200 

tcggtagata acggaaaata a 1221 

<210> 8 
<211> 3258 
<212> DNA 
<213> B. fragilis 

<400> 8 

cgatgctttt atacattaaa gatttataga acctgttttt taaatttatg tactatgcgt 60 

aaaaaagaac aaatgttttg gctcgcaagt cgtagcagaa tgtggcgcat accactttgt 12 0 

atggctgcct tttcgctttt gccaagtgcc tacagtttcg ccagtgccga aaacccggca 180 

acagaaactg tattggcagt gaactccgtt caacaacaac ggactgtaaa gggtatagtt 2 40 

atcgacgcca atggcgaagc ggtaattggt gctaacgtaa aagagccggg aagtacaact 3 00 

ggtaccatca ccgatatgaa cggtgaattc tcgctgagtg tcggccctaa ag£tacactt 3 60 

gaaatctcat ttattggtta cacaacacaa aaagtaaacg taggcgcatc aaacacagtt 42 0 

aaagtaattc tgcaggaaga cacaaaagta cttgatgaag tcgttattac cggtttcggt 480 

atggcacaaa agaaagcgac tttaacaggt gctgtttctg caattaaatc aacagatatc 540 

gagcgttcag ctgcatctac tgcttcagga gcattagttg gtaagattgc aggtttgaac 600 

actcgtatgc aagatggtcg tcctggtgct tctactgcat tgcagattcg taatatgggt 660 

accccgttat ttgtgattga tggtgtgcag tcggatgaag ggcaatttaa taatatggac 720 

tttaatgata ttgaaaatat ttcaatattg aaagatgcct ctgctgctat ttatggtatt 780 

cgtgctgcta acggtgtagt agtggtaact actaagaaag ggcaacagaa gagcaaaaat 840 

acagtttctg ttaatgctta ttatggttgg cagaaaaact caagatggat tcaacccgct 9 00 

gatgctaaaa cgtatgtaaa tgcatatacg gctgctgaga cttgggccgg ccgaactgat 960 

ggagaacgta aattctcaag agaggattat gataagtgga tggctggtac ggagaaaggt 102 0 

tatacaggat ttgactgggg agattatatt tggaagacat ctccacaata ttatgtaaat 1080 

actaactttt ccggaggttc agataaagcc aactattatg tatctgtatc acatattaat 1140 

caagatgcta cagtgcgtaa ttacggtgga ttcaaacgta ccaatgttca gatgaatgtt 12 0 0 

gatatgaaag tcaatgatcg ttttaagatc ggagcaagta tgaatggtcg tatcgaatca 12 60 

cgtaagaatc ctggagttcc gggaggtgat gattatgatc ttcctttgta ttctaacttg 132 0 

aagaactggc cgacaatggg tccgtacgca aatgataatc ctctttatcc gcaaaaggtt 13 80 

tcaacagata ttaataccaa ttttgccctt ttgaactatg agaactctgg taaaatgacg 1440 

gatgattggc gtgtgcttca aatgcaggct acagcagaat acgaactact gaaaggattg 1500 

aaggccaagg gaatggtggg atactatttc gcttatagag aaatggaaaa tcatgaatat 1560 

ccttttaaat tgtatcgata taatcaggct aatgatactt atgaagtagc tgaatcaatg 162 0 

aatactcctt atcgtgaacg tattcgtcat agaaatgaag atttattttc taatttccag 1680 

ttgaattttg atcgtaagtt tggggatcat tatattaatg ctattgctgg ttttgaagct 1740 

tctcaacgca agagtccgaa tttcaatata atctcaactc cggtagctaa taatttgaac 1800 

ttgattcaat ttaaagaaat taaaacgttt aatgataacg gaaatgatac ccaagctcga 1860 

atgggatatt taggacgcat caattatagt tatgctgata aatatttagt tgaatttatt 1920 

ggtcgttggg atggttcttg gaagttccgt ccgggaaatc gttggggatt cttcccttcg 1980 

gcatctttag gatggagaat ttctcaggaa aaattctggc aagaaagtaa attggcaaat 2 040 

attttctcag actttaagat tcgtggttct tatggtgtag taggagatga taatgtaagt 2100 

gactattctg catttgatta tttggccggt tatgattata atagaggagg ttcggttatt 2160 

gatggacagt atgttgtagg atctgctcct cgtggattgc ccaatcagac attatcatgg 222 0 

ataaaagcca aaatattgga tattggtgtg gatatgggtt tcttcaataa tcggttgacc 22 80 

gctcagtttg acttcttccg ccgattacgt acaggtatcc cggaatcacg ttatgatgta 2340 

ttgttacctt ctgaagttgg ttttggattg ccaaaagaaa atctgagatc agatcttcat 2400 

atcggttatg atgcgatggt acgttgggcg gataatatca atgatttcaa ttatagtgtt 2460 

ggtgctaatg ttacttattc tcgtttctat gattgggaac aatatgatga tcgtcgcagt 252 0 

aactcttggg atagatatcg taatagtatt tggcatcgtg taggctatat aaattgggga 2 580 

tatgaggctg ttggacgttt tgaaaattgg gagcagatag cgacttatcc tgtagatatt 2 640 

gaccgaaaag gtaatcgtac agtagttccg ggtgatatta tatacaagga tgtaaatggt 2700 

gatggtgtta ttaactatat ggatgaacgt cctattggtt acagacaaga tggaactcct 2 7 60 

aacttgaatt ttggtatcaa tttatctgct agttggaaag gttttgatct ttcaatggac 2 82 0 

tggacaggtt ccggaatgac ttcatggatg caaaaatggg aaactgcacg tccattccag 2 880 

aatgatggaa atagtccggg tgaagtgttg aaggattctt ggcatttagc agatgtttgg 2 940 
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gatgctgata gtgaattgat tcctggtaaa tatcctttga tacgtatgaa taatgccgaa 3000 

acatctgctt atgaaaagag cacgttctgg ctgcataatg tacgctatat caagttgcgt 3 060 

aatttggagt ttggatatac tttaccgaaa gctttgttgg caaaatctgg tattagtaat 3120 

ttgcgtgtat atttgtcggg aactaacttg gtgacattga ctaatgtacc tattattgat 3180 

ccggagggtt ctaaagataa tggcttgatt tacccaacgc cgcgtattat aaatcttggt 3240 

attaacctca aattttag 3258 

<210> 9 
<211> 198 
<212> DNA 
<213> B.fragilis 

<400> 9 

gggagtaaac atcagtatgc taccatactt ctaatggtat gtatacctgt atgttatgtc 60 

ttattctgta atctttatct tgtctcgcgt acccgggctg ccgtgcccga gtatgtggat 12 0 

tcatgggcac aggcacccgg actcatgggc ccaggtatac aggacgaagt aaagatcgca 18 0 

ccatttccaa ctaattaa 198 

<210> 10 
<211> 2175 
<212> DNA 
<213> B.fragilis 

<400> 10 

caacacaaac acatgagtaa gagggtatta gttttgattg gtttgttttt ggcctgtggc 60 

ggcgtctatt ctcaaaccgc cacaggcaca aaaacgaatt tccagacagc agagtcgtgg 12 0 

aaaccggaaa cagatgtgcg tgccgatgcc gtgatggtat acggaacgct ggataagaaa 180 

ggcgttactt ttgagcaacg cgttcaatcg tggagagaca aagggtaccg ggccgagttt 240 

atgacagggg tagcctgggg agattatcag gattattttc ttggaaaatg ggatggggtg 3 00 

aaagatcacc tcaaagaggg tcagcgcgac cgcgaggggc gggagattgc gcacgggcac 3 60 

ttgatcccct atattgtgcc tactgagagt ttcatccgtt acatgcagga gaagcagatt 42 0 

aaacgggtga tcgatgcggg cattacttct atttatctgg aagaaccgga attctggatg 48 0 

cggggcggat acagtgaagc cttcaaatcc gaatggcaaa aatattatgg attcccgtgg 54 0 

agagctcaac atgaatctcc tgagaataca tacttatcga ataaactgaa atattatctg 600 

tactacaacg cgcttaatca gatattcacg tatgccaaaa cgtacggcaa atcgaaagga 660 

ctggatgtga agtgctttgt gcctacacat tcacttgtca actatacatc gtggcagatc 720 

gtaagtcccg aagcgagcct ggcttcgctg gattgtgtgg atgggtatat cgcgcaggtc 780 

tggacgggaa ccgcccgcga accgaactat tatgacggag tgaagaaaga acgggtattt 840 

gagaatgctt ttcttgaata tggatgtatg aaatcgatga cagctccgtt aaaccgcaag 90 0 

atgtacttcc tgaccgaccc catcgaggac cgggcgaaag actggctcga ttataaaata 96 0 

aattaccagg ctacttttgc cgcccaactg atgtatccgg cggtagacac atacgaagtg 102 0 

atgccttggc cggaccgtat ttatcagggg ctgtaccaag tagcgggaac agaccggaaa 10 80 

gaacgtattc cgcgtgacta ctccacacag atgcagatta tggtgaatac gctgaacgac 1140 

atacggactt cggaaactca ggtgagcggt acgcacggca tcggggtact gatggcaaat 12 00 

tcgctgatgt ttcagcgttt ccccggtcat gatggctatg acgatccgca attcagcagt 12 60 

ttctatggac agaccctgcc gttattgaaa agaggaatac cggtggagtt ggtacatatg 132 0 

gagaatactc cgttcggaga cacattcaag gggctgaagg tgcttgtaat gtcttattcg 13 80 

aacatgaaac cgatggaacc ccgatatcat gattttctgg cagactgggt gagaaaaggc 1440 

ggagcgctga tctattgcgg cgaagatatc gatccttatc agtcggtgct tgaatggtgg 1500 

aattccaacg gaaatcaata taaggctcct tcggaacatt tgttcgagaa actgggactg 15 60 

gacagagtac ctgctgccgg aacttaccct tgtggaaaag gtatggtgac cgttatacgt 162 0 

gaagatccga aacactttgt gctgaaaagc ggaaatgacc ggcaatattt cgatgcggtt 1680 

tcggctgctt acagaaagag tgccgggaaa gaagtagaac tgaaaaacag tttcctgctc 1740 

gaacggggac catataccat tgctgccgta ttggacgaaa gtgtatcgga tgctccgatg 1800 

gaactttcgg gggtgtatat cgacctgttt gataaagatc ttccggtatt gacacataag 1860 

gtgattcgtc cgggcgaaca aggttactta tataatgtca aacggatttc gggacgggca 192 0 

aaggcgaaag tgctttgcgg tgcttccagg atatacgatg aaaaggcagg aaaacgaagt 19 80 

tattcgtttg tggccaagag cccgttgcat acgactaatg cttcgaggat attgctccct 2 040 

aaacagccga tacgggtttg cgtgaacggg aaggaagagc ctcagccgga gaaattgtgg 2100 



7 



gaagagcgtt cgcgtaccct gctcttgaag ttcgagaacg accccgccgg cgtgcaagta 216 0 

gatattgaat ggtaa 2175 

<210> 11 
<211> 258 
<212> DNA 
<213> B.fragilis 

<400> 11 

cacagagtta aagcttgttc tttggatgtg aataagaagt cctttaaatg caaaagactt 60 

gttatttgcg cacaagaacc tgccaacctg caaaaggcgt taacaatgtt aattgaaaaa 12 0 

aggtataagg atgaagatac cggttcagac ggcgtaaact cacttccgga acttgagcta 18 0 

tcttattcag ccggtgtctg ttttttctta ttaaagcaag caaaaaggac aattatcaac 240 

ttgaaaataa agaaataa 25 8 

<210> 12 
<211> 1482 
<212> DNA 
<213> B.fragilis 

<400> 12 

aaaccaatca agattatgcc aggaaaaaac tcaaagaaaa tgatcggagc atgtgtcgtt 6 0 

actgcggcac tgctctgtgc gccttcagca ctgaaggccg aaggtatgtt gtcgcattat 120 

acttgtgtgg cagatgctat tcagaaagac aaccgtccgg aacccgctaa gcgtctgttc 180 

cgttcgcagg ctgtagaaaa cgaaatcata cgtgtacaga aactgttgcg taactcaaag 2 40 

ctggcctgga tgtttaccaa ttgtttcccc aatacactgg ataccaccgt acacttccgc 3 00 

aaaggcaaag acggcaaacc cgatactttt gtatatacag gagatattca tgccatgtgg 3 60 

ctccgtgact cgggggctca ggtatggcct tatgtacaac tggccaattc cgatccggaa 42 0 

ctgaaaacga tgcttgccgg agttatcaac cgccagttta aatgtatcaa tatcgatccg 480 

tatgccaatg cgttcaatga tggccctaaa gggggtgaat ggatgagcga cctgacggat 540 

atgaaacctg agttgcatga acgcaaatgg gagatcgact cgctttgcta tccgttgcgc 600 

ctggcttatc agtactggaa gacaacaggg gatgccagta tcttcgatga agaatggata 660 

caggcaatca ccaacatatt gcgtactttt aaggaacaac agcgcaaaga cggtgtgggt 720 

ccgtataagt tccaacgtaa gacagagcgt gctctcgata cagtgaccaa tgacggactg 780 

ggtaatccgg tgaaacctgt cggactgatt gtttccactt tccgtccttc ggacgatgcc 840 

acgacattgc agtatctggt tccgtccaac ttctttgccg tatcttcact ccgcaaggca 900 

gccgagatac tgacaaccgt gaataaaaag acggctttgg ccaatgaatg caaggctttg 960 

gcaaacgagg tggaaacagc cctgaagaaa tatgccgttt acaatcatcc caaatacgga 102 0 

aagatttatg ctttcgaggt ggacggtttt ggtaaccaca tgctgatgga cgacgccaac 1080 

gttccgagcc tgctggcaat gccttatctg ggtgatgtgt cgattgatga tccgatttat 1140 

cagaataccc gccgttttgt atggagcctc gacaatcctt acttcttcaa aggtaaggca 12 00 

ggcgagggca ttggcggacc acacatcgga tacgatatgg tatggcccat gagtatcatg 12 60 

atgaaagctt tcaccagcaa ggatgatgcg gagatcaagt cgtgcatcga gatgctgatg 132 0 

aatacggatg caggtacagg cttcatgcac gagtctttcc ataaagacaa tcctgagaaa 13 8 0 

tttacccgtg cctggtttgc atggcagaat actttgttcg gtgagttgat cctgaaactg 1440 

gtgaatgaag gtaaagtgga tatgctgaat agtatacagt aa 1482 

<210> 13 
<211> 3624 
<212> DNA 
<213> B. fragilis 

<400> 13 

gcaaacatga aattacacat tgctatgctg gcagctaccc tgctgttgtc cggaggagcc 60 

tcgtacgctc aagggaacaa acaggagaaa aaggcgaaag cctacatggt agcagatgcc 12 0 

catctggaca ctcagtggaa ctgggatgta cagactacca ttaaagagta tgtatggaac 180 

acgatcaacc agaacctgtt tctgctgaaa aagtatccga actatgtatt caactttgaa 240 

ggcggagtga aatatgcctg gatgaaggag tactatcctg cacaatacga agaaatgaag 300 

aagtacatcg gggaaggccg ctggcacatt tccggaagta gctgggatgc aacggacgct 360 
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ctggtacctt cgactgaatc gttcatccgc aatattatgc tgggacaaca gtattacaga 42 0 

caagaattcg gagtggaaag cacggacatc ttcctgcccg actgtttcgg atttggctgg 480 

acactgccta ctatcgcttc gcactgcggc ctgattggtt tctcttcaca aaaactggac 540 

tggcgtgtgc atccgttcta tggtaagagc aagcatccgt ttacaatcgg cttgtggaaa 600 

ggaatagacg gatcgtctat catgctggca catggatatg attatggcag aagatggaac 660 

gacgaagacc tttcagaaaa cgaacaactg aaagaactgg caggccgtac acctcttaat 72 0 

acagtataca gatattacgg tacaggcgat atcggcggat caccgacact ggcctctgtc 78 0 

cgctcagtgg aaaaaggact tcgcggaaac ggtccggtag aaattgtcag tgcaaccagc 840 

gaccagcttt acaaagatta ccttccttat aagaatcatc cggaattacc ggtattcgac 900 

ggtgaactgt tgatggatgt tcacggtaca ggatgctaca cctcacaggc tgccatgaaa 960 

ttgtacaacc gccagaacga attattggga gatgcggccg aaagagctgc cgtaactgcc 102 0 

gaatggctga atcaggccaa atatccggga agcaccatca atgaagcatg gaaacgcttc 1080 

atttatcatc agttccacga tgacctgaca ggaaccagta taccgcgtgc ctatgaattt 1140 

tcatggaacg atgaactgat ctcactgaaa cagttctcca atgtactgac ttcttccatt 12 00 

catggtatcg gcagggaatt ggatacacgg gtcagcggta ttccggtaat cctttataat 12 60 

gcactcggat ttacggttac agatattgcg gaaatagaac ttgaccttcc aaaagccccc 132 0 

aaagggataa cggtgtacga tgaaaagggc aaaaaagtat cggctcagct catttcttat 13 80 

accgacggaa aagcacgcat cctggtagaa gcaacagttc cggctacagg atatgtggta 1440 

tatgacgtac gcacatcagg aaccggtgca agcaacgtct cgacgaacgt caataccttg 15 0 0 

gaaaactctc tgtacaagat tacattggat aaaaatggag atatcgtctc tctgactgac 15 6 0 

aaaaagaacg gcaaagagct ggtgaaagcc gggaaagcaa tccgcctggc agtcttcact 162 0 

cagaacaagt catacaattg gccggcatgg gaagtgttga aagagacaac cgaccgtact 1680 

ccggtttcga ttacgaatga cgtgaaaata actttggttg aagacggaac tttacgtaaa 17 4 0 

tcgctttgtg tcgagaaacg tcacggagaa tctgtcttcc gtcaatacat acgtctgtat 1800 

gaaggtagcc gtgcagaacg catcgacttc tataacgaaa tagactggca atcgaccaac 186 0 

gcattgttaa aagccgaatt cccgctcaat attgaaaacg aaaaggctac gtacgacttg 192 0 

ggtatcggca gcatacaacg tggcaacaat accgaaacag cttacgaagt atatgcacaa 19 80 

tattgggccg acctgaccga tcgtgacgga agttatggtg tatcggtgat gaacgacagc 2040 

aaatatggat gggacaagcc ggataaccat acgatccgtc tcaccctgct ccacacaccg 2100 

gaaacacgcg gaggttacgc atatcaggat catcaggatc tcggtcatca taccttcacc 2160 

tacagcctga taccacatca gggagccttg gataaacccg ccactgtaga gaaagccgaa 2220 

aaactgaacc agcaactgaa agccttccgt acggaaaagc acaaaggaaa tgccggaaaa 22 8 0 

tcgttctcgt ttgtcgcttc ggacaaccgc aatgtattga tcaaggcact gaagaaagcg 2340 

gaagaaaccg atgagtatgt agtacgcgta tacgaaaccg aaggccggaa agcacagagc 24 00 

gccacactga cctttgcagg ggaaatcatc agtgccagcg aagccaacgg tacagaaaag 2460 

acaatcggca atgcaacttt cgaaggaaac aagttgcagg taaacatcac tccttattct 2520 

gtaagaactt acaaagtacg cctcaaacca tcgggacgtg agacgtctcc gatcgaatat 2 5 80 

gccgctttac cgcttgacta cgaccgcaaa tgtgcttctt ataatgaatt ccgtggagaa 2 640 

ggcgacttcg aatcgggcta ttcttttgca gccgaacttc tgccggactc actgatagcc 2700 

ggtcagatca ctttccgttt gggagaaaaa gagattgcga acggaatgac ttgtgaaggt 27 60 

gataccttgc aactgcctgc gggaaacaaa tacaaccgtc tctatatcct cgccgcctct 2 82 0 

accgaaggag acaatcaggc cgacttccgc attggcaagc agaccgcttc attcgttgta 2 880 

ccttcttata ccggcttcat cggccaatgg ggacataaag gacacaccga aggatatctg 2940 

aaagatgctg agattgccta tgtaggtaca caccgccatg catccaacgg tgatcagcct 3 000 

tatgaattca cttatatgtt caaatttggt atggatattc cgaagggagc taccagcgta 3 060 

atcttgcccc gaaatgagaa agtggttttg tttgctgcta ctctggttgc cgaaaatgaa 312 0 

ccggctacaa ccgttgccgg cactcttttc >cgcaccaata acgtaggtaa tgcagctact 3180 

gccggaaatg atgaagaagc agtacgcgaa aatatcctga aaagagctaa aatcattgct 32 40 

tgctccggat ataccaacga cgaagaaaaa ccggacttcc tgctggatgg taaaacggat 33 0 0 

acaaagtggt gtgacgtttc gcagactccg aactacgtag acttcgatct gggtgaagca 33 60 

caaaacatca gtggttggaa gatggtgaac gccggacagg aaagtcactc atacatcacc 342 0 

aatggttgct tcctgcaagg taaaatgaac ccgggcgatg aatggacgac tctggatgct 3480 

atcgacggta accatgcaaa tgtcgtttca cgtccgctga actatgacgg aaaggtacgt 3540 

tacatccgtc tgcttgtgac tcgtcctaca cagagcaccg gaggcagaga tacacgtatc 3 600 

tacgaactgg aagtttataa ataa 3 624 



<210> 14 
<211> 1860 
<212> DNA 
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<213> B.fragilis 



<400> 14 

caaacaatga 

tgcagtggat 

gatgtcaatc 

catgtaggag 

gatgatcagg 

atcaaccaat 

cctatggaag 

ttaggaggaa 

actactcttc 

tgcaaaacta 

aaatgggctg 

tacaatacag 

tcagagaaag 

agtggaaaat 

aacttcttta 

tatatttatc 

gaagatggag 

atagctacag 

aaattttata 

attctgtatc 

attaagaatt 

gatcaaggaa 

tgtaatcgta 

gaccgtggat 

gcagaagctg 

gcagtacgtt 

catgagaatc 

cgtcaagctg 

ttgtggcctt 

gaaaacatga 

gagctagata 



aaaagagaaa 
tcctggacca 
tgacaaaatc 
atacggatgg 
aagtagatcg 
tcttaaaagg 
gagaggctcg 
tgccgattgt 
aagttcctcg 
ttgcagaaat 
caaaaatgct 
ttgccgatta 
caactgatta 
attctttgat 
aggctgtatg 
cgggacaaac 
ggaatagccg 
atactccggg 
ctaaccctga 
caggttcttc 
ctgatggaca 
gatatgttac 
caggcttcta 
ctgaaatgtg 
cttatgaact 
ccagggcagg 
aagtagagtt 
ataagatctg 
tcttagttgt 
atcgttatta 
atggttggct 



ttttatagct 
aaagccagat 
tgtattggca 
gttcgctctg 
taactggtgg 
gttgagagaa 
ttttatacgt 
aggcgatgaa 
tgcaacagaa 
gttgccaacc 
tgaagcgaga 
tccattgttg 
ttataagaaa 
gagagttgct 
tgaaaaaaat 
tcatggttat 
tttgtctgca 
agaaggagct 
agatctgttt 
ttttagagat 
atgggaacag 
agccttgaat 
tgttcgtaaa 
gaatgtttat 
taatggtgga 
tgttaaagaa 
tgcctttgaa 
gacaggctca 
ctctgacgat 
tagaaatcca 
gaataacaat 



gttgctgctt 
cgtattatga 
aatttctatg 
ttggatgagg 
cgtacatatg 
tcgactgcat 
gcatgggttt 
gtatatgatt 
tctgcaatgt 
gaaccttcaa 
gctgctgttt 
aacccggaaa 
gcgttagctg 
gatgatgcta 
ggcaatacag 
accaagtctg 
ttgctaaatt 
aagtttgatg 
gtaggtcgtg 
agaactgtcg 
aagttgggac 
ggtccgatgg 
tatcttgata 
ttccgtcttt 
agtgatgcta 
ttggcttctg 
ggtcatcgct 
gaaatggata 
gataagaatg 
ttgaaatgtc 
ccgaaactgg 



gtgcactggc 
cagaagatca 
aacgtatatc 
ccattactta 
attatacatt 
tgtctgaagt 
atttttgtac 
atacttctgg 
atgattatat 
agaatggagc 
atgcaggttc 
caggagtagt 
ctgcagaaga 
ctccgcaaga 
aagtcatttg 
tacagcctca 
tggtagaggc 
ttggtacaaa 
atcctcgttt 
ttttacaaac 
agagtttggg 
tacgtaatga 
aaacaacttc 
ctgaagctta 
ctgctttgaa 
ttaaccatca 
ggtgggattt 
tcacggctac 
gaaaatgggt 
tacctaaaca 
tgaagaaccc 



attaagtagt 
agtttatgga 
actaggccaa 
tgatactaaa 
gatccgtaat 
tgagaaagct 
ttgtcgtact 
tatggacatt 
tatagaagag 
tcgcgcaact 
tattgcccga 
aggtatctct 
agttatcaat 
gaaagcagac 
gtcacgtgat 
tgatggtgcc 
ttttgaacct 
ggataatcct 
ggcagggaca 
aggacaatgg 
agaaaaagat 
ccaacgtgaa 
tgcgggaact 
tttgatagct 
atatataaat 
acagattatg 
gaaacgttgg 
acgccgtggc 
gttctttgaa 
ttattatgct 
gtatcaataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 



<210> 15 
<211> 1284 
<212> DNA 
<213> B.fragilis 



<400> 15 

ggtccggcag 

atcatgacac 

cgggggttcg 

ttagctccgg 

ctctattttt 

tttttcatgc 

tggaggcttg 

ttttttatgg 

agatggttgc 

gtatctctgc 

cggctttttg 

ttcaatacgt 

ctccagttat 

agtgaggaaa 

gcagtatttt 

cgggtaggac 

tgcggtttca 

ccggtgggac 



atctgcttct 
atacacaaac 
cactgattgg 
ttgtggagtc 
tgttttccgg 
agatggagtc 
ccttattgtt 
tctatgccgt 
tggtgctatg 
tcagcgacaa 
aaagggcggc 
tcgacggaca 
tgggactgtt 
agatggtgaa 
atgctgttgc 
agactctttt 
ctttgctgta 
gaatgagtgt 



atatccttat 
gatcacacct 
catcatgctc 
tcctttctgg 
gaaatcatac 
tcaggcagct 
cctgtttggc 
attaggagtc 
cattttgctg 
tgtggctaac 
cgatgtcttt 
gtcggccaaa 
tattgccgga 
gtacagtcgt 
cttcctgctt 
caagacgtat 
ttatcggtat 
gacgaactat 



gcctcttttt 
aaaaagcgta 
ttgcattgca 
caggcaatag 
gctatgtttt 
aaaggagtcg 
tatatcaacg 
ttcttgattc 
tttctgcaga 
gaaccgactg 
atcaatggat 
tgcctgtggg 
atgctgatcg 
ctgtttttac 
ccggtatggg 
ggcaatctgg 
aaggggcaga 
atggcgcagt 



atatcctgtt 
ttaattcgat 
tggagcgttt 
atacggcagt 
cccttttgtt 
atttccgggg 
gattggtcta 
ctctttataa 
taccggcagt 
ctgcggcagc 
cactgatgga 
tattcaataa 
ggcgtcaggg 
cttattgtct 
gagtggacgg 
gacagatgat 
aagtgctcga 
cgatagtcgg 



tttaattcat 
cgatgccttg 
cgacctgact 
atacgattca 
cggtttgagc 
acgcttcctc 
tatgggagag 
agtttccacc 
cattagtttt 
ctatatggac 
tgtactgagt 
tttccgttac 
tattcacaaa 
ggctttctgg 
gtttgcgttg 
ggtttatttc 
ccgtattgct 
agtttcccta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 
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ttctatggtt ttggcgggaa ctttgctgtc gagttcaact atttgcagag ctttttgctg 1140 

ggagcggctt tctgtgtcat ccagattgct tatagcaatt ggtggattaa gagattctac 1200 

tatggtccca tggagtggct gtggcgttcg cttacctggt ttcaggtggt gccgttgtca 12 60 

aggcgtaaag cttcgcttgg ataa 12 84 

<210> 16 
<211> 477 
<212> DNA 
<213> B. fragilis 



<400> 16 

aacagatatt 

ggattgatcc 

caacagtcgg 

ggaggagaga 

ggaggtacgg 

ggacacttca 

tgggtctatc 

aagataggga 



ggcttatgtt 
tgttgggggc 
aaagacaggt 
cagcccgtaa 
tggagtgggg 
agtcaccgga 
tgaatgaaca 
ttgtaaatcc 



tataaattct 
gtttaccctg 
gcagcaagtt 
cagtggaaga 
gccgggtctc 
cggagtcctg 
gtctgacaac 
agcttgtttc 



ttaaggaaat 
ttatctgttc 
cctttccttc 
ggaggctcaa 
cagcagggat 
gcacatgtga 
caaacggtgt 
tccgctccca 



cccctttttc 
ctgtgtatgg 
aattcaactt 
aatatgatgc 
cggcaagact 
aagatttcac 
gtcttcacca 
tcgatcttgg 



ctgttctctg 
gcagcagata 
cgatgaacag 
ccggatcacc 
cagtaacaaa 
tttgtctgtc 
cgggagctgg 
atattaa 



60 

120 

180 

240 

300 

360 

420 

477 



<210> 17 
<211> 1011 
<212> DNA 
<213> B. fragilis 



<400> 17 

ataaggggac 

actgtttttt 

tatagccttc 

acggaaaata 

gtgggtacgg 

gcaccggata 

ggtggagagt 

accgatcatg 

ttctatatag 

ggagcagaac 

gcaggcacag 

gcctctatcg 

tcgaaatact 

catgaagtgc 

gtaacagatg 

gaaggaagag 

ggtgatacac 



ttttcttaat 
ctttctgtgt 
ccgaccctac 
tcagaaatct 
cttttaccaa 
taaataaaat 
ggacctgcgg 
gaaaactgtt 
aagatggtgg 
taagtgatga 
cctatgaagg 
gtcgttgctg 
tatttggccc 
tgattgataa 
ataaaggagc 
tgttgatgct 
cgtcgttgca 



ttatatattc 
tccttccatt 
ggttatcaag 
tcctattcac 
tgaaacccgc 
cggtgatcgg 
gattggtgtg 
ccggagcaat 
taagaaatac 
cgggctctcc 
aacttatatt 
tgaaggatta 
ttatgtagat 
aaatgaagca 
ggattgggtg 
cgaccgtgta 
agcaaaagca 



agcatgaaga 
gcacagcagt 
gcagatgatg 
cgatcgaaag 
cctacatttg 
tatgtgatgt 
gctacggcag 
gaaataggga 
cttttttggg 
ctgaaagaag 
cacaaacggg 
aagagtacgt 
aaaaaagggg 
ttcgtgggtc 
ttttatcacg 
aactggaaaa 
cctgttatcc 



aacttttatt 
attccaaccc 
gttattatta 
atatggtaaa 
agccgaaagg 
attactccat 
ataaacctga 
ttcagaattg 
gaagctttca 
gaatgaaacc 
gaggctatta 
ataccacagt 
aatcgatgct 
ccgggcacaa 
ctgtcagtgt 
aagggtggcc 
agcataaata 



ttcccttttt 
cgttattaat 
cctgtacgca 
ttggagcttt 
aaatctttgg 
gtctgtttgg 
gggatctttt 
tatcgatcct 
tggcatttat 
acagcaggtg 
ctacttgttt 
ggtggggcgg 
ggagaatcat 
ctcggagatc 
ggccaatcct 
tgttgtagag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1011 



<210> 18 
<211> 444 
<212> DNA 
<213> B. fragilis 



<400> 18 

attaattgta 

atgacaaatg 

aaagtgacag 

tattctattg 

cttcgtattg 

acatctgaca 

ataagtacta 

tattattata 



ttatgaacaa 
taatggctaa 
ctactgagta 
aacttttaga 
aaataaattg 
gttggactgg 
gttataatga 
ttagagaaag 



gaaattcttt 
tacatcgaag 
ttatttagaa 
ttcctctcaa 
tcacggacat 
agaggtaata 
aagtgctggt 
atag 



atcgcaatgt 
gatgaagtag 
gctgcgggac 
agaaaactaa 
ggaattgttc 
actgatgaaa 
agtagctcta 



tggcatttgc 
tagaagtaag 
agagtggatc 
agattactgc 
cagatgatcg 
ttaatactga 
cgactaaata 



attatttggt 
tagcacgaat 
tggtcttttt 
aaaaagagga 
tttattctat 
agcttttact 
tgaaactcgg 



60 

120 

180 

240 

300 

360 

420 

444 
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<210> 19 
<211> 486 
<212> DNA 
<213> B.fragilis 



<400> 19 

ggcaaaaaca tgaaaaacaa agtactgatc 
gcaatggcat gggcacatca gcccgcagac 
gcgacgacgg cgatggatgc ttttcattct 
gccatatctt cggatatgaa aggaagagct 
atgattatga acgcgtataa acgtacgaag 
gtgtatcagg gaggatacga acaatacgat 
tttatttatg atgatatgat gtggtggatt 
aatgacccga aatacctggc acatgcctct 
gtatga 

<210> 20 
<211> 723 
<212> DNA 
<213> B.fragilis 



atagtagcta tcttattact tcttcccaat 60 

ggaaacctga agcattttac aaagaaagac 12 0 

actttttata atccggatat gaagttgtat 180 

gccatctggg tacaggctat ctattgggat 240 

gctcctaaat atcgccggtt gatagaagag 3 00 

aaatacaatt gggacaataa aatcgaatgg 3 60 

atttcgctgg cacgtgctta cgaaatcacg 42 0 

tcgggattct acccatgtct ggaaagagtc 480 

486 



<400> 20 

aaagaaataa 

gttaccagtt 

gtttatgaag 

caacgtggtt 

tctgcatgtt 

tcagaagaag 

gaagttgttc 

acagcttcct 

ttaggaacaa 

gatggcgcca 

gactatacag 

atttgtatat 

taa 



tgaaagcaat 
gtacaaagga 
gagaggcttt 
atgaaaagca 
tatttgatgg 
gccgtgatac 
cttattacat 
tcaatgtaga 
ctcaatacat 
aaatggctga 
acaataaaat 
ggccaaaagg 



atccaaaatt 
taattatgat 
gcaactgcgt 
tgatccgatt 
tgagtatcaa 
tattaatgtt 
ggtcagaaat 
aaagattgcg 
aaatgatggt 
aatcaatgta 
gtttcaaaca 
ttctgatcaa 



ttttcagcac 
gctccggaat 
ggaaatgagg 
gaagtttttg 
ttgatcacca 
attgtctcag 
gctgagatga 
ggaaaagaaa 
gaacacaatg 
actggtgcac 
gctttgaaaa 
ggaatttact 



tactattggt 
caatgctgac 
cggtccgatt 
taaatcaaga 
aaagcggtaa 
gcaatactgt 
aactaaatgg 
tagatcgtgt 
ttgatcgttt 
gttatgagtt 
gaggtactct 
ctgaagtaat 



aatgatagtt 
tggaaaggtc 
gttcctttat 
cggagcttat 
tggcccttgg 
acaaaatgta 
taatgttgtg 
cttctttatg 
tgatgatgca 
tactcctaga 
ttttggccgt 
ccgactgaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

723 



<210> 21 
<211> 429 
<212> DNA 
<213> B.fragilis 



<400> 21 

cttctaatat tatatataat gaacaaaaga 

gtgagtatgg caaatgcaaa ggctgatatt 

acattagttg aagttgaatc tcccagtggc 

aatgagtatg cttacaattc tactgaaata 

gttagaatag attgggttgc agaaggtaac 

gacaataccg ttataatacc aaattttttg 

acatttgctg gaagtatggc tggaatacag 
agatcttaa 



tttattattg 


tattattagc 


atttgttttt 


60 


cctaaagttt 


gggaagtaaa 


tggggtttat 


120 


tatgggctta 


taaaatcaat 


tacaattgga 


180 


aaaattgttg 


ttattgatgg 


tgtaaacagc 


240 


cgatatcctg 


ttacgtatag 


cgttggttat 


300 


agacgtcgat 


ttattgtgag 


tgtagaatat 


360 


tatgcttatg 


aaacgagagt 


ttttgagata 


420 
429 



<210> 22 
<211> 1263 
<212> DNA 
<213> B.fragilis 



<400> 22 

gaaattcccc cttttggagg aattttttca atcatggaaa aatttggact catgcttttc 
acccgtaatg gactcaccct gggtaaaaga tgcaccagtt tcttcggata tcagttcagg 



60 
120 
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aggatagtcc gttcgctgat gagcgtttat ttctgtggcg gttcatgcgt ggaagatgta 180 

acgtcacaac tgatgcgcca tctctcgtat catcctacct ttcgtacatg cagctctgat 240 

accatcctca gagccatcaa ggaactgaca caggaaaaca tctcctatac ttccgaccaa 300 

ggcaagacct atgatttcaa tactgcagac aaactcaaca cattgcttat aaacgctttg 360 

gtttctacag gcgagttgaa ggaaattgag gaatacgatg ttgactttga ccatcagttc 42 0 

cttgaaacgg agaagtatga tgcaaaaccg acctacaaaa agttcctcgg ctacaggcct 480 

ggcgtatatg ttatcggtga caagatagtc tatatcgaga acagcgatgg taacacgaat 54 0 

gtgcgttttc atcaggcaga cacccataag agattcttcg ctcttctgga atcccagaac 600 

atccgtgtaa atcgcttcag ggcagactgc ggttcctgct cgaaggaaat cgtcagtgag 660 

atagagaagc attgcaaaca tttctacatc cgtgccaacc gatgcagttc gctctacaat 720 

gacatctttg ctctgagagg atggaagacg gaggagatta acggcatcca gttcgaactc 780 

aattccattc tcgttgagaa atgggaaggc aagtgctatc gtcttgtcat ccagagacaa 840 

agacgcaaca gtggcgacct tgacctgtgg gaaggcgaat acacttaccg ttgtattctg 900 

accaacgatt acaagtcatc gacaagggac attgttgaat tctacaatct gcgtggcggc 9 60 

aaggaacgta tctttgacga catgaacaac ggattcggtt ggagcaggct ccccaagtca 102 0 

ttcatggcgg agaatactgt ctttcttctg cttactgcat tgatacacaa tttctacaag 1080 

accatcatga gcaggcttga caccaaggct tttgggctca agaaaacgag tcgcataaag 1140 

gcttttgtct tcagattcat ctccgtacct gccaagtgga tcatgactgc aaggcaatac 1200 

gtgctgaata tctacacaga gaaccgagct tatgcaaaac ccttcaaaac agaattcgga 1260 

taa 1263 

<210> 23 
<211> 2574 
<212> DNA 
<213> B.fragilis 

<400> 23 

cgactaaaag aatcgcgttt cacgggccaa cgttttcggg aggcacaaga gcgggattat 60 

cggcattatg accgttttgt cgaaaaaatc atcccggatt ctgtcaattt ttaccggact 12 0 

tatgtaaact atcactcttt cgagcgctat ctggagcgtt tgaagtggta taaacgcggg 180 

ttagagaaac gttgggcaat acaggatgcc aggaaacgtc gtccggaccc tttgctgttg 240 

cgttttgata tgttcaaccg tcaggtaggc aggcgggaca gcctgatgaa aagtcgtatg 3 00 

ttggataact ctcaacgaat gattacccgg cagtggtggc gatacggtcg tgcatgggag 3 60 

cggatgaatg acaccttaca gtttcaaagt aggcatctgc tggaacgttt ccgcttcttc 42 0 

aacaataaat gggccgataa tgccgctttc caatccgatg gactgatagc ccgcaagaat 48 0 

tattttcgcg acaaagcact gagtacccct atgtggcagg caaagcgtgc actctataaa 540 

gcggacccgg atgctgcgat acgaatatat gcttctcgct ttggctattt taatgataaa 600 

atggaacggc tggatgctac cctttaccga tattatcgca ccaaaggcgc gcgtgctgaa 660 

agtagggaag gagtaagatt tctgcgagct tttatggtgg gacgtgatac tactttgtcg 72 0 

tacctgaacc gcaaccaatt aacggagaaa tatattcgtc gctacgagaa ggtgaaaaat 7 80 

ttcttcccga tgtttcattt ccgccgtccg gatccggata cgttatctcc tttgtgggag 840 

acgaggacac ggatagatac aatgcagaca cggcatacat tgctgtcgaa gctttcaaaa 900 

gaagatatat acgaatatta tgtccggcag caacaagggg tatctgatag gggaatgata 96 0 

ggaccttttc gtggtcttct gcctctatat acctatcatc gtgatttgcc cgattctata 102 0 

gtattgcgtg tcccgggacg taaaacacgg cgggattttg aactcagccg gtttgattca 10 80 

gctactacgg tcaatcgtta tatcggtcgt tacgagtttc tgcgatcaac ttatccgcaa 1140 

taccatttga tacgtaaatt gtataacata catccgcctg ctctgcggca tgcggcccgg 1200 

caggcgagct atgaagagcg actggcacgt atcaattctc ttgattcgac cagtctgata 12 60 

aagatgttct ataatacaca gaaaattgcc cgtaatgagg cgcgtaaggc gatgaaagat 132 0 

acaaaatacc gtgatatcgt tcgttttccg ttcaatcctg aagcgcagct cgatacggtg 13 80 

atttatgcta ctgatcaggt acatttcctc tactcgcaga aagtaccggc agatgaaaat 144 0 

tcggcacgta tgaaggtata tgtagttggt gatgtgctga atagtaatgg aagcaggttt 15 0 0 

tcccttccgt actcggatac gctgacttat ctggtgagtt cgatgactaa gtttgttgac 1560 

aggacgccac gctttgttcg aaaaatagtt acccgtgatg cggaagcaaa tgctagtgta 162 0 

aacttttact ttccaaaaaa cagttttcgt atggatgaaa ctattgacat aaaccggcag 168 0 

ggagtgaagc aggtacataa ccttactctg gcattaatga ccgatccggt atatatcata 1740 

gacagtctga cgcttttggc tacctcgtca cccgaaggta actggcacgt taatggagaa 1800 

atatcccgaa aacgtgcgga atcaatccgt aatattttgg tggaggactt caaactgctt 18 6 0 

tatgattcat tggctatcgg tgctgctatc gagatggatg agacgggcaa catcatccgg 192 0 
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caggagatga aggacgggat tccgaacttg ccggagttga taaagatacg taccgtacct 19 8 0 

gaggggtggg agaaactgcg ccgtctgatt gtaaatgata aaaattttca aggcaataaa 2 040 

ggtgcaatct tgagaattat tgatcgtgaa caggagcccg atcggcgcga atggctgatt 210 0 

aaaagtcagt ataagacaga atatgcctat atgcttgaca aactctatcc ggcagtacgc 2160 

agggtggatt tccttttcag tctctcccgt cggggtatgc ggcaggacac actctatacc 2220 

aatgaaccgg atacaatgta tgcccgagct gtggattatc tagagaaacg taaatatgag 2280 

caagctttgg aaattctgcg tccgtacgag gatgtaaata ctgcaattgc ctatatgtct 2340 

ttgggatatg ataaggctgc cttacgaata cttgaacaat cgtcgcagac tgccgaaacc 24 00 

caatatatgc aggctattct gaatgctcgt ctgggtaatg agcagcgggc tgtatcgttg 2460 

ttgctcagtg cggcggaagt ggatgaccgg ataagatt cc gagccaatct agatccggaa 252 0 

ttatctctat tagtgaagaa atatggcttg tttaaagagg atgatttgtg gtaa 2574 

<210> 24 
<211> 883 
<212> DNA 
<213> B.fragilis 

<400> 24 

gccatagagc gcgtctactt ccagccccgt ggtgaagacc tattgaaaaa cgatgctttg 60 

ttacctttaa ataaagaaaa gattaaatct gtagccgtag tagggccgtt tgccgattac 12 0 

aattatttgg ggggatatag cggacagcct ccttattcgg ttagcctttt gaaaggagtg 180 

aaggagctga taggtaaaaa agggaaagtc acttatctga acggaatggg aacctctgcg 240 

gattctatag cgcaagtggt aaaaggggca gatatagtac ttgtagcttt gggtagtgat 3 00 

gaaaaaatgg cacgagaaaa ccatgatatg ccttctattt atttaccgga gggacaagag 3 60 

aagcttctaa aagagattta tcaggtgaat ccgagaattg tattggtttt ccacacggga 42 0 

aatccgttga cttccgaatg ggcggataca catataccgg ctattatgca ggcttggtat 480 

ccgggacagg aagcgggtag ggctttggcc aatttgctgt ttggaaatga aaatccgtcg 540 

ggtaagttgc ctatgactat ctacagaacc gaagaacagt taccggatat actggatttt 600 

gatatgtgga aagggcgtac ttatcgttat atgaaagggg aacctttata tggtttcggc 660 

catggattga gttatacatc ttttgagttc gataatatac aagggaatga tactttgcag 72 0 

ccggatgcga ttttacaatg ttcggtcgag ttatccaatt caggtcagtt agcaggagaa 7 80 

gaagtggtcc aagtctatgt ttcgagggag aatactcctg tttacacata tccgttgaaa 840 

aaattagtgg catttaaaaa agtaaaactt gctttcagtg aga 883 

<210> 25 
<211> 513 
<212> DNA 
<213> B.fragilis 

<400> 25 

aaactgttac aatgtagaaa aagaaaagag gccctcatga cttcacttta tgatttttcc 60 

gttttgaacc aaaacaacca agcaactccc ttggatagct atcgtggcaa agttctcttg 12 0 

attgtcaaca ctgctactgg atgtggttta acgccccagt accagggact tcaagaactc 180 

tatgaacgct atcaagatca gggcttcgaa atattggatt tcccttgcaa tcagtttatg 240 

ggacaagcac ccggcagcgc agaggaaatc aacgccttct gtagcctaca ttttcaaacc 3 00 

accttcccac gttttgccaa gattaaggtc aacggtaagg aagcagaccc tctctatgtc 3 60 

tggttaaaag accataaatc tggcccacta ggaaaacgaa tcgaatggaa tttcgctaag 42 0 

tttctcatta gtcgtgatgg gcaagtcttt gaacgctttt cttcaaaaac agacccaaaa 48 0 

caaattgaag aggcgataca aactctacta taa 513 

<210> 26 
<211> 273 
<212> DNA 
<213> B.fragilis 

<400> 26 

aaggaggaaa acaaattgaa aatttttaag ggagagtttt atcgaatctc tgtattaaca 60 

gacaagctag taaggttaga atactctcaa actggaagtt ttgaggatag aacgacacaa 12 0 

cttatctata atagagattt tggccaagtt tcgttagatt atatcgagac atcaaacgta 180 



14 



ctagatatta tgacggacta ttttcatctg cactttaata aaggagaatt taacgccgaa 240 
aatttattta tagaattaaa aggaaatttt gcc 273 

<210> 27 
<211> 885 
<212> DNA 
<213> B. fragilis 



<400> 27 

agacatgccc 

gatgtaatga 

gaaggcgaga 

cccgcaagcg 

caatgggacc 

ccgggcgacc 

atcactcccg 

acttatttta 

ggtttttggg 

tcggccaatg 

cgcgcccgtg 

ctgcgtgatg 

tacgacctgg 

tatcaaccca 

gtactggtac 



gtacgtacat 
attcagggca 
acagcagtga 
ataaaatcgg 
tcggctgggg 
cccgcaaaga 
agaataccaa 
acaagaaagc 
taaatatccg 
aattgggtaa 
gcaacaaccc 
ccatccgcca 
tacgctgggg 
agaatgcctt 
agaacccgga 



gatgcgcaat 
atacaatctg 
gtcggtattc 
cagccagttc 
atggcacatg 
tgctacgctg 
caaaccttat 
ttataccaac 
cattatccgc 
aacaggtgaa 
ggacattctg 
cgaacgacgg 
catcgcttcg 
gctaccgctt 
ttatttagag 



gactggcaga 
aatactcctt 
gagttgcaat 
tgcgaagtac 
ggaaccgagc 
ctttacttcc 
ggagagtctc 
ccggcactcc 
tatggtgacg 
gcttccaact 
cctaaagtga 
gtagaactgg 
caagtgctcc 
tcgcaggacg 
cacaccacag 



acatgtatac 
atgatgtcat 
gcgcatctac 
aaggtgtacg 
tgatgggtga 
gtcgttcgga 
cggtatctca 
gtgaagagtt 
tggtgctgat 
atctggaaat 
cttcattaga 
gactggaatc 
atgctgcagg 
aaattgataa 
agtaa 



cgctgcgacc 
cttcaccgat 
tgccgctttg 
cggttccggc 
agcgttcgaa 
tactgatccg 
agccgacggt 
tacccggcac 
ggctgccgag 
ggtacgagcc 
tcagaccgtg 
gggacgtttc 
caaaacgggt 
atcaaaaagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

885 



<210> 28 
<211> 1482 
<212> DNA 
<213> B. fragilis 



<400> 28 

tcaacaaaca 

gcaggaacac 

gtcgttttca 

aagttcgaaa 

tattcgggta 

ggacatacgg 

gcagatgcac 

ggtaaatggg 

gagttctatg 

gacaatgaca 

acttattctc 

agcggagaat 

cccgaagaca 

accgaaccgg 

gccacatttg 

ttgaaagaga 

cacatggaag 

aaacgcgact 

gtacagccca 

cgtgaaattc 

ttgctcgaaa 

atgaacggac 

ggcaagaatc 

gtgctgaacc 

atacctaacc 



cttttatgaa 
aacaggcgct 
tcctggccga 
ctcccaacat 
ccactgtcag 
ccattcgtgg 
aaaccatctt 
gactcggttt 
gctacaactg 
aacgagtaga 
aggatctgat 
cgttctgcat 
gcattataaa 
gtaatcccgc 
cagctatggt 
tgggggttta 
gtggagccga 
tgtacgaagg 
gtactcaaac 
tgaatccgaa 
accgcaaagg 
gccaagccgt 
cgtattatga 
aatatccgga 
cgaacttccc 



tcaaaaatta 
tgcacaaaaa 
cgacctcgga 
agacaagttg 
cgccccttcg 
aaatgtagaa 
ccacgatttc 
tatcggttcc 
ccagttgctg 
actgaaagat 
tcactcaaag 
gtggtatccc 
gaagttccgc 
attccgcaag 
ctatcgtctg 
tgacaatacg 
tccggacttc 
aggaatccgt 
cgacttcatg 
agcaaagaat 
gcagaaagaa 
acgcaaagga 
actctataat 
aaaggtgacg 
gttacttccg 



ttgttcagta 
aagaaagtcc 
ttcggtgacc 
gctcaggaag 
cgctcttgcc 
ctcgatccgg 
cagaacgcag 
acgggtgatc 
gcacacagtt 
aacacactgg 
gcacttgact 
accatcatcc 
ggcaaatatc 
ggcggttact 
gatgtatatg 
atcatcatct 
ttcaacagca 
gtaccgatga 
tgttcgtttt 
cagcaaatgg 
catgaatatc 
ccgtggaaac 
ctcaattctg 
gaattgaagg 
ggagaaaaat 



gcgcgttgct 
aggatcaaaa 
tcagttgtta 
gaatgcgctt 
tgctgaccgg 
aaggacaatt 
gatacaagac 
ctaaaaaaca 
attatcccga 
acgtacagta 
tcctcgatcg 
cccacgccga 
ccgaaaaacc 
gctcacaatt 
taggtcagat 
ttgcaagtga 
acggaatctg 
ttatttcatg 
gggatgtaat 
atggtgtcag 
tgtactttga 
tggtccacat 
atccgtcgga 
ctatcatgca 
aa 



tgtcggcata 
gagacccaac 
cggacaagag 
cacccagtgt 
tacccacagt 
ccctctaccg 
cggcgctttc 
tggcatcgac 
tcatctgtgg 
cggtaaaggt 
gatgggaaaa 
actgattgta 
tttccatgga 
ctatccacac 
tgtacagaaa 
caacggtccg 
gcgcggatac 
gccgggacgt 
gcctacgttc 
tctgctaccg 
atttcaggag 
gaatgttcgt 
acgacataat 
gtcatcgcat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1482 



<210> 29 
<211> 1653 



15 



<212> DNA 

<213> B.fragilis 



<400> 29 

accatgaaca 

acagcctgcc 

gagattgtcc 

tacccggcaa 

gaaatgaccg 

cttgctgtag 

aaggttcttt 

aaagctcttc 

accgactttc 

ccggtcagct 

cattggcacc 

acagagatcg 

gggaagcccc 

gccgaccgtt 

ctggcttctt 

ggtgtcttcc 

gtattggatg 

tgtccgaaaa 

atcaaagccc 

ctggagaaag 

ggaggtctga 

gctgcccgcc 

ccgcgtatca 

gtacctgcag 

tggaccgaat 

gctttgtccg 

cgcctacccg 

ttcgcggcgg 



agaaactact 
gcccggcagc 
ttgcccggga 
ccaatgaaaa 
gaaccgaggt 
actccacaat 
tgacgggagg 
cgatcctgaa 
cccgtttccg 
accttaagca 
tgaccgaaga 
gttctaaacg 
atagcggatt 
tcattacggt 
atccggaact 
ccgatgtact 
aaattatgga 
gccgttggga 
tgcctaaaca 
aaatcaatgc 
ctccgaactc 
agcatcatga 
ataaaatgac 
aactgaccga 
ggacagccga 
aaatacaatg 
agatgctgaa 
ataccttgaa 



atcccgttta 
caccgtaaag 
cactactcct 
gatgcatcgt 
tcgtgtatcg 
ggggcatccg 
cagtgaagcc 
agacggtaag 
ttaccgggga 
gatgattgac 
tcagggatgg 
agactctacc 
ttatacacag 
agttcccgaa 
gggatgtaca 
ctgtgcgggt 
tatcttccct 
gaagtgcccc 
ttcgaaagag 
tcacggacgc 
cactatcatg 
tgtcattatg 
gggattcgaa 
tgcggaaaag 
ttcaacgaag 
gacattgccg 
gatttattct 
gactcataaa 



gctcccggtt 
ggtaatttgg 
tttattattg 
actgctgatt 
gacaaagaga 
gaaggttata 
ggtgtctttt 
gtggcagctg 
ttcatgatcg 
ctgatggcac 
cgaatcgaaa 
attatcgatt 
gacgaagctc 
attgaccttc 
ggtggtccgt 
aatgaccaga 
tccgaatata 
aaatgtcagg 
aatcagttgc 
cgtatgctgg 
tcatggagag 
actcctattc 
tggatgaacc 
aagtttgtga 
atggagtggc 
gagcataaga 
tctctggatt 
taa 



tgtttgccgt 
acgtaatccc 
accgcagcac 
ttctggctac 
aaagcagcaa 
aacttcaaat 
atggtatcca 
cccttcctgc 
atgtaggccg 
tgcataacat 
tcaagaaata 
gggaaaccaa 
gtgagattgt 
cgggacatac 
acaaagtact 
cacttcagtt 
ttcatatcgg 
ctaaaattaa 
aaacctactt 
gatgggatga 
gaatccaggg 
agcggctcta 
gtgtatacaa 
ttggtactca 
agattctgcc 
actttgagcg 
atggttatcg 



tgtgctattc 
tcaaccgcag 
tacgattgtc 
ttttattaaa 
tgctattatt 
cactcctgaa 
gactattcat 
cggtacggtt 
tcacttcttc 
caactacttt 
tcccaaactg 
gaaattcgac 
tcgctatgct 
tactgctgca 
ttgctcattc 
caccaaagat 
cggtgacgaa 
ggagttgggt 
catgtccgag 
ggtattggaa 
aggaatcgaa 
tttcagtaat 
ctttgaaccg 
aggatgtatc 
ccgaatggct 
tttcatggag 
ggaagatgta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1653 



<210> 30 
<211> 2943 
<212> DNA 
<213> B.fragilis 



<400> 30 

cgtgatagaa 

gtacttttta 

tccctgaagg 

gtatggatag 

gtccaccggc 

gatattcaac 

agtgacgatt 

tgcctgacag 

gacagccgat 

gccatcagtc 

ctgatcaatc 

atggcactcc 

cgctgcttca 

agcaacaacg 

ggaggaggca 

ccgggagata 

aataacatgt 

aagacctata 

agtctgtatc 

aacagcctcg 



aaacatccat 
ctgtctgttg 
agggacttcc 
gaaccaatgc 
aagaggacgt 
ataatatatg 
tcgccattcc 
aacaaggggt 
cgataaaact 
ggtgggatga 
tccgttcggg 
tgatcgattc 
accccgaagg 
tggtgctgag 
tcaacatcat 
actattcgct 
gggcaggcag 
ccgatgtctt 
aagacgaacc 
accctgtcac 



gaaaaacaat 
cctcccgctg 
ctctactgta 
aggattggga 
acattcgttg 
gatactgaca 
gctagacgat 
aatcttcggc 
cctgctggat 
agagactctg 
cgaacgccgt 
tcacaaccgc 
ccgactgctg 
catggccgaa 
acacccggac 
tccggttaac 
tatccgcaaa 
tcccggaagc 
gaacggacga 
cgaagagttc 



ccatataccg 
aaagccagcc 
cgttgtgtct 
cgatttgacg 
ccgcacaact 
gacggtggaa 
cgggggcatc 
ggacgcaacc 
tttagttccg 
ctctgttgca 
ctcccccctt 
atctggctgg 
gcctcgtaca 
cgggactcac 
agccatcgaa 
tccatcttgt 
ggattgatca 
acccaaggcc 
atctggatag 
cgtcacgacc 



gctttctgac 
attactacta 
ataccgagcc 
gacaaaagct 
acatccatca 
tagcgcaata 
cgatcctcgc 
gcatctatcg 
atccttattt 
gccgttggca 
tcgactgcgg 
ctccttataa 
ccaccgacaa 
acatctgggt 
tcacagtact 
cactctacaa 
atatccgaga 
tgagtgaccc 
gtacagatgg 
gttctacctg 



ctggctgacc 
taaacaaatt 
aaaaggattt 
gagaaaatat 
aattaccgaa 
tcgcaggtca 
ctactctgcc 
ctacgactat 
cgctatctcc 
agggctccga 
aaaggagatc 
tgaaggactc 
ttccgggctc 
cggtacagac 
cgaacatatt 
tgacaattac 
agtgtccatg 
caccgtgctg 
tgggggtgtc 
gggtgacaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 



16 



gtggtctcga 
ttgtttgttt 
ctcaaacaat 
agcgtacttt 
gtagtaaatg 
gaacgtttca 
aggctgaaaa 
gagaaaggtg 
cggcaatatc 
catcggggaa 
cggaagttca 
ccccggctcg 
atcgataacc 
cgtgtaaacg 
ccacaagaca 
aaaaaaagat 
ccggaactgg 
actcaaaatg 
tggtaccgaa 
gctatcatct 
cgggaattgc 
ctcccatcgt 
cgtctccgta 
gtcaaggagg 
accgaccatc 
agccgcgcct 
atcaataaat 
acagagatag 
caatataccg 
tag 



tcaccggatt 
ataacaaaga 
atatttacta 
tgttggcagg 
aagaagaagg 
cttacctgca 
aacttttcag 
atttctggat 
acccattgat 
aagtatggat 
ttcttttcgg 
tctcagggaa 
gctttccggc 
gtgaaccggc 
gccgggcaat 
accgctaccg 
tcatccgctc 
gcgactggac 
gcggatggtt 
tcgccatttt 
aggaatatga 
tcacggagaa 
acggcgaaaa 
aactcagcca 
tggacagtcc 
cactctataa 
tccgtatgga 
cagaaaagat 
gagaaacacc 



tacccgagaa 
gaacggtaag 
tagcggtatg 
acatacttac 
tatggagatc 
cgacagtcgt 
ctgcacgggc 
aggcagcaat 
cacttcactg 
tggtgcagac 
tgaatcggac 
gggtgaagtc 
aacttcctcc 
cacgaaccgg 
caccctgcgc 
gatagacgga 
actaccggca 
tcccttccac 
catcatctgc 
gcgccgtagg 
agaaaagatt 
gggggaacgg 
aagcaaagca 
acccgatgaa 
cgaactggac 
caaactaaaa 
aaaggccatc 
aggatttaca 
gactcagtat 



tccatccttt 
cggaaaccgc 
gcggtcaata 
cggtatgaca 
gcaggcagta 
accctctacg 
gacacactgc 
acagggttgg 
ttcggcgaag 
cacatgttat 

ggggtcattc 
tatatggggg 
aactatccgg 
acggcaggaa 
gtgatgtctc 
ctcaatgagg 
ggcaactacc 
ccgatcctgt 
ttgttactat 
aagaaccggt 
cgattcctcg 
gaactgcaaa 
cctgccgaaa 
actttcctac 
gtcacattcc 
gcaatgacca 
caactgatat 
acatcccgct 
aaagagaaaa 



tatcggtctt 
tacctattga 
tctaccagga 
tcggttcgca 
tgaatgccat 
aactggatcg 
tctactccgt 
gacaatacag 
ccagttcggt 
ttgcctggat 
cgaatgaata 
gtgtcaatgg 
aagtagtact 
atcccgacaa 
acgaagaaga 
aaccgatcga 
gcatacaggc 
cactgaccat 
ttgtgtcggg 
tgaaatggga 
tcaatgtcag 
tcgtagaact 
tagcaagcag 
ggaagctgaa 
tctgcacgga 
atatgggagc 
caaccaccga 
atttcagtac 
taaggaaaag 



ttccagggga 
ccatcccgac 
cgaaccgggc 
gaaaatacgc 
tgcgcataac 
aaccggcaat 
ttcgatggac 
catccggacc 
gatatgtgat 
gctgcaatcc 
tcttgccaag 
gttgctctgt 
gaccgatgta 
actcacccta 
catcttccga 
atcatacgat 
tgcatgcagt 
actccctccc 
aggcatcacc 
attaaaagaa 
caatgaactc 
gatacgtaac 
tccgaatatt 
ccagttgatc 
gatgggattg 
caacgattac 
tctcactttc 
atcatttaag 
cagtaaggtg 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
2943 



<210> 31 
<211> 2361 
<212> DNA 
<213> B. fragilis 



<400> 31 

ttcctaaatg 

ctgctatccg 

cccggaacag 

caggccggcc 

attcttatca 

cccggacaaa 

tataaaaacg 

gtgaacggga 

ttcaacggat 

gcgggaggtt 

gaaggcaaag 

atgggtatgc 

ggggaatggt 

gacctgcaaa 

aaaaccggaa 

cgcaccttta 

ggagccctga 

ataggaggca 

catatcaccc 

tcccctctgt 

cccctcaccc 

ggagagacag 



aacaacacat 
cccaaacaga 
cggaaaagcc 
ggcgttccat 
caaacgagac 
aggcggtcat 
gtatcatgca 
aaaaacagat 
atgcagctga 
atctgcacgc 
acgccaaggg 
accacaccta 
attttgataa 
ccgccttgtt 
gtccggtccg 
tgaaaaccaa 
taattgaaaa 
atgctatctt 
gtataggagc 
tcgagtacgg 
ccgactatcc 
aaaagcaggg 



gagaaagctc 
gataacactc 
gatggctact 
caccatctac 
ttcgggcaca 
cagcggttcg 
ggccaaagtg 
atcggcacgg 
cgcctgctca 
catgcacagc 
cgaactgata 
ccacatggta 
agaaacccat 
cgaagtgccg 
tcacgtatcc 
tgaaccccta 
tgccgaaaaa 
cttctccaat 
cagtgctgtt 
aaagtcgcaa 
ctcagactgc 
agccggtatc 



ttttttccat 
tatgtatcac 
ttagaatatg 
tgcgaaggca 
cccgaacatc 
cgtatactcc 
gaagaagaac 
tatccgaatt 
cccgaacgtg 
agagaatggg 
ctgaaaggcg 
gaaaacatct 
acactctatt 
caggcagaaa 
gtagaccatt 
ttgcgcagtg 
tgctctgtaa 
tataaccgca 
tgctttgtag 
acctgggagc 
ctggtggacg 
caactatcta 



tactactatt 
cttcgggtag 
cctggaaaaa 
ccaactacct 
cgatccgttt 
ggaacctgcg 
tgatccccga 
ttgatccgga 
tgaaaaactg 
gaggctacca 
gatttcagaa 
ttgaagagtt 
tctatccgcc 
acctctttat 
tggaactgac 
actggaaaat 
acggctgcta 
accaccgtgt 
gctctccgga 
agatggataa 
acaatctgat 
tgtccgcacg 



cgtatccggt 
cgaccatcat 
ggcctcacgg 
gtccgctccg 
ttcttcgtat 
ttggaaagag 
ccagctcttt 
tatacgcatc 
gagtaacccc 
atacagcatc 
caaccgccag 
ggatgccgaa 
gcgagaactc 
cctgaaagga 
acaaaccctg 
ctatcgggga 
cctgcacgat 
cagccaaaat 
tgccgtccgt 
agggacaggt 
tcactcgatc 
aatcaccatc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 



17 



cgtaacaaca gtatttacga cctgccccgt gccggcatca acgtcagtga aggtacctgg 1380 

ggaggacatc tgatagaagg aaacgatgtg ttcgacaccg tacttgagac aggcgaccac 1440 

gggtccttca actcctgggg acgcgaccgc tattggcatc ccgaccggaa tgtgatggat 15 0 0 

gaattcgcga aagaacatcc tcaaatggta ttccgggacg ctaccgaaac gactgtcatc 1560 

cgcaacaacc gctggaggtg cgaccatgga tgggacatcg atctggacga cggttcttcc 162 0 

aactatcaca tctacaacaa cctctgccta cacggaggat tgaaattgcg cgaaggcttc 1680 

gcgcgaacgg tggaaaacaa cattatggtc aacaacacat tccatccgca cgtatggttt 1740 

gcaaactctc aagacatttt ccgtcataac atcgtcacga ctccctatcg ccccattcag 1800 

gtaaaggaat ggggaaagga aacagacact aacttttttg tcaccaagca aggactggaa 18 60 

caggcacaaa agagaggaac ggacctccat tcactttacg gtgatccgct cttcatcgct 192 0 

cctgaaaaag gagactaccg ggtaaaagaa aattcgcctg ccttgaagac ggggttccgg 1980 

aatttcgata tggagcactt cggcgtacaa tgcccacacc tgaaagcttt ggcggctact 2040 

ccgaaattgc cggttttcaa aattccggaa gaaaagccgg agacggtaca gacgtattcg 210 0 

tggaaggggt taacattgaa agaggtgtcg accgaaggag aacgttcggc cacggggctc 2160 

gacaaaatac gaggcatact ggtagtgcag gtcgaaaagg gaataaccgc cctgcaagcc 222 0 

aacgacgtga ttctgcgcat taacggcaaa ccggtagata accggacgga tatggaaacc 22 80 

gagatccgga agtcacccga aggcaataag ttccggatca tcttcttccg aaatcagaaa 2 3 40 

gaaaatgcgg taacgatgta a 23 61 

<210> 32 
<211> 1608 
<212> DNA 
<213> B.fragilis 

<400> 32 

tccctaatag aaaaactact tatgatgaac aatctaccat ccggaattct ctactcgctg 60 

accggtgcgg cagctgtagc ttctttgact tcatgtgcca cgggcaaaca gaaagaagag 12 0 

caaaaacctc tgaacattgt ttatattatg acggacgatc atacggcgca aatgatgagc 180 

tgctacgata cccgttatat agaaactccc aacctcgatc gcattgcccg cgatggcgtg 240 

cgctttacga attcttttgt agccaactca ctgagcggcc ccagccgtgc ctgcatgatc 3 00 

accggcaaac atagctgtgc caataaattc tacgacaata cgacttgcgt gtttgacagt 3 60 

gcccagcaaa ctttcccgaa actgcttcag aaagccggtt accaaaccgc tcttgtaggt 42 0 

aagtggcact tggagagcct gccctcaggc ttcaattatt gggagattgt gcccggacaa 480 

ggcgactatt ataatcccga cttcattaca caagataacg ataccgttca gaaacacggt 540 

tatatcacca acctgatcac tgatgacgct atcgactgga tggagaataa gcgtgacgag 600 

agcaaaccgt tttgcctgtt gattcatcat aaagctattc accgtaactg gatggcagat 660 

acttgtaacc tggctttgta cgaggacaaa accttcccgc tacccgataa cttctttgac 72 0 

gattacgaag gccgtccggc tgctgcggca caggagatga gtatcgtgaa ggacatggac 78 0 

atgatttatg acctgaagat gctgcgtccg gataaggact cacgtctgaa atcactttat 840 

cagaagtttc tgggacgtat ggacgaagga cagcgtgcgg catgggacaa gttctatggt 900 

ccggtgatcg atgacttcta caagcaaaac ctgagtggga aggaattggc tgactggaag 960 

ttccagcgct acatgcgcga ctacatgaag actgtgaagt cactggatga caatgtggga 102 0 

cgtgtgctcg actatcttga aaagaaggga ttactggaca acacgttggt ggtctatacc 10 8 0 

tccgaccagg gcttctatat gggcgaacac ggttggtttg acaagcgttt catgtatgaa 1140 

gagtccatgc gtacaccgct gatcatgcgt atgccgaaag gattcgaccg tcgtggtgac 12 0 0 

atcaccgaga tggttcagaa cattgactat gcacctactt tcctcgaact ggccggtgct 12 60 

cccgttcctg ctgatataca gggtatgtca ttgctgccat tgctgaaagg cgaacagccc 132 0 

aaagactggc ggaatgcatt atactatcac ttctatgaat atccggccga gcacatggtg 13 80 

aaacgtcatt atggaatacg taccgaacgc tataaactga tccatttcta taacgacatc 144 0 

aattggtggg aactgtatga catgcaagcc gacccgacgg aaatgcacaa tctgtacgga 15 0 0 

cagaaagagt atgagcctgt ggtgaaagag ctcaaagagc agatgctgaa gttgcaggaa 15 60 

caatacaatg atccggtgcg cttctctccg gagcgggata aagaatag 1608 

<210> 33 
<211> 183 
<212> DNA 
<213> B.fragilis 



<400> 33 



18 



agttgtagtt tatctacaaa aaacgttgtg ttactctgtg ttactttgta ttactccgtg 

ttactctgtg gtgaaaaagc ttttggtgaa cttttattta tgagtcttca aggtatccgc 

cgcgaataca tcttcccgat aaccataatc cagagaagaa taaatcttca gcatctcggg 
tag 



60 
120 
180 
183 



<210> 34 
<211> 1530 
<212> DNA 
<213> B.fragilis 

<400> 34 

aaaaagaaac taatcatgaa aagaatagaa atctatatcg gactgtccgt tttcgcttta 60 

tcggccaaaa gccaggtgaa agaatctcga cccaatgtca tatatatcat aatggatgat 12 0 

ctgggctacg gggatatcgg ttgttatggt tcggagaaaa tagaaacacc gaacatcgat 180 

cggttgtata aggatggcat cagtttcaca cagcattaca caggttcacc cgtttcggca 240 

cccgcccgct gtgtgttgat gacaggtatg cactcgggac atgcgcaaat ccgggctaat 300 

gatgaaatgg cttatcgggg cgctatcatg aattacgact ccatgtatgt acatcccggt 360 

ttggaggggc agtatccttt gaaagcccat accatgactc tcggaagaat gatgcagcaa 42 0 

gccggatacg tcaccggatg ctttggaaaa tggggactgg gggctccggg cacggaaggt 480 

actcccaaca aacagggatt cgacagtttc tacggataca actgccagcg gcaggcacac 540 

agttattacc ccgccttttt gtataagaat gaagaccggg tatacttggc caataaagtg 600 

ctcgatcctc acacgaccaa gctggatgca ggagccgacc cccgtgatga agccgcctat 660 

gccaagttct cgcagaaaga gtatgccaat gatcttattt tcgatgaact gatttcgttt 72 0 

gtcgggcaga acagaaagaa accgtttttc ctgatgtgga ctactccgct accgcacgtg 780 

tcgttgcagg caccggagaa atgggtgaag tattatgtcg ggaagtttgg agacgaagcc 840 

ccctacatcg gaaaagccgg atatatgcct tgtcgctatc cgcatgcgac ttatgctgct 900 

atgatcagtt attttgacga gcaaataggc aagctgatag agaagctgaa gaaggaacgt 960 

ctgtacgaca atacggttat catgtttact tccgataatg gaccgacttt taatggcggt 1020 

agcgattctc cgtggttcga cagcggaggt cctttcaggt ctgagtatgg ttggggaaaa 1080 

tgttttgttc acgaaggagg aatacgtatc cctgctattg tcacctggcc cgggaaaatc 1140 

aaaccgtcta cccagagcga tcatatctgc ggatttcagg atgtgatgcc taccttggcg 12 00 

gatatcgtaa acattgcttg tccggagacc gatggcatca gtttcttgcc tgctttgctt 1260 

ggcgaaacgg aacgccagaa agaacacgaa tatttgtatt gggaatatcc cgatcccaca 1320 

atcggcctca aagccattcg catgggtaag tggaaaggaa ttgtcaacaa catccgtaag 1380 

ggcaactcta caatggagct ttatgacttg gagagtgatc ttagggaaga acatgatgtg 1440 

gctgccgaac atcccgatat cgtccggaaa ctgacgaggt tgatggaaaa gtcacatacc 15 00 

gagccggaga atcccaaatt caggttctga 153 0 

<210> 35 
<211> 1272 
<212> DNA 
<213> B.fragilis 



<400> 35 



gtgcacggac 
caaagtctga 
atcaccgatt 
cagaaagtgt 
tttcattggc 
ctgacccgta 
gaagaggtga 
attgacatgc 
aatgttgcgg 
gacagtacac 
gcttatattc 
tgccggtcac 
tcggcccgga 
gtgatttaca 
agaggacacc 



tcttccagcc ccgtggtgaa gacgatgatg ccggctttat ctatgccatt 
ggcaatggaa cacgggtgag gaaagaggac tgatatttcc ttgtgtcgag 
ttccacgggt gaaatggcgc agctttatgc tggattccgg acgccagtat 
ctacgatcaa gaaatatatc gacatggctt cgatgctgaa gatgaattac 
atctgaccga aggacttggc tggcgcatcg aaataaaacg ctatccgttc 
taggagcttt tgtagggcag gggccggaac agcagggctt ctactctcaa 
aagagatcat cggctatgcg gcggaccggg gcattacggt tgttcccgag 
ccggacatgc cgaagcggca cttaatgcat atccccggct gggatgtttc 
taaaggttcc ccaaagcgga tttacgcaga atatattttg tgcgggaaaa 
tcatcttcct gaagaatgtg ttggacgaag tatgccggat gtttccgtcc 
atctcggagg cgacgaagca cctaaaggga attgggataa atgtcccgat 
ggattgaaaa agaaaaacta aaagacagtc atgacctaca attgtggttt 
tggccgatta tctgaaacaa aaagggagga aggccatctt ttggggagat 
aagacggcta ttccttgccg gacaatgtgg tgatacagtg gtggaactgg 
gggatctggc cttgaagaat gccgtcagac ataattatcc ggtgatttgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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720 

780 

840 

900 
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ggtacaaact 
gctcgcactt 
gaaaatccgc 
agcatgatcg 
tccggcaatc 
tttgaacagc 
aaatgggact 



attatacgta 
tcgatctgga 
ttattctcgg 
atcgtcgggt 
cggaaaattt 
aggggtattc 



tctgaacttc 
agatgtgtat 
aatgagctct 
ctttccgcgt 
tgatgagttt 
attcgggcct 



ccgcttaccc 
ttgcgtaatc 
gccttgtgga 
attctcgcac 
tatggcaaag 
gcattgaagg 



cctggaaggg 
cttcttatag 
cggacgacgg 
ttgccgagca 
tactctctaa 
aagatgcggg 



atatactcaa 
gccccgggag 
ggtgacggaa 
gatgtggcat 
gcaactgtgg 
tacaaattat 



960 

1020 

1080 

1140 

1200 

1260 

1272 



<210> 36 
<211> 1464 
<212> DNA 
<213> B.fragilis 



<400> 36 

aataggatta 

gtgcgtttgg 

ggacccaatg 

cgtgagggta 

gtgaaatata 

caacgctgga 

atcaaggatg 

gacaagaaga 

ctcctgacag 

acccgtgaat 

attcttgtgc 

gaagacctgc 

accgatgagg 

ggagcaaact 

gaccgcacta 

agtggagaaa 

cagtcgtacg 

tgggagatta 

aatctgccgg 

cgcccgcagg 

gctctgaagg 

gcccgtgctt 

gatacgtaca 

aagacgatga 

cttttcctga 



tggaacaata 
cacaacctgt 
ggggagggaa 
cattggacta 
ttgctttccg 
atgcccacga 
aaagactgaa 
tcattcttct 
ctccccgggt 
tgcttttctc 
tttcgatgct 
acgtgcttcc 
ccgaagtcct 
atgactccgg 
tcctgaagga 
acggtgccgg 
cctgtgacat 
aaaaacacat 
ccattgagat 
aaagccagat 
acaaaccttt 
ttgtgaaaga 
atcgccggcg 
ttatggtaac 
aaagaaatcg 



cacattcaat 
cacagcacaa 
aagcttgttt 
cgatttttct 
tgacacctat 
tcaggaagat 
agaggaattg 
ttccagtggc 
attgattatg 
gctgctcgaa 
cgatgatatc 
gaaaatggaa 
ggacgcactg 
ggaggtggta 
gctcgactgg 
taaatctaca 
cagtcttttc 
cggttacgtt 
tgtggcttcc 
ggctgcctgc 
tcttcagctg 
tccggaattg 
ggtgaaaaag 
ccattacgaa 
ttga 



atagcgggtg 
atcgctaccg 
gtagacacgc 
ccttcttcta 
ggggcggcgg 
gcccctacgg 
ttcgaactct 
gaattacgta 
gacaatcctt 
cgtctcaccc 
ccttcgttca 
agggaggctt 
caacagcgta 
aaattaaata 
acagtccgcc 
ctgctcagct 
ggacgtaagc 
agtcccgaaa 
ggattgcatg 
gagtggtgga 
tcgagcggtg 
cttattctgg 
attatcgaag 
tcggaactcc 



gcgtggcacg 
gcgaacatat 
ttttgggtaa 
cccggacggt 
atgccaacta 
tgcgggagat 
tccacatcga 
aatttcaact 
ttatcggttt 
gcttgtcatc 
ttacccatgt 
atctggcttc 
tagccggatt 
aggtaagtat 
ggggtgaaaa 
tggtttgtgc 
ggggtacggg 
tgcaccgtgc 
acagcatcgg 
tggatgtgtt 
agcagcgcct 
acgagccgct 
ctttttgccg 
cttctaccat 



caacccgctt 
cgccatcgta 
atatcctttg 
atatgataac 
ttactatcag 
gttgggagag 
gccattgttg 
gactaaaacc 
ggatgcacct 
cgtgcaaatc 
gattccggta 
attttgtgtg 
accttatgac 
tcgttatgat 
gtgggcattg 
ggacaaccct 
cgaaagcatt 
ctatctcaaa 
tttgtacaaa 
cggcattgtg 
ggcattattg 
gcacggactc 
tcggcaggac 
caccgaccgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1464 



<210> 37 
<211> 1113 
<212> DNA 
<213> B.fragilis 



<400> 37 

gagaaaataa 

atgaataaaa 

aaaaccattg 

attgactctg 

cgtctggcaa 

gccggagctt 

gccattgaag 

cgaaaatatg 

tatcccaata 

gcagtggctg 

gaaatcgctg 

tatctgcgta 

accggacaag 



tctctgtgga 
taatagaact 
ataaatcact 
accgtaacat 
acaccggata 
cgttcgctcc 
gcggttgtaa 
cgcataagat 
cttacgatca 
taggtgctac 
aggctttcga 
acaacgagtt 
ccaaccgtct 



actctgtgta 
gttgggaaat 
gattcacgta 
acagactttg 
tgtatccatt 
gaatccgatt 
tgcagttgca 
accgtttgta 
ggttctgttt 
tatctatttc 
ttatgcgcat 
caagaaagat 
gggagttacc 



atctgtggtg 
caggctgaat 
ccgtcaccgg 
cgcagtttgc 
cttccggtcg 
tatttcgatc 
tccactttcg 
gtaaaactga 
ggcaccgtca 
ggttccgaac 
gaactgggta 
ggtatagact 
atcaaggccg 



aactcaaaac 
attacctgaa 
atacaatcga 
agacattgct 
atcaggacat 
cggaaaacat 
gcaatctggg 
accataatga 
aggaggcttg 
agagtcgccg 
tggccaccat 
atcatgcggc 
atatcgtaaa 



cattacaatt 
ccacacttgc 
taagatatgg 
ggggcatggt 
cgaacatacg 
tgtgaagctt 
tgctgttgcc 
gttgttgtct 
ggaaatgggg 
ccaattggtg 
cctgtggtgc 
tgctgacctt 
acagaaattg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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720 

780 
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ccgactaaca 
accgagctga 
atgggacgtg 
gatgctgtcg 
cgtaaagctt 
gtctatctgg 



atggtggttt 
ctacggacca 
tcgggctgat 
ttacggcagt 
tccagaaacc 
atgcgtctgt 



caaagcgatt 
tccgatcgat 
caactccggt 
agtaaacaaa 
catgaacgaa 
caccattgcc 



catttcggaa 
ctttgccgct 
ggagagtcac 
cgtgccggcg 
ggagtggagt 
tga 



agacggatga 
atcaggtggc 
atggagcgtc 
gtatgggatt 
tacttcacgc 



aagaatgtat 
caatggatat 
cgacctgaag 
gatcagcgga 
cattcaggat 



840 

900 

960 

1020 

1080 

1113 



<210> 38 
<211> 747 
<212> DNA 
<213> B.fragilis 



<400> 38 

atgaaaaaga 

accggttgga 

gaactactga 

gcggtgaaaa 

aaaagctggc 

acagccgcta 

cccaatgccc 

gtacccgatg 

ccttattgga 

gcccacggaa 

atcgtaaaac 

aatctggaaa 

gttgccaacc 



ttgtattgct 
cagatgtcga 
aagagaatgg 
cgctgaattg 
gcctgaatga 
aatacgggga 
tgtcggaaga 
cggaacttcc 
agtgtatcat 
atagtttgcg 
tgaatctgcc 
aagactattt 
agggaaagaa 



ccgtcatgga 
tctgacagaa 
atttaacttc 
cgtactcgac 
aaaacattac 
tgaacaggtg 
cgatccgaga 
ccggacagaa 
cttcccgaat 
cggcatcatc 
gactgccgtc 
cctgggtgat 
aaaataa 



gaaagtgcat 
aaaggaattg 
gataaagctt 
cggatggatc 
ggcgatctgc 
cttatctggc 
aatccccgct 
tctctgaaag 
ctgaaaacgg 
aagcacttga 
ccttacgtat 
cccgaagaaa 



ggaacaaaga 
ccgaagcctg 
atacgtcata 
aggactggat 
aaggactgaa 
gcaggagtta 
ttgagaatcg 
ataccatcga 
ctgatgaaat 
agcacatctc 
ttgagttcag 
tccgtaagtt 



gaaccgtttt 
taaagcaggc 
ccttaaacga 
tccggtagag 
caaaagcgaa 
tgatatagct 
ttatcaggaa 
acgtatcatg 
tctggttgtt 
cgatgaagag 
tgacgaactg 
gatggaagcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

747 



<210> 39 
<211> 2307 
<212> DNA 
<213> B.fragilis 



<400> 39 

aagccactat 

ggttcttccc 

atatttcata 

gtagcttgta 

acggacctta 

agactactga 

tatccgggtt 

ggcaatctca 

acagaaacca 

gtagcctatc 

aagccggtcg 

tatctcaccg 

caacccggca 

cctttttttg 

ggcaccatcg 

ttacgaatca 

acatttacca 

cgtaatctgc 

actctgctta 

ctgatgaaag 

ggcaacaaac 

agtaagcttc 

ttcggtatct 

ccggaatggg 

cttgacctga 

acggagaatc 



ctgccggaga 
gtttcatttg 
acatcaacaa 
ttcatgtcaa 
tcctggaagt 
acgaacaaga 
cgggtggaga 
gcacgatcct 
tcatccggat 
ctaaacaaaa 
tgttatggcg 
aattcagcag 
aaaagattct 
aactcggact 
gatggacagg 
tccctgctat 
ccccggagtt 
acaattgggc 
ataattggga 
aggccaaaca 
atccgcgcaa 
ccggaggaat 
ggattgaacc 
ctatccatta 
gcaatcctaa 
ccgatgtagc 



acgcttaagc 
tgaatataat 
caataccatg 
tgcacaagag 
tgctccggac 
cctgaaaaac 
agattatttc 
gcgttatgta 
gaaagatgac 
tgtcatcaaa 
ttatgcttcg 
tgactgggct 
cgatacgaag 
ggaacagccc 
caactaccag 
caatccatac 
tatctttacg 
acgcaactac 
aaatacttac 
cctgggcgta 
cgatgaccat 
ccctgcatta 
ggagatggtg 
cccgaaccgg 
agtacaagac 
cttctttaaa 



aacagagctc 
gaagtatatc 
aaaaaattac 
tccatacaga 
ggacgtctgt 
ctttccggct 
gaaccggctg 
tcttcggaac 
caatatccgg 
acatggagcg 
acaatgcttt 
aaagaggtgc 
ttaggtagcc 
gctcaggagc 
tttactttcg 
gcctcggact 
ttgagtaaca 
caactgaaag 
ttcaccttcg 
gatatgttcc 
gccggcctgg 
gtagagaaag 
aatcccaaaa 
gaaacttatt 
ttcgtgtttg 
tgggattgca 



tgaaccacaa 
ttactttgca 
ttgcaacatt 
ttcgcatctc 
atcaatctta 
cctcacgagg 
tagccattac 
agaaagcagt 
tggacgtcac 
agatcaagca 
acttctcaaa 
agatgagtac 
gtgctgccat 
atcagggaca 
aagtggacaa 
atcaattgaa 
acggtacggg 
acggcaaggg 
atgaagaatt 
tgcttgacga 
gcgattggga 
cgaaagaagc 
gtgacctgtt 
atttccgtaa 
gtgtcgtaga 
acagtccgat 



acatccattt 
cacacaatgc 
actgattctt 
gacagatcgg 
tctgggtgac 
atgggaagtc 
gaacaacgat 
ggaaggtgga 
actgcactat 
tcaacaaaag 
ccaaaaatat 
acagcaattg 
gcacatgcaa 
agtagtattg 
tgaaggcgac 
agcaaacgaa 
tgaagccagc 
agaccgaatg 
actgggcaaa 
cggatggttt 
agcgatgaaa 
cggtgtcaaa 
cgaaacacat 
tcagttggta 
taagattatg 
tactaatatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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tattcgcctt 
tataatgtat 
ggtggaggtg 
gataataccg 
gccaaagcga 
accgatgttg 
gatgaactta 
ctagatggtg 
tatactgccc 
ggcgagaaac 
aaggaaatta 
tccggtgact 
cgggtaatag 



acctgaaaga 
tgaaacgggt 
cacgttgtga 
atccggtaga 
tgtgtgcaca 
ccagtatgtg 
cttattgcca 
atcaatatcg 
ccgatgcttc 
tactaccggt 
atctgatgcc 
atctgatgaa 
agttggtagc 



taaacaagga 
aaaagagaaa 
ttatgaagca 
acgcttattc 
cgtaacaagc 
taaactcggt 
ggaagcagta 
tctcgtatct 
gaaagccgtc 
aaagctccgg 
gggtcggaaa 
aataggattg 
agagtaa 



cagctctaca 
tatcctaatg 
ctgaagtact 
attcagtggg 
tggaacagca 
ttcgacatcg 
gccaattata 
ccatatgatg 
ctctttacct 
gggcttgatg 
tccaatttgt 
aatgcattta 



tcgaccacgt 
tgcccatgat 
tcaccgaatt 
gcttctcaca 
aaacaagtgt 
gactgaaaga 
aacgcttgaa 
gcaaccacat 
acgacatcca 
cccaaaagat 
cgggtaatga 
caacttcaca 



gcgcggtata 
gctttgctcc 
ttggtgttcg 
gttctttccg 
gaaattccgc 
catgaaagca 
acctgtcatt 
ggcagtgatg 
tccgcgtttc 
gtaccgggtg 
aaaaatcttc 
aaccaatagc 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2307 



<210> 40 
<211> 1218 
<212> DNA 
<213> B.fragilis 



<400> 40 

atgatgaagt 

tcgccattag 

gaaatggcag 

actgtacctc 

gctgcattgg 

atggggcatg 

tcctttggtc 

aataaccgac 

ggaaatatac 

gaaacggaac 

gataggcatt 

ccggcagggt 

acggtgatgg 

tataataacc 

ttgaccccac 

gtaaggctaa 

catactttac 

gactggttct 

aagcaaatgg 

aagctaaccg 

cgattgaaat 



tgttccgcga 
cttccgggga 
gctctgatag 
ttagttattt 
tatcttcttc 
tccctttcaa 
aaggtgcggg 
tttatgtgtt 
ttcaaccgat 
gaggacgagt 
cgcctatgat 
abttggccgt 
acattggttt 
aggagtttag 
acagcttcgg 
gtccacattc 
gaggcgcttt 
tttcatctca 
ttgaggatcg 
aactgcttag 
gtagatag 



gatattgatt 
gataaatgat 
cgtttgggtc 
tgtcgaggag 
caaaacaatt 
actttttact 
tgaatatggc 
atgttggcag 
tcgattggcg 
gcatgtttgt 
ttggacgcaa 
gaatgattat 
ctggtttgga 
gcttcttccg 
agagttgccg 
gtcaactacc 
tgttgaaata 
tgacggatat 
tttgtcttca 
gagtactaaa 



atttgtcttc 
gtttggggac 
tgccacttgt 
ctggaaatgg 
attggcaaac 
aaaagcggga 
ttagcttatg 
gccgaccata 
cattggtcac 
gctctttctt 
agtttggatg 
ggaaatgaaa 
ggccaatatc 
cgttttacgc 
aatcacttct 
actcctccgg 
tacaatgatt 
tatgtttgga 
ggtgagattg 
gaaaacgata 



ttgggaagtt 
ataaacaagt 
ctatgttgaa 
tcaaacttga 
aatatatttt 
cttatttgag 
atgcacagat 
tcttggtatt 
ctaaaggggt 
ttaatcgtga 
gcaagattat 
tcaaatctct 
gtaacgattc 
ttgattatgg 
ggggagaaat 
aatattatat 
tcttaggggg 
atgttgaacc 
tctcggattc 
acaattatat 



aatagcttgt 
ggctacgatt 
ggatacggtt 
taatcgggat 
agtacataaa 
ggatatcggt 
ggatgaggag 
cgatttacaa 
atttcatgta 
ctttgtaggt 
aaaagaactt 
aaataatggg 
attatatcac 
aggacatgaa 
atcatatcct 
ggttgataaa 
cattcctgct 
tgtacgattg 
ggaccgaaga 
tttctatggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1218 



<210> 41 
<211> 1203 
<212> DNA 
<213> B. fragilis 



<400> 41 

ctgatatttg 

tttctcggga 

tgctgctgct 

tacaataata 

ctcatgtccc 

tgcaccaata 

gtagtcaagc 

acgatgttcc 

actgacatat 

cagacgctga 

tacagatacg 



ccaccccact 
ttagcatgag 
gcatgattaa 
attgcgatat 
ggagtatcga 
tgctgtgtga 
aacgaagttt 
agatcttccg 
tcagtcttaa 
ttgatagaaa 
ttcatcatgc 



cttctacgag 
gcttatctat 
tggcttcttt 
cggtcacctt 
ggaacgtaat 
tacctccggc 
taccatggtc 
gcgcatcttc 
atccaaattc 
ccatcatacc 
ttgccaattc 



ataattcatc 
tttattaata 
ggtctggggc 
tgcaccacgg 
cttacgtcca 
ttcacctgca 
tacgtgtccc 
ctcttcaaca 
ttcagccaca 
aatgctcata 
attagcagtc 



tgagccaatg 
gcaaatacga 
attacatcat 
gcacgcattg 
tcttccaatg 
attacatttg 
atgactgtaa 
atggcttggg 
agattaatcg 
caagttccga 
acaaattctg 



tctctttaat 
taggaacacc 
catcggcagc 
cagtaaatgc 
taacatgata 
ctttacgaat 
caatcggagc 
ctacttctgc 
tttctgcatc 
taacctgatt 
tcagtttcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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taccttgctt 
acgtttttct 
cgtttcttta 
tttgaagcgg 
atttcctcca 
agcaacattg 
accgttagga 
gatttccttg 
ctcttcacgc 
taa 



tctgccattt 
ttacgatatt 
acctgctttg 
tctttattat 
cggttattat 
ttcacatcta 
tcaaggtttt 
atgatggctt 
tctttccgct 



cctgatcttc 
tggcaccttt 
ctacatcttc 
tgttattacg 
tcgtacgctc 
ctttttcctt 
ccttaccaac 
ccttcatctg 
tttcctcttt 



cagttcctgc 
gttcttacct 
ttcgcttact 
gtttctattc 
actgtttgga 
attattgttg 
gaccttagct 
tttcttctga 
cgatttcttc 



atacggttgg 
ttgcttgtca 
tcctgcttta 
tgtccgccgc 
gtagggtgtg 
atgcgattac 
tgtttactat 
tcctgacgaa 
ttcggacgtg 



acgccatgtc 
gacgagccaa 
ctacaggctt 
caccttgttg 
caaaattaga 
gtttcttctt 
cttctttacg 
gcttttcctt 
tcgactgatt 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1203 



<210> 42 
<211> 525 
<212> DNA 
<213> B.fragilis 



<400> 42 

tttaatgcag 

cggatcttaa 

ggtttctctt 

ttgacttcct 

ggttcttctt 

ttcacttcga 

ggttgtggtt 

aaatcaattt 

atcacatcgt 



ccaagtcaat 
atacgccctc 
ctttcttctc 
caaccacttt 
tcaccggttt 
ttacaacagg 
ttggctcttc 
ttccgacagg 
cagcaacagt 



ctgcccgata 
ctcttccttc 
tgttttttct 
cttttctgtt 
cggttccgaa 
cttaacctct 
tttcatcggt 
tttaaacttc 
cttctccggt 



acattaatct 
tctactggag 
ggagacacca 
tcaacaggct 
gtagcaggag 
tccgctactt 
tctttctcaa 
ggacgcacat 
tccttcttat 



tagatacaaa 
taaccggtgc 
cgacttttgg 
tttcaaccac 
tcacagtgac 
tcttttcctc 
ccttccggtt 
cttccggaat 
cataa 



ctcagtcgga 
ttctgctacc 
ttcttctttc 
caccggtttc 
ttcttctttt 
agcagctaca 
cagtttatct 
gaccgtctta 



60 

120 

180 

240 

300 

360 

420 

480 

525 



<210> 43 
<211> 1269 
<212> DNA 
<213> B.fragilis 



<400> 43 

ataattatgg 

gaactgaaga 

gtgatcgcga 

ggtgactttg 

aatatgcaaa 

gaagtaaccg 

cagacactgg 

gataaagtag 

ttgcttgacg 

ttttatcgta 

aatccgaaaa 

gaagtacctg 

gaacgtgcca 

gtaggtgtaa 

gatgtgatta 

aagatttctt 

gaagaagtat 

actgagtaca 

tatttggacg 

ggcattgata 

gatcttgaag 

aatgaataa 



ccaagaaaga 
atatcgatag 
aaatgtttgg 
aaatatggcg 
tttcgttgac 
atgaagtgat 
cttctaaaat 
gtactatcat 
atgaaggaaa 
aaggagaaac 
ttatcctctc 
aaataaacga 
agattgcggt 
agggaagtcg 
attatacatc 
ctattcgtct 
cgttggctat 
ccattgatgt 
agttcagaga 
cggctaagtc 
aagaaacggt 



agaaacaatc 
aaccacgatg 
cactgatgaa 
taaccgtgag 
tgaagcacaa 
tttcgctaag 
tcttgagctt 
caacgcagaa 
cgagttattg 
tgcccgtgca 
gcgtacttct 
tggcctgatt 
agaatcttat 
tattcatggc 
taatatttca 
gaatgaagaa 
tggtaaaggc 
gttccgtgag 
tgaaatcgac 
tgtattgaat 
ggacgaggta 



agcttgattg 
gtaagcgtgc 
aattacgacg 
gtagtggcag 
aaaatcgatg 
ttcggtcgcc 
gaaaaggaca 
gtataccaga 
ttgccgaaaa 
gtggtggcac 
ccggttttcc 
accatcaaaa 
gatgacagaa 
atcgtacgtg 
ttgtttatcc 
gaacgtaaag 
ggtttgaata 
ttggatgaaa 
ggatgggtga 
gcacctcgcg 
ttacgcattt 



atacattttc 
tcgaagagtc 
taattgtgaa 
acgaggattt 
cttcttacga 
gtgctatttt 
gtatttataa 
tctggaaaaa 
cagagcagat 
gcgtggacaa 
tgcagcgctt 
agattgcccg 
ttgaccctgt 
aacttcgcaa 
agcgtgcttt 
cagaagtatt 
ttaaactggc 
acgcgcagga 
tcgatgctat 
aaatgctgat 
tgaaatcgga 



ggaatttaag 
gttccgcagt 
cccggataag 
gactaacccg 
agtgggtgaa 
gaatcttcgt 
taaatacatt 
agagatgttg 
accaagcgat 
caaaaacaac 
gttcgagatg 
tattcccggt 
aggagcctgc 
tgaaaacatt 
aagcccggct 
cttgaaaccg 
cagtatgttg 
tgaagatatt 
caaggctatt 
tgaaaaaacg 
gtttgaagat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1269 



<210> 44 
<211> 855 
<212> DNA 
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<213> B.fragilis 



<400> 44 

cttccctttt actattacgc agacggacgg aaattccata ttacaatggt tggacgaggt 6 0 

tatttttgga aaagaataga taatgaaata attgataaga tcatgttaga gataaaagac 12 0 

ctgcatgcca gcattaacgg caaagagata ttgaaaggca ttaacctgac ggtgaagccg 180 

ggcgaagtac atgccattat gggacctaac ggttcgggta aaagtacgct ttcgtctgtt 240 

ctggtaggta atcctgcttt cgaagtgacg aaaggaagca tcacgttcta tggtaaaaat 300 

cttttggaat tgagccctga agatcgcagt cacgaaggta tttttcttag ttttcagtat 360 

ccggtggaga tcccgggcgt gagcatggtg aactttatgc gtgctgctgt caatgaacag 42 0 

cgtaagtaca aaggattacc cgctttgaca gccagtgagt tcttgaaatt gatgcgtgaa 480 

aagcgtgcag tggtcgagtt ggataataaa ttggccaatc gttcggtaaa tgaaggtttc 540 

tcgggtggag agaaaaaacg gaatgagatt tttcagatgg ctatgctcga accccgtctc 600 

agtatcttag acgagactga ttccggactc gatatcgatg cgcttcgtat tgtagccgaa 660 

ggagtaaata aactgaaaac tcccgatacc agttgtattg tcatcaccca ctatcagcgt 72 0 

ctgctggact atataaagcc ggacattgta catgttcttt acaaaggacg tattgtaaag 780 

actgccggtc cggaactcgc tcttgagttg gaagagaagg gatatgattg gattaagaag 840 

gaattaggag aatga 855 

<210> 45 
<211> 195 
<212> DNA 
<213> B.fragilis 

<400> 45 

ttgtggggct attggcagca aaagcctttg tacacctcac tggctgtgaa gttatgccct 60 

acggtgaccg actccatgac tgtggcgcag atactggcat ttatcattat ctggatcgct 12 0 

gtggccgcta atctttacat tggtggcttc agtattaacc aaggcattgg aggcggtttc 180 

acttggctgg cctga 195 

<210> 46 
<211> 348 
<212> DNA 
<213> B.fragilis 

<400> 46 

tataaacctc taaatgaaac aagaattatg ttattagcaa ccactccaat catcgaagga 60 

aaacgaataa ccacttatta tggcattgtg tccggagaaa ctattatagg tgccaatgtc 12 0 

ttccgtgact tttttgccag tattcgtgat atagtaggcg gacgctccgg ttcatacgaa 180 

gaagtgcttc gtgaggcaaa agatactgct ttgaaagaaa tgtctgaaca ggctcgccaa 240 

atgggcgcta atgctgtgat cggagttgat ttggattacg aaacagttgg gggaagtggc 3 00 

agtatgttga tggtaactgc tagtgggacg gctgtgttct tggaataa 348 

<210> 47 
<211> 1662 
<212> DNA 
<213> B.fragilis 

<400> 47 

attagcctgt atatacctac caccggacag ggatatacag gctattttac cttacaaaaa 60 

caacacctta tgaaaaagaa gaaagttact acttattgct gcctcctgtt attggcaagc 12 0 

tttttcacaa ctgtcacggc acaaaacaca aatactccca tgatggggtg gagttcatgg 18 0 

aacaccttcc gagtacatat taatgaagaa ctaattaaag agacagctga tgccatggtc 240 

aaccggggtc tgaaggatgt aggctatgga tatgtgaaca tagacgacgg atactttgga 3 00 

ggacgaaatt cggaaggacg tctttttgcc aataagaaaa aattcccgaa tgggatgaga 3 60 

gtcctgtccg actatattca ttcaaaggga ttgaaagccg gtatatattc tgatgcgggc 420 

agcaacactt gtggctccat ctatgacgca gatacactcg gtatcggtgt agggctttgg 48 0 

aaacacgatg atatagactg ccaaaccttc ctcaaagact ggggatatga tttcattaaa 540 

atagactggt gtggcggtga agcaaccgga caaagtgagc agcaacgtta tacggatatc 600 



24 



tacaaagcga tcagacggac aggacggaca gatgttcgat ataatatatg ccgttggcag 660 

tttccgggca cttgggctac ccagttggca ggttcctggc gaatccatac agacatcaat 72 0 

ccacgattca caacaatcga ccgaatcatt gaaagaaatc tctacttagc accttacgca 780 

agcccggggc actataatga catggatatg cttgaagtag gaagagggct cacggaagac 840 

gaagaaaaaa ctcattttgg aatatggtct atcttgtcct ccccgttaat gatcggatgc 900 

gatcttcgta caattcctga aaaaacttta tcgatcatta ccaataagga agtgatcgca 960 

ttaaatcagg attcattagg tctgcaggct gaagccattg aacggggaaa agactatctg 1020 

attttatcaa aagccattca gaaacgtgaa ggcaaactac gtgcagtagc actatataac 1080 

agaagcaata cagatcagca gatcagagtc gatttcgata agctctattt atcaggggat 1140 

gtacgagtga gagatctatg gaaccatcaa gaaatgggaa cattcaccga ttactatgaa 12 00 

acgctagttc ctgcacatgg aacagcttta ataagacttg aaggtagcaa acgtcacgac 1260 

cggacatgtt atgaagctga atatgctttc atgcaagaat ttctgccaga caacaaacag 132 0 

gcagctcatt ttacaccaaa atcaggagcc tcaggagaat atattatgaa aaatcttgga 1380 

aattcacctt ccaattgggc agaattcaga aacgtgtata ttagcaaagg aggagattat 1440 

caacttaagt taacttatta ttcaggtgat aaacgcgata tccaaatagc tgtaaacgga 1500 

acagaatata aacagtctaa cctttattcc ggtacatggg atcaagcagc tacaacaact 1560 

atcaaggtta aacttcgcaa aggctataac acgatacgtc tgtataattc gtacgggtgg 162 0 

gcacccgata ttgataaaat ggaaatcatc aaaggtcgtt aa 1662 

<210> 48 
<211> 1350 
<212> DNA 
<213> B.fragilis 

<400> 48 

ataacccgac aattcatgaa aaacaccaac cgttccattc tccataaaga tggagtaagt 60 

tatatcctac catttatctt agtgacctct tgttttgctc tatgggggtt tgctaacgat 12 0 

attaccaatc caatggtgaa ggctttctcg aaaatattcc gtatgagcgt cactgatgga 180 

gcactagtac aagtcgcttt ttacggggga tactttgcaa tggcctttcc tgctgcaatg 240 

tttattcgca aatactctta taaagccggt atcctgttgg gactggggct atatgctttg 3 00 

ggtgccttgc tgtttttccc agcaaagatg acaggcgatt attacccttt tctgctcgct 3 60 

tattttattt tgacatgtgg actctcgttt ctggaaacaa gtgctaatcc ttatatatta 420 

tcgatgggta cagaagagac ggcgacccga cgattgaatc tggcgcagtc gtttaatccg 480 

atgggatcat tgctcggcat gtatgttgcc atgaatttca ttcaggcgcg tctgaatcct 540 

atggatacgg tagaacgcag ccaattgtct ccggcagagt ttgaagtatt gaaagagtcg 600 

gatctctctg tgttgattgc tccttatctg attataggat tagtaattct agcgatgctt 660 

tttgtgatac gtgccgttaa aatgcctaag aatggcgata agaaccataa tattgatttt 72 0 

atacccacat tgaagcgtat ctttaaaatt ccccattata gagaaggagt catagcacaa 7 80 

tttttttatg taggtgcaca gattatgtgt tggacttttg ttatccaata tggaacgcgc 840 

ttgtttatgt cgcagggaat ggaggagaag gctgctgaag tgctttccca ggaatataat 900 

ataattgcta tgattatttt ttgcataagc ccgtttcgtg tgtacattta ttcttcgcta 960 

cctgaatccg gggatgcttc tcaagattct tgcgattgcg ggtggtgctt ttacgttagg 102 0 

tgtgattttt ttgcaagaca tatggggatt gtattgttta gtagctgttt cggcttgtat 1080 

gtcactaatg tttcccacga tttatggcca ttgctcttcg tggtttgggt gatgatgcca 1140 

aatttggggg ctgccggttt gattatggca attctgggag gctctgtgtt gccaccatta 12 00 

caggcttgta ttattgacca acatacattg ttgggtatgc ctgctgtaaa cttgtctttc 12 60 

atacttcctt ttatctgttt cgtagtgatt atcatttatg gacatcgtac gtgtgcacgt 132 0 

gtgaagaaga taaaagcagc acgaaagtaa 13 5 0 

<210> 49 
<211> 1722 
<212> DNA 
<213> B. fragilis 

<400> 49 

gcaaagcggc atataccatt aatacggctt tcgaactgga acagaaactt gacttccctt 60 

accaaaggat tgaattttaa agcactggtt tcttttaaaa actggtcgaa gacgactgtc 12 0 

aatcgctcct tttcacctta cttttatgaa ttacagaatc ctcaggagca agaagacgga 180 

agctatcttt atgattataa ctctatcagt aagggacgta ccgctcttga gacatcgact 240 
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tccactactg gcgaccgtct gatgaacctg caggctacac tgaactatca gcgcatgttc 3 00 

ggtgataaac atgatgtcgg agcaatgttg gtatatcttc agcgcgaata caatctgaac 3 60 

aatcctgaca ataactatta caatacattg ccggaacgta atcaggggct ggccggacgt 42 0 

gttacctatg cttatgacgg acgctatttg gctgaattca atttcggcta caatggtagt 480 

gagaacttcg aaaaaggaag ccgttacgga ttcttccctt cactcgctgt cggctatctt 540 

atctccaacg agaaattttt cgaaccattg acaaaagtta tctccaactt aaaaatacgc 600 

gcttcgtacg gattggtagg taatgcggat atcggctcca accgtttccc ctatcttact 660 

aaagtagatt tgggtggagc cggatttgta ttcggtgacc agtggcaaac ctcatctaac 72 0 

ggagctacca tcactactta cggagctgaa aaggtgacat gggaaatcgg taaaaagtat 780 

aatgtaggat tcgacctggg attattcaac aaattaagcc tcaacgtaga tttctttaga 840 

gaagaccgta aagacatctt ccttagacgt aatacaatcc ctgcagaaag tggtatcacc 900 

ggagatctcc gaccctatgg taatctgggt aaggtacgca atcaaggcgt tgacatgtca 960 

ttggactata atcacgctgt cagcaaagac ttcatgatct ctgccaaagg tactttcaca 1020 

tacgctaaga accaatatat ggaaatagac gaaccggact acgaatatgc atacatgtca 1080 

caagtaggac gccccctgaa tcagtataaa ggctatattg cattaggact cttcaaagat 1140 

caggaagaga ttgacaacag tccaaaacaa atactaaccg gagttgtgca accgggtgat 12 00 

attaaatatg cagacctcaa taatgacgga aagatcgacg gaaacgatca aacttacatt 1260 

ggtaatccgg aattacccca aatcagctat ggtctgggag tcagtatcca gtacaaaaaa 132 0 

tgggatgctt ccatcttctt tcaaggagta ggcaaaagaa gcatcatgtt gagcgacatc 1380 

catcctttcg gtggagaatc gtatggtgtc atgcaatttg ttgccgataa tcattggaca 1440 

gaggcaaacc cgaaccccga agcaatgtat ccgagactga caaacgggaa aaacaacaat 1500 

aataacccca actctactta ctggctgaga gatggttcgt atatccgact taaaaacgtg 1560 

gaattaggat actcttataa atttttacgt gcctatatca gcggacaaaa cctgctgaca 162 0 

ttctctaaat ttaaattatg ggatccggag ctctatacct caaacggatt aaaatatccg 1680 

acacaaatca tgggttccat cggtttacag ttcacttttt aa 1722 



<210> 50 
<211> 1668 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (1640) 

<22 3> Identity of nucleotide sequences at the above locations are unknown. 



<400> 50 

aatcaatgta aatgtatgaa aaagaaagca attccttgtc ataaggcagg gaggattacg 60 

tccttttttt tattaattag tattttttta cttataccga gtatcactac tccggtttat 12 0 

gctgtagaaa cttataccca gcaaactgtt tttacgcttc acgcaactaa taaaacagta 180 

aaagaagtgt ttgaatacat cgaaaaaaac agtgaatttg tcgttttgta ttcaaaagat 240 

cttttacctg tactgcagaa gaaagtgtct gtttcgatag ataaacagaa tgtagaatcg 3 00 

attctgaata tcttgtctaa agaagcggga ttgaagtaca acatcaacga ccgtcagatc 3 60 

acaattacca aagttacggc agaagcacct caacaggaaa aaaaaatcaa aatcaccggt 42 0 

caagttcttg acgaaaacgg agaagggatt ccgggagcaa atatcgtaat aaaaggcaat 480 

agtacattgg gaacagtaac caatgtcgaa gggaacttta cattaatggc tccggaaaat 540 

agcacattag tagcctcctt tatcggatat acccctgttg aaattccgct aaaagggaaa 600 

aagatagttg ttttcaaatt ggtacctgac gcccagagtc tggaagaagt agtggtagta 660 

ggattcggaa cacagaaaaa agccagtgtt gtaggtgctg tacaatccat caaaccggct 72 0 

gaacttcgag taccttccag taacctgagt acatcatttg ccggacgtat agcaggcgtg 780 

atttctatgc aacgcaccgg tgagccgggt gccgatggag caaacttctg gatacgcggt 840 

gccgcaacct tcagcggaac gactgatcct ctgatcttca tcgatggtgt cgaagtttcg 900 

gcaggagata tgaacgctat tccctcggaa gctatcgaaa acttctcaat attgaaagat 960 

gcctcggcta cagccctcta cggagcacgc ggtgccaatg gtgtcatcct gatcactacc 102 0 

cgaaccggta aagatcttga aaaagcacgc atcaacgtac gcatcgataa tacatttacc 108 0 

gcaccgacac gtacactcaa actggcagat gcagtaacag ccatgaaatt gagaaatgaa 1140 

gccattctga cccgtaaccc ggatggtaca ccggctttct cagatgataa aattcaagga 12 0 0 

acgcttgaag gcagaaatca gtatgtatat cccaacgttg attggttcga ctatatgttt 12 60 

aaagactact ccatgaacca atcagccaac ctgaatgtaa tgggtggtac aaagaaagta 132 0 
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gactatttca tcagcgcctc catcaataat gataatggta tgctgaaaaa agatccgaat 13 8 0 

aacacattcg acaacaatat acagaatctt cgctactcgt tccaaagtaa cgtgggagca 1440 

tggttgacat caagtaccaa agtaaatgtg agaatcaact cgcaaatagt caattacaat 1500 

ggtccgtcaa ccagtatgga cgatttgtat aaatacgtaa tggaagctcc gtcaatgtat 1560 

tttgcacctg tatatccgaa tatcaaccgt gaagatcaca ctatattcgg aaacaaatca 1620 

ggtggtccta tcggttccgn aggattcagt atttatcgca acccttaa 1668 



<210> 51 
<211> 411 
<212> DNA 
<213> B. fragilis 



<400> 51 

atattaagaa aagaagttta tat tt tat at ttttgcagcg cacatatggt aaccattact 60 

ctatatatga acaacaacat agaatatatc agcaagataa agaaaggaga agagacttct 12 0 

ttccgtcatt ttgttaatag ctattcgaaa gacttgttct actatgcaca gtgtttcgta 180 

cgaagcaaag aaaccgctga agaagtagtc agcgacgtct ttctggatgt atggagacac 240 

cgcgaagaaa tagatgaaat caagaatata aaagcttggt tgctcacatt aactcataac 300 

aaagccatct tctatctgag aaaagcggaa aattcaagtg aaattgcttc atgggaagaa 360 

atagatgatt ttcaaataat cggaaatctg caactcccca tgaagagatg a 411 



<210> 52 
<211> 1851 
<212> DNA 
<213> B . fragilis 



<220> 

<221> unsure 
<222> (920) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 52 

ataattatga aactaaaaaa tataattgta gctttactaa tcggagctag cttacactct 60 

tgtgattatc tggacattgt acccgatgac acccctattt tggctgatgc gttcaagaac 12 0 

gaacagactg ccgagaactt tgtcttcgcc tgctattctt tcattcccaa ttatctgaac 180 

ttccgtcaga acttcagttg gtgcacaact ccggaaactg tcggatctgc ccactggacc 240 

actacttggt tcacctttat gagaatgcaa caaggattgt acaattctgc tgatccaatc 3 00 

attgatgtgt ggcaaagttc atacaacggt atccgccaat gttatacgtt cttggataat 3 60 

attgatgatg taaagccatc acaaatctca gaggcagacc tcgcagccaa gaaagtactt 42 0 

tggaaaggtg aagtaaaatt tctgattgcc tactaccact acctgctatt acagaactac 480 

ggtcctatag tcatactgga cgaagcaatc cctcttaatg cacccaaaga agaacttttc 540 

aagccgcgtg taccctatga tgaatgcgtt agccgaattg ctcaaatgtt cgataatgcc 600 

tctgccgacc tgcctatgac agtgaaagct tccaactacg gtcgtgctac aaaagtcatt 660 

gcacaagcac taaaggcaag aatgtacttg tacgcagcca gcccacagtt caatgggaat 72 0 

gctgatatgt ataagaattt caagaacaag gacggacagt tgctcatgaa cctgacttat 7 80 

gacaagaata aatggaaaac tgccatggac gaatgtaaaa aggcaatcga catggcacat 840 

caagccggag cagaattgta taagtataca aagaaaggta atctgccgga attcaaccaa 900 

gccattgcca atgcacgtan acctgttgta gacgcatgga ataaagaact gatctgggga 9 60 

tatagtggct ggaaagaaac atgggccgat ggaaactcta ttcaaacaca cgtaattccc 102 0 

aaaggtatca gtacttcctc gggagcacct tatggagctt taggtgcaac ggctttcagt 10 8 0 

gcggacatgt atctgaccaa gaacggactt ccgatagatg aagatccaga gtttgattat 1140 

gcacatcgtt tcacagtagc cgaaggggat tcggtagcag tgctccatcg caaccgtgaa 12 00 

ccacgtttct atggttctat cggcttcaac cgcggggact acctgatcaa cggagacacc 12 60 

attaacctca aaatgcgctt caaagagcaa aatggaacac gtgatgcggg aagtgaccaa 1320 

ttatatggat cgtatgctat cgccaaactg gctcatccag aaacttttgt tagtggtacc 13 80 

agcaactctc tggtagcttt ccctttccct atcatccgct taggagaatt gtatttggac 1440 

tatgcagagg cttactttga atacaatgga acactggaag gagatgcact tacttacttc 15 0 0 

aacctgatcc gccagagagc cggtattcct aatgtagaag tttcctacaa aggacttccg 1560 

tccggagaca aacttcgtga ggtaattcat cgtgaaagaa ccatagagct gatgttcgaa 162 0 
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ggacatatgt catacgacta tcgccgttgg ctgattgccc tgaaagaatg gagcggtatg 1680 

gaaaatggta tgatcggatt gaactcttac ggtacaacca acgaagagta ttataaaaat 1740 

gcacgtttgg atgctcaacc attcatcttc agggatgaac agtatttgag tccaatcaaa 1800 

caggattacc tgaatgtaaa ttcaaatctg gtccagaatc cgggttggta a 1851 



<210> 53 
<211> 339 
<212> DNA 
<213> B. f ragilis 



<400> 53 

acgataaaga 

gtcgttggaa 

acagacctcc 

caagccccta 

atctgtcaga 

agcatcaact 



aagaaaaagg 
gtaatactgt 
tccccaaaaa 
tcaaacataa 
atacacatgc 
tcctcttttg 



ttgcaggaat 
acggtacttg 
agttaaagac 
aaaaatgtcg 
ctcagaacat 
tattatacac 



ccttcattta 
ctccgccttc 
agagccctaa 
caaaagcaac 
cttactgacc 
ggcaaatga 



ttatctattt 
ccttggttga 
agtcattcaa 
aactttcacg 
cgttcgatac 



atacggatcg 
cggaggaaaa 
cacatttcag 
acacttcaat 
cagctacaag 



60 

120 

180 

240 

300 

339 



<210> 54 
<211> 1134 
<212> DNA 
<213> B.fragilis 



<400> 54 

aagcagcacg aaagtaatat tgagaatcgg atgcggtgtt tgacgattct tctgggcaac 60 

tgttttcttc tgcttgtgtc attagcctct tgcgggaaag tgtcattagc ggaagaagca 120 

gtgttttcta taccggtgga tacgacattt atgaggcttc gtcaatggga gtggtattgt 180 

cagaaacggg ctgacagttg tctgacagag aataattatc agggagcttt atcttggctg 2 40 

gattccgctc gtatccaagt ggaacattac ggacgtcctt attatatatt ggcacgcggg 3 00 

gacgtatatt attccatcca tcaatatgat tctgcccgtc gttattttag tatggcagtc 3 60 

cattccattc atccacatat tgctatcgaa gcttggagga aacttgcaga actggaactt 420 

atggaaggaa atgagaagca agggttctat tctacgcaga aggcagatgc acttttccgg 480 

gtggagatag gccatgtgca gagtgataac agtgaagctc tatatcagga agagaggttg 540 

aaaaacgagt taaaccaatt gaagattgcc aaacagaata gggaaattgc catgttaact 60 0 

ttgagccttt gtctgattat actgattgct ttgtttattt tctaccggca aaataagata 660 

aagcgtgaaa aagagcgtct gcttcttgaa gagaaagcca agttggagca agagaaccaa 72 0 

atactgaaac aaactgaaga gttaagtgct ttgagagaaa aagaggcggt tttgcgagag 780 

tctttgttcc gtaaggtcga tgttttgcgt aaaataccct ccctcaatga agaagaacag 840 

gagagtggtg aacatcgcat agctttgtcg gaaagggagt gggaggaaat tcgtcagaca 900 

gtggataatg cttatgatgg gttttcacaa cggttgcttg cacgctttcc tttgttgacc 960 

ttaaaagata tttatttctg ttgtctggtg aagatcaatg tcagtataaa ggacctttcc 1020 

gatatttatt gtattagtcg tacctcggtt agtaaaaaga aatttcgcat caagcgagag 1080 

aagcttggag cagaggattc ggactcttta gatgactttt tacgtggttt ttag 113 4 



<210> 55 
<211> 471 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (228) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 55 

tcaatgatag aaaaaagaac tgtttgtcag attgttgaag aatggctgga ggataaagac ' 60 

tattttctgg tagaagtgac cgtcagccct gatgacaaga ttgtggtcga aattgaccat 12 0 

gcagaaggtg tttggattga agactgtgtg gagttgagtc gcttcattga gtcgaaactg 180 

aaccgtgaag aggaagatta tgagctggaa gtacgttctg ccggaatncg acagccattt 240 
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aaagtattgc aacagtacta taaccacatc ggcctggagg tggaagtgct gactaaaggc 3 00 

ggacgcaaac tgagcggggt cttgaaagat gctgatgaag aaaagtttgt tgtgaccgta 3 60 

caaaagaaag taaaacccga aggagccaaa cgtcctcaat tggtagaaga ggatgaaacc 42 0 

ttcacctatg atgatataaa atatactaaa tacttaatta gttttaaata a 471 



<210> 56 
<211> 1566 
<212> DNA 
<213> B. fragilis 



<400> 56 

ccaaacaaag aaggagcagt ccttgttata ttatcttatg gaaagctttg cggggatctt 60 

ctttcctgca gcaaaagagg ttacacaaca atatatattc aaataaaaat gatgcaacaa 12 0 

gaagaaccca ataaatatgt aaaagaactc acgcaggaga agtataaata cggcttcact 180 

acggaggtac atacagatat catagagaag ggactcaatg aagacgtggt acgtttgatc 2 40 

tcgtctaaaa agaacgagcc ggagtggttg ctggagttcc gtctgaaagc ttatcgtcat 3 00 

tggttaacgc tggagatgcc tacttgggca catttgcgta taccggaaat tgactatcag 3 60 

gcaatctcat attatgccga tcctacgaaa aagaaggagg gcccgaagag tatggatgaa 42 0 

gttgatccgg aattgataaa aacattcaat aaactcggca ttccactgga ggagcagatg 480 

gcattgagtg gtatggctgt ggatgcagtg atggactctg tgtcagtgaa aacgaccttt 540 

aaggaaacac tgatggagaa aggtattatt ttttgctcat tcagtgaagc tgtgcgtgaa 600 

catcccgact tggtgaaaaa gtatctcgga tctgttgttg ggtatagaga caacttcttc 660 

gcggcattaa actcggctgt attctcagac ggttcttttg tctatatccc caagggggta 72 0 

cgttgtccta tggaactctc tacttatttc cgtattaatg ctgccaatac cggtcagttt 780 

gaacgtacat tgattgtggc tgatgacgat agctatgttt cctatctgga ggggtgtaca 84 0 

gctccaatga gagatgagaa tcaattacac gctgctattg tcgaaatcat ggtacatgat 900 

cgtgcggaag tgaaatatag caccgtgcag aattggtatc cgggcgatgc cgaaggcaaa 960 

ggtggagttt ataattttgt gacaaaacgt ggcaattgca aaggagtaga cagtaaactt 102 0 

tcatggaccc aggttgagac aggttcggct attacatgga aatatccgtc ttgtattctt 1080 

tccggggata attctactgc agagttttat tctgtagctg tgacgaataa ttatcagcag 1140 

gcagatacag gtactaaaat gattcattta ggtaagaaca cccgtagtac gattgtcagc 1200 

aagggtatat ctgccgggaa gagcgagaac tcttaccgtg ggttggtccg tgtagccgaa 1260 

aaggctgata atgcccgtaa ttatagccag tgtgactcat tgctgttggg tgataagtgt 132 0 

ggtgcacata cttttcccta catggatatc cataatgaaa cggcagttgt ggagcatgaa 13 80 

gcgactacca gtaagattag tgaggatcag atattttatt gtaatcagcg tggtatttct 1440 

acagaagatg ccattggatt gatcgtaaac ggctatgcta aggaggtact taataaactt 1500 

ccaatggaat ttgccgtaga agctcagaaa ctacttacga tctctcttga aggcagtgta 1560 

ggataa 1566 



<210> 57 
<211> 246 
<212> DNA 
<213> B. fragilis 



<400> 57 

ccaaggcatt ggaggcggtt tcacttggct ggcctgaatc gcatgcttga agccgggctc 6 0 

ggagctttaa aatatctttt attggtgagc ttggttatat gtgtcattca gtttatagac 12 0 

tccgatagtc agttgattag ccaaacaaag aaggagcagt ccttgttata ttatcttatg 180 

gaaagctttg cggggatctt ctttcctgca gcaaaagagg ttacacaaca atatatattc 240 

aaataa 2 46 



<210> 58 
<211> 1341 
<212> DNA 
<213> B. fragilis 



<400> 58 

ctttttacgt ggtttttagt tctcggaaac ttgtttgtcc atgctaattt aagaaatatt 
ttatataata tgttgatata tagtgtggta tcgtatttct ttttgaagta ctttgtctat 



60 
120 
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atcccttttt gtttttcctc tgcccgatgg ttaattttgc aatatcaaaa ctttaaaaat 180 

gtacgtgata tgttgaatcg actgaactat ttcattatgt tggcagggct acttgtttta 240 

gtagcatgtt cgtctaattc cggaaaacag gtagaggttg caaatactcc ttttgtttac 300 

gatggattaa aagagtatcc ggtaaaagag ttgaaactgt ccgatctggc agttagtgat 360 

tatgtgttat taaaagacga tgaaaattca ttgttgggta ggttaccgac aaatccctgt 420 

atgcaggtca ctgaagaccg gatttatatt caggatgaag agcaacaggc tatatttatt 480 

tttgaccgtc agggaaatcc attattgcaa atgcgtcata aaggaggtgg tcctcaggaa 540 

tgggcgtctc tgaactcttt ctatgtggac agccctaata aagaaattat agtgttggac 600 

tgggctaaaa agtttatagt ttatgatttg aatgggaaat ttaaacgaag cttcccaacg 660 

cccggttgct cttggaagtt tgctaatttg aatgatgagg ccgtactgat atattgtcct 720 

tttacgaatc gtaataacgg agaggcagtt tgtatccttt ctaaaaaaga tggcaaaaag 780 

ttgtatgtgt gtcctattac gatagataat tttgtgtggg atagtgaggg acgtattggc 840 

tatgaaccat tgaagccagc ttatggtggg attttatttt cagatttatc attgaaaggc 900 

gtgtatttta ttgatgctga aacatatgaa gtaaaacagg ttattgatga agtgacagaa 960 

tataaatttg aaaatgcaga gttcgttaag ctacatccgg cgatcgatgc taaagactat 1020 

actttgtata ctacgctcgg tacgaaatgg ttgactccag atatgccaat gaactattat 1080 

tattttgata agaaggagca aaaaatgtat actttgaaaa atgaaaccgg atgggctgtt 1140 

ttaaaagata tctgcaatgt acagagaacc cgtacaacga atactccagg tatcggcatt 1200 

ggttattatt ggccttctac tatgaaagga gagtcgatgc aagctgaaaa agagcagttt 1260 

gactctcgtt tccgggcaat aatggactct atacctgaag aaggtaaccc tgtattacaa 132 0 

attatgaact tcaataaatg a 1341 

<210> 59 
<211> 270 
<212> DNA 
<213> B.fragilis 

<400> 59 

ttacaactcc gtgtctccgt ttttgtttta catagaatga cgaccataga tattatcata 60 

ctgattgctc ttggagccgg tgttattgtc ggatttatga agggctttat ccggcagctt 12 0 

gcttccattc tcggattaat tgtggggcta ttggcagcaa aagcctttgt acacctcact 180 

ggctgtgaag ttatgcccta cggtgaccga ctccatgact gtggcgcaga tactggcatt 240 

tatcattatc tggatcgctg tggccgctaa 270 

<210> 60 
<211> 1371 
<212> DNA 
<213> B.fragilis 

<400> 60 

gaaggaatta ggagaatgag tttaattatg aatgcagaac agcaatatat agatctcttt 6 0 

tctcagtgtg aggcgatgat ctgtcgtcat agcgctgagg cgttgaatgc cccccgggca 120 

acagcttttg ctgatttcga acgtcagggg tttcctacac ggaaacaaga gaaatacaaa 180 

tatacggatg tcagtaaatt ctttgagccg gattatgggt tgaacttgaa tcggctgccc 240 

attccggtga acccttatga agtgtttaaa tgtgatgttc cgaacatgag cacttcattg 3 00 

ttttttgtag tgaacgatgc attctacaat caggtgcttc ctaagtccgg attgcctgaa 360 

ggagttatct tcggtagttt gagaaatatg gctgaacagc atccggaact tgtgaagaag 42 0 

tattatggta agttggctga tacttcgaaa gatgcggtta cggcttttaa tacagctttt 480 

gcacaggatg gagtattgat gtatgttccg aagaatgtga tcgtcgatag acctattcaa 540 

ctggtcaata tacttcgtgc ggatgttaat ttcatggtaa accgccgtgt gttgattatc 600 

cttgaagaag gggctcaggc ccgtttgctt atttgcgatc atgcaatgga taatgtaaac 660 

ttcttggcta ctcaagttat tgaagtgttt gcagaagaga actccgtttt cgatctttat 72 0 

gaattggaag agactcatac cagtacagtg cgtttcagta atttgtatgt gaaacaggga 7 80 

gcaaacagca atgtattgct taatggaatg acacttcata acgggacaac ccgtaatacg 840 

acagaagtta cccttgccgg tgaaggtgcc gagatcaatc tttgtggtat ggccattgct 9 00 

gataaaaacc aacacgtgga caataatacc tcgatagatc atgccgtgcc gaattgtacc 960 

agcaatgagt tgtttaaata tgttcttgac gatcagtctg tgggagcttt tgccggtttg 102 0 

gtactggtac gtcctgatgc gcaacatacc agttctcagc agacaaaccg taacctctgt 1080 

gctactcgtg atgcccgtat gtatactcag ccgcaactgg agatatatgc cgacgatgta 1140 
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aaatgctctc atggagctac tgtaggtcaa ctggatgaaa acgctctttt ctatatgcgt 12 00 

gctcgtggta tcgccgaaaa ggaggcccgt ctgttgctga tgtttgcatt tgtcaacgag 12 60 

gtgattgata ccattcgtct gaaggcgttg aaagatcgtc tgcatttgtt ggttgaaaaa 1320 

cgtttccgcg gtgaactgaa taagtgccag ggatgttcta tttgcaaata a 1371 

<210> 61 
<211> 762 
<212> DNA 
<213> B.fragilis 

<400> 61 

aggcagtgta ggataatgaa gacgaaacgt gttggttggc tattgatatt cctgtcctat 60 

gttggtgtag tactggcaca aaaccttgac gatcaagaaa ggaggtgggc gatcagtggc 12 0 

tcttggggag gaaattggcc gatagtcaca aagaatacac tttcgggaaa agctgtttct 180 

gcaggacata tacatacttt aatgttggag tattatattc cttatacccg tttctccctg 240 

aaaggaggat atacaggtga agaaataggt ttgaatccag gtatttctgc ctcaatgagt 3 00 

aatctggaaa taggagggcg gtattatttc ttaccacaac ggtttgcaat ccaaccttat 3 60 

gggggacttt ctactggatg gaacctctct ccacgaaggc aggaggggat gggcagtagc 42 0 

agttattacg atccttcaag gcaagagttt cgtaaagatt acgattatcg ataccgaatt 480 

aaagaaccat tattcacagt ttctcctgtg gtgggagctg atatatattt tctttcttgt 540 

cttgctctca ccttggaata taatttccgg atgggcattg ccggaaagat aagtggagag 600 

atagagaaga ccaattctcg tggaaccgga tttgtacgta gcaatgggat gcggcagacc 660 

gtaagtgtag gggtaaaggt taacttccct tttactatta cgcagacgga cggaaattcc 72 0 

atattacaat ggttggacga ggttattttt ggaaaagaat ag 7 62 

<210> 62 
<211> 879 
<212> DNA 
<213> B . fragilis 

<400> 62 

gaagacggtg gcggatcttc tatggatacg gcaaaggcga ttggtattat taccaataac 60 

ccggaattca gtgatgtcgt ttcattggag ggagtggcag ataccaagaa gaaatctgtt 12 0 

cccatcatcg cgttgcctac tactgcagga actgcggcag aggtgactat caactatgtg 180 

ataacggatg aaaagaacca gaagaagatg gtttgtgtag atcctaatga tattccgtct 240 

attgcgatag tggatgctga gttgatgtac acacttccta aaagtctgac tgcagctacg 3 00 

ggactcgacg cactgactca tgctattgaa ggtttaataa ccaaaggggc atgggagatg 3 60 

agtgatatgt tcgaaattaa agctattgaa atgatcaatc gttatcttgt gactgccgtt 42 0 

gaagaaccat cgaatgcaga ggcacgtaac ggtatggcag tggctcaata tattgcaggt 480 

atggcttttt cgaatgtagg tttgggagtt gtgcatggta tggcacatcc gttgggagct 540 

attttcgata ttcctcatgg tgtggccaat gctctattat tgcccattat tatggagttc 600 

aatgctcctg cagctcttga caaatatgtt gagatagcta aagcgatgaa tgtgtattct 6 60 

actgacatga ctaaagaaaa ggcggcagaa gcagcagtcg aagctgtaaa aacattatct 72 0 

ttgagggtca atattccgca acacttgtcg gacttgggta ttcaggaaag tgatcttgac 780 

cgtctggcca cagcagcgtt tgctgatgta tgtacgccgg gcaatccacg ggaagtaaca 840 

aaagaaatta ttcttgattt atataagaaa gcattatga 879 

<210> 63 
<211> 648 
<212> DNA 
<213> B . fragilis 

<400> 63 

gaaagcatta tgataaccaa tgaacatatc gagcaatacc ttgctcaggc acatcgctat 60 

ggcgatgcca aattgatgtt gcgcagtagc ggtaaccttt catggagaat cggtgaagaa 12 0 

gcgcttgttt ccggaacagg ttcttgggtg ccgaatttgc agaaagagaa agtatccatt 180 

tgtaatattg ctacgggtac gcctcaaaac ggtgtgaaac cttccatgga aagtaccttt 2 40 

catctgggga ttcttcgtga gcgtccggat gtaaatgtcg ttttgcattt tcagtcggaa 3 00 

tatgctacgg ctgtttcttg tatgaaaaat aaaccatcta acttcaatgt aactgcggag 3 60 
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atcccttgtc atgtacgtaa agagattcct attattcctt actaccgtcc cggttctccg 42 0 

gcgcttgcca aggctgttgt ggaagcgatg aaagaacata attctgtatt gctgactaat 480 

catgggcagg tggtatgtgg caaggacttt gatcaggtat acgaacgtgc tactttcttt 540 

gagatggctt gccgtatcat agttcaatcc ggaggggact attcggtctt gacgccggaa 600 

gagattgatg acttggaggt atatgtattg ggaaagaaaa caaaatag 648 

<210> 64 
<211> 1167 
<212> DNA 
<213> B.fragilis 

<400> 64 

tgtaatccta ttaaaataat gagaaagaat aagtttaaat catttgcttc acgcctgaat 60 

aaggatggcg atcatccgga aaagatatca tttgaatctc cggaagaaca agccgaatat 12 0 

gataagctcg actttctctg gaaccgatgt ctccccgaag aaacgggtga accggatata 180 

tgggcaaaag tgcaggcaaa aataaatgcc gacaacaccc cggtccgtct tgccttgaag 2 40 

agcaataaga cggcaaggtt gttcagtatt ctgaaatatt cggcagttgc agcttctgta 3 00 

gccctgttaa taggagccgg ctgttttctt ttattgaatg atgaagagag acatgatctg 3 60 

aataaaatag cacaaagtct gcaaacagaa attccacagg atataaaaga agttacgctg 42 0 

gtggtttcgg atcaaaagaa gatagaattg gacaataatg cccagatcgt ctattcggca 48 0 

acaggtcagg tgcaggtcaa ctctaataaa cttgtggaag atgacattaa agaggaatac 540 

aatcagatta ttgtcccgaa aggtaagcgt tcacagattg tcttagccga taacagtaaa 600 

atatggatca attccgggag taaagttatc tatccccgtg catttgaagg gaaatacaga 660 

gaaatttatg tggaaggaga agtgtatctg aacgtaacac atgatacttc gaaaccgttt 72 0 

attgtgaata cttccggatt tgaagtacgc gttttgggta catccttcaa catatcggct 780 

tataaaaatc aggaaaaagc cgcagtcgta ttggtagaag gttcggtcaa tgtaaaagac 840 

caacaaaatc atcatataaa gatggtacct aacgagaaag tagaacttaa tcaggaaggt 900 

atatcaggaa aagaaaaagt aaatgcccgt gattatatca gttggattga cgggatatgg 9 60 

accttgcagg gagaaagcct aaagcaagtt ttgttacggc tgcaaaatta ttacggacaa 102 0 

aacatccggt gtgatgctgc gatagagaac gaacaaatgt ttggtaaact ctttttaaat 1080 

gatgatttaa atcaggtaat gaagtcaatt ctatctatct tgcctgccga atacacaatg 1140 

aaaaacaatg taatctatat agaataa 1167 

<210> 65 
<211> 1467 
<212> DNA 
<213> B. fragilis 

<400> 65 

cttggaggta tatgtattgg gaaagaaaac aaaatagata atgactacca gcttatgagt 60 

acttacttag cagctgactt tggtggaggt agcggtcgga ttatggccgg tacccttacc 120 

gaaggtaagc taaaactgga agaggtatat cgttttgcca atcggcagat aaaacttgga 18 0 

aactgtgttt actgggattt tctttctctt tttgaagaaa tgaaaaacgg acttcgtgtc 240 

gctgcccgga aaggctatga agtaaaaagt atggctattg acacctgggg agttgatttt 3 00 

gggttaatag ataaggatgg taagttgctg ggcaacccgg tctgttatcg tgattcccgt 3 60 

acggatggta tacctgaaag agtgtttaaa cagattgatc agactgttca ttacgctgaa 42 0 

atcgggattc aggtgatgcc tatcaatact ctgtttcaac tttatagtat gaagcagaat 480 

gatgatgtgc aactccgggt ggctgataag ctattattta tgcctgacct gttcagctat 540 

tttcttaccg gagtagcgaa caatgaatat tgtatcgctt ctacttcaga gctactggat 600 

gctcgtcagc gtaattggtc ggataacttg atcagtgagt tgggattacc ccgtcagctt 660 

tttggtgaaa tcgtttttcc cggaactgtc cgtggcaaat tgaagcagga aatagcagat 72 0 

gaaaccggtt tgggatgtat caatgtcgtt gctgttggtt cgcatgacac agccagtgcc 7 80 

gtatttgccg ttccctccaa tgaacccaat cgggcttatc tcagttcggg aacctggtct 840 

ttactcgggg cagaggtaga tcaaccgatt ctgacagaag aagcacgtgt ggccggattt 900 

acgaatgaag gcggaataca aggtaagata cgttttctac aaaacataac tgggctttgg 960 

attttacaac gtttgatggc tgaatggaaa gaacagggaa aggaaatcag ttatgattgt 102 0 

gcaatagctg aagctacagt gtcggatatc cgttcggtga ttgatgtgga tgattctgct 1080 

ttttgcaatc ccgaccatat ggaagagtcg atcattaagt attgtcataa gcaccattta 1140 

cggacaccag tctctcaagg agaatttgtt cgttgcgtta tcgagtcatt ggcatatcgt 1200 
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tataaattgg gagtagagca gatgaatcga tgtctgccgg caccggtcaa acagcttcat 

attattggag gaggctgcca gaaccgtctg ttaaatcagc ttactgccaa tgctttaggt 

attcctgtgt atgccggtcc ggtagaagcc actgctatcg gcaatatttt agtgcaggca 

aaagcccaag gcgaagtcga ttcttgggaa gaattaaaag aaattatcat aaacagtgta 

gaacctcagg tatattatcc tgaataa 



1260 
1320 
1380 
1440 
1467 



<210> 66 
<211> 3051 
<212> DNA 
<213> B. fragilis 

<400> 66 

aatatgacga taaggttaaa caaagttaca agagatttaa atgtaggaat cgcgacggta 60 

gttgagttct tgcaaaagaa ggggtatacc gttgaggcaa acccgaatac gaaaattacc 12 0 

gaggagcagt atgctatgct cgtgaaagag ttcagcacag ataagaacct tagacttgaa 180 

tcggaacgtt tcattcagga acgtcaaaac aaagatcgca acaaggcatc tgtttctatc 240 

gatggttatg ataagaagga accggagaag actgttgctg acgatgtgat taagacggtc 3 00 

attccggaag atgtgcgtcc gaagtttaaa cctgtcggaa aaattgattt agataaactg 3 60 

aaccggaagg ttgagaaaga accgatgaaa gaagagccaa aaccacaacc tgtagctgct 42 0 

gaggaaaaga aagtagcgga agaggttaag cctgttgtaa tcgaagtgaa aaaagaagaa 480 

gtcactgtga ctcctgctac ttcggaaccg aaaccggtga aagaagaacc gaaaccggtg 540 

gtggttgaaa agcctgttga aacagaaaag aaagtggttg aggaagtcaa gaaagaagaa 60 0 

ccaaaagtcg tggtgtctcc agaaaaaaca gagaagaaag aagagaaacc ggtagcagaa 660 

gcaccggtta ctccagtaga gaaggaagag gagggcgtat ttaagatccg tccgactgag 72 0 

tttgtatcta agattaatgt tatcgggcag attgacttgg ctgcattaaa tcagtcgaca 780 

cgtccgaaga agaaatcgaa agaggaaaag cggaaagagc gtgaagagaa ggaaaagctt 84 0 

cgtcaggatc agaagaaaca gatgaaggaa gccatcatca aggaaatccg taaagaagat 900 

agtaaacaag ctaaggtcgt tggtaaggaa aaccttgatc ctaacggtaa gaagaaacgt 960 

aatcgcatca acaataataa ggaaaaagta gatgtgaaca atgttgcttc taattttgca 1020 

caccctactc caaacagtga gcgtacgaat aataaccgtg gaggaaatca acaaggtggc 10 8 0 

ggcggacaga atagaaaccg taataacaat aataaagacc gcttcaaaaa gcctgtagta 1140 

aagcaggaag taagcgaaga agatgtagca aagcaggtta aagaaacgtt ggctcgtctg 12 00 

acaagcaaag gtaagaacaa aggtgccaaa tatcgtaaag aaaaacgtga catggcgtcc 12 60 

aaccgtatgc aggaactgga agatcaggaa atggcagaaa gcaaggtact gaaactgaca 132 0 

gaatttgtga ctgctaatga attggcaagc atgatgaacg tatctgtaaa tcaggttatc 1380 

ggaacttgta tgagcattgg tatgatggtt tctatcaatc agcgtctgga tgcagaaacg 1440 

attaatcttg tggctgaaga atttggattt aagactgaat atgtcagtgc agaagtagcc 1500 

caagccattg ttgaagagga agatgcgccg gaagatctgg aacatcgtgc tccgattgtt 1560 

acagtcatgg gacacgtaga ccatggtaaa acttcgttgc ttgactacat tcgtaaagca 162 0 

aatgtaattg caggtgaagc cggaggtatc acacagcata ttggtgcata tcatgttaca 1680 

ttggaagatg gacgtaagat tacgttcctc gatactccgg gacatgaggc atttactgca 1740 

atgcgtgccc gtggtgcaaa ggtgaccgat atcgcaatta ttattgtagc tgccgatgat 1800 

gatgtaatgc cccagaccaa agaagccatt aatcatgcag cagcagcagg tgttcctatc 1860 

gtatttgcta ttaataaaat agataagcct catgctaatc ccgagaaaat taaagagaca 192 0 

ttggctcaga tgaattatct cgtagaagag tggggtggca aatatcagtc acaggatatc 1980 

tcggctaaga agggtctcgg agttcctgaa ctgatggaga aagtacttct tgaagcagaa 2 040 

atgctcgact taaaggcaaa tccgaatcgt aatgctacgg gttctatcat cgaatcaact 2100 

ttggataagg gacgtggata tgttgcgact gtattggtct ctaacggtac gctgaaggtg 2160 

ggggatattg tacttgccgg aacaagctac ggccgtgtaa aagccatgtt caatgaacgt 222 0 

aaccagcgtg tagcccaggc agggccatcg gaaccggtat tgattctggg tttgaatggt 22 80 

gctcctgctg caggtgatac tttccacgtg attgagactg atcaggaagc ccgtgagatt 2340 

gccaataaac gtgaacagtt acagcgtgaa caggggctgc gtactcagaa actgttaaca 2 400 

ctggatgaag tgggacgtcg tattgcgctg ggtaacttcc aggaactgaa cgtaattgtg 2 460 

aaaggtgacg tggatggctc tatcgaggcc ttgagtgatt cgttaatcaa gctgtctacc 2 52 0 

gaacagatcc aggtaaatgt gatccataag gctgtaggtc agatttcgga atcggatgtg 2580 

acattagcag ctgcttcgga tgccattatt attggattcc aggtacgtcc atcggcttcc 2 640 

gcacgtaagt ttgccgaaca ggaaggtgtg gacatacgtt tgtactctgt tatctatgca 27 00 

gctatcgaag aggtgaaggc tgctatggaa ggtatgcttg ctccggaagt gaaagaggta 2 7 60 

gtaactgcta ctatcgaagt gcgtgaggta ttccacatta ctaaggtggg tacagtagcc 2 82 0 
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ggtgctgttg tgaaagaggg caaggtgaaa cgttcggata aggctcgcct gatccgtgat 

ggtatagtaa tcttctcagg ttccatcaat gctttgaagc gctttaaaga tgacgtgaag 

gaagtaggta caaacttcga atgtggtatc agtcttgtta actacaacga tttgaaggta 

ggtgatatga ttgaaactta cgaagaagta gaagtgaagc aaactttata a 

<210> 67 
<211> 1251 
<212> DNA 
<213> B. fragilis 



2880 
2940 
3000 
3051 



<400> 67 

gtgccaggga 

gaggactttc 

ggtgcgacta 

gtcaatgcga 

gaggcttcac 

ttcactcgcg 

atgcaagagg 

tggcagttgt 

ggtgagctgt 

gtcgcccaag 

gcacatgctc 

gtggatgtac 

cctaccggaa 

cagggtggag 

ccatttaaat 

ctggactatg 

gtctatgcta 

cataaaagca 

ttgcttgatc 

cgtcgtttag 

gaagttgatg 



tgttctattt 
ccatacttag 
ctcagaagcc 
atgtacatcg 
gtgagactgt 
ggacaacgga 
gtgatgaggt 
tggcggcccg 
tacttgaaga 
tgtccaatgt 
atggagtacc 
aggatttgga 
taggcgtctt 
gagaaatgat 
tcgaggctgg 
tcacaggtat 
tgcaacggct 
gtgtaatttc 
gattgggcat 
gcattgaagg 
ctcttgtagc 



gcaaataatc 
ccgtacggta 
tcgcctggtg 
gggagttcat 
acgtcagttc 
gagcattaat 
gattgtttca 
taaagggatt 
gtacgaaaat 
actgggaaca 
tgttatgatt 
tgctgacttc 
atatggaaag 
tcagtctgtc 
aaccccagac 
tggcttagac 
gaaagaaatt 
attcttggta 
tgctgtccgt 
tacggttcgc 
tggtatcgaa 



agtgataaaa 
tatggcaagc 
atcgattcga 
ttcttgtctc 
attaatgccc 
ttaattgtct 
gtgatggaac 
gcgattaaag 
cttttttctg 
atcaatccgg 
gacggtgctc 
tttgttttct 
gaagattggc 
tcatttgaaa 
tatattgcca 
ccaatagcat 
ccgaacatgc 
ggcgatatac 
acgggacacc 
gcctcatttg 
cgggtcagta 



tgaatattca 
ctttggttta 
ttgtagacga 
agcaggcgac 
gtagtactcg 
ccagttttgg 
accacagtaa 
tcattccgat 
aacgtactaa 
tgaaagagat 
aatccattcc 
caggtcataa 
tcgaacgtct 
agactgtctt 
ctaccgggct 
tacatgaaca 
gtatttttgg 
accatttgga 
attgtgcaga 
ccgtgtataa 
agatgttctg 



taagatacgt 
tttagataat 
atattattcg 
ggagctgcat 
tgaggtcatt 
tgaagagttt 
tatcgtacct 
gaatgataaa 
gattgtcagc 
gattgccacg 
tcacatgaag 
aatatatggg 
tccgccttat 
cggtgaatta 
tgccaaagcg 
tgagcttaca 
tgaggctgag 
tctgggtacc 
accgttaatg 
tacaaaagag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1251 



<210> 68 
<211> 204 
<212> DNA 
<213> B. fragilis 

<400> 68 

gcccgtttcg tgtgtacatt tattcttcgc tacctgaatc cggggatgct tctcaagatt 
cttgcgattg cgggtggtgc ttttacgtta ggtgtgattt ttttgcaaga catatgggga 
ttgtattgtt tagtagctgt ttcggcttgt atgtcactaa tgtttcccac gatttatggc 
cattgctctt cgtggtttgg gtga 

<210> 69 
<211> 2088 
<212> DNA 
<213> B . fragilis 



60 
120 
180 
204 



<400> 69 

tatcaaaaat 

aaacgacacc 

atttatgccc 

atgaacgctg 

acgcaagtac 

atccacgaaa 

ggactggaag 

agagagagag 

gagaagtatg 



ctaattgtat 
attcacaaag 
aacaaggttt 
aactcccggt 
cggatatata 
aaggtacggt 
cgagaaaaca 
tattcctcaa 
aacaatatgc 



gaacgaaaga 
gagaacaccg 
atctcctgta 
tatacttccg 
cacccctgaa 
atgtaatatc 
ggagatccga 
tagcatgtac 
actgctcaat 



attaactatc 
tcaagtatcg 
gaacgtacta 
ggtgaaaaga 
gaatggaacg 
tcccccaatt 
aagcgtcagg 
caatgcatca 
aacgagacag 



taaaaacgta 
ggctggacaa 
cagcctgttt 
tcgtttttac 
aaataaaaaa 
acgcttatac 
aaaatccctc 
tatccctaca 
aaattgcgca 



tattttagat 
attgaacaca 
cgcagctttg 
ccgtaccctt 
caaatattac 
cattcaacac 
cttaaatgaa 
gaaactaatt 
cactttacat 



60 

120 

180 

240 

300 

360 

420 

480 

540 
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accataaaga 
catttctcca 
atgtatcctt 
gat ctgcttg 
caacagggtg 
ttattcaacg 
cctaaaatca 
cgtctgacca 
ggacttatac 
tgggaattta 
atcggatgtg 
tatacattgg 
aat ctctaca 
gccaaagata 
gcaaccgcta 
gat tatacaa 
aaaaaattaa 
gccaaagatc 
ggggtttacc 
cgtgccaccc 
ctgtttttaa 
gaccgtgttg 
gacgaaacca 
caactacaac 
caacacaaaa 
gaatgttacc 



ccgaaggcgc 
tttgggaagc 
tctatcagag 
aagaattttt 
acaatgggca 
acctgtcccg 
acattcgtgt 
aaatagggct 
ggaaaggcta 
tcatccctaa 
tagaccggtg 
tggagcagga 
tcatcccttc 
tttcggaagg 
ccgataccct 
ccctgcttac 
gagaagaagc 
ttctcgactc 
gggcaggtac 
ccgacggacg 
aacaaaaagg 
tcaacggtgg 
ttgaaaagct 
taaacacagt 
atttgattgt 
aaaatcacgt 



tcaaaacttc 
aggcaactat 
ggacctggaa 
ccttgtatgc 
gagcctggtt 
catgtgcctg 
agccccaaag 
gggtttcccc 
ttccaaagaa 
ccgagccatg 
ccttgaaaaa 
aatacaaaag 
tccaatgatg 
ttcctactac 
tgctgccttg 
tgccataaga 
gcccaaaatg 
ttttgatcgg 
tggcaccgct 
taatgatggc 
tcctatatcc 
tcccctcacc 
gggtatgctc 
cagccgggag 
cagagtatgg 
tatcaatcgg 



agacaagctc 
cacaataccc 
aatggaacac 
aataaggaca 
ttgggaggac 
caggcaagtt 
acccctgacg 
caatacagca 
gacgcataca 
gatataccta 
ctaaatacct 
gaggtcaatg 
tctttactga 
aacaactacg 
aaaaaatact 
agcaacttca 
ggacaggata 
tcattggccg 
atgtactaca 
gagatgattc 
gtcataaaat 
ctggagttcg 
gtgaaaacct 
acactgctac 
ggatggagcg 
atcgaattcg 



tacagctttt 
tcggccgctt 
tcaccaaaga 
gcgacctgta 
gggatccgga 
acgaattaaa 
aaatattcac 
atgatgatat 
attacgtagt 
acattgatgc 
gttcgaacta 
cgatctgcga 
tggacggcac 
gaatccacgg 
atttcgaaga 
aaggctacga 
atgactatgc 
ataaacgaaa 
ttttccattc 
ctgctaatta 
cttttaccaa 
atcaatccgt 
acatcgtttt 
atgcccggaa 
gatattttgt 
gtctataa 



gagaattctc 
cgatcaatat 
agaagcattc 
tccgggtatg 
gggcaaatac 
actaattgat 
tttaggttcc 
catcattccg 
ggctgcctgc 
cgtttcttta 
ttcatccttt 
gaaacaccgc 
catcgaaaga 
aacaggcatt 
gcaaagcctg 
agagttacaa 
cgacttgata 
tgaacgcgga 
caatcaatta 
ctcccccagc 
acaacatctg 
attcagcaat 
aggcggccat 
acatcccgaa 
ggaattagac 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2088 



<210> 70 
<211> 2085 
<212> DNA 
<213> B. fragilis 



<400> 70 

aatcaatgta 

atgcaaatgg 

ctaaacgaag 

ataaccgcta 

accgacacgc 

aaacagcacg 

gcctactttc 

gaccgcatct 

cttatggaaa 

gtatacaatg 

acacgtttgc 

gaaccacgca 

tactggttgt 

tcagattatt 

ccattatggg 

caccaagctg 

atccctgtac 

accggtctaa 

cgtctttggg 

cacgaagacg 

tgcgcagctg 

cgctatatgg 

tccggtgata 

tggcatgtat 

tatatctatg 

cgaattccgg 



caatgaaaac 
ggtgtcagtc 
tccggataga 
atgatgtatt 
gtaatgcttt 
atggtcccga 
ttgcaagcca 
atgccgctca 
acaaccatag 
ccgggatgct 
ttgaaattgc 
agaatatcgt 
ataaaaacga 
ataatctggc 
gaacgtgggg 
aattcggcac 
tcgaacaaaa 
ccgcggccgc 
agaatatggc 
aaaagtttgg 
tcggagccgg 
acgaagtgga 
aatacaccta 
gtccatgttg 
cctatcaggg 
tcggtgacaa 



cacaaaacat 
tcacacagac 
agatgctttc 
aaacaaattc 
ccggaatttc 
atggtatgac 
ccccaataaa 
gcaaacagag 
atggggagac 
gatcgaagcc 
aacccgcttt 
tcccgcacat 
accggaactg 
cactttttgg 
ataccggaaa 
acactctcgt 
aacaattgaa 
acttgagaac 
aggcaaacgg 
tccggactac 
ttttttcagc 
aagagtgctt 
tcaaaatccc 
cccaccgatg 
agataatgtc 
cagcgtccga 



ttatctgtcg 
aatacccggc 
tggagtccga 
gaagggaaat 
gatcgtgtag 
ggactcgttt 
gaactggaaa 
ccgaccggat 
aatgggggac 
ggagtacatt 
gccaattaca 
tccggccccg 
aaagataaac 
atagaaaaca 
tcggaaaaat 
cccagttggg 
gggcatgctg 
caatcacccc 
atgttcatca 
tttctgccca 
caacgcatga 
tacaacaatg 
ctgaatactg 
ttcctgaaaa 
tatgtcaatt 
ttgaaacaat 



cagcagtact 
aaacactcca 
aactcgatat 
acactccttt 
ctgaagggca 
atgaaagtat 
aacgaatcga 
acatcaacac 
tccttcgtgg 
attatcaggc 
tggcagatta 
aggaagccgt 
tttccatacc 
gagggcatca 
ggattaaaga 
gagagtattc 
tgcgagctac 
aatacatcga 
caggtggagt 
ccgatgcata 
accaattaac 
tactgactgg 
ataagcccga 
tcatggctgc 
tattcatagg 
taacctctta 



aaccgtactg 
cttgcccgag 
ctggaggaaa 
tcccggatcg 
gagagatatc 
ccggggtatc 
cggatatgta 
ccatacccaa 
acaacacgat 
aaccggcaaa 
tatgggtccg 
aatggcatta 
ggtccgggaa 
ctgtggcttt 
cgcttgttat 
acaagattcg 
cttgatggca 
aacagctaaa 
cggcgctatt 
tctggaaacc 
ttgcaatgcc 
cgtttctttg 
caggtgggaa 
catgcccggt 
aagtgaagtg 
tccctggcac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 
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ggcgctgttt ccattcaagt caatcccgac aaggcaagca ccttctctat gaaagtccgt 162 0 

attcccggat gggcacaagg tacagaaaat ccatacgacc tttaccaatc gaatctaaaa 1680 

gcaccggtca aattaaaagt taatcaagag gatgtacttt tgaggatcgt agacggatat 1740 

gcggaaatca accgggaatg gaaaaaaggc gatcacattg agcttgaact acccatgcaa 1800 

cctcgcctga tcactgcaaa taaagcagtc gaaaacttac ggggacaagt cgcattggcg 1860 

tcagggccta tcatttactg ttttgaggat gccgataatc cggaactgca gacattcaaa 1920 

cttcaggcac aaacaccttt ggaactctcc catgacagta atctgctcaa tggagtcaat 19 80 

atcatcaaat gtcagggtga tattccggca aaagccatcc catattatgc tgtggccaac 2 040 

agggaagaga gccatagcta taaagtatgg attcctcaga aataa 2 0 85 

<210> 71 
<211> 783 
<212> DNA 
<213> B.fragilis 

<400> 71 

aaaaacattg cagcattcgg catatctctg cgtctaaaag acataaaaga cttaactatg 6 0 

caaaaattca gattgacaat gctatttatc atttgcggca acggatttgc ttatgcgcaa 12 0 

acattcaacg aaacgcccat acctgccttt acactgcaca aagaaatgaa aacccctcaa 180 

atcttcaaac tacccgaaat aaagaatact ttgtcagaaa ccaaccctgc tttcaataac 240 

agtatgccgt tagtcaaaca atacgaactg agaaaaaaat tctcctatct ggatcccgtc 3 00 

ttcaccggtt attttaatca gcaacagtac cgattgttca attcccgcta tttcggatac 360 

gaattatacg gttccagtta ttcactccgg ggagtaggta cacagaatat ggcaggtggc 42 0 

agattggtat atcgtcttaa cagacaactg gctatccgga taggtggcaa tgcctatcaa 480 

taccgctcta acggacggat gtttaatgat tttaccctta acgcagacct tacctatcgt 540 

ctgaataatt ggctgaccgc ttatatttac ggacaatacc ggctggactg taatcccaac 600 

tccggtgtac aaggtttccc gttatctcca caatcccatt acggcgcttc attccggata 660 

aacctcctgg aaaggaaaga atatggtctc gacctgaatc tgggtaccga cagaagttac 72 0 

aatgctgcta cccggcaatg ggaaaatact tataagatag gcccaaccat acgattaaaa 780 

taa 783 

<210> 72 
<211> 792 
<212> DNA 
<213> B.fragilis 

<400> 72 

aatattatgt atatatttgc cattgttaat ccaaacacaa tgaaaacagg aacaatattc 60 

agtgtcgagg aatttgccat ccatgacgga ccgggaatcc gaacaacgat atttctcaag 12 0 

ggatgtcctc tacgctgtgc atggtgccat aatcccgaag gtatatcgcc acagccgcaa 180 

tacatgatta aaaaaggagt taaaagtatt tgtggatatc agataactgt ggaagaattg 240 

gttaccatga tcgaaaagaa ccggtccatt tatacgctca accggggagg agttacacta 3 00 

accggcggag aacccttatt tcaaccggat tttgttatcg aactgctccg acaacttccg 3 60 

gacatacata cggctatcga aacaagcgga tacgcaaaca ctcacatttt caatgaggtt 42 0 

acttctttag ctgatcttat tttattcgac atcaaacata cggacccgga aatgcaccgg 480 

aaatatacag gagtggataa tacgattata ctggaaaatc ttgctttact ctgtaattcc 540 

ggacgagatt ttatcattcg gataccttta atcccgggtg ttaatgatac ccgggaaaac 600 

atgagtgcca ttcttgaaaa aatcaaagat gccaggaacc tgatacgtgt cgaaatcctt 660 

agatatcacc gtacagcagg tgccaaatac gcaatgatcg gagaaacgta tcatcctccg 72 0 

ttcgataccg gaaaggcgcc acaaatctat aatgtatttg aagaaaataa tatcaaaaat 780 

ctaattgtat ga - 792 

<210> 73 
<211> 231 
<212> DNA 
<213> B. fragilis 



<400> 73 

gcatatattc atataatatt aagaatatta atagctcaaa ttacctttat aggtaatgtt 



60 



36 



tatgggagta aagacgatta ttcgacattt acaaataatg gatgtctcag atcactgtta 120 
aggcgtaatg gaccgaataa aatggcaaaa cggtctgtct attggctttt accgttttta 180 
aattcattag gtgtcactcc acaaaaacgc ttgaatgtct tactgaaata g 231 

<210> 74 
<211> 708 
<212> DNA 
<213> B.fragilis 



<400> 74 

actgaaaaac 

tgttgtgtgg 

aatatgaact 

ttctggggac 

agtgatttca 

gtgatgggac 

gtgattgttc 

gaatggattg 

atacgggaaa 

gtggacggta 

attgatgatg 

gttgaaggag 



atcatatgaa 
tttgtggagc 
taccccgtac 
gtattccggt 
gacgtatctt 
gctatatggc 
ctgtaccttt 
cccgtggtat 
agaatacaga 
ttttcaaatt 
tgctgactac 
tacggatcag 



tacttggttc 
tccgctgtcg 
cggatttcat 
tcttgaaaga 
gcatctgctg 
ggcagaatta 
gcataagaaa 
atcttctgtg 
gactcaaacc 
gtgtgatgta 
cggatctaca 
tgtgttgaca 



gactcttttt 
aaagaagagg 
ctgcgaaagg 
gcttcttcct 
aaatatagcg 
atttcttgtg 
aaacagaagc 
acggggattc 
cgtaaatcga 
gcctgtttcc 
actgtggcct 
ttggccgtgg 



ggtctctttt 
agtgcttgtg 
ataatccggt 
ttctctttta 
gatataaaga 
gtttctttga 
tgagaggata 
cgctgaatgc 
cttttgaacg 
aagggaaaca 
gtgcatctac 
ccgaataa 



gttcccacgt 
tattcgttgt 
tgagtgtctt 
tcgtaaaggt 
actgggcgaa 
tcatgtggat 
caatcagagt 
gaagagcgtg 
ctcggaaaat 
tgtactgatt 
gctttttgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

708 



<210> 75 
<211> 267 
<212> DNA 
<213> B.fragilis 

<400> 75 

attcttttta aacgtatgaa aattacaagt tatgtttctt ttttatattg cttatgtctg 60 

atgctggcga gtccgacagt acaggcgagc gaagtgcgaa ccgctatttt tgaggggaaa 12 0 

ccgtgtatta atccccctca tgtggtggga aactatcctg ctaccccttt tttgttttat 180 

attcccacct ccggagaacg accgataaag tggcatgctg aaaatcttcc caagggattg 240 

aagctggata aaagaaacgg gggataa 2 67 

<210> 76 
<211> 4107 
<212> DNA 
<213> B.fragilis 



<400> 76 

atatcagttt 

tacgtaatta 

gttatcctat 

cttacctata 

accctcaaag 

gacggactct 

atgggtacca 

gttccacaaa 

ttcattgcgt 

ctccataaag 

agtccaaata 

actgaagttt 

actctatggc 

ttcgtatccg 

caagaagata 

tatacagata 

accgataatg 

ttcttcggag 



gtgaccataa 
agagacgaat 
atgcacaaca 
gtgccgtacg 
gactgaacag 
cttcaaactg 
atgaaggact 
ccaaggcccc 
ccgattccgg 
gattgattgt 
caatttattg 
ctcctgatta 
tgggaacgac 
tggagttcgc 
tgcgtggaaa 
acagctacat 
cgatctatac 
gggtcagtta 



gtatcttttt 
gaagaagagc 
gaacgaactt 
ggatattctg 
gtatgatgga 
catcgagaaa 
gtgcttgtat 
actatatgta 
gctatacgta 
aaaagtaacg 
tttccgtcca 
tcctgtcgag 
cgaaaacgga 
ttcacaagac 
tttatggatc 
acaatatcgg 
tatttataaa 
caccagcctg 



gattcgtttt 
acttttaccc 
atgttccact 
caggattcaa 
tacaacatca 
ctcctactcc 
gatatgatga 
ttggacatgg 
tataacaaga 
ctggatataa 
aacggacaaa 
tttacttcta 
ctgtaccgat 
agaaaggata 
ggaaccgaaa 
cagcatgcaa 
agccggggtg 
accgaaaata 



tatttcagac 
tgattttatt 
cc ctgggtag 
aaggatatat 
aacaatatta 
tcggtcagga 
gagagaagtt 
cctacgacgg 
cggagcaaag 
atggaaatgt 
tgaccagaaa 
tttataaaga 
acaacaaaaa 
tgcgttatat 
acggactttt 
aagatgtcca 
acattatgtg 
atttccacta 



ttttgtgttg 
tttttcttct 
tcaacacgga 
ttggattgca 
taagtcagat 
tactctattg 
caccactatt 
acgttcagtt 
tatgccacta 
atgggcagtg 
aatcacagcc 
ttctcagggg 
ctataatcaa 
tcgctgcatc 
catctatgac 
atccggactg 
gatcggcact 
cctgatagca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 



37 

gataatggga aacaatacct gaaaggaaaa gcaatcagca acatcattaa agataaaaac 1140 

ggagctttgt ggtttgcatc cgaagatcat ggaatttcta ttctttaccc ggatggtcac 1200 

atcaggtatt taaacaaatc gacacacccg tcattaaacg gagataatgt ccatgcatta 1260 

gccgaagatc actccggtaa tatctggatt ggcaacttta tcgatgggtt gcaaaaagtc 1320 

gatttagcaa aaggctatat tcgttcatat aaaaacatag cagggggaca cgcgggacta 13 80 

tccaataatt caatctataa actttatgtt cataatcctg atactatgtt tataggaacc 1440 

agccaaggtg tcaatatcta tcatttccgg accgattcat tcactccctt ccttccggat 1500 

gtattccggc ttatacgtat tgacgacatc acacgtgatc tcaaaggaaa tatctggttt 1560 

tccacacatt tcaacggtat ttttcggtat catattccga cccatagtat ccatcgatat 1620 

caaaaaggag tgacaggctg taaaacaatg accagcgata atatttattg tagttttgtt 1680 

gactccaaag gagaggtctg gttcggtacc agcaacggag gattgatgaa atacaatgca 1740 

cgtgcagaca gtatacaggc attcggaaag gagaatgaac tccggcaaag agatatttat 1800 

tccatacagg aagatagctt tggctattta tggatgagta cggataacgg tatcttctct 1860 

ttcaatccgg aaagtcgtag ttttgcccat tataaagtat ccgataatct ggtttccaac 1920 

cagttcaatg cctgtccggg ctataaagat cctgacggta cccttttctt tggcagtatc 1980 

aatggggtat gcttcttccg accggaagga ctgaaccata acagtcctac aaacgatatt 2 040 

catctgactt tctcggattt caggattttc aataaacacg tacaaccatc accggacggc 2100 

attctacaga ataatatcga cagcacatct gccattcgct tacctcacgg catgaatacc 2160 

ctgacttttg attttctggt gatcaactat aatgaaaatt gccaatcaca actttcctgt 2220 

gaatactatc tggaaggtat ggagaccgaa tggaatgcaa cacaacaaat cccacaatca 22 80 

gtcacatata ccaaccttga tccgggcacc taccaatttc acgtacgggt cataggaaaa 2340 

aacggagttg tattcgaccg tcggaaaata accattaaca ttcgtcctca ctttttgctg 2400 

agtggtttca tgatcactat ttattctctt atcggacttc ttatcagttt tataattgtc 2460 

cgcttctacc aagtgcgtat gcgagacaaa atggatatcc gcatcgaaag aatggaaaaa 2 52 0 

aacaacctgc gcgaactgaa taaacacaag ctgaattttt tcacttatat cacccatgag 2580 

tttaagactc ctttatctat ccttatggct gtattcgaag atatatcaat cggacgaaac 2640 

aatacaatta ccggtgaaga aatgaaaatc atcaatcgga atatccaacg gcttcaattc 2700 

ctgatcaatc aacttttgga atttcgttct gtagaaaccg accatgcacg catcgaatat 2760 

gtcaaaggag atattatgac ttatggacgc agcatcttcg aactgtttat tcccgtcttc 2 82 0 

agacaaaagc agattgtttt tcaatatgca acttcagccg actcttatta tacggtattc 2880 

gatagagaca agatagagaa aatcatcagt aatctgctca gcaatgcttt caaacattct 2940 

gatcctcaaa gtgaaataaa cttcaggatt gatgtagaca aagcttccgg acaattgatt 3000 

ctctcctgtc ataacagcag ttcatatatt catccggaac agcgggaagc tgtcatgcag 3 060 

ccttttcaca aaaccgattc atccgatcaa aagtattcca atacgggtat tggactggct 312 0 

ttggtgaatg gtctagttca gctgctttca ggaacagttg agatagaaag tcatcaaaac 318 0 

agcggtacga cctttaaagt aaaattacct ttggtcgaag actctaagga tatgattgca 3240 

ccggacgaaa ctttggatat cgttaactca cccgacgtag tggcagatac tgtatacctg 33 00 

ctcaataact ccggactaaa agaggatatg aatgctgcaa atgccgagaa aaagatgact 33 60 

gtacttttgg tggaagataa tccggatatc aataacattt taaaaagtaa gctactccgt 3 42 0 

ttatataagg tgaaaacggc ttataacgga caggaagctg tagagttgct aaaaacacat 3480 

atcatcgaca ttatcatcag tgacattatg atgccttata tggacggata tgaattgagt 3 540 

aaatatatta aaacttctcg tgaatactcc catatcccgg tcattctcat cacttcacag 3600 

ccttcgaaag aaaacgaatt gcaaggttta tctgcaggag ccgacgccta tatcgaaaaa 3 660 

ccattcactt tcgatgaatt gaatcttaga attaccaact tgcttaaagc caaaaataat 3 72 0 

atccgcgaac attatcacga catgaaaata ttccaactca atgaagaact caacaacaaa 3780 

gacgaggaat ttatcaaatc attgacacaa ttcgtcatcg aacacattga ggacccggaa 3 840 

ttgagtgtcg accaactgac cactcacatg aatatcagtc gaactcaact atacaataaa 3900 

ctgaaaaaac tattaaacct gagtgcaacc gagtttatca ataaaatcaa gatcgatgtt 3 960 

gctaaagtaa agattataaa gactaatctg actattgctg aaatctcatg gcaactcgga 402 0 

ttcaataatc ccagctattt cagtaagaca ttcaagcgtt tttgtggagt gacacctaat 4080 

gaatttaaaa acggtaaaag ccaatag 4107 

<210> 77 
<211> 210 
<212> DNA 
<213> B. fragilis 



<400> 77 

cattggccgt ggccgaataa atgcagtgat aaatatgagg gtgcattgaa gcaccctcgt 



60 



38 



atctatttaa aaagtttcaa ttatcaaatc ggatgccagc atttccttga ttgcctttcc 120 
aaaagcctgg aaatgggggg atgccgcatg tatatccatg gcacgctggt ctttgtactc 180 
ttcgtagaaa atgaaagcta cgggttgtga 210 

<210> 78 
<211> 1149 
<212> DNA 
<213> B.fragilis 



<400> 78 

aagataggag 

tttccactat 

acacaaagtc 

ttagaaacta 

tcgattattg 

tttatgaata 

ttttttactg 

tatggcttca 

ggagctgtag 

gataatcccc 

ggatatatgg 

cgttatgggg 

gctaacagaa 

tctgccaatg 

gagtcagacc 

tatgataaac 

tttgatttta 

ggttatctga 

ctggatgaat 

ttaaaataa 



gtattatgaa 
ttgctcagaa 
ctaagacggt 
ctgataattg 
cgggtgacgc 
agataggccg 
accctgataa 
atggtaaatt 
acggtcaggg 
aacaattgtt 
aacctggaaa 
gcgatattta 
aaaagacatt 
aaatagatcc 
gttatttctt 
agaagaagag 
ctcctcccgg 
gtaaaggtaa 
tgataaaccg 



atcatttaca 
gaatgcagct 
attaatgagt 
tctgctgggg 
acagaccaga 
gcaaggacag 
ccaaaaactg 
tctccgtcgt 
atctattctc 
ccttatcgat 
gaaatatggt 
ctttaaacct 
ggcatggaaa 
cggcaaacgt 
tgtgctttat 
tttttcaaat 
aacagggttg 
acgctattcg 
attggatgaa 



ttttgcattt 
gccgtcacgc 
gaactggcat 
aatgaatgta 
agtttttatc 
gggcctgaag 
tatgttcagg 
ataccggctc 
tattgtgata 
gaaaatggga 
gttaacctat 
gcgttggaga 
tttgattgtt 
tttcaatcaa 
gtcctgaaga 
gtgattataa 
ggaagtcagt 
aaagctttac 
gaggataatc 



tattagccca 
tgaatttggc 
ctgatgtacg 
gtattattta 
ggttcgataa 
aatatgcggt 
atttccagga 
ctcacctgaa 
ataattattt 
agaaactaaa 
ctacccgcga 
atctgattta 
cggggaagga 
ttgcagtaca 
atgaaagctt 
aagatgattt 
tggcaaatgc 
tgccggaaag 
cggtaatggt 



tgtgttggct 
aaaggctgtt 
ttatttccct 
tgccggaaac 
aaatggtaag 
aggattattg 
tataatctgt 
tatgggtacc 
tatgagaaag 
gatctggaaa 
tgtaatgtat 
taaaattgat 
tgtggatgta 
acaggttttt 
tgtgggattg 
ggcggcagga 
ccggatggta 
aaaaaaagaa 
tgttgtaaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1149 



<210> 79 
<211> 1257 
<212> DNA 
<213> B.fragilis 



<400> 79 

aagaaacggg 

aaagctgaga 

ttgttgctga 

gaggagctac 

tatgcttata 

atccaaattg 

gagcgaggat 

tgtggcagtt 

cttttgaagt 

gagaagatgg 

tggggacagc 

tccggtgata 

attttgaaca 

aatgatccgg 

tcggaaggtt 

tctccattac 

ttgaataagg 

attcgtgccg 

gcttgtctga 

ggattgtcat 

accggatggg 



ggataatcaa 
atgcccttgg 
ctcctccgat 
tgttgcaaac 
ttaatatcga 
ataaaacaaa 
tcaaacttgg 
atggatatga 
atgactattg 
ggagggcatt 
gtgaaccttg 
ttggcgactt 
ttcttgaaat 
atatgcttgt 
gtacgaatga 
tctgtggcaa 
atctgattgc 
atcattatga 
accggatatc 
tggatcgggt 
tcgtaaagct 



gggaaaagta 
aaccgataca 

ggggtggaac 

tgcagatgca 
tgacttttgg 
gttcccgaga 
aatttattcg 
agaaattgat 
caatgcacct 
gagagcgacc 
gaaatgggcg 
gtggaatcgt 
aaatgcaccg 
tgtggggatt 
acagtatcag 
tgatgtacgg 
tatcgatcag 
tgtctgggtg 
cggacccgta 
atatgatgtt 
tgctcccggt 



gtagaaaagg 
caggaactac 
agttggaata 
atggtagaaa 
cagttgcctg 
ggcatcaaat 
gatgccgctg 
gcacgggatt 
gccggaagag 
gaccgttcta 
aagaaagtcg 
tcgacagacg 
cttagtgaat 
ggcggaaaaa 
tcccactttg 
cagatgaatg 
gatccgctgg 
aagccgttga 
gacgtggagt 
atagaaggca 
gagtgtaaag 



gaacgtacaa 
tgattaatat 
cattcggacg 
acggaatgcg 
aaaggggagc 
atgtggccga 
ataaaacttg 
tcgcatcctg 
tagaagcgat 
tcgtcttttc 
gcggacattt 
aaaagggagg 
atgcaagacc 
gtaagagtat 
ctctctggtg 
atagtacgtt 
gcattcaggc 
gtgacggaag 
tgaatgtaaa 
gccttgtggc 
tgtttatatg 



agtaatgctg 
aggagatgag 
tcacctgaca 
tgatttgggg 
cgacggtcat 
ttacttgcat 
tggtggggtt 
gggtgttgat 
ggaaagatat 
aatttgtgag 
atggcgagta 
tttacgtggc 
cggcggatgg 
cggttatgaa 
catgatggct 
acaaatactt 
agagcgtgcc 
caaagcaata 
gacggtagaa 
tgaggcttct 
taaataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1257 
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<210> 80 
<211> 291 
<212> DNA 
<213> B.fragilis 

<400> 80 

attatcatta tggaaaagaa aacaatcgta gcacgtgtag aagtactacc cggcaaagaa 60 

caagcatttc tacaggcggc tgatgctcta atcaaaggta caagggcaga agaggggaat 12 0 

attagttata atttatatca aaacccgtca caacccgtag ctttcatttt ctacgaagag 180 

tacaaagacc agcgtgccat ggatatacat gcggcatccc cccatttcca ggcttttgga 240 

aaggcaatca aggaaatgct ggcatccgat ttgataattg aaacttttta a 291 

<210> 81 
<211> 3183 
<212> DNA 
<213> B.fragilis 

<220> 

<2 21> unsure 
<222> (2747) 

<223> Identity of nucleotide sequences at the above locations are unknown. 
<400> 81 

attactaata ttatgaatct aaaagatctg aacaacctta gagcagatac ggaaggcagg 60 

ataaaagccg tcttcttaat atgcatgttt gtgctggtgt ctgcaggtgg atttgctcaa 12 0 

aacacaaaga gcatttcggg tacggtgaga gagaaaggca gtaatgaaac tgttattggt 180 

gccactgtac aagtgaaagg aacacacaat ggggtgatta ctaatgagaa tggggagtat 2 40 

acaattaaaa atgtatctcc gggacaagta cttgttttct caatgattgg tatgaatacg 300 

gttgaaaaaa ctgtaggtag ccaaaatcgg atagatgtac tgatggatgc gggagtattg 3 60 

attgacgagg tggtagtgac cggttatcag actcagcgta aagtggactt gactggatct 42 0 

gtatccagtt tgagttccga tcagttcatg caaaccaacc cgttaagtct ggagcaggct 48 0 

ttgaaaggaa aaatatccgg tgtgcaggta atgaataatg atggtgcgcc gggtggtgga 540 

attacgatta agattcgtgg agccagttct attacggcag gtagttcacc tctgtatgtc 600 

atcgatggtt ttcctctccc tatttcggac gatcctctgg aaagtccttt ggctactatc 660 

tctcctgatg caatcgagag tatctctatc ctgaaagacg tatcatcaac tgccatttat 72 0 

ggggcacagg gggctaatgg cgttgtgttg attactacga agaaaggatc ggccggtatg 780 

agtgaaatct ctgtgaaggc tacttacggt atcagtaaac tggcaaattc tattccaatg 840 

ctgggtgcgg aagactatat gcgtgcgtat atgcgtgata tgattatgag cggacgctgg 900 

caaaatgctg atttctatca ggaatataaa gatcagatat ggaataccaa tccttcccgt 960 

ttccaattct atcccgatct ttgtttgcag aatggtacta aacagaatta tgaagtttct 102 0 

tacagagggg gaacagaccg catacagaac tctacaatct tttcgttgat gaatgaagac 1080 

ggtattgcca tcaataccgg atttaaacga ttctatttcc aaacgaataa tagcattaaa 1140 

ttgcttccgc aattgacttt gaacacaaat ctttcatatg aacacaatat tcgtagcggt 1200 

gctttctgga cagaagggaa tatttttaac gaaatacaga ctttctctcc gcttgttcct 1260 

aaagaatgga cttttcagga gatagatgat aacctttact atacaggtaa gatggataat 132 0 

ccttatagaa aattaaaaga tattgattac tctaacaaga ataatacttt cttcggtcag 13 8 0 

gcagagttgg tttacaatat caatgataac tggtttgtaa aaggaggtat tggtgtgcgt 1440 

atacccaaag gtgaagtgaa agaatttatc ccgaaaacca ttcagagagg ctacgataat 1500 

aacggattgg ccacatacgc tacgcaaagt ggattaaata tgcgtggagt agtccaggcc 1560 

ggatttaata aagtgttcaa caaagtacat agtctctcgg taaatgccgt atacgaggct 1620 

aataccaaca agtatgaaac ctttaatcag gaatattctc agtttaatac cgatttggga 1680 

tgggaaggta tttatgatgc aaaaagtggt aatcatgtga aatctcccgg tgtatcttac 1740 

gagaagatag caatgctttc gggggtattg atggctaact actcttataa agggcgttat 1800 

ttattgaaag catctatgcg tgctgacggt tcttctaaat tcagccctga taatcgttgg 1860 

ggatttttcc cgtccggagc attgggttgg agagtttcag aagaagaatt ctttaagaat 192 0 

gtatcttggt tggagaagaa cgttaataac ttgaaactac gttttagcta tggtcaggta 1980 

ggtaacgatc agattgctcc gtatgcttat gcacagacct tatcttccag ccagagacaa 2040 

gccatttttg gtgacggagc tattccggcc ttgttcacca gccgtatggc aaaccccgaa 2100 

atcagttggg aagtaacgga agagtttaac ggcggtctcg atttagatat gtttaataac 2160 



40 



cggctgaata 
aatctgccac 
cgtggatttg 
gcaacagtga 
atgttggaag 
ggatatcctc 
tctgactata 
atgccatacg 
gaccgtgtcg 
cgctggaaat 
aatggtaatg 
tataaagatg 
gattggtccg 
ctgaagatga 
aaaattaagg 
tcgggatatg 
gtcgatatct 
taa 



tttcattgga 
gtacttccgg 
aaatcagtgt 
actttagttc 
ggcgtccggt 
tcggactgtt 
atggtatagg 
gattcccatc 
tgataggaga 
tcattgaact 
tataccatct 
cttggtttgc 
ggtatatgtg 
ataacttggc 
acctggctct 
atcctgaggt 
cggcttatcc 



tctctatacc 
ttttggaaaa 
aggcggtgtg 
caatcagtcc 
gggttctgct 
ctatggtctg 
cagtgctgat 
ttttgcagat 
tgtgaatccg 
ggctatggag 
gatgaccaat 
taataatcct 
ggctgcttcc 
tgtaactttc 
gacatatacg 
acgtagtggc 
atatgcccgg 



aagacaactc 
gtaaccagga 
ctgattgata 
aaagtactga 
tcgggttcgg 
caaatggaag 
tctccttggt 
accaatggag 
gtatttatcg 
ttcagctggt 
ggcgatattc 
accggtactt 
aactctgaaa 
cgtatgccga 
attaacaatg 
agctctgtaa 
tctcatatat 



gtgatatgct 
atatcggttc 
agaaagactt 
gtctaggtgc 
aaaatgtcct 
gcattcgcag 
ggtatgccac 
acgggaaagt 
gtggcttgaa 
catacgnaaa 
gtaataagtc 
tacaaggtcc 
tggtggaaga 
aaaatatatt 
ttttctgttt 
ataatcgtat 
tctcacttaa 



gcttgaacag 
ggtacgtaac 
tacgtggaat 
tgagacacag 
gataaaaaag 
taattggcat 
tgaaagagag 
ggatatgagt 
cacccgtttg 
tgatatatcc 
ggccgtatac 
tggtgcaatt 
tggttcgttc 
aaaggcttgg 
aacgaactat 
tcttccggga 
ttttaaatac 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3183 



<210> 82 
<211> 1149 
<212> DNA 
<213> B.fragilis 



<400> 82 

actaaaaata 

atacacacaa 

acaattgacg 

gccatcacca 

tttgaagaca 

agatttttgg 

ctgaacggac 

tttaaaggat 

ctcacgtatt 

agttacaagc 

gagacggtca 

gatatcaatg 

ctaatcccca 

acacctatcg 

gaccataatt 

tatgaacccg 

tatggcggta 

cgagcttccc 

ccgtcaacaa 

gtacaatga 



tgaatcgtat 
acaccgtaaa 
gaaagcaaac 
atttcggagg 
tcgtcctcgg 
gagccactat 
aaacttacca 
ttgatatggt 
tatccaaaga 
ttacggataa 
tcaacctcac 
accatatact 
ccggtattct 
gaaagcgagt 
gggtattgaa 
cctcaggaag 
acttctttga 
tcgcattgga 
ccctcttacc 



gaaaacaaaa 
tgcccaacat 
cgatctctac 
acgagtcgtt 
acacgatcat 
cggacggtat 
gttacccatc 
agtatgggat 
tggtgaagag 
aaacgaattt 
ccatcattcc 
catgatcaat 
ccaggatgta 
aaatgattcg 
tcgcaaaacc 
atatcttgaa 
tggtacaatg 
aacccagcac 
cggagatact 



ctgatcattt 
tcacaattga 
tttctgagaa 
gaattctgga 
gtggacaaat 
ggcaaccgga 
aatgatacac 
gttgaacagc 
gggtatccgg 
attatcactc 
ttcttcaacc 
gcggataagt 
gaagggaccc 
ttcgagcaac 
tccaacaccc 
gtatggacca 
accggaaagc 
tatccagata 
tacaaacata 



tattaatact 
aaagagccga 
acaaaaacgg 
ctccggataa 
atctccatta 
tcaacaaagg 
ccaacagtct 
cggacagcca 
gaaatctcca 
accaggctca 
tgcacggtgc 
ttactcctgt 
cgatggattt 
tggagttcgg 
ccgaactggc 
ccgaacccgg 
acgaaaagaa 
gtcctaacca 
tatgcatcta 



cacaaccatg 
ttttcaacaa 
cattgaaatt 
aaaagggcat 
taaaggtgaa 
aaagtttacc 
acatggtggc 
gactttacaa 
agtatccatg 
aacagacaaa 
aggcaataag 
cgatcagatc 
tcgtcgccct 
tcacggctac 
agcaaccgtt 
actacaattc 
atacaactac 
accggcattc 
taaaatcaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1149 



<210> 83 
<211> 816 
<212> DNA 
<213> B.fragilis 



<400> 83 

aaatcgaata 

gaagccggga 

aagcatgcgc 

caactgtctg 

aagaatgagc 

gacaatgctc 

gtggtgtatg 



tggaactgga 
attttttgag 
atgactatgt 
cgttgctccc 
cttattgctg 
cttattgtgt 
aagtttgcag 



tcttcaacaa 
aaaagaacgg 
gtcgtatgtt 
cgaagccggg 
ggttattgat 
cagtatcgca 
ggatgaatgt 



cttactactg 
aggagcttta 
gataaagaat 
tttattgcgg 
ccgttggacg 
ttgaggagct 
ttttacgcct 



aggtatgccg 
gtcgggagcg 
ctgaacgctt 
aagaaggttc 
ggacgactaa 
gtacagaatt 
ggaaaggtgg 



gatagcaact 
tgtggtggaa 
gttagtggca 
tgccgtttat 
ctacattcat 
acttttagga 
gaaggcttgg 



60 

120 

180 

240 

300 

360 

420 



41 



atgaacggag 
actgaacttc 
ttgtatggag 
gtggcggccg 
get gcggcac 
tattttattg 
eggctgetga 



atgaactgea 
cttataacca 
tggtaggagg 
gacgttttga 
tgattgtgtt 
aaggacatca 
aagagatgee 



tgtctcgaaa 
teggcaatae 
aattcgtatg 
tgcctgggcg 
ggaagcegge 
tatcattgeg 
tccactggaa 



atagaaaaca 
aaacggactg 
aatggctcgg 
gaagctttta 
ggaaaagtaa 
aegaatggee 
atgtaa 



tagaagaggc 
eggaatattt 
ctgcatcggc 
tegggaaatg 
ctgatttctt 
ctttacatcc 



gtttgtaatc 
actgaaacaa 
tctttgttat 
ggattactcg 
tggaagtgag 
tgtctttcaa 



480 
540 
600 
660 
720 
780 
816 



<210> 84 
<211> 288 
<212> DNA 
<213> B.fragilis 



<400> 84 

ggtactgeca tgaaaaaaat attattagee cttttaacct cttgcgcctt agtgtcgtgt 

gaggggtatt tcgaccagtt accgaaaaca gaacttccgt ctgaaacttt ctatacttcc 

tatgatgetg etttaegtaa tgtagctata ttgtatgcta atgeagggea tgtcaatgat 

ggaattatga ccactgaccg gtttatgatg ccttcattga tgaatgaagg tccgttcgac 
ctgacttcga categgtett caccacgggg ctgcaaggtt gcacatag 



60 

120 

180 

240 

288 



<210> 85 
<211> 1332 
<212> DNA 
<213> B.fragilis 



<400> 85 

tacatctgea 

tcattgggag 

atccaggtgg 

ateggtacaa 

aaagcactga 

atcgattggt 

teegtagteg 

gtegcattet 

tggattcacg 

ategcatteg 

gategggaag 

gaaattcatg 

cagcataaat 

tcaggtatca 

accgactcag 

ateggaatga 

ggtatgacct 

tactatatgc 

gtcatttggg 

ttaggcagca 

ateegtacag 

ttctttgetc 

ttgaccaatt 



ttatgaaaaa 
gattactttt 
tatatgatct 
teateggage 
aaattattgc 
attccttttt 
gtcccatgta 
tccagtttaa 
geattgegea 
ctcttctatt 
ccgaagcccg 
aaatcaaaga 
accggaaacc 
atgecattet 
ecatgatgea 
tcctgataga 
tctctttggc 
ttatctgect 
tattaatctc 
tgacacattg 
gaggtacctt 
tcaggctacc 
aa 



tacagcaaag 
tggattcgat 
atccgacttc 
tttcgtctgt 
ttttctctac 
attcttccgt 
cattgecgaa 
tattgtactg 
tgattggcaa 
atacacegta 
acaegtcata 
atccctggta 
gatcctctat 
etattatget 
gtctattgtt 
tcaggtagga 
ettagtagee 
gatggggttt 
cgaagtcttc 
ggtgtggtcc 
cattttcagt 
cgaaacaaag 



aatttcatgt 
accgcagtga 
agecatgget 
agtaaacegg 
tttgtttctg 
tttgeeggag 
atatccccct 
ggtattgtac 
tggatgttag 
cctgaaagcc 
aagaaggtca 
acaataggag 
gctttcctta 
ccgaggattt 
ateggactga 
cgaaaaaaac 
aaaggtttct 
ategctttet 
ccaaacaatg 
gccctccttt 
ttctttgeca 
aacaagtctc 



tttacgttgc 
tatceggage 
tcaccattgc 
tagaaaaaca 
cagtgggcag 
gtttagctgt 
cgcgttggcg 
tggcttactt 
gagttgaagc 
ctcgttggct 
geaatgeaaa 
ccagtggcga 
tagctacttt 
ttgagatgtc 
ccaaccttac 
tcctctatat 
accaaggcgc 
ttgecattte 
tgcgctccaa 
catggatgtt 
ttatgatgtt 
tggagcagat 



cttcgtggca 
cgagaaatcc 
tategctttg 
eggaegtctg 
tgctgccatc 
cggagcctca 
cggccgtttt 
ttccaactat 
cattcctgcc 
ggtaaagcaa 
tattgaacag 
aaaactgttt 
caaccagtta 
aggtgtattt 
tttcactatg 
cggttccatc 
attttcaggt 
actgggtgcc 
agggcaagta 
tcccgttttc 
cctaagcttc 
acaaaaggaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1332 



<210> 86 
<211> 198 
<212> DNA 
<213> B. fragilis 



<400> 86 

gecagtacaa tacccagtac aatattaaac tggaagaatg cgacaaaacg gccgcgccaa 

cgcgaggggg atatttegge aatgtacatg ggaccgacta eggatgagge tccgacagct 

aaacctccgg caaaaeggaa gaataaaaag gaataccaat cgatgatggc agcactgccc 



60 

120 

180 



42 



actgcagaaa caaagtag 



198 



<210> 87 
<211> 207 
<212> DNA 
<213> B.fragilis 

<400> 87 

tctcaaagat ctctatcaca aagatactgc atattttcga atgcaacaaa tcaatcgatt 60 

ttttttattg aggactggca ttatttgctg gttctttttt attggaaatc tgaaggtata 120 

tcgatacctt tttttaatcc gatagcaata cccaatcctg ccacacccta ttcggacaaa 180 

caaagaatga aagataatag atcttga 2 07 

<210> 88 
<211> 240 
<212> DNA 
<213> B.fragilis 

<400> 88 

aaaaactttc catactatgg gtgggatgct ttcgctaatg acaaatcaaa acaagatgct 60 

attgttcctc tacctatgat attacccgat tttgactcgc aggaaagatg ctattattat 12 0 

tctgcacaac cggttatatc agatgtttgt gaaatcagta gagattattt caataaagac 180 

ttttctaaaa attataaact tgaatttaaa ttgaagatag taaattattt ttttaattaa 240 

<210> 89 
<211> 489 
<212> DNA 
<213> B.fragilis 



<400> 89 

atcaccatgt 

ctccatttag 

gatgtgtatc 

gcttctcttt 

aaggaagata 

gtctctctgc 

ctggcggcgg 

gagtttgaga 

atagaataa 



tgtctttgca 
gtctggatgg 
atcgttgtga 
gtatcgcttt 
aaatccagtc 
tgaaatgtcg 
aggcacatgc 
ttgtggagca 



atcagaaatc 
cgaacccatc 
gcatcttttc 
gttgaccggc 
ggtcctcaat 
tttgttggta 
aataatagac 
tttaaagagt 



gattctttgt 
tattccgacc 
ggttcacatg 
tataatgcta 
cgtagttggg 
gcttgttacg 
ggttggaagg 
ttggaggaaa 



gcgccgtttc 
gtttccgtca 
gacgtaccct 
ccatttataa 
atctcttgga 
ccgaggtctt 
atcgggaatt 
atccgtatcc 



gcacgaactt 
gttgaatacg 
cgaagaggaa 
tcacggtgat 
tacacttccc 
tgacgaagag 
gacgagggaa 
gaatacggat 



60 

120 

180 

240 

300 

360 

420 

480 

489 



<210> 90 
<211> 630 
<212> DNA 
<213> B.fragilis 



<400> 90 

cgagaccgac 

aagatattca 

ttgctgaagg 

aaactgttcg 

ggt cagacta 

gagaattggg 

tacatgttca 

gatattacgc 

ttcggcttcg 

acagcattcc 

gaagaagagg 



ctttcaacaa 
acctttgggc 
cgttctgcga 
gcgcaggata 
aagaccttgt 
gaacggcaga 
tggctcttgt 
cccgcaaagt 
atttcatgca 
tacaagtttt 
aagcagaatc 



tacgaaccat 
gaagcgcagc 
ctacggtaag 
tgaattgtat 
ggcagataaa 
ggctcgtggc 
ggctcgtact 
ggttgacttg 
ggacaagctt 
cctcaacttt 
tttggattaa 



cataactcca 
cctgagtggg 
ggttcgacaa 
attctggcgt 
gccaagcgga 
ggtcgcaagc 
ggaatagact 
ctcattgaca 
gaagacaatc 
atgcaaccat 



ataaaataat 
aaacgaaata 
gctatcaaga 
tcttcattgg 
aagactttgg 
agtatggtca 
ggattgccct 
aaatggagaa 
ccgattactt 
caacctctga 



ggcagcaaca 
cgaagacacc 
aacccgtgca 
tctgtatcat 
ctgggcgatt 
gattcgcgaa 
tgacaaaggc 
gtacgcaaac 
ctacaaagaa 
aaatgcagaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

630 



<210> 91 



43 



60 



600 
603 



<211> 603 
<212> DNA 
<213> B.fragilis 

<400> 91 

tatttttgca cgttattctc acaagaaata gatcaagaaa tgatagaaga cattaaaaaa 
gcttgtcaag tgatgagcga aggcggggtg attctctatc ccaccgatac ggtttgggga 12 0 
attggctgtg atgctaccaa tgaagacgct gtgcgccggg tgtatgagat aaaacgacgt 180 
gctgacagta aggcgatgct ggtattggta gactcgccgg tgaaagtgga attctatgtg 240 
caggatgttc cttcggtagc ttgggatttg attgaggttg ccgataagcc attaactatc 3 00 
atttattccg gtgcccgcaa cctggcttca aatctgcttg cagaggatgg aagcgtaggc 3 60 
atccgggtga caaacgaggc gttttcccgt cgtttgtgcc agcagtttcg caaagcgatt 420 
gtctcgacat ctgccaatgt cagcggacaa ccgggagcag ccaattttaa tgaaatcagc 480 
gaagaaataa aatcgtcggt ggattacatt gtcaattttc gacaagatga tatgagtcgt 540 
cctaaaccat cgagtatcat taaactggat aaaggtggag tgatcaagat tattcgcgaa 
tga 

<210> 92 
<211> 1923 
<212> DNA 
<213> B. f ragilis 

<400> 92 

agatatcaaa ccgatagagt gacttaccaa gagattttaa agcaatactg gggttatgat 
tccttccgcg acctgcagga ggacatcata accagcattg gcaatggaaa agacacactg 120 
ggactgatgc ccaccggagg cggaaagtca attacgtttc aggttccggc ccttgccaaa 180 
gagggattgt gcattgtcat caccccactg attgctctaa tgaaggatca ggtgcagaac 240 
ctgaaaaagc gcggaatcaa agcgatagcc atctattcag gaatgacacg gcaagagatt 3 00 
gtggtggcat tggagaactg catcttcggc gactataagt ttctatacat ctctcccgaa 3 60 
cggttggata ccgaaatctt ccgggccaaa ctccggtcca tgaaaatcag tatgattacg 42 0 
gtagacgaaa gccattgcat ctcacaatgg ggatatgact ttcgtccggc ttatctgaaa 480 
atagcggata tcagggatct cgtaccggat gctccagtct tggcactgac cgccacggcc 540 

600 
660 



60 



840 
900 
960 



actcccgaag tagtgaagga catacaggag cgcctccgct tccgggaaga aaacgtgttc 
cgtatgagct tcgaacgaaa gaatctggca tacatcgttc gccccactga taataaaaac 

ggggagttgc tgcacatact gaaccggata caaggcagcg cgattgtata tgtacgaagc 72 0 

cggcgaaaaa ccaaagaaac aaccgagctg ctggtaaacg aaggaatcac ggccgacttt 7 80 
tatcatgcgg gactggataa cgcaaccaaa gatcttcgcc aaaaacgatg gcaaaacgga 
gaaagccggg tgatggtagc taccaacgca ttcggtatgg gcattgacaa accggacgta 
cgtatcgtca tccacctgga cctgcccgac tcaccggaag cctactttca ggaagcggga 

cgggcgggac gagacggaca aaaggcatac gcggtgatac tctatgccaa gtcggataaa 102 0 

acaacgctca gcaaacgtat tacagatact ttcccggata aagactatat aaaagatgtg 1080 

tacgagcatc tgcaatatca ttatcagatg gcgatgggcg acgggctggg atgcatgtat 1140 

gacttcagcc tggaagaatt ctgccgcaag ttcaaatatt ttcccgtacc tgcagacagc 12 00 

gcactgaaga tattgacaca ggcaggatac ctggaatata ccgatgagca agacaatgcc 12 60 

tcacggatta tcttcacgat ccgccgggat gagttatata aactccgtga gatgggagaa 132 0 

gccgcagaga aactgataca aatgattctg cgatcttaca cgggtgtctt tacagactat 13 80 

gcctacatca gcgagcagac tctggcggta cgtacgggac tgacccggca acagatttac 1440 

gacttgctgg tgatgctgag caagcgccgt atcgtcgact acatcccgca caaaaaaaca 15 0 0 

ccttacatta tatatacacg tgagcggata gacctccatt atctgcaaat accccgagca 1560 

gtatatgaag aacggaaaga acgctatgaa acccgtatcc atgccatggt ggaatacgtc 162 0 

acttcggaga atgtctgccg tagccggatg ctgctccgct acttcggaga gaagaacgaa 1680 

cataactgcg ggcaatgtga cgtctgcctc agccaccgcg ccgaaccgga tatatcacaa 17 4 0 

agcaccttcg acggactgag agagcaaata tgtgctctgc tgaaagagca tccgatgact 1800 

ccggcggaga tagcttcaca cataaataca gataaagagc agttgagcga agtgatacgg 18 60 

tttatgctgg acgagggctt actgagctct gagaacggac tgcttactga aaaaacttcc 192 0 

tga 1923 



<210> 93 
<211> 1740 



44 



iOO 
660 



<212> DNA 

<213> B. fragilis 

<400> 93 

aaagaaaaaa caatatataa aatgaatcac aaatggaatt atcgacccat cacacaagaa 60 

caggcagaga taagccgggc attggctcag gaactaggca ttagccccgt cctgggacga 12 0 

cttttggtac aaaggggaat tacgaaggca caggatgcca agaaattctt ccgtccgcaa 180 

ttgcccgatt tgcatgatcc attcctaatg aaggatatgg acatcgcagt ggaacgcctg 240 

aacatggcga tgggaaagaa agaacgcatt ctgatttatg gagattacga tgtggacggt 3 00 

accacggctg tggcactggt ctacaagttc attcaacagt tctattcgaa ccttgactat 3 60 

tacatccctg accgttataa cgaaggatac ggaatttcca aaaaaggagt tgactacgcc 42 0 

gctgaaaccg gagtagggct tatcatcgta ctggactgcg gcattaaagc cgtagaagag 480 

attgcgtatg ccaaagagaa gggaattgac tttatcatct gcgaccatca tgtaccggac 540 
gacgtattgc cccctgccgt tgccatcctg aatgccaaaa gactggataa tacataccca 
tacactcatc tttcaggatg tggcgtaggc ttcaaattca tgcaggcttt tgccatcagt 

aacggcattg agtttcatca cctgattccg ttgctcgacc tgaccgccgt aagcattgca 720 

tcggatattg taccgatcat gggcgaaaac cgtatcctgg cctatcatgg gttgaaacag 780 

ctgaacggca atccgagcgt aggactgaaa gcgattatcg atgtatgcgg attatcggaa 840 

aaagaaatta cggtgagcga cattgtattc aaaataggtc cccgcatcaa tgcttccgga 900 

cgtatacaga acggaaaaga agcggtagac ctgttgattg agaaagattt ctcggcagca 960 

ctcgagaaag ccggacaaat caaccaatac aacgaaaccc ggaaggatct ggataagagc 102 0 

atgacggaag aagccaataa aatcgtagcc gaactggaag gcttggcaga ccgtcgttcg 1080 

atagtgcttt acaatgaaga ctggcacaaa ggagtgatcg gaatcgttgc ctcacgatta 1140 

acggagattt actatcgtcc ggcagtcgta ctgacccgga cggatgatat ggcaaccggt 1200 

tcggcacgtt ccgtatccgg tttcgatgtt tacaaagcta tcgaacattg ccgtgacttg 1260 

ctcgaaaact tcggagggca tacctatgct gccgggctat cgatgaaagt ggaaaacgta 132 0 

caggcattca ccgagagatt cgaaagtttc gtgtcggaac atatactgcc ggaacagacc 13 80 

agcgcagtga tcgatatcga tgccgaaata gattttaaag atatcacgcc gaagttcttc 1440 

aatgaattga aacgattcaa cccgttcggt cccgacaacc agaaaccggt gttctgcaca 1500 

catcacgtgt acgattatgg aacaagcaag gtagtcggtc gcgatcagga acacatcaaa 1560 

ctggaactgg tagacaacaa atcgaacaat gtgatgaacg gcatcgcctt cggacaaagt 1620 

tcacacgtga gatatatcaa aaccaagcga tcatttgaca tctgctatac cattgaagag 1680 

aacacccaca aacgggggga agtgcagttg cagattgaag atatcaaacc gatagagtga 1740 

<210> 94 
<211> 1203 
<212> DNA 
<213> B. fragilis 

<400> 94 

actaatactg cgattgttat gaatactaca gaatatttac agacttggtc tgactcttat 60 

aaaaatgaca tgataagcaa tatcatgccc ttttggatga aatatggttg ggatcgcaag 12 0 

aacggaggtg tttatacctg cgtcgaccgt gatggtcagt tgatggatac caccaaatct 180 

gtttggttcc aagggagatt tgcttttaca tgttcatatg catataatca cattgagcgt 240 

aatactgaat ggttggcagc tgcgaaaagc actctcgatt tcatagaagc acattgtttt 3 00 

gatacggatg gacgtatgtt ttttgaagta accgagaccg gattacctat tcgtaaacgt 3 60 

cgttatgtct tttctgaaac atttgctgct attgcaatgt ccgaatatgc cattgcatca 42 0 

ggagatcata gttatgctgt aaaagctttg aaattgttca atgatatccg tcacttcctt 480 

tcgactccgg gaatcctgga gcccaaatat tgtgaacgtg tacagatgaa gggacattct 540 

attattatga ttcttatcaa tgtagcttcc cgcattcgcg ccgctattaa cgatccggtt 600 

ttggatcggc aaatagagga gtctatagcg attctgcgca aagactttat gcatccggag 6 60 

tttaaagctc tgcttgagac tgtaggtccc aatggagagt ttatagatac gaatgccact 72 0 

cgtaccatta atcccggcca ttgtatcgag acctcatggt ttattctgga agaagccaag 780 

aaccgcaatt gggataagga aatggttgat acagcactta cgattctgga ttggtcgtgg 840 

gagtggggct gggacaaaga atacgggggt attataaatt tccgtgattg tcgaaacctg 900 

ccttcacagg attatgccca tgacatgaag ttctggtggc cacagaccga agcgattatc 960 

gcaactctat atgcgtatca agctactaaa aatgaaaaat atctggctat gcataaacag 1020 

atcagtgact ggacttatgc ccattttcct gacgcagagt ttggtgaatg gtatgggtat 1080 

ctccatcgtg acggaacgat ttctcagcct gcgaaaggaa atctgtttaa gggaccattc 1140 



45 



cacattccta gaatgatgac gaaaggctac gcactttgtc aggaattact gtcagaaaaa 1200 
taa 1203 

<210> 95 
<211> 258 
<212> DNA 
<213> B. fragilis 

<400> 95 

ttggttttcg ctacctttgc aaactgtgaa aaaacactta gttttaaagg ttggaactgc 60 

cagtttatga tacaattcta taatgaatat aaccagcaat ttacaaatac gaaacagcct 12 0 

gtttcgtatt tggatgatgt ttctttatac cttcctgtca tgcatttgag ttggtcgcac 180 

aacatcgtat tgatgcaaaa agtaaaagac ctcaaagcac gtaactggta tatgattcaa 240 

agtctgaaaa atggttag 258 

<210> 96 
<211> 1320 
<212> DNA 
<213> B. fragilis 

<400> 96 

atatcagaag ccatgaatac aaaatattgg gaagaagaga tagagaccat gagtcgcaag 60 

aagctacagg aattacaact ccaacggctt aaaaaaacaa taaatatagc agccaatgcc 12 0 

ccttattata agaaagtatt tcaagagcat ggcattactc cggagagtat ccagtctcta 180 

gacgacatcc gtaaattgcc ttttaccaca aaggcggata tgcgggcaaa ttatcctttc 240 
ggacttgttg caggaaatat gaaagaagac ggagtacgca tccactcttc aagcggcaca 
acgggaacac cgacagtcat tgtccattca cagcatgact tagattcatg ggccaatctg 
gttgcgcgat gcttatattg tgtaggtata cgtaatacgg atgtttttca aaacagttca 
ggttatggta tgtttaccgg cggactggga ttccaatacg gagccgaacg actgggagca 

ttgaccgtac ccgctgctgc cggcaacagt aagcgtcaga tcaagtttat caccgacttt 540 

aagacaacag ctttgcatgc gatccccagc tacgccatcc gcctggccga agtttttcag 600 

gaggaaggta tcgatccgcg cagtaccacc cttaaaacgc tcgtaatcgg tgctgaaccg 6 60 

catacagacg aacagcggaa aaagatcgaa cgcatgcttg gcgtgaaagc atacaatagc 72 0 

tttggcatga ctgagatgaa tggtccgggt gttgcatttg aatgtaccga gcagaatggc 780 

atgcattttt gggaagattg ctattatgtg gaaatcatta atcccgagac aggtgaacct 840 

gtacccgaaa gagaaatcgg tgaacttgta ctcactactc ttgatcgtga aatgatgcca 900 

ctgatacgct atcgcacacg tgaccttacc cgcattttac cgggaaactg tccttgtggc 960 

cgtacccata tccggataga ccgtattaaa ggcggtagtg atgatatgtt cattatcaag 1020 

ggagtaaata tattccccat gcaagtagaa aagatattgg tacaattccc cgaactagga 1080 

agcaattatc tcattacgct cgaaactgtg aacaatcaag acgagatgat tgtagaagta 1140 

gaactgagtg atctttctac cgacaattat atcgaactgg aaaagatacg caaagacatt 12 00 

acccgccagc taaaagacga gatacttgtt acgcctaaac tcaagttggt aaaaaaaggc 12 60 

tctttacctc agagcgaagg caaagctgtc agagtaaaag atctgagaaa caataaataa 132 0 

<210> 97 
<211> 840 
<212> DNA 
<213> B . fragilis 

<400> 97 

aaaacaatgg caatagcata tgacgggatc aactatttcc cggtgggtgt aaacttcatg 60 

gaagagaacg caatggaagt gatagaagct aaatatggaa taaagggttc ggcaatcgta 12 0 

ctgaaactgc tgtgcaaaat atacaaagag ggatacttca tccgttggga tgaagagcag 180 

tgcctgatct ttgccaacaa ggcgggaaga gaggtgcagg ccgctgaggt acaggggatc 240 

attgagatcc tcttcatcaa agggatattg gacagaaaca gttatctggc aaacggaata 3 00 

ctgacttcgg caaacataca gaagatatgg atggaggcaa caaagcgaag aaaaagggat 3 60 

ctgaaagcat tgccctatct gctggtgaac gacttgactc agcaggaaac agaagcgccg 42 0 

gaaggtgaaa atgtaaccat tagcccggga aatgtagtac atgatgtagc cgttaacgca 480 

aaaaatgcat gcaattccgg acaaagtaaa gtaaaagaaa agaaagcaga ggaaaataaa 540 



300 
360 
420 
480 



46 



gaattacccc 
cctctcccga 
gatacactca 
tcggattatg 
gacatagggg 



cctcagctcc 
tacccggata 
aaagactggg 
gaaggaaagg 
caaaaggaag 



ccccaagggg 
cgccttcaac 
gattaccgaa 
aacacgggta 
gtatctgata 



aaggagaaag 
acaatgacac 
gtaggagagg 
tggcaactga 
gcagcactga 



aatgggagga 
acaattatcc 
tgaatgccat 
ttgccaatac 
ataaggcaaa 



ggtttctgct 
gggactgacg 
actcaggcta 
ttgctggagt 
aagaaaataa 



600 
660 
720 
780 
840 



<210> 98 
<211> 636 
<212> DNA 
<213> B.fragilis 



<400> 98 

ggacgcaatt 

gctcataact 

aaagacccct 

gagttgggct 

ttcatgggca 

atgtgcaacg 

cccgaatata 

gcaggagaca 

gaatacgccc 

gcgttgccca 

agtcgggagt 



ttctgattga 
tcgataccac 
atatcttcga 
tggtaaaaca 
gacaatacta 
catttatgca 
tcggtaagtt 
atcagaccat 
tgcgtgatgt 
aagatattaa 
tggaggataa 



agccatcaat 
tctccccgaa 
tatgcttaca 
tatcgagaag 
tatagaggtg 
caggtattta 
gaacttctat 
cgggttgctt 
gcataagccg 
gtcagggttg 
cacgcaaagt 



caggattact 
attcaagcca 
ttcacggacg 
ttccttgtcg 
tccggcaatg 
gttgtggaat 
tgttcggtgg 
ctttgccaga 
ataggtattt 
ccttccattg 
ctataa 



accatgtcca 
agcaagtgaa 
agtatgacga 
agatgggagc 
acttttatat 
tgaagcgggg 
tggatgacat 
acaagaaccg 
ccgattatga 
gagagttgga 



tggggcactg 
agagacattg 
acgggatgtg 
cgggttcgcc 
cgatatattg 
agagttccaa 
cctttgccgg 
catcatggcg 
attggggaaa 
aagcaaactc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

636 



<210> 99 
<211> 1923 
<212> DNA 
<213> B.fragilis 



<400> 99 

atctactttt 

gcttcaaaat 

gcaagccttt 

acaacactgt 

ggggtcattc 

ggatatttgg 

ttgccgatta 

cgtattttgg 

gtgtttgttt 

agaagtaatc 

cagacgatgg 

gagagaattg 

atgattgacc 

ctgggaaagg 

cctattcatg 

acgggagctg 

tataaattaa 

ttggataatt 

cgcatggaat 

aaacatgtgc 

actcgtatta 

acggataagg 

tatgttcagt 

tttatcacaa 

aaacaacaga 

tttatgacta 

ggtgaaatct 

atgatctatt 

gagaagctat 



atattactat 
ggctgatact 
tgcaaatagg 
tttgtgtaat 
gatattcttc 
ttacctgtgt 
gtgttattct 
tgaaaatgat 
atggttcgaa 
attttagatt 
ggtgtagggt 
aagctattat 
ggttgatagc 
aaggtatgca 
tagatattcg 
ccggttcaat 
ttttggttga 
ggcgggatat 
caatctttaa 
ccatgatgga 
tggctgactt 
ctgtcaatcc 
ctctcgcaca 
ctcgttttgg 
tagagaaagg 
ttccggaagc 
atatttttga 
tgagtgggca 
acgaagagtt 



gaatatacga 
tgctgtcgat 
tctgtcggca 
attcaatgtg 
ctttattgat 
aggcaatttg 
tacagcttat 
ccatgagttg 
gggatctggt 
aaaaggattt 
atatgcgaat 
tgtttcttct 
cgaggatatt 
aataaaggat 
aaaaatatct 
tgggcgtgaa 
tcaggcagaa 
tgatgctaaa 
agactatcgt 
agataatgta 
agctgtcaaa 
gactaacgtt 
tcaattatct 
gaatgtgctt 
agggccggtt 
gtgccaattg 
tatgggcaat 
gaaaaatata 
attgaatgtg 



ttttattata 
gttcttttag 
ttagtctttg 
tgcttctttc 
atttctcgta 
ctttggatgg 
atcgtcaatt 
atgactttcg 
attaatattg 
atttcagatg 
aatgaatctc 
gagaaagtgc 
cgtattctta 
attcagatag 
tcccatatag 
atggtgaggc 
tcacctttgc 
atgctggttg 
ccgcagtatg 
tctgaagcca 
tatggtgtgg 
atgggatgta 
aaatatgcca 
ggctcgaacg 
actgtaacgc 
gtattggaag 
cctgtgaaga 
aaaatagagt 
aaagagttca 



agtatctttc 
tgatcttttc 
agttttcgtt 
atctgaatcg 
ttttcatttc 
ggtggagtgg 
tctccttgat 
atcgtagaca 
ctaaatcatt 
atacaggctt 
tgtttgatat 
accgacttga 
cagttcctcc 
aagatttgtt 
aaggaaagag 
agatagccgg 
ataatgtaca 
ccgatgtgac 
ttttccatgc 
tacaggtgaa 
aaaagtttgt 
gtaaaagact 
atgatggggc 
gatctgtgat 
atcctcaggt 
cgggaagtat 
ttgttgatct 
ttaccggctt 
cttgtcctac 



atcgcgagtt 
aatgtttctg 
gtgggtgtgg 
tacttatgta 
cttaaccttg 
acgagaagta 
ggtctgtttg 
tagtattcgg 
acgagttagt 
tatagggaag 
tttagaagaa 
aacctccggt 
attcaatgat 
acagagagac 
aataatgatt 
attgaatcca 
attggaactg 
taaccaaacg 
tgctgcctat 
tgtgctgggt 
gatggtctct 
tgctgagatt 
attagtgaaa 
acctagattt 
tatacgttat 
gggtaatggg 
agccagaaga 
gcggcatggt 
ctaccatgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 
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aaaataatga tagccaaagt tcgtgaatat gattatgaag aggttaagca agaaattcaa 1800 

aagttaatag atttgagcta tacttctgac accatgggaa ttgtcgcttc tatgaaaaag 1860 

atagttcctg aatttgtaag caagaattcg gagtttgaga tattagataa agcctctttt 1920 

taa 1923 

<210> 100 
<211> 306 
<212> DNA 
<213> B.fragilis 

<400> 100 

ataatttctc ttaataacta tataagttat gatgtggttt ctaattcttt ttggataaaa 60 

tattttaatt tatatattaa aaatatgtat cagtatgctt tttctcccaa taaatctcat 120 

ttgatttatt ctacaatagc ttttggagat gaaccggaaa ttattattat gggaaaagga 

cagataacaa atgatgatga aatacaaatg tatccttcta ttaatgataa tggagatgtg 

gagatattgt ttatcaagca ggaaataaaa aaactttcca tactatgggt gggatgcttt 3 00 
cgctaa 



180 
240 



306 



120 
180 



420 
480 



<210> 101 
<211> 2118 
<212> DNA 
<213> B.fragilis 

<400> 101 

tttactgaat atagtaacct tacacaaaca gacaaacatc gtcttatgaa gagaaatgta 60 
tcattgctga agtatgcact gctgatagca ctttgctgtg tagcatgtgt aaatgagaaa 
gatttgtatg aaccgtcggg ggaggatcct ggtgaaacgg aagaattgga cttatcgttc 

aaattcgctt tgagagccga taaacagatt catatatctg ttacccgggc agatggaaaa 2 40 

gctgctgagg ggataggggt tggagtgtat cttcaacagc cttatgaaga agacgggatt 300 

atttccggta agcctctcta tatgggctat acagatggga atgggcagat tgatgctact 3 60 
atttctgttc cggcaaacag tgataagttg tatgtcgctt cgttgacagc cggttatccc 
ggagtgcagg agatggatgt gcaaccttcg atgacgtgca acttgactgc aacagccttt 

caaatcaaga ctgctactac ccgtatggtt gctacccgga gtgaaacagg attggatgtt 540 

cccgtcggcc agaaactgag caatctttat gaattgtata gcccttatac tgattcggag 600 

attggaaaag acggtatacc acttttgaat gcttctccgc ttgttacgaa agaggaatta 660 

tctgctaagt ttctaaattt aatgaatagt tggtatccgg aacagaagaa tgtgcaggat 720 

gtggatttga aaaagagctc tgatctggtg gtgactgatg aattgggagc ggaagtgtgg 780 

gctacgtatg tcggtgatgg tggattttat gtaaataatg cgaccgtcta caatgtgttg 840 

gcttattata gctaccagga aggggagctt ggcagacgtg aagatataca gggacatcgt 900 

atgactttgt tacttccgaa tactcatcag caaaagtgtc cttcgggttt aaaagtacaa 960 

ttgttgtatt gggacggaaa acaatatagt aaggtattcc cgaaaggtgc acgtatcggt 102 0 

tttgctgtgg cacgtgacgg attgaatata gctaatgtaa atgctgccaa tggaggagtg 1080 

aattcaaaaa gttcctataa gttcaagaat cagaccttcc cgaatggaga tgttaatggc 1140 

ttttattact ctaccccatc tttgaatgca acgaaaagga cgaatgcggt gattcgaaat 12 0 0 

gtgcccgatt acaactgttg cattatgggc ttcgatattc gcccttatga tgatccaaaa 12 60 

gcagattatg attttaatga tgtgatgata aagcttaccg catcaccggt atctgccata 1320 

aaaccggaag aagacattcc ggtgatcgat gaatttactc catcggaggc tgtttacggt 13 80 

acattggcct ttgaagacca gtggcctaag atgggggact atgacttcaa cgattttgtg 1440 

atgaattaca gttatgagtt ggagaaaggg gataataata tgattactgc tctaaagttg 1500 

actttcacgc cgattgcaaa gggagcagct tcatggacgc atatcggtgt aggcatcgaa 1560 

ctgccgcttt cggctgacaa tatcgacaaa gcaaagtcgg aaggtgctac acttgaagag 162 0 

ggtaatgacc gggccacttt tattgtctgg aatgatgtta atactgcttt cggtacgact 1680 

gaaggatatg tgaatacgga gggtgcggtg gtcggagttt ccgctattcc ggttgaagta 1740 

accgtacgac tgaagactcc tgtcagcagc ttgttaactc agaagtttaa tccgtttatt 1800 

tttgtcaaca gccgtcaaag agaaatacat ctggtagatt ataagccgac aaaacatgcc 1860 

gacacttcac tcttcgggac agaaaatgac agatcggatc ctggggctga agtttattat 192 0 

cgtatggata accgatatcc atgggctctt gatttcccac ggaaggaaga ctcttcaccg 1980 

gcctggaatt atcccaagga aagagttatc attacgaaag catatcctaa ttatgagaaa 2 040 

tgggtgcttg atcaatccaa tctttcctgg tttgacgcga gtgtgtcggg gaacgtgaat 2100 
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agggaattct tatattaa 2118 

<210> 102 
<211> 1386 
<212> DNA 
<213> B.fragilis 

<400> 102 

aaaggagatt ttagagggag cttatcgttc aatattgatg gatatatgag agtaatgatc 60 

cgaattaaaa gaaagattga cagctcaccg tttttgaaaa gcgttgtagt cttgttttca 12 0 

ggaaatgttt ttgctaattt aatttcactc ttatcaattc caattcttag ccggatttat 180 

tcggatatag cttttggaga ttatgcaatt gttatttcta ctgctacaat tgttaacggt 240 

atttcaacat taggattaac ttcagccata atgataccgg tggaagaaaa taaagccaaa 3 00 

tcagttttta ctacagcatg gatttctcat atattggtta gtactttttg ctttgttctt 360 

gcactgattt tattacctgt ttattctatt tattctatta cagggtctta ttcttgttct 420 

ttactattga tgtatcttta tgtacttctt gttggtacct tctctttgtt gtctgtttat 480 

gcaaatcggt taagaaaaaa tcggatctta ttttggaatg caatgataaa ttcattggca 540 

ttgctctgtt tagcaattcc ttttggctta tgggggtggg gagggactgg cttcttgatg 600 

gcatctaccg gtggatactt agtggcaaat atacaaatgc tatatcatat gaatccattt 660 

aagaaaatag cttataggga ttgtgtatct gtttataaag attttaagga ctttattata 72 0 

tatcagtttc cttctaattt aatatcaact tttacgattc agttacctaa tcaattgttt 780 

tctgcctatt ttggtaatgc ttcattagga ggttatgcta tgtgtgaaag aatattgggg 840 

gttccgatgc gtttgatagg tgctcctatt acaactatct attttcgtca ttcttctgaa 900 

tgcataagag agtgtaagga tatatctggg tttacttata ttttgattac acgtattttg 960 

atattagctt ttttacctgt attgatttta ttttcttgtt cggaagtatt atttaccttt 1020 

attttaggag attcatggtt gcttgttggc aaaattgtat ctattttaat atttccgtat 1080 

gtgctgttgt tttgttcaaa ctgtgttagc tattgtttgg ttgtaattgg aaagcagaaa 1140 

ataaacttgt atctttcttt actttattta atgttgatcg ttgcatctgt tgtgtctgga 1200 

ttttatgttt ttagtgactt tgtttcggtt gtgatatgct ttgcggtagc attgattgta 1260 

tttaatctat tgaatttatt agttatattt tattatctta ggaaagattt tggaaggttt 13 2 0 

gtaagattca ttggaattta tttgctgtta atatacttag gtcttatttt aataaaatat 1380 

ttatga 1386 

<210> 103 
<211> 2571 
<212> DNA 
<213> B.fragilis 

<400> 103 

tcagaatatc ttatgcgtag atttattaca ctattcttct tgatttttac cttgtccgga 60 

gtggctgtag ctcagcaaat gtccgatgat caggtcgtgc agtatgtaaa agatgctcaa 12 0 

aagatgggta aaactcagaa gcagattaca acagagttga tgagaagggg cgttacgaaa 180 

gaacaagtcg aacgcattca ggaaaaatat gaaaatggaa gtggcagtac cggtacacag 240 

aacaaccaga actcaacaag gtcgcgtacg cgtactcagc aaaatgatga aagtgattac 3 00 

tctaatcgct ctcaaaaaaa tctgaaagat cagaaaaatc aaaagaacca gaagaaccag 3 60 

aagaatataa aagggcttcg tcagtcgaac aaccagaaaa acaagcgtgg aatgggagat 42 0 

gagaatctgg aaatgacaga tgaagacatg atgaatgagg aagactggtc tgacgagtac 48 0 

accgtgaagc cggaagagga tccgactcaa caaattttcg gacataatat ttttacgaac 540 

gagaacctta catttgaacc caatctaaat atagcaactc ctgtaagcta tcgtttggga 600 

cctggagacg aggtgattat agatgtgtgg ggagcttctc agactacaat cagacaaacc 660 

atttctccgg agggtagtat tttagtcgat aatcttggtc ctatttacct aagtggaatg 72 0 

actgttcgtg aggctaataa tgcggtacgt cgtgaatttg cgaaaatcta cgcaggtata 780 

tccggcccga atcctaatac ttcagttgat ctgacgttag gcaatatccg tactattcaa 840 

attagtatta tgggagaagt tgctgttccg ggtacttatg cgctctcggc attctcttct 900 

gtattccatg ctctctatcg tgccggtggt gttaataaga taggtagttt acgtagtatt 960 

aaagttgtgc gtaacggcaa aaaaatagca gatctggatg tttacgattt cataatgaag 102 0 

gggaaactga atgacgatgt tcgtttgcaa gacggtgatg tggtcattgt tgatccatat 10 8 0 

gaatctttag tgcagattac cggtaaggta aaacgtccga tgttttatga gatgaagcct 1140 

tctgagacaa tggctactat tttaaaatat tcaggtggtt tcaccgggga tgcttataaa 12 0 0 
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aaagctatac gtttaattcg taaaacaggc cgtgagcatc aggtttataa tgtagatgaa 12 60 

atggattatt cggtatttaa actggatgat ggagatgtgc tggctgtgga ttcggtattg 132 0 

gagcgttttg aaaaccgtgt tgaagtccgt ggtgctgttt atcgtgccgg tatgtaccaa 13 80 

atcgatggaa ctgtaaacac agtaaaacaa ttaataaaga aagctgaagg agtgagaggg 1440 

gatgctttct taaaccgtgc tatcatcgat cgtgagaatg atgatcttac tcatgagatg 1500 

attcaaattg atttaaaggg attgttgaat ggtactgtag ctgatattcc tcttcagaaa 1560 

aatgatatcc tttatattcc gagtattgaa gatttgaagg aggaagcaac ccttacgatt 1620 

catggtgaag tagccaatcc gggtacttat ttgtattcat ccaatatgtc ggttgaagac 1680 

cttgttctac aagccggagg attattggag gcagcttcga cagcccgtgt ggatgtgtcc 1740 

cgacgaataa aaaattcaaa aagtactgaa ttgagtaatg tagttggtaa gactttctct 1800 

tttgaattga aagatggttt tcttgtagga ggtgatcagg atttccattt ggaacctttt 1860 

gacgaagtat atattcgtcg tagccctgct tatcatcaac aacagaatgt tacagttgga 1920 

ggcgaggtct tatttggtgg acgttatgcg ctatcaaaga agaacgaacg tcttagtgat 1980 

ttgatttcta aagcgggcgg tattactcaa gatgcttatg tgaaaggtgc ccgtttgatt 2 040 

cgtaaaatga cagaagaaga gttgcgccgc aaggaggatg cacttcgtat ggctaataag 2100 

ggtggagctg attccatttc tgttaagacc cttgatgtct ctgatactta ttctgtcggt 2160 

attgagttgg aaaaggcttt ggctaatccg gggtcagact ttgatatggt attacgtgaa 2220 

ggcgatattt tgtttgtgcc ggagtatgta agtaccgtca aaatcaatgg tgctgtgatg 2280 

tatcccaata cagtattata taagaaaggc gaaagtttga aatactatat taatcaggcc 2340 

ggtggttttg caagtcttgc aaagaaaaaa agagcttttg tggtttatat gaatggaaca 2 400 

gtgtctcgtt tacgtacggg aaattctaaa gcgatagaac cgggctgtga gattatcgtg 2460 

ccaagtaaag atccgaagaa gagaatgtcg gcagccgaaa ttataggaat gggtacttct 2 52 0 

gctgcttcat tagcaactat gattgcaacg atggttaacc tctttaagta a 2571 

<210> 104 
<211> 2898 
<212> DNA 
<213> B.fragilis 



60 
120 



300 
360 



<400> 104 

atgaattttc aagatttaca tatactcggt gaattaaagg aagaattgct ttatcgtatt 

ctttattcaa ctgatgcttc tgcttacaga gagatgccta ttgctgttgc atatcctaag 

gattcttctg atgtgcagaa gatcagtaat tttgccaaaa aaaatcaaat taatttgatt 180 

cctcgtgccg gaggaacttc tttagcggga caggtagttg gtaaagggct tgttgttgat 240 

atttccaaat atatgaacca tatattggaa atcaatcagg aagaacgttg ggtaagagta 

caaccgggag ttgtattgga tgagctaaat ctttattgta agccttatgg gttgtttttc 

ggaccggaaa cttctacttc taatcgttgt tgcttaggag gaatggttgg caataattcg 42 0 

tgcggttctc actctttagt atatggtagt acacgtgatc atttgcttga agctaacgtc 480 

gttttaagtg atggttctga agtagtattg aaaggaatga cttctaagga gataaacgag 540 

aaatgtaaat tagactcatt ggaaggacgt atttatagcc aaattattac gttattatcg 600 

aattttgaaa accaaaaaga aatcgtcgat aattatcctg atgtatcttt acgaagacgt 660 

aactcaggat atgctattga cgaattattg cgtagtaact attttgataa gaattgttct 72 0 

gagtctttca atctttgcaa attgttagcg ggttcggaag gtacattagc cttaatcaca 780 

gagttaaaac taaaattagt tcctcttcct cctacggaaa aagccgtgat atgtgtacat 840 

tgttctacat tggaagaatc ttttgctgca aatcttgtgg ctttgcgaca tgctccggtt 900 

gcaattgagt taatggatag tacaatactg gagttgagca aacagaatat ttcacaaaat 960 

aagaatcgct tttttattca gggagatcct gctgctatcc tcattattga gttagctgag 1020 

caaacaaggg gtgaggttga taaaaaggct aatgaaataa ttgatgattt aaaaatacat 10 80 

cattatggaa ctcattatcc tcttgtatat gggaaagata ttagtcgtgt atgggcttta 1140 

agaaaatctg gattgggatt gctttctggt atgcccggaa gtgctaagcc cgtttcattg 1200 

attgaggata ctgccattgc tcctgagcgt ttagctgctt ttatcgctga tttgaaagtt 12 60 

atgttaagta aatatggtct ggattgtatt tatcatggac atattagtac tggggagttg 13 2 0 

catttacgcc cggtactcaa tttgaaaaag gagaaagata agaaactatt ccgtttagtt 13 80 

gctacggaaa cggctgagtt agtgaggaag cacagaggct cattaagtgg tgaacatggg 1440 

gatggccgtt tgagaggcga gtttatccct ttgttgttgg gtgataaaat ttatagtttt 1500 

ctgcgagaca taaaggagac atgggattta cctcatatat ttaatattgg taagattgta 1560 

gatacacctt ttatggatat taatttacgt tatgaacagc acaatcttgg ggttaagaca 162 0 

tattttgact tttctaaaca aaagggttgg ttgtgcgcca tagaacaatg taatggttct 1680 

ggagattgcc ggaaatcaaa tctctttggt ggtacaatgt gccctactta tcgggctacc 1740 
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agagaagaaa agaatacgac acgggcacgt gccaatactt tgagagagtt gttgatacat 1800 

cctgcacatg atcgaatatt tagtcaaccg gaaattttgg aagtattgga tacatgcgtt 1860 

tcgtgtaaag cttgcaaatc ggagtgtcca tcgaatgtgg atatggctcg ttataaggct 192 0 

gaatatttgc agcatcacta tgatgaaaca tttgtttctt tacgttccag attaatagct 1980 

aatttgacta aggtgcaaaa attggggatg gttgctcctt ggctgtataa tgcttttgtt 2040 

actgcccaat ttacttcctc attactaaaa cgtatattaa agtttgcacc tcaacgttct 2100 

attcccagac tttataaaat aacattgaag agttggttat acaataatcc agatatgaac 2160 

aaatgtaata gaaaagtgta tttgtttgca gatgaattta ccaattatat ggatgtagag 222 0 

attggtataa agttcatcaa attattgcgt acattgggct atgaggttat tataccaaag 2280 

cacttggaaa gtgggcgtac tgaaatatcg aaaggacttt tgaagaaagc taagaaaata 2 3 40 

gcagaaaaga atatattatt tttaaaagat atagtgacag aagaaattcc tttagttgga 2400 

attgaacctt catgcatact ttcgtttcgt gatgagtatc cggatttggt ggatgaggaa 2460 

ttacaaggat atgctcgtaa attatcggta aactgtctgt tgtatgatga gtttattgtt 252 0 

cgtgagatgc gtaagggtaa tattaaacag aaacaattta ctcaatcata tctttatata 2 580 

aaattacatg ggcattgcca tcagaagtcg ttagcgtcta tagagccttc taaagagatg 2 640 

ctttcactcc ctaaaaacta tcaagtggat attataccgt cagggtgttg tggtatggca 2700 

ggagcttttg gatatgaaaa agagcattat gacttatcaa tgcaaatagg tgagcaggtc 2760 

ttgtttccag caattcgtca agctaaagaa gatgtatgta tttctgctcc tggaaccagt 2820 

tgcaggcagc agataaaaga tggtacggga aggcgagctt atcatccaat tgaagtgtta 2 880 

tatgatgctt taatttaa 2898 

<210> 105 
<211> 3255 
<212> DNA 
<213> B.fragilis 

<400> 105 

gcaactcttc accatgaaag agcggataga caagtggcag gttctcaaca agtattatca 6 0 

agcaacatga gtttcacaac ttcaatcaac atagagcgtg actttggcaa gataccccac 12 0 

tatatcgtca ctgccaatgc tcgccagacc atcggcaaaa ttatcaatca ctttgccagt 180 

ggcattcatt cgttttgcct catcggctcg tacggcactg gcaaatccag cttcatcctc 240 

gccttggaaa actgcctgtg tggaaagact gttggaaaaa atgtcttact gagtcaacgt 3 00 

ggtcaattca atagttttga gcaattctcg tttataaaca tcgttggcga ctacgcatcg 3 60 

ttggcgaatc tacttgcgtc acatcttaat gcagaaagta agaacgttat ctctgtgctg 42 0 

gacaaccact ataacagact tcagaaaacg aatcaattct tagttattgt tattgacgag 480 

ttcggtaaag tgcttgaaca tgcagccaag aacaatcctg aaaaagaaat gtacttttta 540 

cagaagttct gcgagtatgt aaatgacaca agtaagaaca ttctcttcct tacaacgctg 600 

caccaaggtt ttggagccta tgccaaggga ttgaaagcag agcagaaaca ggaatggacg 660 

aaggtaaaag gtcgcattca ggatatcgtc tttgctgaac cgatagagca acttctcaac 720 

cttaccgcca ctcatatatc ttcggctgac aaaaagccaa cactcaacac agacaagatt 7 80 

tacaacctag ctgtcgcttc taagtttgcc gccagcacac ttgatgccaa cgttgcccgt 840 

gctctctatc caatggatat tgtctcggct tacgttttta cccaagctaa tcagagatat 900 

ggccagaatg agcgtacctt gttcacattc ttggagacgc gcggtgaggg aactgtcaac 960 

gattttgaag cttcaatcaa tcgtttgtat agtcttgccg acgttcacga ttatattgtt 102 0 

tataattttt attcttattt gcaagaggcc cacgaagact cggcgaattg gtcggctatc 1080 

aagattgcca tcgaaagaac agagggactc aatgcagatg ccacaaccat aaccgatgcc 1140 

atcaagattg ttaaggccgt cggccttctg aacatctttg cctcgtctgc tgctagcatc 1200 

gacaagcagt ttctgatagt ctatgctagc tatgctatgg acgtttgtca agtgggctcg 12 60 

gtgatagacc tccttgaaaa gaaccagata ctgcgtttcg ctaaatacaa gtccaaatat 132 0 

atcctctttg agggaactga tgtagacttg gaagcaggcc tttatgaggc tgctcgcgaa 13 80 

tgcaagcgtt ccgatgttat agcagaaaag gtgtgtgaat acttcgacga caagatagca 1440 

cttgccaatg cgcactattt tcgcactggc acgccacgat acttccagta ctgtctcacc 1500 

tcttcgccta ttgaatacat cgtcagtggc gagactgacg ggattatcaa tgtgatactg 1560 

acccgtcagg aagaccttgt tgctgtcaaa gctgcttgca cggacataaa tggtaaagcc 162 0 

atcctctatt gcatttttga gaacacaacc gaaatcgctg accatctctt tgagatagac 1680 

aaactccatt gggtgcgcga ctattacgtg gccgacgaga acgacaaagt agccaaccgc 1740 

gagatagcta acctgctggt tcacgaacag tcaatgttga acaaaaccat tatggagagc 1800 

ctcttctctg acaacgtgac gtggattttt aacggcgaaa tccttgcgtc aatcacatcg 1860 

cgtaagatgc tcgcacagca actctctacc atatgtgatt ctgtctatta tgccactcca 1920 
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atctatcgtt 
cagtcgtatt 
gataagtatc 
acaactgccg 
gagaacttcc 
ttagaggctg 
ctgattatta 
atcaacaagg 
tttgctgttg 
ggtagctcag 
tataagaagt 
ttccgtgatg 
gatgagttgg 
gttgctgtca 
aatatcgagc 
catcatctga 
cgtaattttc 
gtgtcatacg 
ctgctggcta 
actgcaagtg 
acgcaagtca 
gagtctattt 
aagaaattga 



ttgagctcat 
tgcaggcatt 
cacccgagaa 
gactgggctc 
tacgctccac 
cgccttttcg 
aacgagatga 
aggtgcttga 
acggtgtccg 
aactctcgac 
tgaatagcta 
tcatagcaaa 
gctttaagga 
tacaagaggc 
aatatttgct 
tagcagaacg 
aagcccggct 
tcgccctcaa 
cattgaaaga 
aggatgtcat 
ttctttccga 
tgagtggtga 
aatga 



taacaagcac 
gcttgaccac 
atcgctatat 
accgaccgaa 
tatcggcaaa 
tctgaagcag 
tttcgcgctc 
ccttatactg 
ccgtacattt 
tcaatcgttt 
tgctcgacgt 
ggcgaccgac 
aattacactt 
tatccgcgaa 
taagacactg 
atataagtcg 
tgttggcaac 
caagcctttg 
tatgctgttt 
aagactgcat 
agcgatgcgt 
caactcgctc 



cgccccactg 
tcttccgagc 
ctcacacttc 
ccttcattcc 
ccgcacaagt 
gggttgctct 
tataacagcg 
cgctcaccca 
ttcgacaaat 
atcgaaacga 
acgaaggata 
ccagagaaaa 
agccaaaacc 
ctccgcaact 
cgacttgaag 
gttaaaacgg 
tatgatgaca 
acagaaattc 
caattggatg 
atcacacaga 
caagaggtta 
gacgttgctg 



gtaatatgtc 
cataccttgg 
tcaagaatac 
aaccactttg 
tgggagaact 
attgttggat 
atgggaccta 
atggttttct 
atagagaggc 
ttcgtccgtt 
tttcgcccaa 
cattctttga 
cagaagccat 
gctactctga 
aagtcggttt 
aattgatgcc 
agaccgcatg 
gcgacacaga 
actatgtgga 
ataagagtaa 
acagccttga 
cactgatagc 



acttgcccgt 
ctttgagagg 
aggtattcat 
ggacgcttgt 
cttcactctg 
tcccacatat 
tgtaccttac 
cattaaggcc 
tatcaatatg 
cctcaccttc 
tgctcgtaag 
ggtgttgccg 
tgagagcttt 
attggttggc 
ctctgactat 
tgtaaatatg 
gattgaagcg 
taaatcgttc 
gatgcacaag 
ggctgtaact 
aaacaaactg 
aatccttaaa 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3255 



<210> 106 
<211> 267 
<212> DNA 
<213> B.fragilis 



<400> 106 

ttttattcac attgcgttga tcaacattgc aaaggtgaac atttgcagga agatgacaaa 60 

agaaaacaag gcatccggat caagttatac agaataagtt ataatcatca cattatcagt 12 0 

gctattactt attttttctt atacatctac ttcacatact ctgctattgt ccgattattc 180 

atattatacc cgtttaacga aatatccccg gttttatacc tggcatggca taaaaccggg 240 

gacgaaacaa aaatctatac ccattag 2 67 



<210> 107 
<211> 432 
<212> DNA 
<213> B.fragilis 



<400> 107 

aaaaacggac aaaacttaca atcggatggt tctatgagct tctggaacga ggcaaatgtg 60 

atattatgga tttcaaggaa cagtttgaag ataagtcttg cggacataga gtcgtcacca 12 0 

tacataagca agcagttgct tcgccacaca ttagaacatc tacaagaact ggactttata 180 

gagtcaactg gtcgcgcatc aggtttgcgt tacatcttgc ataagtctaa gatacaaaca 2 40 

actggtgaga aaataaaata ttcgcaactg aagaggcagg gcaaggcaaa acagagagaa 3 00 

gccgtcatac ggtatataaa cacagtcggc actataacta atgcggaggc tcgcgaaata 360 

ctcaacttga cagagacatc gcagtcatac gtgccaaggt gttatccgaa ctatggcgtg 42 0 

aaggacatat ag 432 



<210> 108 
<211> 876 
<212> DNA 
<213> B.fragilis 



<400> 108 

atgtttacta acctcatcaa aagagtaatt atgaagtatg cattctctgg tcatgagtcc 
tttcaatgca agggcttgtg gttgaaaaaa ggatatgact acgctaaggc gggattgtcg 



60 
120 
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ttcacagatg 
cgctattggt 
aaatatcttc 
ctgctacact 
gagtacaaca 
atgtttgccg 
atcgacacaa 
tctgcactgc 
aactgttccg 
acgcagggca 
ggtatgtcag 
ataaccttct 
tggcaggttc 



actacgctgt 
tgagagcttt 
ttgatgacaa 
atatgctcgt 
aaacacgcaa 
acaaatgctt 
tgctaaagaa 
taattgacct 
ctcgcgccaa 
aacagcaagt 
tcaatgaact 
gcaatactgc 
tcaacaagta 



tgtagaactt 
cggcatcact 
cggcgcagac 
gacatcacga 
agagttcacg 
tgacagcaca 
ttatgttacg 
caaactgata 
gatggagccg 
gatagagttc 
atatgacgtt 
cggtgagcaa 
ttatcaagca 



ggtgtgggca 
aacgacaatg 
ccttacatcg 
gtagcaacac 
aaagcagatt 
ccctacaacg 
cccgactcta 
ggcaagactg 
ctcgttttcc 
gaggtattgc 
ttcgaccaac 
ctcttcacca 
acatga 



agaatatggt 
gtgtgccgac 
aagacacgac 
tctacaacat 
tggcaaatgc 
agaagacagt 
tcaaggcgtg 
gccacgagga 
tctttgccgt 
tgcgacttgc 
tgcacactat 
tgaaagagcg 



agcctcaata 
cgagataggt 
cacactatgg 
cgttttcacc 
agtaagacgg 
ttggcgtgac 
cgatgacttc 
ttataccttc 
actcgacatc 
caatatcttc 
cgacccccat 
gatagacaag 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
876 



<210> 109 
<211> 330 
<212> DNA 
<213> B.fragilis 



<400> 109 

aaaaacgcta 

agaaaagcag 

ctgatagaat 

ttggcacgct 

ccattagttt 

aagaaaaaga 



acactaaaac 
gagcgcagat 
cctacatatt 
acatagaggg 
taagcacaat 
accaagaagg 



acaacctcct 
accgggaatt 
ggacgcaaaa 
aaaaaaactt 
cagaacttac 
aaagcgctga 



atcacagagc 
atctccaaca 
gaacaaaata 
tttggaaaaa 
gtcaacgaaa 



ctattaaaga 
atgaaggagt 
tcaagacatg 
taagaaatgg 
tctggaataa 



gacaagaggc 
tataaaagcg 
caaagattca 
agtattcaag 
gatggaaaga 



60 

120 

180 

240 

300 

330 



<210> 110 
<211> 195 
<212> DNA 
<213> B.fragilis 

<400> 110 

acgtttcata aggtaaagcc tcgacttaac attgaaagca gaatctttca aactcttcac 60 

tactacatat tgactttaat gtttcgtagc aaaggtaaaa ttagaattct aagctttttt 12 0 

tcgagtggtt acgagaatcc gcaaaagggg aaagaaaaca tcctccccct tataatttta 180 
ttttcaatca aataa 195 

<210> 111 
<211> 195 
<212> DNA 
<213> B.fragilis 

<400> 111 

aggggaaata aaagggaaaa tgaaaccttt tctcttctta attgtctgac attaaatgaa 60 

atagttctta aaaaagtgag tgtttcaata aacgatcgtt taattgaacg caaagataga 12 0 

gggtattttc ataactgcaa aatgtttaat aaaaaaaatg ttttccttca tggcttgatc 180 

tatttaattg tttga 195 

<210> 112 
<211> 1596 
<212> DNA 
<213> B.fragilis 

<400> 112 

gatatgagca aacaactctt acttggcgac gaagccattg cgcaagcagc attggatgcc 60 

ggactttcag gcgtttacgc ttatcccggc actccatcta ctgaaattac cgaatatatt 120 

caaatggctc ctattacgag cgagcgtaac atacacaacc gttggtgtgc caacgaaaag 180 

acggcaatgg aagctgcctt aggtatgtct tttgttggca aacgtgcatt agtctgcatg 240 
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aaacatgtag gaatgaacgt agccgccgac tgttttatca attcggccat cacaggtgta 3 00 

aaaggcggac taattgtagt agcagcagat gaccccagca tgcattcatc gcaaaacgaa 3 60 

caggatagcc gtttttatgg cgatttctct ttgatcccga tgtacgaacc gagcaaccag 420 

caggaagctt atgacatggt gtacaacggt tttgagtttt ctgaaaagat aggtgaaccg 480 

atactaatgc gtatggtgac acgtctggct cactctcgtt caggtgtaga aaacaaagca 540 

caaaagccac aaaacgaaat ttcgttcagc gaagacccac gccaatttat cctgctcccg 600 

ggcaacgcac gcaaacgtta taaagtacta ctcacacgcc aggaagaatt catcaaagca 660 

tcagaagagt caccatataa cagatatatc gatggcccca ataagaaaac tggtattgta 72 0 

gcttgtggaa tcggttacaa ttatctaatg gagaattatc cggaaggttg cgaatatccg 780 

gtattaaaag ttggacaata tccgcttccc aagaaacaat tgatgcaatt aatcgacgct 84 0 

tgcgacgaaa tccttgtttt agaagacgga caaccatttg ttgaaaaaca attgaaagga 900 

tatctgggta tcggattaaa agtaaaaggc cgtcttgacg gtacattatc acaagacggt 960 

gaattgaatc cggacacggt tgcacgtgcg ctcggcaaag agaacagctc ggaattcaat 102 0 

gttccgaata ttgtagaaat gcgtccgccg gcattgtgtg aagggtgcgg gcacagagac 1080 

atgtatatta cactgactca agtgctaaaa gaagaatacc ccactcacaa agttttcagc 1140 

gatatcggtt gctacacttt aggagcaaac gccccattca acgcaatcaa ttcatgtgtg 12 00 

gacatgggag cctctattac catggccaag ggtgcctccg atggaggact ccatcctgct 12 60 

gttgccgtaa tcggagactc aacttttact cattcgggca tgaccggact attggactgt 1320 

gtcaacgaaa atgccaatgt taccatcgtc atttcggaca acgaaacaac agcaatgacc 13 80 

ggtggacaag attctgccgg cacaggtcgc cttgaagcca tttgcgccgg attaggtgta 1440 

gatccggctc acattcgcgt agtagttcca ttgaaaaaga actatgaaga gatgaagcaa 1500 

atcatacgcg aagaaattaa ttataaagga gtatccgtta tcatcccgcg cagagagtgt 1560 

atacaaacat tagcacgtaa aaaaagaagt aagtaa 159 6 

<210> 113 
<211> 429 
<212> DNA 
<213> B. fragilis 

<400> 113 

attttatcaa accggaatac attcgatccg acttaccttt ggggcgataa tttatctatt 60 

aaccctttaa atcatatacg tatgaaacag aagaaaagac cggcatcaca aactgaagcc 12 0 

atgaaactga gatggaaaaa acggattgtc tttgagaaag gatacactga aatgtgtgcc 180 

gaatggatgg cggagcgcct ggaagcgttg accgaccacc tgcaatacgg gcacgcagcc 240 

atcgcttatc agaagcagaa cggagacttc aggttggtaa aagcgacact gatctactat 300 

gaaacggaat tccacaaaaa gtatgatccc acacaaatag aaggcgccgt agtctactgg 360 

aatgtggatg aacagcgatg gacgacattc cagatggaga acttcatgga gtggagaccg 42 0 

atcgtatag 42 9 

<210> 114 
<211> 1233 
<212> DNA 
<213> B. fragilis 

<400> 114 

atattaccaa aattattaat ctatatgaaa caacatcttt taaaagaaat agaactaggt 6 0 

accaaaagcg ctcttctcaa aaagaaaatt attacacatt atatatataa tggcagttca 12 0 

acaattaccg acctgtctaa agaattggat cttagtgtcc ctacagtcac taagtttatc 180 

agtgaaatgt gcgaagaagg ttatatcaac gactatggta aattggaaac aagtggagga 2 40 

cggcacccta acctatatgg cttaaatcct gaatccggct actttatagg agtcgatatc 300 

aaaagatttg ccattaatat cggcttgatc aacttcaaag gtgatatgat ggaacttaaa 360 

atgaatattc cttataaatt tgaaaattca atagaaggat tgaatgagtt atgcaaactc 42 0 

atttcaaatt ttatcaagaa gttgacaata gctaaagaca aaatattaaa tatcaatgta 480 

aatgtttccg gacgcgttaa tccggaatcc ggatatagtt ttagtcaatt caattttgaa 54 0 

gaacgcccat tatctgaagt tttagctgaa aaattagggt ataaggtaac aatagataat 600 

gatacgcgcg ccatgaccta tggagaatac ctaaaggggt gtgtaaatgg cgaaaaagat 660 

attatcttcg taaatatcag ttgggggcta ggtgttggaa tcatcatcga tggcaaaatt 72 0 

tatacaggaa aatccggatt ttccggagaa ttcggccaca ccagtacctt tgacaatgaa 7 80 

attatttgcc actgcggcaa aaaaggctgt ctcgaaacag aagcttccgg atctgcgcta 840 
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caccgcatct 
ggagacatta 
cttttatgta 
ctt atcaatc 
gactacatta 
aaagactcgg 
atgcttgcaa 



tgctggaacg 
acaatcctat 
ttgaaatagt 
tttttaatcc 
ctcaaccaat 
caatcgtcac 
ggagcagaat 



tatacaaaat 
aaccttggat 
ggaagaaatc 
ggaacttgtc 
aaaaacagct 
ttcaaaacta 
gtttgagtgt 



ggtgaaaact 
gaaatcattg 
gggcagaaat 
attatcggag 
gtccgcaagt 
aaagacagag 
taa 



caatcttatc 
cttctgtaaa 
taggcaaaca 
gaacaatttc 
actcacttaa 
ccggtattgt 



caatcgtata 
caaggaagat 
aattgccgga 
gctgacagga 
tctggtcaat 
cggagcttgc 



900 

960 

1020 

1080 

1140 

1200 

1233 



<210> 115 
<211> 285 
<212> DNA 
<213> B. fragilis 



<400> 115 

ataaggcaaa aagaaaataa tccggatagg aaagtacagt ttgcagttga taaaaaagca 

agtataccgg tttccatacg gaaggttctc tgtcgggaat ggaaaacggg acgtacattt 

aaacaaatga tctacagtca cttcagagca caatatttca atctgcagaa gctttatttt 

aacgttacgt taaagttcgt ttttcgtctg caaatagcaa attctttaaa tgcaaaagac 

ttgttatttg cagacaagaa caaagctatc ggaaaatggc attaa 



60 

120 

180 

240 

285 



<210> 116 
<211> 588 
<212> DNA 
<213> B. fragilis 



<400> 116 

gccatgaaaa 

acagtaatcg 

ggcatgagcc 

gcttccgact 

ggactcagat 

ttcgttaata 

ccacacaaag 

gccaacatcg 

caagatagta 

aaagctttgg 



aagatatcat 
gaaaggccgc 
agagaggtgg 
tgattccttc 
atctgcccta 
tccccaatta 
tcgtattgaa 
ttctattggg 
tccgtgaaat 
ccgccggaaa 



attatcaggt 
tcttaaagat 
agatgtacag 
gggtaaatgc 
cctcggtcat 
tccggctgaa 
cgtagataaa 
tgccactatc 
attccagcgg 
agagatcgca 



gtaggcggac 
ggtctgtata 
tcaaatctcc 
gatttaatca 
gagggttggt 
tcagatgtta 
gtagctaaag 
ccgtttttag 
aaaggcgatg 
gaaaaaacga 



aaggcatcct 
tgaaacaggc 
gaataagcga 
tttcactcga 
tggtcacgaa 
tggcagaaat 
aattagggtc 
gcattgatta 
caatagtcga 
tgaaataa 



gtctatcgcc 
agaagtacac 
tcagcccatt 
acccatggaa 
tgaaactccg 
taataaactg 
tacacgagtt 
tgaaaagata 
attgaattta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

588 



<210> 117 
<211> 969 
<212> DNA 
<213> B. fragilis 



<400> 117 

tcaatgaaaa 

gctatcaaag 

ataatggata 

cattgtacaa 

aattatttac 

gaaaagcctt 

acagggcatc 

aagaagaaga 

acttctcgtg 

attgctacta 

aaaaagaata 

aaagcacgtg 

caaggtgaga 

aagggattta 

ggcattgaag 

attggtttaa 



attttgcatt 
atacgggaaa 
gtttctttcc 
aattaaaagg 
atgatgcgca 
tggtcttgaa 
atatttatac 
tagagaatgg 
gcaattggta 
atattggtgt 
tagttcatgt 
ttcgttattt 
aacttaccta 
ctgaactaca 
atgcgcgtaa 
aaggagatta 



aatcggagca 
ccgtttggtt 
ggaatcttcc 
tactgacaaa 
catgcgttat 
tccatggaat 
tattctccaa 
ccctaaagat 
ttatacaagt 
tcatttttat 
atatacacat 
tttgagtatc 
ccgtactatt 
cacagaaagc 
tgctattaat 
tcatcctttg 



gctggctata 
gcagcttatg 
ttttttgtgg 
cagattgatt 
gggcttcgat 
gttgatgcac 
ctccgtttgc 
aagatttatg 
tggaaagggg 
gatatgcttt 
gatcgtgctg 
aattctgaaa 
aatattgatg 
tataaagata 
attgtttacg 
gcaaaacttc 



ttgctcctcg 
atacttttga 
aacaagaact 
ttctatctat 
tgggtgctga 
ttcaagaagt 
atcaatctat 
atgtagatct 
atatgcataa 
cgtgggtatt 
ctggttatct 
atcttcctga 
gagaagagtt 
ttttggctgg 
atatccgtca 
ctttgtcaaa 



tcatttacgt 
tagcgtcgga 
tttcgaccga 
ttgcactccg 
cgtaatttgc 
cgaaagagaa 
catagattta 
aacttatatt 
aagtggtggt 
cggtcctgtg 
tgaattggaa 
aaatgcagta 
tgagtttagt 
taatggcttt 
tgctgagcca 
gcatccgttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
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ggctggtaa 969 

<210> 118 
<211> 270 
<212> DNA 
<213> B.fragilis 

<400> 118 

aagtacgaaa aaaaccggaa gtgtggggga tatactgaaa aaaataattg ccattgtccg 60 

agatacaaaa tactatttgc gcacatttta gaaagatatt atcgggactt cgaggctttt 12 0 

ataccgatat gggcgggatg tccgggcatt catacgcctt ggaaaagaga agtgatgcag 180 

gaaagcggtt gttgcaaacc gtatctgccc aaaaaactgc ctgactcctc acgtatcgaa 240 

ttctgttttg acgtatttgt aatttgttga 270 

<210> 119 
<211> 1131 
<212> DNA 
<213> B. fragilis 

<400> 119 

tttattatga acaaacgaat ctggctttcg cttgctcaca tgggtggccg tgagcaagac 60 

tttataaaag aggcttttga tacgaactgg gttgtccctt tgggacctaa cgtggatgct 12 0 

tttgagcaat ctttggccga atatttgcat gaagaccgtc gtgtagtggc tttgagtgct 180 

ggaacggctg cacttcactt gggcttgatt cttctgaatg tgaagcccgg tgatgaagtg 240 

atctgccaaa gctttacttt tgccgcctct gccaatccga tttcctatct ggaggccaaa 300 

cctgtttttg tggacagtga gaaggatacc tggaatatgg atccggtatt gctcgaggag 3 60 

gctataaagg accgtttgcg caagacgggt aagctcccga aggctattat tcctgtccac 42 0 

ctttacggta tgcctgccaa gatggacgag atcatggata ttgcgggtcg ttatggtatc 480 

cccgtattgg aggatgccgc ggaggctttg ggttcggaat tgaacggacg gaagtgtggc 540 

acattcggtg aactggccgc tctctctttc aatggcaaca agatgatcac gacttccggt 600 

ggaggtgctc tgatctgtcg tacggaagag gaggcccgac agacaaagtt ctacgctacg 660 

caggctcgtg atgccgctcc gcattaccag catacccata tcggttacaa ttaccgtatg 72 0 

agcaacatct gtgcgggtat cggtcgtggg cagatgtttg tcctcgatga acatattgcc 780 

cgtcgccgtg ccattcactc tttgtatgtt gatttgctga aagatgtggc gggtattacg 840 

gtcatggaga accctgattc gcggtttgct tccaactttt ggcttacttg tattctggtt 900 

gatccgaagc ttgcgggtaa gagtcgtgag gatatccgtt tgaagctgga ctccgagaac 960 

atagagacac gtcctttgtg gaagccgatg catcttcagc ctgtgttcac ggatgctccg 102 0 

ttctatggga atggtacgag tgagaggttg ttcgatatcg gcttgtgtct gccttcggga 10 8 0 

cctacactga cagatgagga tatcaggaga gtggtggata tgatccgata a 1131 

<210> 120 
<211> 1569 
<212> DNA 
<213> B.fragilis 

<400> 120 

aaaaagatga agcaaaagca gttttacttt atttatgttt tccttctgtc aatgactttc 60 

ttgggtgcat gttccaaaga ctctccaaac gaattaattc ctaatacaat agtaaaaatc 12 0 

gagattgatg aactacctgg aaaaagaata tatttcatag gagaagaatt ggatgtat cc 180 

gatatgacat tgaaagtatt ttattcaaac gaaacgtctg aaatagttcc tgtaaaaaaa 240 

gacgaagtca ctggattcaa cagtacggta cccgaaaacg atcagatttt agaggtacac 3 00 

aaaggcagtt ttaccgttac ttttaaaata caagtactga ttaatgatat tcaagcgatt 3 60 

tcaattaaga ctttaccttc aaaaaccgta tatacattgg gagagcctct ctccctcagt 420 

aatatggtac ttgaaataaa ctatgccgat ggtacgataa aagaaaattc agctccatct 480 

gctgattggg tacaaggttt caattcttcc gtaccggcac aacttcaaat agtgacactt 540 

gaattggatg gtaaacaagt atcttttgat gtgcaaatat tacctgtaaa agtagacgga 600 

gataaagttg taagtgtcat tgattccgac tttacatcaa taaccttccc ggatggtatt 660 

cgcacaatag gatcaaaggc cttcgaaaat aagaatatca aagcgagtga acttctgttc 72 0 

cctgcctctt tgagtacgat tgagcaggca gcatttgctt attgcagaaa tctgaaaatc 7 80 
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gtcgatttaa 
ataaaaaaaa 
tgtactgatc 
gctttcggga 
acatcggctt 
gtgattgacc 
atttaccaca 
ggaacccgga 
aaccattctc 
agcgctctga 
gacttcaatg 
atggttactg 
aaccttgtcg 
gccctttga 



gccacacatc 
tagcactgcc 
tgaatgttat 
aatccggtat 
tcatagagac 
tggaggcttt 
ttgaccgctc 
caacaccttc 
ctaaactcac 
acaagtgtca 
ctttcggaaa 
ccgactatta 
aaacatacaa 



gattaaggaa 
tgcttctttg 
cgacataagc 
atcttctata 
aaagaacttg 
ttccggcagt 
tttctacaac 
gcctgttgac 
tgttcttaaa 
agtaaaaact 
tgctgtttcg 
ccccgtagcg 
gcagaacaaa 



ttgccggaag 
gaaattgttg 
catacttccg 
tctttgcctt 
aaagaattga 
tccattcaga 
tgccccgaac 
aggacggcag 
attcccgcaa 
cttattttac 
ctggacgaaa 
ccaggaattc 
gcctggaagc 



aggccttttt 
gaaaggaggc 
tcaaagagct 
ccacttttaa 
ctctgcccga 
aagtaaccct 
ttactaccat 
caatagtgag 
gcatagctaa 
ccgcaagtgt 
tttcattaat 
aaaagataag 
cattcgctga 



attttccgga 
attttacggg 
acagaacgga 
gattgtaggt 
aggaagtgaa 
tccgaatact 
cgaaacttac 
cgaatgtttt 
aataggaata 
gaaggcatta 
gtcgcctacg 
agttccccaa 
aaaaatcgtt 



840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1569 



<210> 121 
<211> 978 
<212> DNA 
<213> B.fragilis 



<400> 121 

att atgaaga 

gccctgcaat 

aaacctgccg 

catcaactgc 

cgtgagtgga 

tggaatatgc 

ggagcagcgc 

tttttcctga 

gccgatacgg 

gttatagaaa 

atggcagtgg 

gactggaatc 

gctgcatgga 

agtgagaagt 

aactttttga 

ggtaaaaaga 

aagatgaagg 



aagaagattt 
gtctggttga 
gtcgcggaca 
ctttgctgca 
aagccgatct 
cacgtctggg 
ctataaactg 
aacatgaaat 
acaatgtaga 
ctgtagacgc 
caggcgaact 
agccggtgaa 
gtgagttggt 
tgccaaaggt 
gagtggctgt 
gactgaagac 
ctgtgtaa 



aagaattgta 
aggcggttat 
taaaattcag 
accggaaaaa 
acagattgtt 
aacttttaat 
ggcagtgatc 
agatacgggg 
gattgtgcat 
tattctggag 
tcgtccggct 
acgggtttat 
gaatccggaa 
gcatacgttg 
gccggatgga 
agatgaactt 



tatatgggga 
aatgtggttg 
tattctccgg 
ctgaaagatg 
gtagcttttc 
ctccatgctt 
aacggagaca 
gaagtaatcc 
gacaagttga 
ggaaaggtga 
ccgaagattt 
gattttattc 
ggggaagcgg 
gctcccggaa 
tttgtaaatg 
ttacgtggat 



ctccggactt 
gagtgattac 
taaagcaata 
aagaattcat 
gtatgttgcc 
ctctgcttcc 
ccgaaacagg 
agcaagtacg 
tgcatttggg 
aatcaatacc 
ttaaagagac 
gtgggctttc 
ttgtcgtgaa 
gcattgttac 
tcctttcgtt 
tccatctgac 



tgccgtggaa 
gatgcccgat 
tgcactggat 
tcaggcgtta 
ggaagtggta 
gcaataccgt 
tattactact 
tattcccatt 
cggtcggttg 
tcaggaagaa 
ctgccgaatt 
gccttatccg 
aatctttgag 
tgatggaaaa 
gcagttgcca 
tgaagcattt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

978 



<210> 122 
<211> 546 
<212> DNA 
<213> B.fragilis 



<400> 122 

agatttattg 

tatgctgtat 

ggcgttgaaa 

aagattttca 

gctcatgtag 

tctataccgg 

gtcgagtttg 

ttacaaggat 

gttgaagggt 

gaataa 



aacggatgga 
atacagcacc 
actatttgcc 
ttcctgttgt 
ccggtattca 
aagttcaaat 
cgccaaatga 
tggaggctga 
tgggatgtgc 



agaaacagcc 
gagagcagag 
tcttcaaccg 
tccgggatgt 
tggagtagct 
ggagactttc 
gtttgttcct 
gctagttgat 
tttggttaca 



agaaaaataa 
aagaaagtga 
gtagttcgtt 
ctatttgtgc 
tttttactga 
aagactatga 
ggaaccatag 
tgccaaggaa 
gtctcaacgg 



aagaaaatac 
aggaacagct 
tgtggaacaa 
acatctcctc 
aggaaaaggg 
tagagcactc 
tgcgagtaat 
ataataagtt 
attgtgtagc 



ttcttgctgg 
ggataagata 
tcgcaagaaa 
tgaggagatt 
acaatatgtt 
ttgcgaactg 
aagtggacag 
gttactgcga 
ttcaaaagag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

546 



<210> 123 
<211> 1026 
<212> DNA 
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<213> B. fragilis 



<400> 123 

aaagaaattg aaatgagtga agtaagacac gtgttaggta tatctggtgg aaaagacagt 6 0 

gctgctcttg ccatctacct aaaagataaa taccccaatc ttcatattga gtattatagc 12 0 

agcgacacca aatgtgagtt ggatgaaacc attcagttca ttgaccggtt gcgctcttac 180 

ttaggacaca taacgacctt aattgcggca gaaggaagtc ctgaacctac tccttttgac 240 

cactttctga aggtaagcgg tggctatctg ccatcggtac aagcaagatg gtgtacgcag 3 00 

aaaatgaaac tcgccgagtt tgagaaattt gttggcgaca ccccaaccgt ttcatacgtg 3 60 

ggtatccgcg gcgatgaaga ccgtgaaggc tatgtatcga caaagccaaa tatacaagcc 42 0 

atattcccgt tccgcaagaa tatctggagt atggatgtta ttcacgaggt gctgcatgat 48 0 

aagaacattg agaattttgc agaatgctat cgcaacgttg cagacgatga gacctatcaa 540 

acagttgaag cggctctcac ttcaaagctt accaagcact tctactactc aaagaaattg 600 

aatatgctac ttgatgctga tgtcatcacc tttaatcacg ccgtattcag ttttctgaag 660 

caatacacag attaccctgt gggaaagttg gactatttcc cattgattga caatgatgag 720 

gttttggtga gagaagaaat ctttcgcatc cttgaagata gcggcgtagg cataccagca 780 

tattacaacc ttatcgactt tgaggtggat ggaaagaaag gacagtattg ccgtagccgc 840 

tctggatgtt atttctgctt cttccagcag aagatagaat ggatttggct ctacgagcag 900 

catcccgacc ttttcaaaaa ggcaatggag tacgaaaaag acggatatac gtggattcaa 960 

ggcgagcctt tgagcgaact gatacgatcc ggagtcgtgt gcggcaaatc aagcttgacc 102 0 

agataa 1026 

<210> 124 
<211> 1182 
<212> DNA 
<213> B. fragilis 

<400> 124 

atgggcaatg aaaagaaaaa agttgtaaaa atagttccta cctattttga gcatgaaact 6 0 

cgggacctaa aagagatttc agttttaaat agtttaggat gtaatgttat tgtagtggcc 12 0 

aaaggagata atgctgtaat aattgaagag tcttgttata ttctgcatag attatgttct 180 

aggcctttga tgccttttgt ctcaaatctg tttctgaata gacttttttc tctttatata 240 

tgggttcgat acgtcaggaa gttgcatgga gaattgctga gttgccatga tttattttgt 300 

ttgtgcattg gttggttatc tacccttggt ttgcgtaaaa agcctttcct ggtctatgat 3 60 

tctcatgaat ttgagtatgg acgaaactgt aaacgaaatt ttgtttcaaa attgtttatt 42 0 

aaaactttag aaaggttctt gtgtaaaaaa accgctctta atattgttgt aaatgaatct 480 

attgcagatg cagtacaaac tcttcatggt ttgaataata gacctttagt agtccgaaat 540 

600 
660 



gtccctttat actggaatat agatgttaat aaatgtgtat tgagacggaa aaaaatatgt 

gaagcatatg gtattccaat tgatagtttt atcataatgt atcatggggt gattgcagct 

gggcgcggca ttgagaatgc aatttatgct gttgagaatg ttgaaaatac ttgtttgttg 720 

attttaggaa atggggaaaa aagctatatt gcgttattgg aaaaaatgat ttcttcttta 780 

cgattagagc aaaaagtgtt ttttcacaca gctgtggaac attcaatatt gtgggaatat 840 

attggtagtg ttgatgtaga actctctgtt atcttaaata cttgtataag ttattattat 900 

gctttaccta ataaaatttt tgaatctatt caagcgatga tcccactcat agtcagtgat 9 60 

ttccctgaga tggaaagggt tgtaaaaatg tatgatattg gagtttgttg taaatcagat 1020 

gatgtgaata gtttagtaga agctatacga ctaatgaata aggataaagt attatattct 1080 

cgttttaaag caaatatgca agatgccaag aaagaattat gttgggaaaa tgaaaaggag 1140 

attttagagg gagcttatcg ttcaatattg atggatatat ga 1182 

<210> 125 
<211> 1821 
<212> DNA 
<213> B. fragilis 

<400> 125 

aggtggagtg atcaagatta ttcgcgaatg atggaaaaag aaaagataag tttattacag 60 

cgctttatta tctggcgcga gaataaaatc aaagaaaagc agtttattct cattttaagt 12 0 

tttctggtcg gtatttttac tgccattgct gcactgctcc taaaattctt tattcatacg 180 

atacagaatt tcctgacaga taactttaat acgacggagg ccaactacct gtatctggtt 240 
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tatccggtgg 
atcagccatg 
agacataata 
gtaggagccg 
atgttcaaga 
atcggaggta 
atcgacctta 
gtttcgtata 
gagttggagc 
tatttcactc 
aagaagttgg 
tatggtgaag 
gatacggtac 
gtgctgatca 
ggcggtattt 
tttagcaatg 
atggcagggg 
gagctaacgg 
ctgacaatca 
cagttgctga 
gttgaaactg 
atttcaactt 
gtcgtgctac 
accgttagta 
caggtgatgc 
ggcaaatacc 
gtacatttct 



tcggtatttt 
gagtcacgaa 
tctggtcgtc 
aggcacctat 
tggagcaccg 
tttttaaagc 
ccatgtcgtc 
ttacgaccgg 
gtattcctta 
gtgctatgaa 
cattgggagg 
gttacgatac 
tgaataactc 
ttctgttgaa 
ttgcaccttc 
attttgactt 
ttatgagtgg 
gcggatatga 
tcgtgtttga 
cccatcacaa 
actttgtcag 
cgcatcgtaa 
tggacgatat 
aactgatgac 
agacttttga 
tgggatttgt 
cggaagattg 



tctggcagga 
gattctttat 
gaccattgcc 
tgtgttgacc 
tacactgatg 
gcctattgcc 
tttattacca 
acaggaggct 
tgtgattctt 
ctctgtggaa 
tgtgatgctg 
gatcgaacta 
gttgttttat 
agtctttgcg 
gctgtatctg 
tacctcgact 
agtcatgcat 
cctcttcctg 
accgcatagt 
ggataaagct 
cgtgcgtccg 
tatgtttcct 
caggaacatc 
ctcggtccct 
cgatacaaaa 
atccaaatct 
a 



tggtttgtac 
gcaatttcga 
agtgccatta 
ggatcggcaa 
ttgctggtag 
ggactggtgt 
ttgctgattt 
atgtttaaat 
ttgggaatct 
ggagtatttg 
agtgtgctca 
ttgttgaacg 
ggatacggta 
tcgagtgcga 
ggatgtattg 
ttgcccgaaa 
gcacctctga 
cctctgatga 
atctactcta 
gtattgacac 
gaaatggatc 
gtgacggata 
atgttccgtc 
gcccgtctgt 
gcatggaact 
aagatattta 



gcaatatcgt 


aaaggatgat 


3 00 


ggaggcaggg 


gcgtatcaaa 


360 


ccatcggttt 


cggcggatcg 


420 


tcgggtcgaa 


tttgggaagt 


480 


gctgtggagc 


ggcgggtgcc 


540 


ttacgcttga 


agtactgatg 


600 


cggctgtcac 


ggctgccact 


660 


ttcatctgga 


tcagcctttt 


720 


tttgcggatt 


ggtatcgctt 


780 


gcaaactctc 


caatccgtat 


840 


tcttcctctt 


tccacccttg 


900 


gcgtgagcaa 


tgccgactgg 


960 


atctgttgct 


ggtctatttg 


1020 


ccaacggtgg 


aggcggatgt 


1080 


ccggttttgt 


gttttcgcac 


1140 


agaactttgc 


gttgatggga 


1200 


ctggagtatt 


cctgattgcc 


1260 


ttgtttcggt 


cagttcgtat 


1320 


tgcgtttggc 


taaaaaggga 


1380 


tgatgaaagt 


tgaaaatgtg 


1440 


tgggcgaatt 


ggtgaaggcg 


1500 


aagacggggt 


cttgctgggc 


1560 


aggaacttta 


tcatcgtttt 


1620 


atgatacaga 


tagcatggaa 


1680 


tacctgtggt 


caatgaagag 


1740 


attcatatcg 


ccaggtattg 


1800 
1821 



<210> 126 
<211> 252 
<212> DNA 
<213> B.fragilis 

<400> 126 

aatcacggtg aaaagtcagg aggaagtgaa tgctattcct gtgggtattc ctctctctct 60 

ttggatgctt gtctgattaa agccaatgac tctgatccgg tgtatctgag tacgaacggt 12 0 

gtgaaaagtc atattaaatc ggtagaagat tttaataagg tcggttttga ttgggataaa 180 

atcaaggtga tgtctccggc agaagtggat gcaatcccta ctgctccgga atatgagatc 240 

gccaattggt ga 2 52 

<210> 127 
<211> 936 
<212> DNA 
<213> B.fragilis 



<400> 127 

tataaaaacg 

ttttatgaaa 

aagaacggac 

gatgaagaac 

gtgattgtac 

caaaaaatcg 

gtcgaagagc 

tattattatc 

gcagtagacg 

gaatacaatc 

gaaacgatcc 

aactacaatg 

aaagcacgtg 



aaattatcat 
atggagaggt 
tgcaaggagt 
gtatgaagct 
atgtaggtag 
gtgcatgggg 
tggtgaagta 
atattcctgc 
gtcgtattcc 
agtgtcgttt 
ttccatgcct 
gtgtaaatct 
aattacagaa 



ggaaaagatt 
taattatgaa 
atttattaat 
tgctgaacgt 
ttgctgtgta 
aattggtgcc 
ttgtgaagaa 
atttaatgga 
taactttgcc 
gtataaaggg 
agctatggga 
ggttggtatt 
tttctctcag 



attggattga 
ccaattgaag 
ggatcttccg 
tgggtagaag 
aaatcaagtc 
atggctcctc 
atcgcttgcg 
gcattcttgt 
ggaataaaat 
ggtaagtttg 
ggtgcccagg 
atagaagcat 
gaagttatta 



tcaatgcccc 
cgtatgctaa 
gtgaaggata 
tttcacctaa 
gcaagcttgc 
cttttcctaa 
gtgctcccga 
caatggttgc 
atacttttga 
atatgcttca 
gaggtattgg 
ggaaagcagg 
atgtcatttg 



ttttactccg 
gatgttagta 
tatgttgacc 
aggatttaag 
cgaacacgct 
agtaggtcgt 
tcttcctttc 
tttcttggaa 
aagtatgtat 
cggacaagat 
cggaactacc 
tgatcttgag 
tcatttccgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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ggaaatatcg taggtggaaa acgaatcatg aagttgatag gattggattt gggtaaaaat 840 

cgtactcctt tccagaatat gacggacgat gaagaagtac gtatgaaggc tgaactggaa 900 

gctattcatt tcttcgatcg ttgcaataag ttttaa 936 

<210> 128 
<211> 1113 
<212> DNA 
<213> B.fragilis 

<400> 128 

tataagatgg aagaatataa aagatgtacg cgttgtgtaa tggataataa gtcagatgaa 60 

actataacat ttgataagca tggacgatgt aattattgca cagatgcatt aaatctgatt 12 0 

ggaaaggtct actttcctaa tgcggaaggc gaacagaagt tgcgtcaaat gattgaaatg 180 

cttaaatatg aaggaaaggg aaaacaatat gactgcttaa tgggaatatc cggagggtta 240 

gattctgcat atttagccta tttaggttct gtgaaatggg gattaagaat attggctgtc 300 

catgtggacg atggctatga tacagagtta gcaacatcta atataaaaaa cttatgtgaa 3 60 

gcctgtggta ttgaactgat ggtagaagct cctgattcgg agcaatttaa tgctatgaca 42 0 

aaggctttta taaaagctga ggttcctaac attgcaatac ctcaagataa tattttgttt 480 

gcttgcctat acaattatgc acgtaaatat aaggtttaca attttttatc gggtggaaat 540 

tttgccttgg agtgtgtgtt gcaaaaaggt aatacttatg aggtttttga tatgatccat 600 

aatagggata tacagaaaaa atttggttcg aaacctattg ataaactgtc gttcttatca 660 

tcttatcaaa agattgtgga tacgtattta tataaaataa aaagtttacg tcctttaaat 720 

tatattgatt ataataaaga atgtgcaatt catgaattga atgatttttg tggatttact 780 

tattatgaag caaaacattt ggaaaatata ttaacaaaag tgactcagtt gtactggttt 840 

tatcataagt tccatgtaga taaaaggaca tctcacttat ctagtttaat tgtttctgga 900 

caaatgtcta gagagcaggc tctagcagag ttagagaagc ctgtttatga taaaaataag 9 60 

atggaaaaag atattgagtt tgttttgaag aaaatagaga tgtctcgaga agagtttgaa 102 0 

gaacttataa atagaccagg gaaacaacat tcagattata gaatggacaa atttctacct 1080 

tttttacata aaataaaaac attttttgat taa 1113 



<210> 129 
<211> 1473 
<212> DNA 
<213> B. fragilis 

<400> 129 

gaaagatttt ggaaggtttg taagattcat tggaatttat ttgctgttaa tatacttagg 60 

tcttatttta ataaaatatt tatgacggca atttttatag tcgttttttc agttatttat 120 

ttattggtgc tatataactt ttatatagcg atttgtggac gaattagggt ttttactatt 180 

acatcttttt tttgtttatg ttacatatct tttgcttata tcggtagtat tctattgaat 240 

attatgcatt ttgaggctga agattatttg ggtatgtatg cccgtcctga tatttttttc 300 

cttgtttggg tatttacttt gttaggttta ctgttcttgt tattaggctt tgcgatagca 3 60 

aatatcgtat ttaaaaatat ttgttatccc agaaaaaata gagatctaca attaattaaa 42 0 

gtttcaatta gctgttttga taattcaaat aaaaatttct ttgttatttt atttcttttt 480 

attttaagtt tctttgtttt gcttgtttat agaaatgcaa ttggaggatt tccattggaa 540 

tctgttttct ctgctgataa tggaactgca cttgcctttt tgagaagtga ggctactaat 600 

aatttttctg ggaaatttta tagatatgta atgtttatgg agacattacc tttgttttta 660 

tttatagttg tttcttttat aaaaagttgt aagaagaaaa aatggaaata tttatatata 72 0 

gctttgtttc tttataatct tttttattca ttatctacta tacaaaaggc gcctatcctt 780 

aaatttttat tgttatgttg cattatcttc ttttataaaa atggatttat taataagaag 840 

ataatattaa aattggtcgt tttttcgtgt ggtttagttt tggtaatgta tatgtgtttt 900 

atggggttgg aggatgctcc tattgaagtt attattgaag gggctctaca tcgggtcttt 960 

attggcgcaa ttcatccttt ttattggtat ataaagtatg cggaagagtt cggatttttg 102 0 

tatggaactt ctttcccaaa tccagcggga atatttcctt ttgaatcatt tcgcttaact 1080 

gtagaaatta tgaattatgc gaaaggagat cttttagggg atttagtagg ttcaatgcct 114 0 

actgtttata ttggagaaat gtatataaat tttggactgt atgggttggc tttagctagt 12 0 0 

ttaatgtttg ggtttatatt acaaacatta gatattttat ttgttaggta tcttttagtg 12 60 

aataagagtg ttttagtttc aagtttatat atatatatga tttattattt ctcacagttt 1320 

acagaaacag gaataagtgg aataataata gatacagatc tttatatagt cttatttatt 13 80 



60 



tcatttattt attgtttgat aaatagatat aatttgagaa gatatgggaa aaaaaagggt 1440 
ttgccatgtt acaagtgtac atcctgcaga tga 1473 

<210> 130 
<211> 378 
<212> DNA 
<213> B.fragilis 



<400> 130 

taccgccttt 

tttgaacttt 

tcgatttatt 

aaatatcttc 

ggcaacgatt 

gttttgggga 

catcgtaggc 



gtaaaacttt 
ttttgaataa 
ccctcgtttt 
gaaaacaaat 
ttttcagcga 
actcttatct 
gacattaa 



tacgcagacg 
acttttatat 
tattcatcac 
ctcaaaacag 
atggcttcca 
tttgaattcc 



gtttatgcat 
aggttttatc 
ataacccgct 
cttcgagtga 
ggctttgttc 
tggcgctacg 



tagcggatat 
gatctgtttt 
atgcagctca 
aatttaaaca 
tgcttgtatg 
gggtaatagt 



tatgactctg 
gagttccttt 
caaaacggga 
gacctcaaag 
tttcgacaag 
cggcagtaac 



60 

120 

180 

240 

300 

360 

378 



<210> 131 
<211> 213 
<212> DNA 
<213> B. fragilis 



<400> 131 

tctatggatt gtatcatgca aaacaatata ttcgactatg ccgcactgct tagacaggtc 
aaagcaagag tggcactcgc ccagaaaaag gctatctatg ccgccaatgg agaaatgctt 
tctatgtatt gggacatcgg caagttattg tccgaaagcc aaacacaaat tggctgggca 
acaatacgtt ggagcagttg cccggtgatt taa 



60 
120 
180 
213 



<210> 132 
<211> 498 
<212> DNA 
<213> B.fragilis 



<400> 132 

aaaaggagat 

aagtccggtg 

cagcaattag 

gtggttcgta 

ctggacggat 

gaggaagagg 

actcgtcctg 

gcgagtggga 

gatgatccga 



gcaggaatat 
ataagaaatg 
ctgagatgat 
acttgatgac 
tgggtacttt 
tgaacccgaa 
cggctatcgg 
tcggtgcttc 
ctgcgtag 



ggcattattt 
gcacttgaac 
tgcggaaaaa 
agcgatgcgt 
tacgatgaaa 
tcaggtggca 
tactacccgg 
gggcaataat 



tataaagcag 
ttggtgaagg 
tcgtcgttga 
agtgcattgc 
gcgcgtacac 
gcccttcttt 
gctttgtttc 
ggttccggag 



taaaatcgac 
taggcaaggt 
ctccgggtga 
tggacagtaa 
ggggaagggg 
gtcattttac 
agggtgttga 
gtggtgatgg 



tatggcaact 
ggtatctact 
tgtgcataat 
gacggtacgt 
agtggacaaa 
tcctgaatat 
gttccagaaa 
agacatagta 



60 

120 

180 

240 

300 

360 

420 

480 

498 



<210> 133 
<211> 1251 
<212> DNA 
<213> B.fragilis 



<400> 133 

cctcttattt 

tggggggtag 

atgcaggtag 

ttcctttgga 

cgtaagtggc 

attgcagaaa 

ctttatattc 

ttagcggttg 

gctactgttg 



ttatcatgaa 
ccctactcaa 
atattgtgga 
tttatggcct 
tgattgtcgg 
catttaatca 
cggccggtct 
gtatccacat 
ccgctgcttt 



aaactcaaaa 
ttatatggac 
acttcagtcg 
tatgagcccg 
cagtcttttt 
ggttttttgg 
ttctcttatt 
gactggtctt 
ctcatggcat 



atttatcctt 
cgacaaatgc 
gcaaccaatt 
atttccggta 
gtctggtctt 
ctgcgtgcat 
gccgattatc 
tataccggac 
accacattcc 



ggatagtggt 
ttagcacaat 
ttggccgttt 
tgattgccga 
ttgtaaccta 
taatgggagt 
atactgaaaa 
aagctattgg 
attggttcgg 



tgccctcctt 
gaaagatgct 
aatggctgtt 
tagattgaat 
tttgatgggt 
gagcgaagct 
gtcacgttct 
tggatttgga 
tattgtaggc 



60 

120 

180 

240 

300 

360 

420 

480 

540 



61 



attgcctatg cattggtctt gattatattc cttcgtgaga atgaagaaca tgccaggggc 60 0 

attcgggcca tgcatacaga taaatcaaaa aagattccgt tgtttaaagg agtgactctt 660 

ttattcggta atattgcttt ttggattatt ctgttctatt ttgcagctcc cagtcttccc 720 

ggatgggcta cgaagaattg gttgcctacc ctgtacgctg agaatctcga tatccctatg 780 

gctgaggcag ggcctatatc cactataacg attgctgtct cttcttttat cggagttatt 840 

ctgggagggt tattgtcaga ccgttgggta tgcaaagaca tacgcggacg tatctataca 900 

ggcgcaatcg ggttagggtt gaccatacct gcgcttcttt tattgggctt aggcaatggt 960 

ttcatcagta tagtaggtgc aggatttctg tttggggtcg gtttcggtat gttcgatgcc 102 0 

aataatatgc ctattttgtg ccagtttgtt tcggccaaat atcgggcaac ggcctatggt 1080 

ataatgaata tgaccggagt ttttgccgga gcagttgtaa caagcttgtt tggaaaatgg 1140 

acggacggtg gcaatctggg attgggattt gctattctgg gaggtattgt attgttggct 12 00 

ttgggcatgc agttgtgctt tcttcgtccg cacacggata atatggaatg a 12 51 

<210> 134 
<211> 684 
<212> DNA 
<213> B.fragilis 

<400> 134 

ttcaaatatc gcaggttcaa cttgccagtt tcaatttgta aatatcacaa atatggtttg 60 

tttattccag agatttctgt aagtttgtat cgtactataa ctatcataat tatgcaagat 12 0 

ataataaacg ggcgttgcgg ttggtgcgga agtgacgaac tgtatgtgaa gtaccatgat 180 

caagagtggg gaaaattggt gaccgatgac aagacgctgt ttgagtttct tgtgttggag 240 

agtgctcagg ccggtttgag ttggataacc atccttaaga aacgtgaggg gtatcgcaaa 3 00 

gccttttgca atttcgatgc tgagtcggtg gcacaaatga ccgatgaaga tgttgaacgg 3 60 
ttgatgcact ttgatggcat tgtgaaaaat cgtctgaaga tcaaatcgac catcacaaat 
gcaaggtcat ttctcgccgt acaaaaggag ttcggtagtt tttatgacta tactctatca 

ttctttcccg acagaaaacc gattgtcaat acatttcaat cattgagtga gattccggta 540 

tcatctcccg aatctgatgc catgagcaag gatatgaaaa aacggggatt taaattcttt 600 

ggaactacga tttgctatgc tcacttgcag gcctccggat ttatgaatga tcatctggtg 660 
gattgcatct gccggaagag gtaa 



420 
480 



684 



<210> 135 
<211> 222 
<212> DNA 
<213> B.fragilis 

<400> 135 

cacccatgcc gcactggaat gggtaggggg attgtacccc taaatcaaag cttaaatgaa 60 

aaagccgtgg tcattaccga cttcaccgat gaaaacggta tcgaccggat gaaggagcag 12 0 

atacaggaga agtacaaccg tatcaaagcc gacgtgcgtc agattgtcgc cgacgaattg 18 0 

caacgcatcc agaacgatcc tgcattggca catctcattt ag 222 

<210> 136 
<211> 630 
<212> DNA 
<213> B.fragilis 

<400> 136 

aatgatatac gctgcaaggc aaacaacaga attagcaaac ttgaccggaa ggtgtttcac 60 

taccccggtg tcccccaatt atatcctttt gtgaacaata gcattaataa atcctggtac 12 0 

gcccttcgta tcacctatag ccgtgagctt gcctttaagg aatacctgga ctcccgcgga 180 

gtgaggaatt ttcttcccat gcgctatgaa tacgtattcc gtggtgagcg taagatccgt 240 

aaattggttc ctgttgttca caacttggtt tttgtttatg ccactcgcag tgaggttgac 300 

gaaatgaaat ccactgtcgg ggcttctctt cctattcgtt atatcatgga ccgtgagacc 3 60 

cgtcagccta ttaccattcc tgaagtccaa atgcgtagtt ttatcgccgt tgccggtaat 420 

tacgatgaac aggttgttta tctggatcct tcagtcgttt ccatgaaaag gggagaccgt 480 

gtccgtgtca ccggtggcat cttcgaggga gttgagggtg agtttgtccg tatcaaaggt 540 

gaccgccgtg tggtggtttc catccagggg gttatggcag ttgccacggc cttcattcat 600 
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ccctctttaa tcgaattaat aaagaattaa 63 0 

<210> 137 
<211> 1236 
<212> DNA 
<213> B. fragilis 

<400> 137 

aaccggggac gaaacaaaaa tctataccca ttagggcata ttttactcat tttatcggat 60 

catatccacc actctcctga tatcctcatc tgtcagtgta ggtcccgaag gcagacacaa 12 0 

gccgatatcg aacaacctct cactcgtacc attcccatag aacggagcat ccgtgaacac 180 

aggctgaaga tgcatcggct tccacaaagg acgtgtctct atgttctcgg agtccagctt 240 

caaacggata tcctcacgac tcttacccgc aagcttcgga tcaaccagaa tacaagtaag 3 00 

ccaaaagttg gaagcaaacc gcgaatcagg gttctccatg accgtaatac ccgccacatc 3 60 

tttcagcaaa tcaacataca aagagtgaat ggcacggcga cgggcaatat gttcatcgag 42 0 

gacaaacatc tgcccacgac cgatacccgc acagatgttg ctcatacggt aattgtaacc 48 0 

gatatgggta tgctggtaat gcggagcggc atcacgagcc tgcgtagcgt agaactttgt 540 

ctgtcgggcc tcctcttccg tacgacagat cagagcacct ccaccggaag tcgtgatcat 600 

cttgttgcca ttgaaagaga gagcggccag ttcaccgaat gtgccacact tccgtccgtt 660 

caattccgaa cccaaagcct ccgcggcatc ctccaatacg gggataccat aacgacccgc 72 0 

aatatccatg atctcgtcca tcttggcagg cataccgtaa aggtggacag gaataatagc 780 

cttcgggagc ttacccgtct tgcgcaaacg gtcctttata gcctcctcga gcaataccgg 840 

atccatattc caggtatcct tctcactgtc cacaaaaaca ggtttggcct ccagatagga 900 

aatcggattg gcagaggcgg caaaagtaaa gctttggcag atcacttcat caccgggctt 960 

cacattcaga agaatcaagc ccaagtgaag tgcagccgtt ccagcactca aagccactac 102 0 

acgacggtct tcatgcaaat attcggccaa agattgctca aaagcatcca cgttaggtcc 1080 

caaagggaca acccagttcg tatcaaaagc ctcttttata aagtcttgct cacggccacc 1140 

catgtgagca agcgaaagcc agattcgttt gttcataata aactatttat tataatttca 12 00 

ttttcttacg taatagggaa tcaagaggaa aagtaa 1236 

<210> 138 
<211> 2316 
<212> DNA 
<213> B. fragilis 

<400> 138 

aatcaattgc gccaactgct ttatataata tataataagg tatgtcctat gctgaaatct 6 0 

gatgtgatat ggccaaatag ccgacgattc aagtcgcgga cagaatggga gcctttgggc 120 

ttcttctccg aagctttgtg taattccacg caatttgatc tgaagctggg cttcttttcc 180 

tcatcggcca tcaatgtact ggcagacggt ttcgctacgt ttctctataa tggaggaaag 240 

atgcgcatga ttatcaacga tattttatct accgaggata agcgtgcgat aattgtagca 300 

gactcgtgcg acgatgtgga ttacttcaac ctgcaagatt tgggtggtat gagcgacacg 360 

ttgtctaagc gcaaccagca tttcttcgag tgccttgctt ggctgattcg tcataaccgc 42 0 

attgagataa aagtggttgt accaaaagct ggagagggca tagcccattc caagtgcggc 480 

gtgttcttcg atggactgaa ccgtgtggca ttcgatggct catgcaactt ctcgaagacg 540 

gcacttattg ccaacatcga gagcatcact gctttctgcg attgggacgg gcaaagcgat 600 

gtgtgtcgca ttaaagatgt tgtggacgat ttcgaacgca ctttctctgg taacgacgag 660 

agcgtgactt atcttaatac agaccatatc cgcatacata ttactgacac ctacaaaaac 72 0 

aaagatatac aagaactgct ggcagacgaa gcacaactca tcaatgaccg attggagaat 78 0 

gacttgccta aaactgtgac cgcgttcctc ggtcgggcca agaataaggt gaaaagcatt 840 

atcgagcgaa tccatcaaaa tgagatacaa aggggaaagg aagctgcacc tcggtttccc 9 00 

tactcgcaag gacctcgcga ataccagcaa cttgcgtttg agaactggaa agcaaacaag 960 

caaaaggggc tgtttgcaat ggcaacaggt acaggcaaaa ccatcacgtc gctcaactgt 102 0 

ttacttgaaa tatataagcg gtgcggctat tacaaagcca taatccttgt gcctacgatt 1080 

acgcttgttg gccaatggga agaggagtgc aagaaattca atttcaagaa cgtcatacgg 1140 

gtttgttcta agaactccaa atgggcggag cagatagaaa cgattacatt aagcgaacga 12 00 

ttgaaaggga gtgacaacaa tctatcatac ataatcattt ccacatacgc ctcgtttatc 1260 

aaagacaagg tcttcaagtc gttgtctgtg ttcccgaaga caaagttgct gctgattgcc 132 0 

gacgaagccc acaatatggg ctcacgccgg atgttgaaca tcttggatgg catcccttat 13 8 0 
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ttgcggcgca taggtttgtc ggctactccc gaacgccagt ttgaagaaga agcgaaccag 1440 

acgctttatc atttctttgg cgcagaaaat ggctttactt acgagtattc gatgcaggag 1500 

gccatagaca agggtgtttt gtgccgatat tattattatc cgcatgtcgt gcgtctgacg 1560 

atgtcggaga tggaagaata catgagaata tcggtacaat tggctaaatt cttcaataac 162 0 

aaccattttg cggatagtaa tgagatactg accgcactgc tactgaaacg caagcggatt 1680 

attcataagg cagagaacaa attggaggtg ttccggaata tacttgaaca gcgtttccaa 1740 

gaaaaaggta acttgaaata tacgctggtc tatgtcccgg agggattgaa accagatacg 1800 

gcagatgcag acgtttacga tgatacagac cagttacaag acgatgacta ttccgaaaag 18 60 

ctcatcaatg aatataccgc tgtagtgagt ggcattgaca gcaaagtcac agtacgtaag 192 0 

tttacatctg gcattaaaga acgtgaagaa ttgctgaagg gatttgccga cggtgatata 1980 

gaggttctga cctcgatgaa atgtttggat gaaggcgttg atgttccgcg gagtgaactt 2040 

gccattttct gcgcaagcac aggtaatccc cgacagttta tacagcgtcg aggaagaatt 2100 

ttgcgaaaac atcccgacaa gcacatggct gtgatacatg atttggtggt ggcaccagaa 2160 

gttaatatcg gtgaaggctc atatgctatg gaacgtagtc taatggcaac ggaattacgt 2220 

cgagtcagga atttctcgtt gctttcggaa aacagcgacg acactatcaa cgaattggag 22 80 

gatataatga attactataa cttatcattg ttttaa 2316 

<210> 139 
<211> 279 
<212> DNA 
<213> B.fragilis 

<400> 139 

ataaaaaagc gattcttatt ttgtgaaata ttctgtttgc tcaactccag tattgtacta 60 

tccattaact caattgcaac cggagcatgt cgcaaagcca caagatttgc agcaaaagat 120 

tcttccaatg tagaacaatg tacacatatc acggcttttt ccgtaggagg aagaggaact 180 

aattttagtt ttaactctgt gattaaggct aatgtacctt ccgaacccgc taacaatttg 240 

caaagattga aagactcaga acaattctta tcaaaatag 2 79 

<210> 140 
<211> 597 
<212> DNA 
<213> B.fragilis 

<400> 140 

aagatttttt ttaataaata ttccatgcaa gattattttg ctcatgaaac agcaactgtc 60 

gatgacggtt gccgaatcgg tgcaggcaca aagatatggc attacagcca tataatgacg 12 0 

ggatgtgtgc ttggtgaacg atgcaatatc ggtcagaatg tggtaatttc tccagatgtg 180 

gttttaggaa ataatgtcaa ggtacagaat aatgtatcgg tttatacagg tgttacttgt 240 

gaagatgatg tttttctcgg tccttcttgt gtctttacca atgtgataaa tcctcgtagt 3 00 

gctgtcaatc gtaaatcaga atatgctaag actcgtgttg gtaaaggagc tacaataggt 3 60 

gctaatgcta ctattgtatg cggacatgat attggtgaat ttgcctttat tggtgccggt 42 0 

gcagttgtta ctaaaactgt tcctccttat gctctcttgg tgggtaatcc tgcccgtcag 480 

ataggttgga tgagtgagca tggatatcgt ttagaatttg atgagagagg gatagctgag 540 

tgtttggaaa gtaaagaatg ctatcagctt agagatggca aagtattcaa aatgtaa 597 

<210> 141 
<211> 225 
<212> DNA 
<213> B.fragilis 

<400> 141 

atccggctaa gaattggaat tgataagagt gaaattaaat tagcaaaaac atttcctgaa 6 0 

aacaagacta caacgctttt caaaaacggt gagctgtcaa tctttctttt aattcggatc 12 0 

attactctca tatatccatc aatattgaac gataagctcc ctctaaaatc tccttttcat 180 

tttcccaaca taattctttc ttggcatctt gcatatttgc tttaa 225 



<210> 142 
<211> 534 
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<212> DNA 

<213> B. fragilis 



<400> 142 

aaatcctcaa 

cgttgggccg 

aaatgtcata 

aagatagagg 

gatacggtaa 

ctgatagatg 

gctgacaatg 

ccggatgttt 

gaagtgaatg 



gagacaaaaa 
cagagtataa 
cagattccat 
gtgatcattt 
tgcattatat 
gacttcatcc 
cggatgcggt 
attttgcctt 
ctattcctgt 



tcgcttcttc 
agacggtgtc 
cttgaattac 
tgtccatgct 
cggatgggac 
tgatctggct 
ttatttggtt 
atttcctgcc 
gggtattcct 



gtagactatc 
ttttatcatt 
atatcggatg 
cccaatggtg 
ggtcgtaaat 
tccgactgtc 
caattcggta 
tgggataaaa 
ctctctcttt 



ttacctggaa 
atgaaaatgg 
cgggtgagaa 
actattcacg 
ggcgtgccga 
cagaaggtat 
gcctccatca 
tcacggtgaa 
ggatgcttgt 



caccaatgga 
tgatacgact 
ctggcagatg 
tgcgcatact 
acttttgact 
gctcctcaaa 
cattcccaat 
aagtcaggag 
ctga 



60 

120 

180 

240 

300 

360 

420 

480 

534 



<210> 143 
<211> 183 
<212> DNA 
<213> B. fragilis 



<400> 143 

atctcagcaa gtcttttact acatcccata acgttagtcg gattgacagc cttatccgta 60 

gagaccatca caaacttttc cacaccatat ttgacagcta agtcagccat aatacgagta 12 0 

cccagcacat tcacctgtat ggcttcagat acattatctt ccatcatggg cacatgttta 180 
tag 183 



<210> 144 
<211> 1341 
<212> DNA 
<213> B. fragilis 



<400> 144 

agatggcaaa gtattcaaaa tgtaatattt aattttaaaa ggccaagagt tatgtataat 60 

aaattagtaa ataaagaagc taaattagct ttggtaggtc tgggttatgt aggacttcct 12 0 

atagccttgg agtttgccca aaaaatatca gttataggtt ttgatataaa cgaggaccgt 180 

ttggcgaaaa tgcgtgaagg aattgatccg tgcggagaat tggatagttc tgcttttgaa 240 

aatgtagata tcgaatttac ttcctctatt gaaaagttga aagaagcttc tttcttcata 3 00 

gtggctgttc ctacaccaat tgataaatat aataaaccgg atttaactcc attgctgggt 3 60 

gcttcccgtt ctgtagccaa agctttgaag ccgggagatt acatagttta tgaatctaca 42 0 

gtttatccgg gttgtacgga agaggattgc cttcctgttt tagaagaagt tagtggcttg 48 0 

aaagctggta tcgattttaa atatggttat tctcctgaac gtattaatcc tggtgagaaa 54 0 

gtacatacgc ttcctaatac tattaaaata gtttccggtt gtgatccaga ggctttggat 600 

acagttgcta gagtttatga attagttgta aaaccaggag ttcatcgtgc tccaaatgta 660 

aaagttgctg aagctgctaa aatcattgaa aatactcagc gtgatgtcaa tattgctttg 72 0 

atgaacgaat tatctattat tttcagtcgt atcggaatta atacttacga tgtattggaa 780 

gcagccggta ctaaatggaa tttcttgaaa ttttatccag gattagtcgg aggacattgc 840 

attggtgttg atccttatta tttggttcaa aaagccagtg aactgaagta tcattgtcag 900 

ataatcagtg caggtcggta tatcaatgat agtatgggag gatatattgc caagaagctt 9 60 

gtgaaacgtt tgatttcttt aggtaaaggt gtattaggtg ctcgtgttct agtgatggga 102 0 

gttactttta aagaaaatgt agcggacatt cgtaattcta aggttgtaga tattgtcaat 1080 

gaattgaaag attttggttg cgatgtggac gttgttgatc catatgcaga cagtgatgaa 1140 

gtacatagag agtatggatt tcgtttagta gagaaaccga gggataatta tgatgcagta 12 0 0 

attgttgccg tagcacatga tgaatataaa aatttagagg agaagtattt taaaaatatg 12 60 

acctatgatc atgccgtact tgtagatatt aaggggatgt atcgtgatag gattcataaa 1320 

ttaaagtatt ggagtttgta a 1341 



<210> 145 
<211> 1113 
<212> DNA 
<213> B. fragilis 
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<400> 145 

gaagatatgg gaaaaaaaag ggtttgccat gttacaagtg tacatcctgc agatgatatc 60 

agaatattac acaaggaatg tgtctcgtta agtaatgctg gttacgaagt ttatcttgtg 12 0 

gctcctgagg tgtcgaatca gttaaaaaat ggaattcaaa ttataggggt actcaataag 180 

cctgtcagtc gatttcatcg tatcttattt tatattagat atgtctataa gaaagcatta 240 

tgggttaatg cagatatata tcatttgcat gatccggaat tacttcttta tgcattgtta 3 00 

ttgaaaaaaa aaggaaagat agtcattttc gattctcatg aagatattcc tcgtcaaata 3 60 

ttgtcaaaag aatggattcc tttctttatc cgtaaattta tatctttctc atatactaaa 42 0 

tatgaaaagt ttatattgaa acaacttgat gctattgtaa ctgtaaatca agatatagct 480 

tctagattgg ttcaatataa taagcgtaca tatgttgttt ccaattatcc tgtatttagg 540 

aataatgtag aaagaagttc cgtgatggaa aggactattg gttttgcagg taatataaag 60 0 

caagagtata tgcatgagaa tatccttatt gcattaacta atttgggaaa tgtccgttat 660 

ttattggctg gtaatgctga ggagggttat ttaaaacaac ttcaaacttt taaaggatgg 72 0 

gattttgtcg atttctacgg acggatatca aaagaaaaag tattgcttct ttatgataaa 780 

gttgctattg gtatggccat tcatgattat actttaaatg ttggagggaa gaagggaggt 840 

ctaggtttta ttaagaattt tgaatatatg gaagcaggaa tacctttaat ctgtacagat 900 

tttgatattt ggaaagaaat agttgaagag tattattgtg gaatatgtgt aaatcctcat 960 

gatgtaaata gtataactgg tgctatacaa tatttaatag ataatcctgt tattgctcgt 1020 

aaaatgggag ataacggtcg tagggcagtg aaagaaaaat ttaattggga aacacaagag 10 80 

gagatacttt tgcaattata tgatagttta tga 1113 

<210> 146 
<211> 543 
<212> DNA 
<213> B. fragilis 

<400> 146 

tgtgacttta tgaatgatgg tgagcggaaa gaaactgttt tatctttttt ttataggaaa 60 

attcttaaaa aatcatct cc tccatattat tgttattatt ctttattgac tatttgtgcg 12 0 

aaacctattc gcaagtggtt ctcagtagtg gtaataccca tcattccttt ttctaattta 180 

cgtgtacagt gttatcggtg gtgtggttat aaaatagggc gtcatacttt tattggtatg 2 40 

cgttgttatt tggatgatat gtgttatgat ttgattgaaa taggtgagaa tgtgaccata 3 00 

tcttatggcg ttttttttgc atgccatggt cgtaaacagg ggcataatag aattattata 3 60 

aaagatgggg catatattgg catgaatagt tctattatat ctcggagaga agaaggtttg 42 0 

attattggaa aagaggcaat agtgggtgca tgtagtttag taaatagatc tgtaccagat 480 

aataagactg tagttggtgt acctgctaaa gaattaaatg ctgttctaca cgggaataaa 540 

tga 543 

<210> 147 
<211> 1200 
<212> DNA 
<213> B. fragilis 

<400> 147 

aaggagatta tcatcctttg gcaaaacttc ctttgtcaaa gcatccgttt ggctggtaat 60 

aaagtaacag atataattat gaaacttcaa atggttgatc ttcacggtca atatcttaat 12 0 

attaaaccgg aagtggatgc cggtattcgg caggtcattg aaacttccgc ttttatcaat 18 0 

ggtccgcagg tcaaggagtt tgcggagaac ctgaaggctt acatgggtag caagtatgtg 2 40 

ataacttgtg gtaatggtac agatgcactt caaatagctt taatggcatt ggatttgaaa 3 00 

cccggtgatg aagtgattgt tcctgctttt acctatgttg cttctgccga ggtgatcgga 3 60 

ttattagggc tgattcctgt gatggtggat gtggattatg ctaccttcaa tgtaacggtt 42 0 

tccaatctgg aaaaggcttt gagtcctaaa actaaagcga ttattccggt gcatctgttt 480 

ggccagtctt gtgatatgga acctattatg cagtttgcca aacagcatgg tatttatgtg 540 

attgaagaca atgctcaggc tattggagca gtatatactt tctctgatgg tagtaagaag 600 

catacgggag ctatcggtca cataggctgt acttcttttt tcccttctaa aaatctggga 660 

tgttatggtg acggtggagc tatttttacg gatgacgatg aattggcaga acgtttgcgc 72 0 

atgattgcca atcatggaca acaagtgaag tatcatcata aagtcatcgg atgtaattcg 7 80 

cgtttggata ctcttcaggc tgcgatactc aatgttaaat tgaaacactt ggatgaatat 840 



66 



agccatgccc gtcatgaagc ggcacaatat tacactttcc agttacaggg ggtgaaaggg 900 

attattactc ccgaggaact tcctttaagt actcatgtct atcatcaata tactttaaaa 960 

gtactggatg gcaaacgtga cgtgctaaag cagcatcttg ctgatgcggg tattccgagt 102 0 

atgatttatt atccgttgcc tttgcagcaa caggaggctt ttcagactat cgcacgtgca 1080 

gcagaaccat tagatactgc tgaaaaactg gcatattcag ttctttctct tcccattcat 1140 

accgaactat ctactgaaca acaggattta gtcatcaata gtataaaaga ttttttttaa 12 00 

<210> 148 
<211> 1122 
<212> DNA 
<213> B. fragilis 

<400> 148 

gtaataacta tgactgaaga taagaatata aataaaacga ctccgcaatc tgaggaacaa 60 

gaaattgatc tgatagagtt ggctcagaaa gtttgggccg gtcgtaaact agtattaaag 12 0 

gtttgtggtg ttgccgtgtt agtaggactt gtagtggctt ttagtattcc taaagagtat 180 

tctacaagtg taacactggc accggaaaca ggtagcaagt cttctactgg aggcatgggg 240 

gcattagccg ctatggacgg tattaatctt ggcagttcaa ccggagaaga tgcactttct 3 00 

cccgaattgt atcctgatat tgttagttcc acaccttttc tattggaaat gttcgatgtg 3 60 

aaggttgctg atcagaaagg taagattaat acaactttgt atgagtactt ggataaatat 42 0 

caacgggctc cttggtgggg agcggttgct tcagctcctt tcaaagcatt aggttgggtt 48 0 

gtatctttgt ttaaagatgc accggaggaa cagggagatg caaagataga tcctttctat 540 

ttgactgcag atcaagcagg aatagcagat gctttgagtc atcgtatatc tgtttcggta 600 

gataagaaaa caggagtgac tacacttact gtgacaatgc aggatccatt aatttctgca 660 

gcattaacag atacggtaat gcattgtttg caaaattata tcacagatta tcgtaccaat 720 

aaagcgcgtc atgatttggc ttttactgag aaactattta atgaagctca ggagaactac 780 

tatgaagcgc agcagaaata tgctcgtttt atggatggta atcaaaatat cattatgcaa 840 

agttttcgta cagagcaaga gcgtttgcag aatgagatga atttagctta tggagtattc 900 

actcaagtgt cgcaacaatt gcaattggcg aaagctaaag tacaggaaat aactcctgtt 9 60 

tatactgtag tacaacctgc tacagtccct ttgagaccgg ctaaacctaa taaaatcatg 102 0 

attttaattg gttttgtatt cttagcgggt gtaggtagta taggatggat tctctttgtt 1080 

aaagatttat tgaacggatg gaagaaacag ccagaaaaat aa 1122 

<210> 149 
<211> 681 
<212> DNA 
<213> B. fragilis 

<400> 149 

ttttgtgcta ttattttaag aatgatgaat atgaaaccaa ttatatcccc ttctatcctt 60 

tctgcagatt tcgcatatct ggcaaaggac attgagatga tcaaccgtag tgaagcagac 12 0 

tgggtacaca ttgatattat ggacggagta tttgtgccga acatatcttt cggctttccg 180 

gtactgaaat atgtagctaa gttaacttca aagccgttgg atgtacatct gatgatagtc 2 40 

aatccggaaa agtttattcc tgaagtgaag gcattgggtg cccacatcat gaatgtgcat 3 00 

tacgaggcat gtcctcactt acaccgggtc gtgcaactga ttcgtgaagc aggtatgcaa 3 60 

ccagcggtca ctatcaatcc ggccactccg ataaccctgt tgcaggatat tatccgggat 42 0 

gtatatatgg tgctggttat gagtgtgaac cccggatttg gcggacaaaa atttattgaa 48 0 

cactcggtag agaaagtgaa agagcttcgt gaactgattg agcgtaccgg atctaaagca 54 0 

ctgatcgaag ttgatggagg ggtaaatctg gaaacaggcg cccgtctgat agctgccggg 60 0 

gcagatgcat tggtggcagg aaatgctatc tttgctgctg agaatccgga aggaatgatt 660 

cacgccatga aagggctgta g 681 

<210> 150 
<211> 1047 
<212> DNA 
<213> B. fragilis 



<400> 150 

atattctata tgaataagaa aagaaagaaa atatttctca gcatactggc tacttttttc 



60 
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ttcatttgta 
ccaagtaaga 
aaaataaaaa 
cgtgaatata 
tatcaattat 
agtgtccgaa 
gctgaaattg 
gccaccatcc 
gcagactttt 
tcaaaggccc 
gaagaagaaa 
ttacatgccg 
gggttacgca 
aatgcgggac 
ttgaattatg 
cataattttg 
ctgaatgaaa 



tcgccggtgc 
caacttatat 
agcaaggaaa 
gtaaaaatat 
acagtagatt 
cacttgacag 
ccatggcact 
cctgcttatt 
tagcccgaat 
aagcaatagg 
ccaacaacaa 
gcatgcccct 
gaatcaccaa 
tgcctccggg 
taaagcataa 
cctccaacta 
gaaagatttt 



aggaacagtc 
ctatatagac 
ccctcatagc 
ccataccgga 
atcaagaggc 
attagtccgc 
atacgactct 
tattcccgaa 
gaagaaagag 
gatgactcct 
tgcagaaaag 
gcaagccgac 
tcaacactta 
tcctatccgg 
ctatatctat 
tgcagatcac 
taagtaa 



tattactacc 
cgcgatgata 
tttaatggct 
cgttatgcca 
tatcaaactc 
agcgtaggga 
atttttctgg 
acatatcagg 
catgataaat 
gaagaaattt 
cctatggttg 
ccgactatca 
gacgtacagt 
ataccatcac 
atgtgcgcaa 
atggttaatg 



tattttaccc 
ctacagactc 
tcaaatggat 
tcaaacccgg 
ctgtcaacct 
aacagttaat 
aaaaaatggg 
tatattggga 
tttggaacaa 
gcacgctggc 
caggattgta 
aattcgcact 
ctccctataa 
ccaaggggct 
aagaagattt 
caagaaaata 



ccagtttcat 
catcttcaat 
gtcccatttc 
agatagcact 
gacaattgga 
gatagattcc 
atacacagaa 
tgtcagtgca 
agaccgactc 
ctccatcgta 
catcaaccga 
acaagatttc 
cacttacctg 
ggacagtgtc 
ctccggtacg 
ctggaaagcg 



120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1047 



<210> 151 
<211> 891 
<212> DNA 
<213> B. fragilis 



<400> 151 

attttcaata 

tttaagagct 

ctctgtacac 

gttgcacgtc 

ttaacagagc 

atccccaatt 

aacgaacgat 

tcccgttccg 

gaagccatga 

cgtcggaaat 

atccttatta 

gtgactgttc 

gctatcggat 

tctattgaca 

ttagaaaatg 



tcatgaaaaa 
tttggataat 
cggaaattca 
ccaattattc 
agataattca 
gggtagctta 
tcatcgccga 
gatttgacaa 
aagaatcctt 
ggaaaacact 
tttgcggccc 
catcaaagtt 
ttatttttaa 
gcattgaaca 
aaatagaaag 



taagcgaaaa 
agctctattt 
agctgtattc 
tcacgatgaa 
tcacaaaggt 
cgaactcacc 
tcctgtcgtg 
aggtcatatg 
ttatttcagc 
ggaggacaag 
ggtcacgaat 
ctttaaggtc 
gaatgaacgc 
actcaccgga 
tcggatagat 



agaccatcta 
gcgattttac 
tttcaggcaa 
aatctgaaga 
tatactgtgt 
aagcaaaaga 
aaaggaggta 
gcacctgctg 
aatgtatgtc 
gtccgcgaat 
aaaaaatctc 
atcctctctc 
gcaatagcac 
ctggatttct 
accaccttat 



aaaaacaaca 
cattaatcta 
ccaaagtatc 
ttccggtttc 
cttataataa 
cacaagggaa 
tggcaaacaa 
ccgacatgaa 
cgcaacatcc 
gggctgtagc 
cggtaatcgg 
ttcacggctc 
ctttacgaaa 
tttcttcact 
ggagcatcta 



ccacaattcg 
cggagtctat 
aagaccgaat 
ccaattccca 
ggataaaaag 
tataaaaaga 
ttctgattat 
atggagtaat 
cgaacttaac 
cgatagtgca 
caaaagccgg 
cactcccaaa 
ttatgccgtc 
ccccgattct 
a 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

891 



<210> 152 
<211> 1233 
<212> DNA 
<213> B. fragilis 



<400> 152 

tttatgaata 

cgtccttatt 

gctaattatt 

atacattatc 

tcaatgtttt 

attccagatc 

atagcccaat 

ccaattgaaa 

gaaaatgatg 

cat atggtct 

ccggaagagt 

ttgaaaaata 

ttggattatt 



tacttcttat 
atctttctaa 
ctcatttaag 
gttggatatc 
gttttgttct 
ttgtaattgc 
attatcatgc 
taggaggata 
catataaatt 
ctcatggaat 
ggacatctca 

aggggaaaaa 

tgttggaggc 



taatcattat 
agaatgggta 
gattaagcaa 
agcaggaagg 
taaattaagg 
atcatctact 
aaaacttatt 
ttctaaatat 
tagtgataaa 
ggatgcaaat 
atgtgatctt 
ggttattggt 
aatgaaaatt 



gctggatatc 
aggatggggc 
cctttagata 
tatagcggaa 
ttgtatttcc 
tatccattag 
tatgaggtac 
catcctttca 
gttatatcgt 
aaatttgttt 
tcgcctttac 
tatgcaggtg 
gtctttgata 



caaatttagg 
atcaggttag 
gctttagtgt 
atggtgctaa 
ggaactatct 
atatctatcc 
atgacttgtg 
ttgcactgtt 
tacttccaaa 
atatccccaa 
atatgcaatt 
gccatgcgaa 
aaaaccagaa 



aatggaatat 
agttttggcc 
aatagatggt 
gcgtgtatgt 
cgatggtttt 
agctcataaa 
gccattatct 
gcaaaaggca 
tgcatgttca 
tgggtatgat 
tatatccgaa 
atcgaatgca 
tatagtatgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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cttttggttg gtaatgggca agaaaaggga cgtttagtag aacgtgttca aaaggaaggc 840 

attaagaata tttattttct ggatccagta cctaaaaaaa aaatacctga attattaaat 900 

cagatggatg tattatatat tggttgggag aaaaacccat tatatcgttt tggtatatct 960 

cctaataagt taattgatta tatgatgtct cagaaaccga tattgcattc ggtttgtgcc 102 0 

gcaaatgatt gggtaaagga agccgattgt ggaattacgg tgaatgcgga gtcgccacaa 1080 

gaaatagcgg caggtattat agaaatattt tcgttttcag atgtagagtt aatcaataaa 1140 

gggggtaggg ggagaaaatt tgcagaagag aatttaagtt atcctttcct tgcaaagaag 12 00 

ttcatcgaag aatgcataaa caatagagtg taa 1233 



<210> 153 
<211> 1002 
<212> DNA 
<213> B.fragilis 



<400> 153 

aattactaca tttgcaacca aactaaaaaa actttgatca tgccgaactt ctttaaatct 60 

ttttttgcgg ggaaaacaga aaaccctgag gaagaaaaac aaaaaaacgc caaaaagaac 12 0 

tttgagatat ttaaatatga cggcctgcgt gcccaacgta tgggacgtcc ggactatgcc 180 

attaagtgct ttaacgaagc gctggccatt gaagaagatt tcgaaacact gaattatctg 240 

agccagcttt acatccagac cggtgaattc gggaaagcac atgagttgct ggaacgtatg 3 00 

atcgcactgg aaccagaatt gacaagcacg tacctgaccc tggccaatct ctgcttcatg 360 

caagaagatt atcaggagat ggccgatgcc gcccagaaag ccatcgcact ggaagaagga 42 0 

aacgcaatgg cacactacct gttgggcaaa gccaatcatg gattggataa cggaataatg 480 

accatcgccc acctgacaaa agccattgtg ctgaaagatg atttcacgga agcccgactg 540 

ctccgtgccg aagcactgta taagatgcag caatttgcag aggctatgga agatattgaa 600 

gccatactta cacagaatcc ggacgaagaa gctgccctcc tgctacgtgg caaaataaaa 660 

gaagccaccg gaaaggaaga agaagcagag acggactatc tccatgtgac agagataaac 72 0 

cctttcaacg aacaagctta cctatatctg ggacaactat ttatcacaca gaagaaattg 780 

acagctgcta ttgagttgtt tgacgaagct atcgagttga atccaaactt tggagccgcc 840 

tatcatgaac ggggacgtgc caaactatta aacggggaca aagacggttc gattgaagat 900 

atgaagaaat cgctggagct gaacccgaaa gagggagaga acctgaacgg acagttcaat 960 

aatcagcaag cagaaacaac cccaaacgta ttgggactgt aa 1002 



<210> 154 
<211> 810 
<212> DNA 
<213> B.fragilis 



<400> 154 

tacatataca ttatagttat gatattctat ttttcaggaa ctggaaattc taaatggatt 60 

gcggagcaga tcgctaaggc acaaaacgaa gtgcttgttt ttatgccgaa tgccatcaga 12 0 

gacggaatag aagagtttgt gttggcggat gatgaaaaag taggttttgt tttccctgtt 180 

tattcatggg gacctccgtt gagcgtattg cggttcttgg attggattac tttatctaat 240 

tatcattctc aatacgtctt ttttgtctgt tcctgcggag atgatacagg gctgacggaa 300 

gaactctttc gccgggcatt gtctcgtaaa ggaatggagt gtaatgccgg tttttcagtg 3 60 

gctatgccta ataattatgt tttgcttccc ggatttgatg tggataagaa ggaactggag 42 0 

aaaaagaagt tggatgaagc agttggcagg gtagaagaga ttaatgattc gataaccgga 480 

aagaaaatag gttttcattg taatgaggga agttttccat ggtttaaaac caaagtactc 540 

aatccgctct ttaatcgttt tatgacctcg gcaaaaccat tttacgccac tgatgattgt 600 

atcgggtgta aacgttgtga aaggatatgt ccggttggga acgtggtgat gatagggtgg 660 

aggcctgtgt ggggaatgga ttgtacatcc tgcctggctt gctatcatgt ttgtccgaag 72 0 

catgctgtgc agtacggaag aaggactaaa cgtaaaggac agtatttaaa tcccaatgtg 780 

agtatttcac atgaggcggc cgcccaatag 810 



<210> 155 
<211> 2175 
<212> DNA 
<213> B.fragilis 
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<400> 155 

tatatgataa 

tttgagtttt 

ttctttgagg 

atatcggaaa 

atgttttttg 

aacggagtat 

cgtatgcaac 

tattgcctgt 

acgcttgtag 

gaacttgaag 

tcgcagaggg 

ataaaatgcg 

gatttggaga 

acgcaacgcg 

ctactcgatg 

aaggtatcgg 

gcggcggcaa 

ttcacacgtt 

gaagtatgca 

gagaacaagc 

gaattaccag 

atcagttttg 

aaactggaac 

gaactggaag 

cttgataaga 

cgcattagcg 

gaagaatttg 

acgttgctcg 

ttcctcgcaa 

ttccgtggcg 

agt tctaacg 

tcactactct 

atattcgatg 

atcgacaaaa 

ctgacgggca 

cgcatcgaga 

actccaataa 



tcaagagcgt 
caaagggact 
cgttggagtg 
tgcgcaagtc 
agcataatgg 
gtcaagtgac 
gaaagggcag 
tcaaaggtga 
ataaattctc 
caaaatccga 
tttcggaact 
acatcaggaa 
agcaccaggt 
agaagctggc 
agttctgggc 
cgttgagcaa 
aggctaagaa 
tgccttggta 
aggtatgcgg 
tgagggaata 
acactccgct 
gaggcatgac 
ttgtggcaag 
acgagaaatc 
atttccgcga 
attaccgcga 
agcaactgaa 
acaaagtgat 
gccttgagga 
taatccgcat 
gcacgccaat 
ttgctatctc 
cgccaacttc 
tccagaaaca 
agaagactct 
aacagacggg 
aataa 



aacaataaat 
gactctgatt 
gttgcttgac 
ggaattggac 
agaaaaagag 
aaattttgct 
ctcgctaatt 
gagccaattg 
cgacatccgc 
tcgagcatac 
ccaatgtaag 
acaggaagat 
caccagcgaa 
taaactccgt 
attgatgcca 
agagaagcga 
ggaagtcgtt 
tctgcccgat 
ccgaccagca 
cttggaacat 
ttttggtact 
ggcgagagat 
aatcaaacgg 
ccgtttgctc 
catcaagggt 
gcggttggtc 
tccgacaacc 
gcgggctttc 
gcgcaccaat 
tgtacagaca 
caaaaatccg 
cgatctcact 
atcgtttgag 
gtgcattatc 
gaacgaagcg 
ctataacgag 



aatttccgca 
attggtggca 
acagctcatg 
gaagatgaag 
gtgtcgaaga 
ttcaaaggtt 
gacgtgtgct 
aatgttttta 
aagtttgaag 
gcaaaggagt 
aaagagcatc 
gtggtgagca 
agttatcagg 
tccatgacga 
taccagaatg 
cgtctcagcg 
gatgaactga 
ggggaaacga 
aagaagggca 
aagtcgcagg 
caatatatcg 
atttctaaga 
gacattgcag 
atccaagccg 
ttctatgagc 
aaggtccaga 
ggcatggcga 
gttaatgcaa 
agttatttcg 
gcgagcgatt 
ggcggtgcgc 
acgctgaagc 
aacttcaaag 
gtgacgaaag 
caaatagaag 
accgaccttt 



gttactacag 
atggtgatgg 
agacgaaaga 
ccgacaccat 
gtctgacctt 
acgagactaa 
ttgatgcgtt 
atgagaaaga 
attatgtcgc 
gccagtcgga 
ttggacaaca 
cctattctgt 
acatcaagaa 
tggtgcgcta 
tgtttgagga 
accttgacat 
catcaagtct 
tgcaagagat 
cgccggagta 
aacttgcggc 
aggagcttca 
aatatcgcga 
agaaagaggc 
acggacttac 
agcgagaccg 
tggaatacga 
aagtgtataa 
agagcgaaaa 
agaaactgaa 
cggcagagat 
aggagacaac 
gcgaagagga 
agaacgtctt 
acttgcttga 
ctttgacctg 
caacaatacg 



ggagaatacc 
taagaccaca 
cccttcgctc 
gtctgtttca 
cgaaaagaga 
tggcgctgag 
cattcgtaag 
ggcgttgaga 
tgttgctact 
taagaagatt 
gatagacgag 
gaaacttgaa 
gcgcatagac 
caatacgaac 
atttcagaaa 
acaggaaaaa 
gcaaagcgac 
gcttgacgaa 
tcgtttcatg 
caagcaagaa 
ttcgctttct 
ggttgtggat 
tgagttgcta 
ggaagcgatg 
agcaaaaaac 
caaggtgaag 
ccgtgtgcat 
cctgcgccgc 
caagaatgac 
aaagctattt 
catgtatatg 
ttatccgctc 
ctacaacatc 
agtggacaaa 
ttctgtatat 
aaccatcata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2175 



<210> 156 
<211> 471 
<212> DNA 
<213> B.fragilis 



<400> 156 

ttttgcccaa 

agtttaccta 

gtctattttt 

gtcgactggg 

attttttcat 

tttcatctgc 

tcggtcatgg 

atagaggcaa 



taataacaaa 
tacaactacg 
cattttacga 
aaaaagacgg 
cggaccacat 
tgcaacgggt 
tgacattcga 
tctgtcggtt 



ggttaagaga 
attcaacgac 
tctcggcaaa 
cattgtagtc 
cgccgtacaa 
catcgacacc 
tctggagaga 
cgaagggaga 



aaacaaatgg 
gtagacaaat 
acagaatatt 
gtacacattg 
accgcagttt 
gaaacaatgg 
cacgaatcca 
gacttaagaa 



aagaaatcga 
tcggacacgt 
tcgcttctgt 
aagccgactt 
gcgaaatcgg 
aagtgaaatg 
agccactgac 
agaaaaaata 



atttcatcac 
caacaacacc 
atgcccggga 
tctggcacag 
aaccaaaagc 
catctgtcgt 
cgaagaatgg 



60 

120 

180 

240 

300 

360 

420 

471 



<210> 157 
<211> 216 
<212> DNA 
<213> B.fragilis 



70 



<400> 157 

cgaaaaccaa ttaacaatca aatagttatt 
gcgagcatta tggggcatgt tttgctgcga 
acggatattt caaggagtat aaacgattat 
ttcaaaaagc tcaatattta tttagtttgt 



aatttctatt tttggttgtt ggtagagaaa 60 

cttccgttac ttatccgtta ccttgcaagt 120 

ttttcaatgc tttgcgtcac ttttcataac 180 

aactaa 216 



<210> 158 
<211> 525 
<212> DNA 
<213> B.fragilis 



<400> 158 

agggttggga 

atgaaactga 

cgcctgcgga 

aatgctttgg 

gaagtttatc 

gtaacggaaa 

tgtgtatggc 

gggccttttg 

gaggaggcgg 



tgtgctttgg 
ttaccgaagg 
tgaattacaa 
agccgggaac 
ttgtattgag 
aggttcattt 
atactatcgt 
ctcctcttat 
cccgagtatt 



ttacagtctc 
gcttcttgat 
tttccatgac 
ttacttgccg 
aggtagcttg 
gaatccagct 
tgtcttagaa 
tcctgaaaat 
tatgcagcga 



aacggattgt 
aaagtgactg 
tccatggatg 
ccacatcgtc 
ttggctatcc 
gagggacatt 
tctggaaccg 
ttagcatctt 
atgcttgagc 



gtagcttcaa 
atcaggcaaa 
ctcctattca 
ataagaatcc 
tgtttgatga 
atggaattga 
ttatttatga 
gggcacctcc 
tttaa 



aagaggaata 
agagaattca 
caggatgttg 
ggataaggaa 
tgagggtaat 
gattcctccc 
aataaagcaa 
tgcaactgat 



60 

120 

180 

240 

300 

360 

420 

480 

525 



<210> 159 
<211> 975 
<212> DNA 
<213> B.fragilis 



<400> 159 

agccgtcttg tgaactttca gtatttacac cgatatcctt ttatacgcct gctattccct 60 

ctgatagcag gctttcttgt tggcaatggg ttgtttttta ggggagtctg tgtttcgaag 12 0 

ggcgtgctgg caggagggct ggcaggatta tttcttctgc tcctagtcgt ttatttttct 180 

caccgttact ctttacgttg gatgttcggc tgtattttgt acctgttcgt gttttttggc 240 

ggagcaggtg gaataaatca ggctttgcaa cagacgcttt attctttttc ggaacaaaaa 300 

tgtgtttacc gggctgtagt gttggaacaa ccggagccga aggaacatag cttcctttgt 3 60 

cgggcatttt tggaggaaag gcaggattca gtgtgcacca tgccggtaaa tcgaaaagtt 42 0 

ttgctttata tatcgaagga ttcattgtcc gaagggttac gtagtgggga tgagttaata 480 

ttttttgccc atgtatctcc accttcaaat aatggtaatc ccgatgaatt tgattatgcg 540 

cgttatctgc gctacaaagg gattagcggg attgcttttg ttgcaagtgg gaattggaaa 600 

attaccggat atcggttttc ccgatcatgc aggcagattg cattggaata ccgggatcgg 660 

attcttgacc aatatcgtgc tttgaagttt aatccggatg aatttgccgt acttgccgca 72 0 

cttacggtag gttataagga ggagttgagc gaagatattc gggaaactta ctctgtatcg 780 

ggagccagtc atgtactggc actttccgga cttcatatcg ggtttctgta tatgatgctt 840 

ctgttttttc tgaagtggct gccagggaat gcttttggtg tgagactttt tcgtgcggta 900 

gtgataatca ccgcattgtg gggattcgct ttttttaccg gtctctctcc ttcggtcgtc 960 

cgttccgttg tcttc 975 



<210> 160 
<211> 252 
<212> DNA 
<213> B.fragilis 



<400> 160 

cttatcattg ttttaacgat ggcacattac aacaataaca gcaacagaat cttgcaggct 60 

gttttggccg atgagaaact gatagagttt ggcgagtaca atcccgctga ctatcaaagc 12 0 

ttggacgagg ctcttgtgtc tgataacctt gtggtgaata ctgtggcaag gattatcaac 180 

gaggtaaatg aggagagcag ctcacgggaa atatataata tggtaacaac ctatctaaag 240 

aataatatat ga 252 



<210> 161 



71 



<211> 615 
<212> DNA 
<213> B.fragilis 



<400> 161 

aaaatgaatg taaatattac tgcggtgcta ttgaaatctc 

tttctcggtc tcctttttct ttctccaatt ttattagtaa 

aagatgcctg gaggtcctgt tatattcaaa cagaaaagag 

tttaccatgt ataaatttcg ttctatgacg gttgggcatt 

aaaggagaaa gccggatcac gccattgggg gccaaattga 

cttccggaac tgtggaatgt gctgatagga gatatgagtt 

gttccgggat atgctgacaa tttgctggga gacgatagga 



gggaaagata tgtag 



tttttgacca 


tatcgttgct 


60 


cagctattct 


tattcgtgtt 


120 


ttgggcggta 


tggtagatta 


180 


ccggtggttc 


tgtttctgta 


240 


gaaaatataa 


gattgatgaa 


300 


tggtcggtcc 


tcgtcctgat 


360 


gaatgttgct 


tttaaaacca 


420 


aagaattgct 


ggcagggcag 


480 


ataaagtgcg 


aataaatata 


540 


tcatcgttta 


taccgttttt 


600 
615 



<210> 162 
<211> 927 
<212> DNA 
<213> B.fragilis 



60 



<400> 162 

gaaccaatcg ttgaaagatg gcaaggtgcc cattatggga cgtataacga tcaacaagac 

caccgcctgc ttcagttgca agcggaagtt tcactggcat tatgggatgc caaggccaag 12 0 

agggcgaaag ggaaatccga cgaggccaga cggctgaatc aggagcttga caatgtcaag 18 0 

gcccagatca caaggcatta ccagtatgtc tgcgaccatg acagcctggt gacagctaaa 240 

agtgtctaca accgctatct tggtttcggg gacgattatc acacccttat gggactgttc 3 00 

agggagcagc ttgcctccta caaggaaaag ataggcaagg aaaaggcggc aagcacctat 3 60 

cgcgggctgg tggccgacta caagaatctg cagctttt cc tcaaagagaa gaggcgcatc 42 0 

gaggatatag ccatcgccga gcttgacaag aagttcatcg aggactatta caactggatg 480 

ctcgggacat gcgccctggc gagttcaacg gctttcggcc ggggcaacac cctgaaatgg 540 

ctgatgtata ccgcccagga aagaggctgg ataaggcttc atccgttcat cggtttcgac 600 

tgcctgtccg aatacaagtg gcgttctttc ctcaccgagg aggacttgca aagcgtcatc 660 

catgtcaagt tgaattacaa gcgccagcgg gctatccgtg acatgttcct gttcatgtgc 72 0 

tttacaggtc tggcgtacgc ggatctgaag gagatcacgt acaagaatat ccatacggat 780 

tccgagggtg gtacatggct gataggcaac cgtataaaaa ccgacgtggc ctatgtggtg 840 

aagctgcttc ctatcaccat cgaactggtc gagaggtaca gggggacaat gaaaagaaaa 900 

gttcgcctga caaggtgttt tccgtag 927 



<210> 163 
<211> 249 
<212> DNA 
<213> B.fragilis 



<400> 163 

aatattttat taataaaaag agattccaaa gatctactta ataaaattca ttcactatta 6 0 

ttattaatta aaaacaatag agaaacatct tttcacctta taaaccctaa attaataaac 12 0 

aaattaacta tctttgtaga tattaccaaa attattaatc tatatgaaac aacatctttt 180 

aaaagaaata gaactaggta ccaaaagcgc tcttctcaaa aagaaaatta ttacacatta 240 

tatatataa 249 



<210> 164 
<211> 573 
<212> DNA 
<213> B.fragilis 



<400> 164 

atcatgcaat tattaaaaaa aagaatccta caggacggaa aatgttatga ggggggaatt 



60 



72 



ctcaaagtag acagtttcat caaccaccag atggacccag tgctaatgaa gtcaattggc 120 

gtagaattcg tacgtctctt tgcagggaca aacgtcaata agatcatgac cattgaagcc 180 

agcggaatag ctccggccat aatgacggga tatttaatgg acttgccggt cgtttttgcc 240 

aaaaaaaaat cgcccagaac aattcagaat gcgctaagta ccacagtaca ctctttcacc 3 00 

aaagaccgtg attatgaagt agtcatcagt tccgacttcc tcactccgaa agataacgta 3 60 

ttattcgtcg atgatttttt agcttatgga aacgccgctt taggtgtcat tgatttgatc 42 0 

aaacagtccg gtgcaaatct ggttggaatg ggattcatca ttgaaaaagc atttcaaaat 480 

gggcgtaaaa cacttgaaga aagaggagta agagtagagt ctcttgccat catcgaagat 54 0 

ttatccaatt gccggattac aataaaagat taa 573 

<210> 165 
<211> 204 
■<212> DNA 
<213> B.fragilis 

<400> 165 

gacatttttt ccaacagtct ttccacacag gcagttttcc aaggcgagga tgaagctgga 60 

tttgccagtg ccgtacgagc cgatgaggca aaacgaatga atgccactgg caaagtgatt 12 0 

gataattttg ccgatggtct ggcgagcatt ggcagtgacg atatagtggg gtatcttgcc 180 

aaagtcacgc tctatgttga ttga 2 04 

<210> 166 
<211> 372 
<212> DNA 
<213> B.fragilis 

<400> 166 

tattttgcat cgttttattt ttttagggat aagatatatt ttattttctt tcttagaact 60 

tatgaaggaa tacccaagag ggcgattgaa ccggaattca gttcatttcg gaatagttat 12 0 

aaatcggagg agcataataa gtttcaagaa ctggttaaga aatatggttt ctatcctgag 180 

ttgtgcgata cctgtagaaa agggaatcta ctgaagataa aatcaaaaag gcggttttat 240 

aagtcactct gtgggggcat gacccgcgat ttgcttataa aaccgttttt cgtttataaa 3 00 

ggcttaagtt tcaactcgat ttgtgtaact gaacccgggc gtacggtacg tcctgcagac 3 60 

catatttcat aa 372 

<210> 167 
<211> 1008 
<212> DNA 
<213> B.fragilis 

<400> 167 

ataaataaat caatcatggt aaaaataatc ttaggtgttc tgtcactgct tgtcatgttg 60 

tcgtgcagca ctgccgtgaa agagaacact acacaacccg atataatgga gacaaacaag 12 0 

aaaaatctcg gaaatctgtt ggcactctat cccaaaccaa tgacggttgt cggggcggag 180 

gtcgaaggga aagtaaactg gcttgtggta ggacacacgg gagtcatcgg ccatgaccgg 240 

atactggtca gcatgagtaa aagtcattat accaatcaag gtgttaaaaa atcaaaacga 3 00 

ctttccgtca atcttgtgag tcgtgagatg ttaccgaaag ctgactatgt aggaagtgta 3 60 

agtggtgcga cggtcgataa gtcggaggtg tttgcttacc atatcggaga gaacgatacg 42 0 

cccgttatag acgcatcacc actcacgatg gagtgtgaag tggtggacat ttatgaaacc 480 

gacggtttcg acaatttcat ttgcgcgata gtcaatacat acgctgcttc cgatgtgctt 540 

gacagcgatg gcaaactcga ctatacgaaa ctaaaacccg tattattcga gttcccgacc 60 0 

tactcctacc ttgcgacagg agagatcatc ggcaaatgtc tgaatccgga taagccgggt 660 

atgtgcgtta aagagccgat gacgaccgat ggtatcgtac ggctgtcgaa aatagaggtt 72 0 

tatccgcagt atcttgacga gtatatgaac tatgcaaccg aggtaggtga aatctccctg 7 80 

cgtaccgaac cgggcgtact gacgatgtat gctgtcggcg aaaaggagaa tccctgtaaa 840 

gtaacgattc tcgaaaccta tgcgagccgt gaagcatacg agcagcatat cgcttcggaa 9 00 

cactttcaga agtacaagca gggaacgttg catatggtca aatcgttggt attgtccgac 960 

cagacaccgc tcaatccggc caacaaactc aataacttca tgcaatag 10 0£ 



73 



<210> 168 
<211> 1248 
<212> DNA 
<213> B. fragilis 



<400> 168 

aagaaagaat 

gaacgccatg 

aactttccgg 

acggcggaaa 

tacaccatat 

aaaaagggaa 

ggttttctgc 

ggcatgttgc 

gtcttattgc 

tcgtctcccg 

aacttcggaa 

caaggtgcat 

ggattgttct 

aatcgggtac 

gttctctatg 

atctatacgg 

tacctgcatg 

ctgactaatt 

ggactggggg 

gtccagatgg 

ggctggcgga 



caatgaataa 
tcatcctcga 
aattctcgct 
tagataaggt 
tctcgctgct 
cggacggatt 
atctgatgtt 
tccctctttt 
tacttcctat 
cagtgcgaat 
tctggctacg 
gggtgcgctt 
tattgggctt 
tactgaaaaa 
cctggagtgc 
caagtgtcta 
gtagagagtg 
acgtgggaca 
ccggcattgg 
cattcagtgc 
tgctgactta 



ggaaatagac 
tgctctaaga 
ttacactttt 
aattcgcttt 
gttcggtatc 
ccgtatcttt 
tatctggagt 
ccggcatgtt 
tctgattgat 
gcaacagcac 
cgacgcggaa 
gcaggaattt 
ctacatcgga 
aacggtgaca 
ggtaaacggg 
tcctttaggt 
gcgcttgtgg 
gtcggtatgg 
attgacagga 
cttatggctc 
tgggaagtgg 



ataaaagaca 
ggatttgcat 
caaaaaccgg 
cctcaatacc 
ggattttcaa 
taccggcgga 
ggggacatct 
tcggacagag 
tggttggccg 
tattgcaatt 
aactacggag 
atcgacggca 
cgaaagcaaa 
tacggttttc 
catcctttcg 
tttgcatacg 
cgctgtcttg 
ggcatggttc 
acagaatcca 
tcctatttcc 
ttaaaaataa 



tggcacccgt 
tgctggtaat 
aaattacgga 
ttttcgtgga 
tcattatcag 
tgattgttct 
tgttgttgta 
tgttgctggg 
gtacattcgg 
tatatggtat 
gggtctttca 
atcgctattt 
tatacgccga 
tgctgggact 
gaacggctgc 
tttccgctat 
ccgctccggg 
tcttctacgg 
tagctttcta 
gctttgggcc 
ggaaataa 



gaaggcctcg 
ctgctttgcc 
ggctatgcct 
tggtaagttc 
caacgcggcc 
ggccgccatt 
tgccttattg 
gacttccgct 
agtgtcccgg 
aacggaatat 
attcctggta 
taaggtattg 
tcttgaggcc 
tcccctatcc 
acacaccgcc 
ctgtcttctg 
gagaatggca 
tatcggcttc 
cgtctttctt 
tctggaatgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1248 



<210> 169 
<211> 228 
<212> DNA 
<213> B. fragilis 



<400> 169 

gccgtttgtc cgttgttgga agttctgccc cacaacctgt cagtatggtt cctgtcagga 60 

aacacttttg taggatatgc cgaccagcat cctttggtca gcgtgataga ttacgatacc 12 0 

gtcacgccca tcttgcatac gcacagccaa ttttcagtct atgcacttta taaagggatg 180 

ggcgttactc ttctctccgg acttgttgca tggaactccg ttgggtaa 22 8 



<210> 170 
<211> 237 
<212> DNA 
<213> B. fragilis 



<400> 170 

attacgatac cgtcacgccc atcttgcata cgcacagcca attttcagtc tatgcacttt 60 

ataaagggat gggcgttact cttctctccg gacttgttgc atggaactcc gttgggtaat 12 0 

cacatcaaag acttctcttt cttttcttat cagtccaatg aagccggaga gcacatccgt 180 

tgtcatatca tcgatatggc caaaagtaaa cttatcaacg gagaacaaat catgtag 2 37 



<210> 171 
<211> 627 
<212> DNA 
<213> B. fragilis 



<400> 171 

attttaaaac caacaagaaa tatggaaata accaatgctg aatttgtaat tagtaatacg 60 

gacgtgaaaa aatgtccggc aggcactttc cccgaatatg cctttatcgg ccgatccaat 120 

gtaggaaaat ccagcctcat caatatgctg accggacgaa aggggctggc catgacttcc 180 



74 



gctactcccg 
gttgacttac 
accatcatcg 
gacagccgtc 
ggcattcctt 
ataaatatca 
ttcatcactt 
atcaataaag 



gtaagaccat 
ccggatacgg 
aagattacat 
tggaacccca 
ttgccattat 
gcgcttactt 
catcagaaga 
aacttaattc 



gcttatcaat 
atatgccaga 
cctcgaacgc 
gaaaatagat 
cttcaccaaa 
gagagaatta 
gcgccttggc 
aaaataa 



cattttctga 
cgaggtcaga 
gaacagatga 
cttgaattca 
gcagacaaac 
cggaaacaat 
aggacagagg 



tcaacaacag 
aaggacagga 
ccaatctatt 
tggaatggct 
tgaaaggggg 
gggaagaact 
tattaaacta 



ctggtacctg 
acagatacgc 
cgtattgata 
gggtgagaac 
acgactcaaa 
ccctccctat 
catcaagtca 



240 
300 
360 
420 
480 
540 
600 
627 



<210> 172 
<211> 528 
<2»12> DNA 
<213> B . f ragilis 



<400> 172 

aaaaataaaa 

ctttccttat 

tatggtaaag 

ctctatgaag 

ataatgggct 

acagtaaaac 

gtaagacacc 

tgcaatgcct 

cgtatccgac 



cgatgcaaaa 
cctactccat 
ttaattcgct 
catacacttc 
ttgcttatgc 
tttatatcat 
ccaattacta 
ggtatacatt 
aagaggaaag 



tatcattatt 
tcgtaacgag 
attactgaca 
gggaactacc 
tatgctattc 
tcccgatcat 
tctgaatatc 
actcattgga 
ggccatgaaa 



acatttattg 
aaacgtcttc 
ttagcacata 
ttcaactact 
tatgtgatct 
cgcattgaaa 
atacctgaac 
ctccctatct 
gaactattgg 



ccttttttgt 
tgaaaagtgg 
tcgtctacta 
tctctgtttg 
ataaactcca 
aaagcttcct 
taattggaat 
acgcttgttt 
agaattaa 



actcagatta 
agcggtacaa 
tttttcggcc 
tggtgttttt 
tgatgtatgg 
tttcagaaca 
tgctttactc 
gctcgctata 



60 

120 

180 

240 

300 

360 

420 

480 

528 



<210> 173 
<211> 1488 
<212> DNA 
<213> B.fragilis 



<400> 173 

aactctggga 

acaggaaagg 

ctggggcagc 

tcttccatcg 

gctgcttcca 

gccgcaacgg 

gcacgaaaga 

gctgtcggca 

cggagcgatt 

aatttccttg 

aacgtgctga 

caggtggaat 

gcaatattgg 

tgccgtcgct 

gagacactga 

tgtggggcgc 

gccaactcgt 

gaggctgcca 

cgttttgcca 

atgtatatgg 

ttgggaatcg 

gtagcctacg 

ggcagcattt 

cgcggcgtgt 

agactttggg 



cgagaaaact 
gaaagacaga 
agttgcgcct 
ccatgcagta 
tcggattggt 
gcttctccgt 
tactccgcca 
tttccatcag 
catccctcta 
cgggtggcat 
tgtgtcttct 
ggttcggagt 
gaacggtgct 
cgcccatgct 
gtaaggcttt 
agatcgcaac 
tcgccatcac 
cgacgttggt 
atattaccgt 
cggctccgca 
agattctgcg 
gtatattcgt 

ggggcgtacg 

ggtttgccat 
gcagtaactg 



tgtttcacag 
ttacctgcta 
gactgcatac 
tattgatgcc 
ctcgaccacg 
tcaagtagcg 
gtcgattgct 
tggtatgctt 
cttttggata 
gttgcgatgc 
ggatatcgtg 
gacatttacc 
ggccgagctc 
gagactgtcc 
ccgcatctcc 
gacggtgatt 
tgccgaaagt 
cggccagagc 
ctggtcggga 
aatcatagga 
gatagaagcc 
gggtgtgggt 
gctgacactg 
gtgcatcgag 
gatttataaa 



actaatgccc 
tcgcttatcc 
ctcagtgtcc 
tcgatggtgg 
acatggctgt 
cataaaatcg 
gccacattgg 
cccggctggt 
ttcgcacttt 
agtggaaata 
ttcaacttct 
actcccggcg 
atcactgccg 
ggagaacggg 
ctgccgatgg 
gtcgcaccac 
ctctgctata 
ctcggagcaa 
atgctgatta 
gtgatgaccc 
tttgcagagc 
aatacattcg 
gcggcatggc 
ctttgtttcc 
ttacgaataa 



ttgtacataa 
gcgaagggaa 
ctgccattat 
gcagcctggg 
tttgggagct 
gagccgggga 
ttttcagctc 
tgggcggtga 
tccttcctgc 
tgcgtgtgcc 
tcctgatttt 
caggcttggg 
gcgggatgat 
gaagtttcct 
gattcgagca 
ttggtatcat 
tgcccggcta 
accgtatccg 
tgggtgtcat 
cggtagagga 
cgatgttcgc 
tacccagtct 
tcgcccccac 

ggggggtaat 

atagataa 



cctgatggga 
gcagatgaca 
ggcacagata 
cgcgaatgcc 
gtgtgcagcc 
tttcgtggga 
attgttggcg 
tgaagtaata 
cttgcagttg 
cagtatgctg 
cccttcgagg 
cgtggaaggc 
gtggtatctt 
gcctcggaaa 
catggccatt 
tgccattgcc 
cggtatctcg 
gttgctccgt 
gggaacgctg 
aattcgcacg 
ggcgtctatc 
gatgaacttc 
gatgggacta 
cttcctcgcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1488 



<210> 174 
<211> 1083 



75 



240 
300 



<212> DNA 

<213> B. fragilis 

<400> 174 

aataatacgg ctatgaagtt acaagcaatc gccatactga cattcctgac ttttgcgaat 60 

gtcatggcac aagaaacgac aacaacaaaa tatataaatt caaccgatat ggaagcattg 12 0 

aaattgacgc aggaatggga taagaccttt ccgcagagcg ataaggtgga acatacgaaa 180 
atcacgtttc acaaccgtta cggtattacg cttgccgcag acctttacaa gccgaaaaat 
acacaaggac gtctggcagc cattgccgtc agtggccctt acggtgcggt gaaagaacaa 

gtgtcaggcc gttatgccca gacacttgcc gaacgaggct ttctgaccat tgctttcgat 3 60 

ccctcctatt acggcgaaag tggtggtaca cctcgctatc ttacgtcacc cgaaatcagc 42 0 

acggaggatt tcagcgcggc agtcgattat ctgacatccc gtgcggacgt cgatccggaa 480 

cgtatcggaa tcttaggcat ctgcggttgg ggcgggtttg cacttaatgc tgcggccaat 54 0 

gaccctcgta tcaaagcgac ggtaacatcc actatgtatg atatgagccg ggtaaatgcc 600 

aacgggtatt tcgacgccat gagctccgat gaccgttaca aattgcgcga acaactcaac 660 

gcacagcgta ctgaggatta tcgtgatgac agctatgtac gcgatggtgg cgtacttgac 720 

cccgtaacgg acgatactcc gcaattcgtc aaggagtatc acgactacta caagacggaa 780 
cgaggctacc atcgccgttc accgaactcc aacgagggaa tcacgaaaac aagcgtattg 
gcattcatca atatgccgct gctcacctat atcagcgaaa tccgcagtgc cgtgttgatg 
attcatggag aaaaagctca ttcccgctat ttcagtgagg atgcctacaa acggctgacg 

ggtagtaaca aggaactgtt gattataccc ggagccaacc atgtcgattt gtacgataat 102 0 

ctcaacgtga ttccgttcga caagatagat gctttcttta agaatgcctt aaaggagaaa 1080 

tag "83 

<210> 175 
<211> 642 
<212> DNA 
<213> B. fragilis 

<400> 175 

ttaaatagat atacttgttt gtacatgaat caacaatatc catcgacgtt acttgaaaag 60 
gccgtcggag agttttctaa attgccgggt atcggacgga aaacagctat gagactggtg 12 0 
cttcacctgt tgcgtcagga tacctctgtg gtggaagctt tcggaagttc tattataact 
ttaaagcatg aggtgaaata ttgcaaggtg tgtcataata tatctgatac ggaaacttgt 
cagatttgtg caaatccgca gcgggacgcg tctatggtct gcgtagtgga gaatatacgg 



840 
900 
960 



180 
240 
300 



gatgtgatgg ccgtagaggc cactcaacaa tatcgtgggt tgtaccatgt tttgggggga 3 60 
gtgatttcac cgatggatgg ggtaggaccg ggcgatctgc agatagaaag tctggtgcgc 
cgggtagccg aagggggaat aaatgaagtg attcttgctc taagcacaac catggaaggg 
gataccacga atttttatat ttaccgtaaa cttgagaaaa tgggtgtcaa attgagcgta 54 0 
cttgcccgtg gggtatccat tggtgacgag ctggaataca cagacgagat aacgttgggt 600 
agaagtattg tgaaccgtac gacttttacc ggtaccgttt aa 



420 
480 



642 



<210> 176 
<211> 1167 
<212> DNA 
<213> B. fragilis 

<400> 176 

gttatgagat acgatttcga tacgattgtc ccgcgtcgcg ggacgaactc ctacaaatgg 60 

gacactcccg aagagaaaaa tgtgctacct atgtgggtag cggatatgga tttccgtacg 12 0 

gcacctgcca ttgtagaagc cttgcaaagg cgggttgcac acggtatttt cggttatacc 180 

aaagtacccg aaacctatta cgatgcggtc gtccggtggt tcgagagccg tcatcgctgg 240 

cagatagatc cccggtggat tatctataca agcggtgtcg taccggctct gtcggccatt 3 00 

atcaaagccc tgaccgcacc gggcgataaa gtaattgtcc aaactccggc atacaactgc 3 60 

ttctattcgt cgattcgcaa cgacggatgc gagctatcgg ccaataatct aatttatcgg 42 0 

gacggtcgct atatgataga cttcgacgac ctcgcagcga aagcggctga tccgaaggcg 480 

aaaatcctgt tactatgtaa tcctcacaat ccggtcgggc gggtctggac accggaagaa 540 

ctgcggcata tcggcgacat ctgtttgcgc aacggagtgt ttgttgtggc agatgaaatt 600 

cattgcgaac tgacctacga gggacacgac tatacgcctt ttgcctccct ctccgaacgc 660 



76 



420 
480 



ttccaacaaa attccgtgac ttgcatttcg ccaagcaagg cgttcaacct tgccggactg 72 0 

caaatcgcca atatcatcgc cttggacgaa gaggtgcgtc gccgcatcga ccgtgctatt 780 

aacatcaacg aggtgtgcga cgtcaatcca ttcggcgtga tcgctacaat tgccgcttat 840 

aatgagggtg gcgagtggct cgatgctttg cgaaaatacc tgcgagggaa ttatgaatat 900 

ctatgccatt ttttcgccga aaggctgcct caatatcccg tattgccgct cgaaggaact 960 

tatttggtct ggatagactg ccgagcactc ggtatcggtt cggacgccac gaccctgcat 102 0 

ctgcaagagc agcagaagct gatggtcaac tccggtacga tgtacggacc cagtggagag 1080 

ggattcatcc gtctgaacat tgcctgtccc cgcacattac ttgccgatgg tctggagcgg 1140 

atggcccgtg tattggaatg ctgttaa 1167 

<210> 177 
<211> 615 
<212> DNA 
<213> B. fragilis 

<400> 177 

aaacaaggat atcaaatgaa aagaaaacta ttatcatttg cagttcttat cacactactg 60 

cttgtaccga ccgtaaaccg tgcacaatct atcaaggact tattcaataa agacaatatc 12 0 

tccaaagttg tcaacgctgt cacaggacat accgaaacag tggatatgac cgggacctgg 180 

cgttataccg gctcagccat tgagttcgag tctgaaaacc tgctgaagaa agccggagga 240 

accgtcgctg cttccgctgc cgaacaaaag ctggacgaac agctggccaa agtcggcatt 3 00 

aaagaggggc aactgagttt tacattcaat gcggacagta ctttcgtaag cactttaggc 3 60 
aaacgcaagc tgaacggaac atactcttac gatgccggca cccagatgct ccacctgagg 
tatatgaaat taatccccat gaatgcaaaa gtcaattata ccactcagca gatggatctt 

ctgttcgaag cagacaaatt gctgaagcta atcactttct tatccagtaa gagcagcagt 540 

gccaccctca aagccatcag ttcattggca gatagctatg acggcatgat gctgggatat 600 

gaattgaaac gatga 615 

<210> 178 
<211> 330 
<212> DNA 
<213> B. fragilis 

<400> 178 

aaacaatatc aaaaatttgt cacaattctt gtactattag ccggcattgt ccctgtctat 60 

gccatcatga acatcgtatt cgatcctaat gacgatggaa atctgttaat aacactcggc 12 0 

actctgacac ctatactggg tgaccttttg atggtatatg ccttcaaaga caaatatcaa 180 

attttaatta gcaatcatcg tttgcaaaat aagtgttacc tttgcgctcg ttatgatgat 240 

acttgccact attgtatgct actttgccat tctcttgctg atagcccgta tcaccggacg 300 

gaaaggaggt tcgaatgcag cgttttttaa 33 0 

<210> 179 
<211> 540 
<212> DNA 
<213> B. fragilis 

<400> 179 

atgatgaagc aatctttctt agccaacgag cgaatatatc tccgtgcagt ggaaccggag 60 

gatttggatc ttatgtacga aatggaaaat gatccttcta tgtgggatat cagtagtttc 12 0 

acagttccct attcgcgttt tgtactcaaa cagtatattg aaggatcgca aagtgacatg 180 

tttgccgata aacagttgcg gctgatgatt atgcgtcgga aagataattg tactttgggt 240 

acggtcgata taactgattt tgtaccttta cattcaagag gggcagtcgg aattgccgtt 300 

cacagcaatt atagacagga ggggtatgct tccgatgcat tgaaactgct ttgtgaatat 360 

gctttcaact ttttatttat aaaacaattg tatgcccata tagctgtgga taatgaaccc 42 0 

agtttgcgat tgttcaattc ttgtggattt acccaatgtg gagtattgaa agaatggctg 480 

ttaacacacg aaggttataa agatgccgtg cttgtgcaat gtatgaatcc caaacgatga 540 



<210> 180 
<211> 450 



77 



<212> DNA 

<213> B.fragilis 



<400> 180 

atggaagagc 

attattccgt 

gcagacaatg 

tgtatacctc 

acgtttccgg 

gaatttgtag 

ttatgtgcac 

cgtgccgaac 



aaataaaacg 
tattgttggt 
tacgggctgt 
tttctcttaa 
tggccctgag 
tggtgtttaa 
ttataggact 
tgcatattga 



cattgtgaaa 
gctattggga 
ttatgttttc 
actatttagt 
ccgttatatg 
tctggccggt 
gacagcctct 
taaagaataa 



agccagaagg 
gaagccggcg 
gaaacagtag 
tttgttctga 
ctttgggggg 
tactacttta 
tttttctgtc 



tacagtatat 


ttctttttgg 


60 


tgttgcctgt 


aggaataaaa 


120 


gtattttgat 


gactgccgtc 


180 


caaagaaaat 


agatcagctg 


240 


ctgttcggct 


ggctttactg 


300 


cacttagtag 


tacaggtgcg 


360 


ttccgggaga 


aaaaagattg 


420 






450 



<210> 181 
<211> 213 
<212> DNA 
<213> B.fragilis 



<400> 181 

cacagagtta aagcttgttc tttggatgtg aataagaagt tctttaaatg caaaagactt 60 

gttatttgcg cacaagaacc tgacaacctg caaaaggcgt taacaatgtt aattgaaaaa 12 0 

aggtacaagg atgaagatac cggttcagac ggcgtaaact cacttccgaa acttaagtta 180 

tcttattcag cctgtgtcta ttttttctta taa 213 



<210> 182 
<211> 693 
<212> DNA 
<213> B.fragilis 



<400> 182 

ataaaaacca agaatatgag accatatata atcagtcaca 

cgcatcgact gcccgatggt cgggcaactg agtacggatg 

aagctggggc cttgctcgaa actgtcagga cggataacta 

gtcaaagagg aaagtactcc gatggaggga actccgatag 

gccagtaaat cggacgaata tacgatcatt gtcgatacct 

gagggtgaag ctgacggtca tcctctactt tgtattgtca 

tatctggaaa cgctgcgcac attgggtatt tcatggattg 

gacttgccgc aagctatgga gctgcttcac gaacatttcg 

gtcgggggcg gacatatctg cggcggtttc ctggaggccg 

attatggtag ctccgggtat tgacgggcgt aagggacaga 

tcccgtatgg aatgtaaccc gtacaaactg aaattagaga 

ggtattgtct ggctccgcta taaagtaaaa taa 



tgatgacttc 


ggtcgatggc 


60 


agtattacat 


agccttggaa 


120 


ccgcactcga 


atgttctgcc 


180 


gtcataaatc 


cgtatatgtc 


240 


atgggaaact 


gcgttggcag 


300 


gtgaacaggt 


gtccgaggaa 


360 


cggccggtgc 


ggaacgcatt 


420 


gcgttgaacg 


cttggcgatt 


480 


gactgattga 


cgaagtgagt 


540 


cggcggtttt 


cgatggaatc 


600 


gtgtggaaca 


atgggaaaca 


660 
693 



<210> 183 
<211> 1221 
<212> DNA 
<213> B.fragilis 



<400> 183 

aatataacaa aaatgaaaat atatatattt attatactgg ctgcggccac ttcaatctcc 60 

ctgatatctt gcgattcgaa acagagtgac acccgctcgg cctcttcctc agaggttcac 12 0 

cggaatgacg acggtcatga tcatcgggaa agtgatggag acaaccatag tgaaatagag 18 0 

aactccggca agggacatga ggacgaaatc attttcactc ggcaacaggc ggaagctatc 240 

gggttggaga tatataatgt ggtacccgga tcttttgcac aggtaatcag aaccagcgga 3 00 

cagatacagg cagcccaagg agatgaagaa actattgtcg ccacgaccaa tggtgtcgta 3 60 

tcttttcccg gacaaaacat catcgaagga gcaactgttg gcgtgggaag tactattgta 42 0 

accatttcag ctaaaaatct ttatgaagga gatccggtgg caaaagccaa gattgcctat 48 0 

gaaactgcct tgaaagagta tcagcgtgca gaaggtctgg taaaggataa gattatttcc 540 

gctaaagagt tcgaacagac tcgtatgaaa tatgaaaatg ccagaactgc ttatgaagcc 6 00 



78 



caagctgcca atgtaactgt ttccggggta aaagttactt ctcccatcag tggatatgtc 660 

aaaaacaggc tggtgagtca gggggaatac gtgactgtcg gacagcctgt tgctacaatt 72 0 

tccaagaacc ggagattgca actgcgagcc gatgtttcag aaaactattt caatgaactt 7 80 

aaaaaaatca ggggagccaa cttcatggta tcctacaata acaaggttta taggttggaa 840 

gatcttcacg ggcgtttatt atcctttggc aaagccgctg ctgaatcttc tttctatatc 900 

ccgattactt ttgaattcga taatatcggt gatttcattc ccggttctta tgtagaggta 960 

tacctgctca ccactcccca aaataatgta ttttccattc ctgttactgc attgacggaa 1020 

gaacagggta tctattttgt ctacctgcaa atagcagagg aggagttcgt gaagcgtgaa 1080 

gtcggtatcg gagagagtga cggtaaaaac gtgagaatac tttctggctt gaaagagggt 1140 

gagagagtgg tcgttaaagg tgcttatcag gtaaagctgg cttctagttc atcggtgttg 1200 

cccgaagggc atagtcatta a 1221 



<210> 184 
<211> 372 
<212> DNA 
<213> B.fragilis 



<400> 184 

ataacgaaaa aagaaattgg atatggaaaa attaccatca atagcattag caacgacaac 60 

cgtcagacct tgccgcgttt ccagccggaa gcgatgcgtg cgaatacccg cattgtaaat 120 

gcgctgcaag ctttcgggcg tacacggagc atgacctcgg cacaggtggc tcttggctgg 180 

ttgcttcaga aagcaccgtg gattgtaccg attccgggaa cgacaaaact gtctcatctg 240 

gaggaaaacc tgcgcacact cgacttcaac atcagctccg gggagtggaa agagttagag 3 00 

gatgccgtgg ctgctattcc cgttgtggga gaccggtaca atgcggaaca gcaacgtcag 360 

gtaggccgat aa 372 



<210> 185 
<211> 1140 
<212> DNA 
<213> B.fragilis 



<400> 185 

aagattatgg atcgcagaaa tttcttaagg acggcatcaa gttttgcact actcgcggcc 60 

ggagctacaa cgggtgtttc ccgtgtgttt accgaacccc ctatctcttc tttatcagga 12 0 

aatttatctg ataaaaatac gccaaatgcg ggcgatacga tggagtatcg caagctcgga 180 

gagctggacg tatcggctat tggtctgggt tgtctgccaa tggtgggata ttacggtggg 2 40 

aagtacgaca aaaaggatat gatcgctctg attcgccggg catacgacaa aggtgtcact 3 00 

tttttcgata cggcggaggt ttatggccct tacatcagcg aagagtgggt cggcgaagca 3 60 

ctcgctccgt ttcgcgacaa agtgaaaatc ggaacgaagt tcggcttcgg tgtcgaggag 420 

aaacaaccga ctgctatcaa tagccgtccc gatcatattc gttgggcggt ggagggctct 480 

ttgaaacgcc tgcgtactga ccatatcgac ctcttgtatc aacaccgtgt cgatccgaaa 540 

gtgccgatgg aagaggtggc cggaactgtc aaggatttga tgcaggaggg caaagtgctg 600 

cattgggggc tgtcggaagc gagtgccagt tccatccgtc gggcgcatgc cgtctgcccg 660 

ctttccgccg tgcagagcga gtatgccatt tggtggcggg agcctgaaac caaaatcttc 72 0 

ccgacattgg aaaaactcgg tatcggcttc gtgccttatt gtccgctggg gcgtgcgttt 780 

ctcactggga taatcaatga aaacagccgt ttctacgagg gagaccggcg ttggaacttg 840 

ccgcaattca cgcccgaagc tttgaagcac aatatgccgc ttatcgcctt ggttcgcaaa 900 
tgggccgagc gcaagggagt gacactcgcg caattcgctt tgctatggat gttatctcgc 
aaatcgtgga ttgctccgat acccggaacg accaatccgg cacacttgga tgacctgctc 
ggtgcgggaa cggtccgtct ctcagcttgg gagatggagg agtttgataa ggagtatgcc 



960 
1020 
1080 



aaaatcgatt tgatggggca tcgtgccgat ccgttcaccg aaagtcaaat agataaataa 1140 



<210> 186 
<211> 678 
<212> DNA 
<213> B.fragilis 



<400> 186 

tctatcacgc tgaccaaagg atgctggtcg gcatatccta caaaagtgtt tcctgacagg 



60 



79 



aaccatactg acaggttgtg gggcagaact tccaacaacg gacaaacggc tcaaacggcc 12 0 

gatacactgc ctgccattct ccgtgtcgtg ctgaacaacg ggatagagat gccgcagttg 180 

ggtgttggca cgtctactct caaggagact gccgcagagt gtgtgaaaca cgccatcgga 240 

ctgggatacc gtttggtcga tgtggcgcaa ggctacgaca acgaggccga agtgtggtac 3 00 

ggaatcaagg aaagcggtat cggccggagt gaagtgttca ttatttcgaa agtctctccc 3 60 

gatgccgtgc gtagcggaaa ggtacgcgag tcgctcgacc ggactattga agcattcggg 42 0 

ggaacgtatg ttgacctgat gctgattcat tggccggtag ctagaaaggt caaggagaga 480 

tggagaatca tggaaaagta tgtcgatgtg gggaagatcc gtgccatcgg ggtgagcaac 540 

ttcaatccgc atcatgtgga cgaattgctg gcatacgctc gtatcaagcc tgtcgtcaac 600 

cagatcaaga ttcatcccta catggaacat caggaggtcg tgggcaacac ttttgccaaa 660 

ggtattcaag ttcagtga 678 

<210> 187 
<211> 1029 
<212> DNA 
<213> B. fragilis 

<400> 187 

aaaaataata gtatggataa aaggaaatta ggacagctgg aagtatctcc gataggaatg 60 

ggatgtatgg gattcagcca cggttacggg caagtgccac ccgaagcgta tgccatagaa 12 0 

gccatccgcg gggcatacga ctacggctgc acgcatttcg atacggcgga agcctatggc 180 

aaagaacaat tctacgccgg acataacgag gaattggtgg gtaaggcgat tgaaccgttc 2 40 

cgtaagaagg tggtgctcgc caccaaattt catattggtg aactctcgaa accggacgag 3 00 

acgaatctct accgggaggt acgccggcat cttgaagatt ccatgagcag acttcgtacg 3 60 

gattatatcg acctgtatta cctgcaccgt atcagtgagg cagtccggct tgaggatgtg 42 0 

gcaaccgtca tgggacggct tattcaggaa ggactgatac gtggttgggg attgtcgcaa 480 

gtatcggccg accagatacg ggcggcacat aagattactc cattatccgc cgtccagaac 540 

atctattcga tggtggaacg cgattgcgaa acggagattt ttccggtatg ccttgaaaaa 600 

ggaatcggag tcgtaccgtt ctcgccgatt gcaagcggat tcctttcggg caaggtaacg 660 

ccacaggatc agttcggctt cgatgacgtg cggaaattcg tcccccaatt atcgaaagag 72 0 

aatatcgagg ccaaccagcc catactcgat ttgctgcatc ggttcgctgt ggagaaacat 780 

gctaccaacg cccagatatc gcttgcgtgg atgctccata aatatcccaa tgtcgtacct 840 

attcccggtt ccaagaatca ggaaaggatt ctggagaatc tgggagcttg gaatgtcacg 900 

ctttccgatg atgaattccg gcagctacaa tcagcgttgg atgaatgtaa ggtacacgga 9 60 

catcgtgggt gtgtggaaac ggaacagacg agtttcggta aacaatggag tgaagaaaca 102 0 

gataagtga 1029 

<210> 188 
<211> 879 
<212> DNA 
<213> B. fragilis 

<400> 188 

aataaggaaa gtatgaaagt aatatcaaat gcagaattcg gaggtgaaag acctttgttc 60 

gaatcacatg acttacgttt ggagaatgta attatccgtg ccggagaatc agccatcaag 120 

gaatgcagca acatcgaagc cgttgattgc cggttcgagg gaaattatcc cttctggcac 180 

gtgcacggtt tcgttatcga ccgttgtttc ttcgatgtcg gcgggcgttc ggctctgtgg 240 

tactccgata atctgaaaat gacgaacaca cgtatcgacg cccccaagat gttccgcgag 3 00 

atgcacgaca tcgaaatcga gaacgtagag ataaacgatg ccgacgaagt gttctggcgt 3 60 

tgcaagaatt tggacatcaa aaatctgaaa ctgcatggcg gcacttatcc gttcatgttc 42 0 

agcagcaata tccgcataga cggattggag agtgacagta aatacgtatt ccagtacgtg 480 

aagaatgtgg aactgcgcaa tgccaaaatc accacgaaag atgccttttg ggaagtggag 540 

aatgtgacaa tctacgattc agaactcaac ggtgaatatt tgggttggca ttcgcacaac 600 

cttcggttgg tgaactgtca tattaccggc gagcagccgc tctgctatgc ccacgacctc 660 

gtattggaaa attgtacgtt cggccccgac tgcgaccggg ctttcgagta cagttcggtg 72 0 

caggcgacca tcaaaggcgc aataggtggg gtgaagaatc cgcgaacggg ctgtatcacc 780 

gccgagagct acggggagat tatcctcgac gagaatatca aggctcccgc cgattgcaag 840 

ctgaaactct gggacgagaa aacttgtttc acagactaa 87 9 



80 



<210> 189 
<211> 864 
<212> DNA 
<213> B.fragilis 

<400> 189 

cgatattatg gtatggattt caaagaattg aataacggag taaagatgcc gatacaaggc 60 

tttggtgtct ttcagatacc cgatgccacc gagtgcgaaa gagttgttac cgatgcgctt 12 0 

gccgtcggct atcggctcat cgacaccgct tcggtctatg gaaatgaacg ggcggtcggt 180 

atggctattc ggaaaagtgg tattccgcgt gaggaactgt tcatcacgac caaagcatgg 240 

atttcagaaa tgggttatga acggacattg cgagcattag acacttcgct cgcccgtttg 3 00 

ggattggatt acctcgacct ctatctgatc cacatgcctt tcggcgacta ttacggagca 360 

tggcgggcta tggaaaaact ttatgcgaaa ggacgtgtgc gggctatcgg ggtatgcaat 420 

ttcgagccgg acagattgct ggatttatgc cataatgcta atgttattcc ggcggtcaat 480 

cagatagagg tgcatcctta tactccgcaa accgatgcga tacggaccat gcaggaactc 540 

ggcatacaag cagaggcatg ggggcctttg gccgaaggac ggaatggatt gttcacggac 600 

gatattctga ccggtatcgc tcgcaaatat gataaatcgg cagcacaggt cgtactgcgc 660 

tggcacttac agcgcggagt tgtcgccatt cccaaatcgg tacatcggca gcggatgcaa 72 0 

gagaatttca acatcgggga tttcatgctg acaccggagg atatggccgc aattgcttcc 780 

atgaatatgg gatacgatat gattctcgac ctacacgctc cggaagaagt acagcgactc 840 

tatggtattg agtgtcctgc atga 864 

<210> 190 
<211> 684 
<212> DNA 
<213> B.fragilis 

<400> 190 

ttggagatta tgataaaagc aattggattg actaagatat tccgtacaga gagtgtacag 60 

actattgcat tgaatgaaat cagtatcaat atatcggaag gcgaatttgt agctataatg 120 

ggaccctcag gatgtggcaa atcgaccttg ctgaatatac tgggactatt ggacaatccg 18 0 

acttccggtg agttgtggtt catcggtaaa gaagtttccc gctactcgga aaatgatcgt 240 

acagacatgc ggaacggcaa tatcggcttt gtatttcaga gctttaacct gatagatgaa 300 

ctgactgtat ttgagaatgt agaattaccg ttgctatatg ccggtgtgcc ggttcgtgag 3 60 

cgtgtagatc gagtgaacaa agcgttagaa aggatgcaga taagccatcg tacggagcat 42 0 

tatcctcaac aactttccgg aggtcaacaa cagcgtgtgg ctattgcccg ggctattgtg 480 
acgaacccga aaattatatt ggctgacgaa ccgacgggta acctcgattc taccaatggc 
aacgaggtga tgcttttatt gaaggagtta aataaagatg gagctacagt cgtgatggta 

actcactctg aagaaaatgc ccaggaggca ggccgtattg tgcggatgat ggatggttgt 660 

atcctgacgg agaacagacg atga 684 

<210> 191 
<211> 1368 
<212> DNA 
<213> B.fragilis 

<400> 191 

gtagattata tctatctttg tgacacaatt tataaagcaa caaaaagaaa catgaaagat 60 

acacctatca aacggcatct aattgatgaa actatcgaag aatttcaaat tacagatttc 12 0 

tcaaaagcaa ccattcgtga agtaaaagcc atagcagcta aagcagaaac agcatccgga 180 

gtcgaattta taaaaatgga aatgggcgta ccgggtctcc ccccttctac tgtaggagta 2 40 

aaagccgaga tagaagcatt gcaaaatgga atagccagtt tgtatcccga tattaatgga 3 00 

ctaccggaac taaaatcgga agcctccaaa tttataaaag catttatcga tatagatctc 3 60 

aaaccggaag gttgcgtacc tgtcacggga tccatgcaag gtactttcgc atctttcctt 420 

acttgcagtc aatgcgatga aaaaaaagat actattctgt tcatagatcc tggctttccg 480 

gtccaaaagc agcaattggt ggtcatggga cagaagtacg agacatttga tgtatacgat 540 

tatcggggag acaaattaaa agaaaaactc gagagctacc tgaaaaaagg aaatatttca 600 

gctgttatat actcaaatcc gaataacccc agctggatct gtttaaaaga tgaagaactg 660 

aaaatcatcg gtgaactagc cacccaatat gatgtaatcg tccttgaaga tttagcttat 720 



540 
600 
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tttgccatgg 
gtggcacact 
gcaggccaac 
ggattcgata 
tatgccctct 
gcagcgaacg 
cgtaaattga 
gaagatcctg 
gagctggcaa 
agccaacaac 
ctggatgaaa 



acttccgcca 
atacagataa 
gtattggtgt 
aacgctacgg 
cttcagggac 
aaggtaaata 
aagaaatatt 
ttgccgacgg 
aagagttgat 
agggactacg 
gaatgaagtt 



agatctgagt 
ttatattttg 
cagctgtatt 
aggcggtact 
gagccattcg 
caatttcctg 
cttgcgttac 
tttctatttc 
gtattatggg 
tgcatgcact 
atttgccgaa 



actccgtatc 
cttatatccg 
tctgataaat 
tttggcactg 
gcacaattcg 
aacgaagtga 
ggattccatc 
accataggct 
gtcagtgcaa 
tcctttatca 
aatcatccta 



atgcacctta 
gttccaaagc 
tataccatcg 
tatttatcca 
ccatggcagc 
ggatatatgg 
tggtatacga 
atccgggaat 
tttccttggt 
aagagcacca 
tatcttaa 



tcagccttcg 
cttcagttat 
ccattatccc 
tcgtgtgctt 
tatgctgaaa 
tgaacgcgcc 
caaagatctt 
gacaagtgga 
tactacaggt 
atatgctcaa 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1368 



<210> 192 
<211> 1497 
<212> DNA 
<213> B.fragilis 



<400> 192 

ttagcaatca 

actattgtat 

ggttcgaatg 

ggaatgattg 

gcgatggata 

gcccatatac 

gatacccgta 

atgctgggca 

cgtgatatgg 

tacactcaca 

ctgatcgccg 

agcggagtta 

tggatgtcgc 

atgaccggac 

gcgcagaaga 

ggccttggca 

ggtgacgaca 

ctatttacaa 

atgacaacga 

cgtagaaaac 

cttgtcgatg 

acatacggac 

gaccgatggg 

tttgcccggc 

cttacttttg 



tcgtttgcaa 
gctactttgc 
cagcgttttt 
gcgcatctat 
tgacgtatat 
tcctcccact 
tcggaaagcg 
cagctgcaaa 
gtatcccatt 
aaagtggcat 
cactgatcag 
tacaaacaat 
gccagaattt 
tcgatcagga 
acatgtattg 
ttttattact 
tcctgccgct 
tcggtatcat 
gcttttgcat 
gaaaccgggt 
cattgaataa 
ctcttctggg 
tgccgtttat 
aggaaaccgg 
caggaatatg 



aataagtgtt 
cattctcttg 
taaaggagaa 
ttcgggagta 
gcaaaccgta 
ctactataaa 
tgcctaccgt 
actatacctt 
ctggagtatt 
taaaacaatt 
catccttgtt 
cagcagcaat 
cttcaaacag 
tatgatgcag 
ttatggcttt 
ggtccttgct 
gtttgctacc 
tgccgccgct 
cgacttgctc 
acatatagga 
ccaaagcgtc 
aatgtttgct 
tgcgatagct 
ctatcagttc 
gatcgtatca 



acctttgcgc 
ctgatagccc 
aaccagtctc 
acctttgtat 
ttcggctttt 
ctcaacctga 
acaggagcct 
gtctgtctga 
gctgccggat 
gtctggacgg 
tttgtcactg 
gaacacagtc 
tttctaagtg 
aaaaaccttt 
gcattcgctc 
caagagatgc 
cagggttatc 
ttcagcaatt 
gacacaggca 
ctatccgtcc 
atcgatgcta 
ttcggattat 
tcaccactga 
ggatacgaat 
aagaaacaac 



tcgttatgat 
gtatcaccgg 
catggtacgt 
ccgtaccggg 
tcttcggtta 
ccagtatata 
cttttttcct 
ttctatatac 
cggtagcttt 
atactttaca 
caaagttaaa 
gcatctttgt 
ggatttttat 
cctgccgtag 
cgctcaacct 
agttggaact 
tgggcgaagg 
cggattcagc 
aagacacaga 
tacttatttt 
tttacatcat 
tcacccaacg 
tttgttacgc 
tattgatgct 
taaaaaatga 



gatacttgcc 
acggaaagga 
cgttgctttc 
catggtaaag 
tctggctgtc 
cacttatctg 
tctttcgcgt 
ctacgtattt 
agtatggata 
gactttctgc 
tcttgacttc 
atttgatgac 
tgttattgtc 
tctgcgtgac 
gctgtttctg 
tccggctgcc 
agtacttatc 
cttaaccgcc 
ggaagaagcc 
ctttatctgc 
agcctcctat 
aaaaacaaac 
agccgataga 
gaacggcatc 
attttaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1497 



<210> 193 
<211> 426 
<212> DNA 
<213> B.fragilis 



<400> 193 

tatcccacca 

cgatactcca 

gaagagatag 

agtgcaaaac 

attgttcatt 

acggatattt 

tatttcagtt 

aattaa 



ttggcagaca 
tcgtatcgcc 
ggggttcggt 
ttgatgccgt 
ttatgattgc 
atacccgaat 
ttctttttaa 



acccagacca 
cgcatttggc 
aaacacacgg 
ccttaagaaa 
aaatttaccg 
cctcgaaata 
gaagataatg 



atagccgata 
gtatttttat 
gaaacacccg 
tttctgcgat 
cgattgaata 
gtgcatgagc 
gggcttgctc 



cgtccagctc 
cagataaatt 
ttgtagctcc 
ccataatctt 
agtcggcttg 
tcagatttcc 
ctaatgaata 



tccgagcttg 
tcctgataaa 
ggccgcgagt 
ctattttttc 
tatacgattt 
ttcgatacaa 
tcggttaata 



60 

120 

180 

240 

300 

360 

420 

426 
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<210> 194 
<211> 495 
<212> DNA 
<213> B.fragilis 



<400> 194 

cattgtaaca tgtttgcaac ttggttgcaa 
aaaaccaaac agatgaagcg tatctttttc 
ttaatagtgc tttccgctat tccgcatcac 
gaattatgtg aacaagacga tatctacaat 
gatgcacata atgaaaacac ctgtgtatca 
gataaaagta atctgcatga tggaagcctg 
tttgctgaca ttctgactat tcatttcgat 
tatgttgtct cttatacgtc cgtagtgctg 
tattttttct cttaa 



gaaatatact 


ctatttttgt 


cccgaaagta 


60 


gtatatccac 


ttgccatagc 


gacactattc 


120 


catcataaag 


agatgatgtg 


tacggtgatg 


180 


gatgggcata 


cggatcatga 


ggcggggcaa 


240 


caagctggtt 


atatttttcc 


ttccagcgtt 


300 


atgaatatcc 


acttgccggt 


tctatatctg 


360 


ataccaatct 


ctgaaaacac 


atacgatagg 


420 


ggtgagagca 


gcggattgcg 


tgctcctccc 


480 
495 



<210> 195 
<211> 600 
<212> DNA 
<213> B.fragilis 



<400> 195 

attaggggca tgaacatcaa tactgacata tttaagatac aatcgaataa tgtaatgccg 60 

tcgagaggaa agattctgat atcagagcct ttcctccacg atgtaacttt cggaaggtca 12 0 

gtagtattgc ttgttgatca tactgaagaa ggaagtatgg gattgattat aaataaacca 18 0 

ctcccattga tgctcaatga tatcattaaa gaatttaaat atatagaaga tattccgtta 240 

cacaaaggag gtcctatcgg aactgacact ttgttttatc tgcatacttt acacgaaata 3 00 

cccggaaccc ttccgatcaa caatggatta tatctcaacg gagatttcga tgctatcaag 3 60 

aaatacattt tacaaggaaa ccctataaaa ggaaagatac gctttttcct cggatattcc 42 0 

ggctgggaat gcgaacaact gattcaggaa ataaaggaga atacctggat tatttcaaaa 480 

gaagaaaata cctatttaat gaatgaagat ataaaaggta tgtggaagga agccttaggg 540 

aaattgggca gcaagtatga aacctggtcc cgcttcccac aagttccttc tttaaactaa 600 



<210> 196 
<211> 228 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 

<222> (10) , (11) , (13) , (14) , (15) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 196 

tgcactggtn ntnnnaatag cgccaccgtc ggatccttcc agccccgtgg tgaagactat 60 

ctttttatgt atcactggct ggatgagttt gcttaccgga ctactatgag ctggtggctg 120 

tttttggggg gcggactgat tattgcgggg attacgttat taactgttat cggacaaacc 180 

tggcggacgg cttcacagaa tccggtgaga tcattgagat atgaataa 22 8 



<210> 197 
<211> 249 
<212> DNA 
<213> B.fragilis 



<400> 197 

aatccatgct ttggtcgtga tgaacagttc ctcacgcgga ataccacttt tccgaatagc 60 

cataccgacc gcccgttcat ttccatagac cgaagcggtg tcgatgagcc gatagccgac 12 0 

ggcaagcgca tcggtaacaa ctctttcgca ctcggtggca tcgggtatct gaaagacacc 180 

aaagccttgt atcggcatct ttactccgtt attcaattct ttgaaatcca taccataata 240 



83 



tcgttataa 



249 



<210> 198 
<211> 423 
<212> DNA 
<213> B.fragilis 



<400> 198 

aaaactaaat 

atgttaattt 

gtagtcagtg 

acttatgaag 

tccaccggag 

cgatttttca 

gcaggacgta 

taa 



ttatggactt 
cctgtagcaa 
ctgatggcaa 
agttcaaaaa 
ttgccacttt 
ctttcgctgt 
ccgtacgccc 



aaaaaagaca 
cgatgatgaa 
acctctgccc 
agacaatcga 
catttttact 
ccaatatggc 
gggttcagtt 



actttctact 
aacaaaaatg 
aacgaaattg 
acaactccta 
tatgataagt 
agtggtacag 
acacaaatcg 



tatttacgct 
atgcgcaggt 
tgcaaatgtt 
cggcatatgc 
ggttcgaatc 
aaaattatga 
agttgaaact 



ctttagtttg 
aacagttact 
cgatgaaaag 
attaactaac 
aaacaaagac 
aatatggtct 
taagccttta 



60 

120 

180 

240 

300 

360 

420 

423 



<210> 199 
<211> 186 
<212> DNA 
<213> B.fragilis 



<400> 199 

acttcatcat cagatatttt atttttaaat tttataagaa gtacatgtgt gtttttcata 
tcatttgtaa ttgttatggt tgtaatgata gcaatattcg gtaataaaaa gcagaaaagc 
aagaaaatcg atgtttattt tcttgctttt ttacatgggg acgattcttg tcacggtata 
ccgtga 



60 
120 
180 
186 



<210> 200 
<211> 384 
<212> DNA 
<213> B.fragilis 



<400> 200 

gtgaaagagt 

acagcttttg 

attacagcgt 

ccgattgaaa 

ggattggcat 

aacgagtatc 

aataagaagc 



cggtacgcat 
ttatttggtt 
acatagtagc 
ataaaaagaa 
atagtgccca 
tggcacaatt 
ttacattcag 



ttttagattt 
gatgatggat 
ccaaattcat 
caacatttgg 
gttcttgttt 
cctggggctg 
ataa 



gccgtaattg 
gaattgtcat 
aactttattt 
aagcagatgt 
ttagtcacac 
tttatctacg 



gtacgctcaa 
acgattacat 
ggagtaaata 
tgtttttctg 
ttgtagagtg 
gaacagtaaa 



tgcattaatc 
tccggccaat 
ttggatcttt 
ttctgctttc 
tggagatgta 
cttcatcgtt 



60 

120 

180 

240 

300 

360 

384 



<210> 201 
<211> 3177 
<212> DNA 
<213> B.fragilis 



<400> 201 

aagtcaagcc 

tcaaagttct 

gccggattgg 

acagtacagg 

ggcattccca 

gcgtccagct 

atggccactg 

gtcatcgttc 

atgcaggcac 

ctggttgacc 

aattacagca 



cgtatcagtc 
ttatcaatcg 
taacattaaa 
tatctgcctt 
tagaacagca 
cgggtgccta 
tacaagttca 
agggagtaac 
aagactctgt 
aattgacacg 
tgcgcgtctg 



aatcacgagt 
acctatattc 
tatattgcct 
ctatccgggg 
agtaaatggc 
ttcgttgacc 
aaaccgggta 
ggtacagaag 
atacgacggg 
tgtacccggc 
gctcgatccg 



caccaacctc 
gccacggtac 
gtcgcacagt 
gcaaatgctg 
gtagacggta 
attacttttg 
agcgtagcac 
caatcgtcca 
ctttacctta 
gtaggggctg 
gaagcaatgc 



taatcgttaa 
tggcattgat 
ttccggagat 
agaccgtagc 
tgctgtatat 
ctgtcggtac 
aatcttcgtt 
atattgtgat 
cgaactacgc 
tcaatgtaat 
gcatccgtaa 



ctgcatgttt 
catcgtggtg 
aactccgcct 
ccagactgtc 
gagctctaca 
agacatagat 
accggaacct 
gtttctcacg 
tcagttgaat 
gggagcgggc 
cctctcgccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 



84 



gcacaaatct atcaggctat ccagtcacaa aacatagagg tcagtgccgg ttatatcgga 720 

cagcctattg gcaaaaacaa caataatgcc tatcagtata ccttgaatgt acaaggtcgc 780 

ctgacgtctc ccgaagagtt cggcaacatt attatccgaa ctgaagaagg agggaaaatg 840 

ctccggctaa aagatgtggc gcgcatcgac ctcggcagtt cttcgtacaa cgtagtgtcc 900 

aaactaaagg gacaccctac tgctgccatc gctatctatc aacaaccggg ttcgaactcg 960 

ctcgatgtct ctaaaggagt caaggcaaaa atgcaggagc ttgcacaaaa cttcccggcc 102 0 

ggagtcagct ataacgtgac cttggatacg accgatgtca tcaatgcatc cattgatgaa 1080 

gtactcgtta cttttctgga aacaacctta ttggtggtac tcgttatctt cctgtttcta 1140 

cagaactggc gggctgtcat cattccatgt atcaccattc cggtatcact gatcggtaca 1200 

ctggcagtca tggcggcact tggattttca atcaatactt taactctatt cggattgata 12 60 

cttgccgtag caatagtggt ggatgatgcg attgtggtag tagaaaatgc ttcacgtttg 1320 

ctggagacag gacagtattc tcccaaagaa gccgtcacca aagcaatggg agaaatcaca 13 80 

ggaccaattg tcggagtggt attggtatta ttggcagttt ttatccctac cacattaatc 1440 

agcggcatct ccggacaact ttataagcaa tttgccctaa ccattgctgc atctaccgta 1500 

ttaagcggta ttaattcgtt gacactgacc ccggcattat gcgcactgtt tctggagcat 1560 

aacaagccat ccaatttctt catatacaag ggattcaata aggtatatga taagacacag 162 0 

aatctatatg accgtatcgt gaagggatta ctcgtccgtc ccggccttgc gttgatctct 1680 

tatggtatta ttacggcagt ggctgttatc ctgttcatga aatggccttc aaccttcgtc 1740 

cctgatgaag atgacggcta cttcatagct gtcatccagt tgccaccggc ttcaagtctg 1800 

gaacgcacac aggctgtggg tcggaaagtc aatcagattc tggacagtta tcctgaagta 1860 

aaagactata tcggtatcag cggattttct attatgggag gtggcgaaca gtccaacaca 192 0 

ggtacttatt tcgttgtctt gaaaaactgg gaccaacgga aaggaaaaga gcatactgct 19 8 0 

gcggctgtgg tcgaacgttt caacgagatg gcttatggca tccaggaagc acagatattc 2 04 0 

gcaatggtac ctccggccat tcccggatta ggagcttcag gagggttaca gctacaattg 2100 

gaagatcgca ataacctagg gccgactgaa atgcaacggg ctgtcgaaac cctgatggct 2160 

acttatcaca ctcaacccgc tctcgcatcc atatccagta tgtaccaagc caatgtacca 2220 

cagtatttcc tgaatatcga ccgcgataaa gtacagttta tgggcattca gttggataac 22 80 

gtattctcta cactgagtta ttatatggga gcggcctatg tcaatgactt tgttcaattc 2340 

ggacgtatct atcaggtaaa gatagaggcc ggagaacaag ctcaaaaagt aattgacgac 2 400 

gtgctgaaac tcagcgtccc caatgctaaa ggagatatgg tcccattttc atcctttacc 2460 

aaagtcgaag agcgcttggg aatggaccaa atcagccgtt acaacatgta ctcgacagca 252 0 

tctatcacct gcaacgtggc ttcgggaagc agttcgggtg agggaataca gcaaatggaa 2580 

gacctgatta aggagcaact gggtaacgag tttggctacg aatggacctc ggtagcctat 2640 

caggagacgc aagcaggcaa cacaaccacc atcgtattca tcatggcatt attggtggca 2700 

ttcctggtac tggcagccca atacgaaagc tggacaagcc ccttatcagc aattatggga 27 60 

ttgccaatgg ctttattggg agcaatgata ggttgttctg tcatggggac ccctgtgagc 2 82 0 

atttatactc agatcggcat cattttactg attgcccttt ctgcgaaaaa cggaatcctc 2880 

attgttgaat ttgcacgcga cttccgtgcc gaaggtaact ctattcgcga tgccgcctat 2940 

gaagccgggc atgtccggct gcgtccgatc ctgatgacct cttttgcatt cgtattggga 3000 

gtgatgcctc ttctgttcgc cacaggggcc ggggcgcaaa gccgtatcgc actcggcgca 3 060 

gctgttgttt tcggtatggc cctgaacacg ttactggcaa cgatatatat cccgaatttc 3120 

tacgagctga tgcaaaagtt ccaggaaaac atattggatc gcaagaaaaa gaaatag 3177 

<210> 202 
<211> 450 
<212> DNA 
<213> B.fragilis 

<400> 202 

atgttatcgt taaacctacc agtatttgac actaaaatcg ccactcgaaa tggaaaaaat 60 

gttattttcg atgtgattcg ccgtcgttat gtcgcattga cccctgaaga atgggtccgt 120 

cagcactttg tacactttct tattgttcat aaggggtatc cgtcgtcttt gatggcaaat 180 

gaagtgctgc tgaacctgaa cgggactaaa aaacgatgtg acacagtgct atataaacgc 2 40 

gatcttagtg ccagaatgat tgttgaatat aaagctcccc acattgagat tacgcaggct 3 00 

gtttttgatc agatcacccg ctataatatg gttttgaaag ttgattatct ggttgtcagt 3 60 

aatgggatgc aacactattg ttgccggatg gattatgata ctcaaagtta ttcgtttctg 42 0 

tcggatattc cggattatga cgctttataa 450 



<210> 203 



85 



<211> 426 
<212> DNA 
<213> B. fragilis 

<400> 203 

agactcaaac caatgaaggc atttttaccg ttacttctct cttttttctt tattatttca 60 

tgccagcaac acaaagaagc tactatatct cctatcgatg aagaagatga attgcaggaa 12 0 

gaggccgata gccttccccg tgcgacagcc attttttggc ttgataaata tcatatgaaa 180 

gagctgaaaa aggacgatgt gcttactttc cgtacggcta aggctaaagt catcattcgg 240 

aatgatggga caatcgagct tctgtcgttt gtggaacaac agcctgggaa tgcacaacga 300 

tatatccgtt accgactgaa agatttcaag gttaagaaaa tcttgatgga taacggctat 3 60 

atcaatccgg gtgaacaata cgtccaactc cgttatatac ctgcacttgc aaggcgcgtt 42 0 

aaatag 426 

<210> 204 
<211> 1062 
<212> DNA 
<213> B. fragilis 

<400> 204 

atgatggaac caacttgcat gagcgaaaac aagaaaaaaa taatattcat cgttaatcca 6 0 

atttcgggta cacaaagtaa ggaacttgtt ctgagtctac tggatgaaaa gatagataag 12 0 

gaaatgtata cttgggaaat tgtgtatacc gaaagggccg gacatgcaat cgaaatagca 180 

gcagatgcgg cagataaaaa tacagatata gtagttgctg taggaggaga cggaacaatt 240 

aatgaaattg cccgttcatt ggtacacacc aatacagcat tgggaattat cccttgcggc 300 

tctggaaacg gattagcacg acatcttcaa atttcaatgg atccgcgtaa agcacttgaa 3 60 

attttgaatg atgggataat cgatatcata gattacggaa aaataaatgg cacagacttt 42 0 

ttttgtactt gcggagtagg gtttgacgct tttgtaagtc tgaaatttgc taatgccggc 480 

aaacgtggac tgctgactta tctagagaaa accctgcagg aaagtctaaa gtatcaacct 540 

gaaacttatg aattggaaac agaagacggt acttccaaat ataaagcctt tctcattgct 600 

tgcggcaacg cttctcaata cgggaacaat gcttatatag ccccacaggc cactctgaca 660 

gatggtttgt tagatgtaac cattctcgaa ccgtttacgg tattagatgt tccggcacta 720 

gcctttcagc tcttcaataa aacaattgac caaaacagtc gcattaaaac tttccgttgc 780 

aaaaagttat gtattcatcg cagttcgccg ggtgttgtcc attttgacgg cgatccgatg 840 

caggctgacg aagatatcaa aatagaactg attcagaaag gactgcgggt cgttgtacct 900 

ggtgataaaa aaaaagataa tcccaacgta ttacaaaaag cacaagaata cgtaaacggt 960 

attaaattga taaacgaagc tatagtagaa gatatagcac ataaaaataa agttattctg 102 0 

aagaagaata agcagctgat acaaaaactt actaaaaaat ag 1062 

<210> 205 
<211> 951 
<212> DNA 
<213> B. fragilis 

<400> 205 

atgattatgc ctaaaaacta tactttacaa aacgcctcca atttaggttg gctattctat 60 

aaagactatt atagacaaga accgaatgta gatttcattt ctacacaagg aaaagaaagt 12 0 

gatacaactg ctgatttttt cagaaaaacc aatcagagaa tcactgctta tcaattaaat 180 

tccgaatcac cattagttgc agcattcaac aaccattttg gtacaccgtt gcaactaaaa 240 

accatttatc cgggtttaat aacaggtagc ggacttccgc atcagacagg tagtaaagga 3 00 

gaatttaaat taggatttca atttgattat actaccggac ttccctatat tcccggatca 360 

tctatcaaag gaactttgcg cagtatgttt cctttttcat tgaaagataa aggctctact 42 0 

aaacgtattc taccggaata tagaaaagaa cgtatggaat atatccgaga cttaataata 48 0 

gaagtaacca atataaatga aatttcagac acagaaattc aggcattaga atatgccata 54 0 

ttcactaaca gtactccatc tggcaaaacg atagaattct ctcttgaaga aaaagatgtc 600 

ttctatgatg cttttgttgc agattcaaag gatggagtaa tgttaagcga tgactatatt 660 

actcctcatg gcgagaatcc attaaaagat cccaaaccta ttttgttctt aaagatcaga 72 0 

cctgatgtaa caataaactt ttatttcaaa ttgtgtacta ctcacttata caaagaaaag 780 

gtatgtagtt caaaacaaat agaagagatt aaaaaacaaa atgatttctc ttcttcggac 840 
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tacaaaatga ttacggcaca ccagaagcga aacctatttg agaaaattct cctttgtatc 900 
ggaatcggag ctaaaaccaa tataggatac ggacaattaa agaaactcta a 951 

<210> 206 
<211> 282 
<212> DNA 
<213> B. fragilis 



<400> 206 

ggggagaagt 

aacattggga 

gtaactattt 

cggggctgtc 

tatttcggtg 



tcagacataa 
aaatgatcaa 
taataagctg 
caaaaagcaa 
aaaagcctca 



cggcttggat aagatcgtca tggacttcgg tattgctttt 60 

taagcaggaa aagaagaaga gaggcagaac taatttattg 12 0 

tggaatagct taccaaaaat acacaaaggc gataatacta 180 

agtgcccccc aaaagtcgga tagccccttt taccattgtt 240 

tattaccgtt gtgaaaaatt aa 2 82 



<210> 207 
<211> 405 
<212> DNA 
<213> B. fragilis 



<400> 207 

ttaatcgata 

gaggaactta 

aatcctaaaa 

gaaattatgc 

catttttcct 

aaggatggtt 

gctattcttg 



ctatcaggaa 
gaaatttcta 
aaggatttga 
aaagagaaga 
tttctgtcgg 
ttgttatcga 
atcctgaagg 



tatgcacatc 
tatcacttat 
atcttatttc 
tatcacaaca 
tagcaaagaa 
gagtgagcca 
aaacatagta 



agccacattg 
ttcaacggaa 
atcagttttg 
cctgcattaa 
gctgtattgg 
cgaaccaccg 
gaaatcacta 



ccatctggac 
caagtaatga 
atcagggatt 
aagactgcct 
aactcacaga 
gagacggcta 
tttaa 



tacccgttta 
aaagtatatc 
tgcttctctg 
cgggttagct 
acaactccgt 
ttttgaaagt 



60 

120 

180 

240 

300 

360 

405 



<210> 208 
<211> 711 
<212> DNA 
<213> B. fragilis 



<400> 208 

ttaagagtaa 

gataatgtgc 

atacctctgg 

ttactggatt 

acagaatact 

gagttgggag 

caggattggc 

cctccccggt 

aattcgagcc 

ttccttgtat 

acttgcgaga 

aaagcattgg 



cattggatag 
gtgcaaaacg 
gagttacgat 
ccaggcagaa 
tcaaattatc 
agatgcggat 
ttcgaaaagt 
tgtatatgct 
gtgggcgatg 
tgttgccaaa 
tgaatcatgg 
aactacgcga 



ggttatcgaa 
gcttgttttt 
gcgagaggta 
gttggtgcgc 
actggttagt 
tatctgtcct 
gattgaagaa 
ttcagagaag 
gggaagctgt 
acatctgata 
agatcgcttt 
agagttgaag 



gataaagagt 
cgtacgaaag 
aaagaggcaa 
cctttgattg 
ggtaaacgag 
ccaacagctg 
gctttgcgac 
caccgtttac 
tcctctcgta 
gattacgtcc 
tgggacttgc 
aggtacaaga 



tagggcgttt 
cggatgctat 
tagagaagtt 
acctgaacta 
agaggttttt 
attttacaga 
ggaatgcaaa 
cctacgagag 
aaaagataaa 
ttttgcatga 
taaatgggct 
ctgagatctg 



ggttgtacgc 
ttacattagt 
gcgtccccga 
tcggattgag 
ggcacattca 
ctcgaatttg 
gattatcttg 
cgtgcagata 
tctctcttat 
actttgccat 
taccgatggg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

711 



<210> 209 
<211> 249 
<212> DNA 
<213> B. fragilis 



<400> 209 

ccgttaaaaa caaatcggag tatgagaaat ttttttgtaa gtgccttttt attattagtc 60 

ggtattgccg ttatgactgt ttgccgaatg aataataagc aatgtttgag tgaattggct 12 0 

ttagtgaatg ttgaagcgct tgctacaggt gaaggagatg ttcctacaag ttgttatggc 180 

agtggtaatg tagattgccc tataagcgat agcaaagttt cctatgttat gaatgggcgc 240 

agtttttga 249 
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<210> 210 
<211> 1506 
<212> DNA 
<213> B.fragilis 



<400> 210 

catagtcgaa cagataaaac tattagaatt atgatatata gttatcacat attttatttt 60 

ccatttaaat gggaaattat gggattagaa aatcaagcat tttctgacca agttaatttg 120 

gacaacattc aatataaccg gaattcttat tgggaacgct cacaaaagcc agatcctgga 180 

gaagaggagt cattatataa cgaaaagaac tattattata catttgtaca caatatatta 240 

tatgatgaag agcacagtcc attaaatcta attcaccatt tcgaacgcaa agaacctaag 3 00 

ctaagtaatc acatttacta ttatataaag aaaaaagggc gtaataatcc atataaactc 3 60 

attgtagacg cgatgaatat taatctatat gctacaggtg tcggattctt gtcattttat 42 0 

ctaaaaaatg aagattgcac tcaaaacagc ccggaagaca tattggctat caatcaatat 480 

gggcgccgta tcatgccccc ctttttcaat gatacaagac tacgaaatga gatttcagaa 540 

tacattcgga tagaaggttt aaatcaaaca gtttattttg aagatttcaa atcatatact 600 

ccctatgaca gctggcagcc ttcctcgtcc ataaaaaagc taatttgtga attagttacc 660 

aatttatcaa ttgaccctat tatagatgat cgtatgtttg tggcaacatg gtacaaaaac 72 0 

aatcagctat ctcaacaatt tacaaataat gcgaaagctt actttgatag ccaggatcca 780 

ttttcagatt actggtatcg ttttctgttt atagatggaa gtaatgccac ttgccaaaat 840 

gagaaaatga aaaaagaact attggaggaa catacctatt atcgttggca acaatggagt 9 00 

tcactttatg gtatcagtaa atattcatta gtatacctta ctaataatga agtacccgat 960 

tacctgatag aatattttca aacgatctat gcacgtatgg ccgaactagt attagttcaa 102 0 

cgtgcttcca tgttaagatt ttccggagaa atcactaaag taagccaatt atccaatcag 1080 

gatgtagaag ccgtatctaa acgggttagt tctttatata aagaatatat tcgtttcgta 1140 

aatcaaatct atttccgtga gattacagct caagaccaag gaatcgaaat gtacaacaag 12 00 

cttcactctt gcttgcaaat ggaaagttat ataaaggatt tagatggaga aatagaagaa 12 60 

ctgcatcaat acatttcttt aatggaagat cgggagcgaa acaaaaaagc aagtttgctt 132 0 

aatgatattg ctactttatt tttacccatt acagtaatta ccggtttttg gggaatgaat 1380 

caaatcagtg aagtgatgga agaaaatgga gaactctcga ccggctttat cattcaatct 1440 

ctattattaa taataggtac actttgtgcc atatgtataa tctataaaag aaaaagaaaa 1500 

ctatga 1506 



<210> 211 
<211> 798 
<212> DNA 
<213> B.fragilis 



<400> 211 

tatatgggaa ctattgatat atcttacttt aatctgctta tagggctact gttattggta 60 

atcccacttt tttatctttg gaagttcaaa accggattac tgaaagccac cctgataggg 120 

acagcacgca tgatcgtgca actcttcctg ataggtatgt acctgaaata ccttttcctg 180 

tggaataacc catggattaa cttcctgtgg gttatcatca tgatttttgt agccggacaa 240 

acagctttgg tacgtacagg acttaaacgt gaaatactcc tgatccctat atcagtaggt 3 00 

ttcctctgta gcgttgtgct ggtgggcatg tactttattg gcattgtatt acaactggat 360 

aatgtattca gcgcccagta ttttattccc attttcggaa tcttaatggg aaatatgtta 420 

tcaagcaacg tgattgcctt gaacacttat tatagtggat tgaaacgtga acagcaattg 480 

tactgttacc tgttgggcaa tggtgccact cgtcaggaag cacaggcacc attcatacgg 540 

gaagcgatta tcaaatcttt cagcccactg attgccaata tcgcggttat gggattagta 600 

gcacttccag gcacgatgat cgggcaaatt ttgggaggca gcagtccgaa cgttgccata 660 

aaatatcaaa tgatgattat ggtcattact ttcacagcct ctatgttatc attaatgatc 720 

accatctcgc tggcatcccg taaatcgttc gatgaatacg gacgtatttt gcaagtaacc 7 80 

aaagaatctc aaaagtag 798 



<210> 212 
<211> 2004 
<212> DNA 
<213> B.fragilis 
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<400> 212 

gaatatatga ctgtaaaaga aaaaatagaa caactccgtc tccaactcca tcagcataat 60 

tacaattatt atgtgctgaa tgccccggaa atctcagata aagaattcga cgatttaatg 12 0 

agggaacttc aggacctgga acaggaacat ccggaatata aagacgaaaa ctcgcctact 180 

atgcgtgtag gtagcgatat caataagaat tttacccaag tagcgcacaa atatccgatg 240 

ctttcattgt cgaatacata ttcggagaat gaagtaaccg acttctatga cagagtgcgt 3 00 

aaagctttga atgaagattt tgagatttgt tgcgagatga agtatgatgg tacctctatc 3 60 

tctttaactt acgaaaatgg taaactgatc cgcgcggtaa cccgcggtga cggtgaaaaa 420 

ggggacgatg taacagacaa cgtaaagacc attcggagta ttcctctcgt cctacatgga 480 

gataattatc cggaagtttt cgagattcgt ggagaaatct tgatgccatg ggaagttttc 540 

gaagcattaa accgggaaaa agaggcccgc gaagaacctc tctttgcaaa tccgagaaat 60 0 

gccgcatcgg gaacattgaa attacaaaat tccgccatcg tggcttcccg taagctggat 660 

gcctatctct attatctgct tggcgataat ctgccgactg acggacatta tgaaaatctg 72 0 

caggaagcag ccaaatgggg atttaagatt tccccgttaa tgcgtaagtg ccagacacta 780 

caagaagtct tcgactttat caactattgg gacgtagagc gcaaaaacct gaacgttgct 840 

acagacggaa tcgtactgaa agtaaacagc ctcaagcagc aaaggaatct tgggttcaca 900 

gccaagtctc cccgctgggc cattgcctat aaatttcagg ctgaacgtgc actgacccgc 960 

ttgaacatgg taacctatca ggtagggaga accggcgccg taacaccggt agccaatctc 102 0 

gacccggtac aactttcggg cacagtagtg aaacgcgcat cattgcataa tgcggatatc 1080 

attgaaggac tcgatttgca tataggcgat atggtctacg tagaaaaggg aggagaaatc 1140 

atccccaaaa taaccggtgt ggatacgtcg gcccgcttca tgatcggtga aaaggtaaaa 12 00 

ttcatcactc actgtccgga atgtggcagt aagctgataa gatacgaagg agaagccgcc 12 60 

cattattgtc cgaatgagac cgcctgtcca ccacaaatca aaggaaaaat agagcacttc 132 0 

atcagccgga aagcaatgaa tatagacgga ttaggacctg aaaccatcga catgttctac 13 80 

cgtttaggac tgattcgtga cacggccgac ctctatcaac tgacgacaga tgacatcaga 1440 

ggcttggacc gtatgggaga caaatctgcg gaaaacatca ttaaaggaat catgcagagc 1500 

aaagaggtac cttttgaaag agtaattttt gcattaggta ttcgttttgt aggcgaaacg 1560 

gtagctaaaa aaatagccaa atcttttaaa gacatagaag agttggaaaa tgcagatctg 162 0 

gaaactctga tcaatatcga tgaaatcggt gaaaaaatag ctcggagtat ccttaactac 1680 

tttgcgaatg aatcaaatcg taaattggtg gaccgattaa aaacagcagg attgcaacta 1740 

tacagacctg aagaagactt gagcggacat accgataaat tggccggaca atccattgtc 1800 

atcagtggag tattcaccca ccattcaaga gatgaataca aggatcttat cgaaaaacac 1860 

ggtggcaaga acgtgggaag catctcttct aaaaccagtt ttattctggc cggagacaat 192 0 

atgggacctg cgaaattaga aaaagcaagt aaactgggaa ttaaaataat gaacgaagag 19 8 0 

gaatttttaa agcttatatc gtaa 2004 



<210> 213 
<211> 609 
<212> DNA 
<213> B. fragilis 



<400> 213 



ttaaatgtgg cacggcagat gatgactgct 
gacatcatag aaactgcaat tgttaccggt 
aaagccatgt tcgaagagtt tggtatgaaa 
caggctgagt gtatcttatt aataggtacg 
cattgtggat atgctacatg ttccggacgt 
attgatgtag gcattgcaat tggttcggca 
acccgtgtca tgttctcagc cggattggct 
cgtcaggtaa tggctatccc ggttagtgct 
cctaagtaa 



aatatgtgcg 


ggatttcttt 


tatctttgtg 


60 


aacgaacgag 


acagtcgcca 


cgaacatgta 


120 


gcccgtacgg 


cccctaaggg 


aaaaggaatt 


180 


gaagaaatac 


agcaactctc 


ggatacgttg 


240 


ttctttttgc 


gggatgcaga 


taatattctt 


300 


cgtgagcaag 


ctcaaggatt 


gaattgcggt 


360 


tctgaaggtg 


tcccctgtgc 


gttgaatagc 


420 


tgtgctacag 


cggctgattt 


gcgcgtagat 


480 


gcccaacgtc 


ttgagtggtt 


gaaaggatgt 


540 


tcttccaaga 


atcctttttt 


cgatcgtaaa 


600 
609 



<210> 214 
<211> 1815 
<212> DNA 
<213> B. fragilis 
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<400> 214 

aacaataacg 

atggctgaaa 

aaaatagtag 

gagatgcaaa 

gagcctgaag 

ggtcatatag 

tatttaaaat 

catgtagtaa 

cctgagcagg 

accaacgtca 

tcttttctca 

cttgaaatat 

acagcaaacg 

attatcctaa 

aagtcagatg 

ataacccaac 

gcttatggag 

tgttgcaatg 

tgcataaatc 

ccaagtctgt 

catacaacgg 

agcaataaaa 

ttctctttac 

agctatgtaa 

aagacgcagg 

gaaatattac 

ttcgataata 

caaacgcttc 

attatccagc 

gccatggatg 

aataaagatg 



ccatgaaata 
gcactaaaga 
aaccattcgt 
agccccattg 
atttggagct 
cttctcctag 
catatattaa 
ttccggcttg 
aagaaaccat 
acggtaaaat 
caagagatgc 
cagcatccga 
aaaaaaacaa 
acgacaacaa 
gtgattcaat 
ttagtaaagc 
gaaagccaat 
gtaataatgt 
aacacttaca 
catttggcat 
actatttatt 
atatacttaa 
aaaaacatag 
agtttaatat 
aatcagaaaa 
aaatcattct 
actttaatga 
tttgccttag 
aaaacacgat 
ctatccatac 
aatag 



cattgcaatc 
gctttgggca 
gaaaaagaat 
tggtgccggt 
gcttaaacaa 
cttgccgggc 
aatatatttt 
tgaaaaatac 
gatatcccat 
atatcgaaaa 
atttggtgat 
attaaacata 
aggtgaaaaa 
agcacaactt 
gggggaaact 
tttactaagc 
attcattggt 
attcaatctt 
gcaatacatc 
ttcaattact 
agagatggtg 
tgaaaacatg 
tggacaaatc 
gttacttcaa 
gttcctctca 
tcagaatgag 
gtcttgtcac 
atatcaagaa 
tctgacatca 
gattttcaca 



actttaggac 
gccagttatt 
cgtacctttc 
ttgtttcctg 
cactcagatc 
acagctaaag 
attgaacgca 
cttaatatta 
caaaaaagcg 
gacaaaaatt 
atgaacggag 
aacatccaac 
tatagcgatc 
agaccatatc 
ataaaaagca 
ttcaacatcg 
ggagatgatt 
gtcgaaaaac 
aacgcttgtt 
tatcataaat 
gccaaagata 
aagcgtttca 
tatcataccg 
aagtacatac 
tctgtcatcc 
gacaagcgaa 
ctaggttaca 
aacattcaag 
gatgaaaaag 
gctttacaat 



caattacccg 
tcttctctta 
agctacctct 
atcgatatat 
aggtattaat 
atgtatctca 
cattagagtc 
ttgagaatca 
atttcttaaa 
caattcctcg 
aaagattgtt 
aaaaagcatt 
agatctggga 
ataaatatat 
tgggagcgta 
aatctattaa 
tactatgttt 
tgagtacttg 
cagaagctca 
atcccatgtt 
acttgtttaa 
ttcttaaaaa 
caatgtctaa 
tgaaaaacaa 
aaatgatcag 
ctgaaatgtt 
ccggtttatt 
attatcaaaa 
aaatattaat 
ttattcactt 



tacaatcgaa 
tcttgctaaa 
tattaatgaa 
ctttaagtct 
agaaatagcc 
aatatatcac 
ggacgatcct 
agaaacattt 
atttttaata 
ttttactgga 
tgagtccatc 
agaagtaatt 
tgcggaagaa 
tgctattatt 
caacatccct 
cgaaatcgta 
tgcacctgta 
ttttgaccaa 
gaggccttta 
tgaagcattg 
atatacatta 
taaattagct 
gaaaggaaaa 
agatatgagc 
agcacatgct 
aaagaactat 
cgaagatatc 
tagaaatgaa 
agtttctcct 
cataaattat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1815 



<210> 215 
<211> 918 
<212> DNA 
<213> B.fragilis 



<400> 215 

actattatgg 

acagcttcgg 

ataggcggta 

cgcatggcag 

gaatatttct 

aatgtttccg 

gacaaaattc 

gcatttgggg 

aaaaagacac 

gcagccgagg 

atcgatgctg 

gcagtaaaac 

gtcataggac 

gcttcagcca 

atagatggta 

ggtgcacttg 



cagatttaag 
gtacatttgg 
tcattgtaaa 
agaccccttc 
caaatcacat 
gatcagccat 
ctgctatcga 
tgacaactaa 
ttatcgtcaa 
ccaacggcgc 
agcgcaaacg 
ccatcgcact 
taggcggtat 
tacagattgg 
taaacgatta 
aggtatag 



tgtaaacatt 
atatggtgag 
gggtactact 
cggtatgtta 
ttatccccgt 
tgaagactat 
attaaacatc 
aggagtatca 
gctatctccc 
cgatagtgta 
ccccatcctt 
aagaatggtg 
catgaattgg 
tacggcaaat 
cctggaaaga 



ggtaaactac 
gaatttgcgg 
cttcacaaac 
aacgctgtag 
atcaaagaca 
gtaaagactg 
tcttgtccta 
gaagttgtac 
aacgttacag 
tcattaatca 
tcaacagtga 
tggcaagttg 
aaagatgctg 
ttcatagatc 
cacggatgca 



aaatgaagaa 
attttattga 
gtgaaggtaa 
gactgcaaaa 
ttcagaccca 
cagagatcat 
atgtaaaaca 
aagcagtgcg 
atatagcaga 
atacattgct 
caggcggcat 
ctaaagcagt 
tcgagttcat 
cggctatcac 
agtctgttcc 



cccggtaatg 
tataacgcga 
cccgtatccc 
taagggtgta 
catgattgtg 
taatgaactt 
aggaggtatg 
ttctgcttac 
aatggcacgg 
gggaatggcc 
gtccggtgca 
aaatattccg 
gcttgcaggt 
catcaaagtt 
tgaaattata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

918 



<210> 216 
<211> 1296 
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<212> DNA 

<213> B.fragilis 



<400> 216 

tatgacatgg 

tcaatcatgg 

agatgcagca 

tatttctgtg 

tatcatccta 

acacaggaaa 

gacaaactca 

gaggaatacg 

ccgacctaca 

gtctatatcg 

aagagattct 

tgcggttcct 

atccgtgcca 

acggaggaga 

ggcaagtgct 

tgggaaggcg 

gacattgttg 

aacggattcg 

ctgcttactg 

gcttttgggc 

cctgccaagt 

gcttatgcaa 



caaaaataca 
agaaatttga 
gtatcttcgg 
gcggctcatg 
cccttcgtac 
acatctccta 
acacattgct 
atgttgactt 
aaaagttcct 
agaacagcga 
tcgctcttct 
gctcgaagga 
accgatgcag 
ttaacggcat 
atcgtcttgt 
aatacactta 
aattctacaa 
gttggagcag 
cattgataca 
tcaagaaaac 
ggatcatgac 
aacccttcaa 



aattaaatct 
ctccatgctt 
atatcagttc 
cgtggaagat 
atgcagctct 
tacttccgac 
tataaacgct 
tgaccatcag 
cggctacagg 
tggtaacacg 
ggaatcccag 
aatcgtcagt 
ttcgctctac 
ccagttcgaa 
catccagaga 
ccgttgtatt 
tctgcgtggc 
gctccccaag 
caatttctac 
gagtcgcata 
tgcaaggcaa 
aacagaattc 



gagaaactca caccttttgg aggaattttt 

tcacccgtta tcgactcaac actgggtcag 

agcgagatag tccgttcgct gatgagcgtt 

gtaacgtcac aactgatgcg ccatctctcg 

gataccatcc tcagagccat caaggaactg 

caaggcaaga cctatgattt caatactgca 

ttggtttcta caggcgagtt gaaggaaatt 

ttccttgaaa cggagaagta tgatgcaaaa 

cctggcgtat atgttatcgg tgacaagata 

aatgtgcgtt ttcatcaggc agacacccat 

aacatccgtg taaatcgctt cagggcagac 

gagatagaga agcattgcaa acatttctac 

aatgacatct ttgctctgag aggatggaag 

ctcaattcca ttctcgttga gaaatgggaa 

caaagacgca acagtggcga ccttgacctg 

ctgaccaacg attacaagtc atcgacaagg 

ggcaaggaac gtatctttga cgacatgaac 

tcattcatgg cggagaatac tgtctttctt 

aagaccatca tgagcaggct tgacaccaag 

aagtcttttg tcttcagatt catctccgta 

tacgtgctga atatctacac agagaaccga 
ggataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1296 



<210> 217 
<211> 2286 
<212> DNA 
<213> B.fragilis 

<400> 217 

atttataatt taaatatgcc cgattattat cattccatta ccaccctcca tgctctacag 60 

aatgcatgga gggctgtgcg agccaaaaat gcggcaggag ggattgatgg attcacttta 12 0 

tctcattttg agaagcgttt gaacgataat ttgattgaat tacaacatga acttatttcc 180 

caaacatgga atcccgaacc ttacctaaga atagaaatta ctaagaatga aacagaaaaa 240 

cgtaaattgg gattattgtg catcaaggac aaaatagtac aacaagccat taaaacagcc 300 

attgaacctc agttagagaa aaccttttta aatctcagtt acggttaccg ccccaacaaa 360 

ggtccggaac gagctatcaa acgggtcgta cacgatttaa agaagttaaa gagtggttat 42 0 

gtagccaaat tggatataga caactatttc gatacgatca atcatgaacg gcttttcact 480 

cgtcttgcca attggttaaa agatgatgaa acactcaggc tgatccgcct atgtatccaa 540 

acaggaatag ttactccgca actgcaatgg caagaaataa ataaaggagt acctcaagga 600 

gctatactat ctcctttatt ggcaaacttt tatcttcacc cttttgatca gtttgctgcc 660 

aataaagtcc ctatgtatat acgctacgca gacgattttc taatcgctac atccacagaa 72 0 

aaacaaataa aagaagctgt agaattagta aaagaagaat tggaaagcca attttattta 780 

caactcaata caccgataat acataatttc catgatggga tagaatttct tggaatcaca 840 

atctctgata caggtctatc catcacagaa aaaaagaaaa agacgttaca agagagaatc 900 

aattcaatca aatttataaa atcgtcattg tcctctcaaa gtaaagagac gcttcaaggt 9 60 

ataaaaaatt actatgccaa gttgcttcct gaaagtactt taaaggaatt ggattgcttc 102 0 

ttaatgaacc gcctcaatgc attgattatc cgaaaccaaa actctattaa taacaaaaaa 10 80 

gaattagttt cgaatcttca aaaaatagaa ttctattcag aaaatagtaa taaaaataaa 1140 

tctcaactga tacaacaatt atgtagtaca tatatcgtac actctacaaa atcaaagact 12 00 

cggttaacca gtacccatat tgataataca aagctaatca cacaaaaaaa gaaagaatat 12 60 

cagaaacgtg aaaatgaagg tgcagaatta gtgataagta ttccaggtag ctatataggg 13 2 0 

gccacttata aaggaattac ggtaaaatta caaggtaaga ttattaataa accttctcct 13 80 

gctttgaaac acattacggt agtaggtaag gggataagtc tctcaagcaa tgcaattacg 1440 

tattgcatga accacaaaat cccaattgac tttttcgatg gtagaggaaa acaatatggt 1500 

actgtactaa atcctgtatt tttggatgta actttgtgga ataaacaagt agaacttcct 1560 
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ttggaacaaa aaataaaact tgctactcaa attattatcg gtaaattaaa aaatcaatta 162 0 

aatctgatta agtattacca taaataccat aaagatattt taggaggaaa gttatctgaa 1680 

aaatatgtgg aagttgtatt aaagatagac aagctaatag agaaagctaa aaattattct 1740 

cagagaaatg aaaaatatac tgcagaatta atggccattg agtcacaggc tgctatagca 1800 

tattggtcgt acatacgagt tttaacagct gatgacggga ttgattttat ccgccgtgag 1860 

caccaaggtg ccaccgattt acttaattct ttattaaact atggctatgc tattctatat 1920 

gctcgtgtct ggaaaaatat tcttgcggcc aaactaaatc catccatcgg ggtgcttcat 19 80 

gcaaagcaag atggcaaacc tactttagta tttgatgttg tggagctatt tcgtgctcaa 2 040 

atggtagata gagtagtaat tagtcttatt caaaaaaaag tctctttaaa aatgcatgac 2100 

ggtctattaa atgaatcatc caaacgagtt ttgatccgat atatattaga gcgactcaat 2160 

cggtatgaaa aatatagagg agaagaaata accttctctc aaataatttt aagacaagcc 2220 

caagaaatag cactttttat ttctggagac aatttaatat ttaaacctta tgttgcgaaa 22 80 

tggtaa 2286 

<210> 218 
<211> 219 
<212> DNA 
<213> B. fragilis 

<400> 218 

tcccataatt tcccatttaa atggaaaata aaatatgtga taactatata tcataattct 60 

aatagtttta tctgttcgac tatgttaagt atttcatttg ttgttaggcg gttgcgtacg 120 

gttccgataa atccattaat accgcctccc caaaagatat tacttattgg ggtgtatcgc 180 

tttaatatct tctggataga atttatccct tttccataa 219 

<210> 219 
<211> 1038 
<212> DNA 
<213> B. fragilis 

<400> 219 

cgccacactt atatttatat ggccaaacaa gaactgactt gcgatgacat cctcaaagaa 60 

ctgagggcca agcaatatcg tcccatctac tatttgatgg gagaagaatc gtattatatc 120 

gacttaatag ccgattacat taccgacaac gtactgacgg atactgagaa agagtttaac 180 

ctgaccgtag tatatggtgc agatgtggat gtggcgactg tgattaatgc cgctaagcgc 240 

tacccgatga tgtcagaaca tcaggtagtg atagtaaaag aggcacaagc catccgcaat 300 

atagaagaac tatcttatta cctgcaaaaa ccgttaaact caacaatatt agtggtttgt 3 60 

cataaacatg gcgctctgga ccgcagaaag aagttagctg cagaaattga aaaaacaggt 42 0 

attcttttcg aatccaaaaa gataaaagaa gcacagttgc ctgcatttat cagttcatat 480 

atgaaacgta aagggataga catggagcct aaagctaccg caatgttagc tgattttgtg 540 

ggtacggatc ttagccgttt gacgggtgaa ctggaaaaac tgatcatcac attacccggc 600 

ggtcagaaac gcgtaactcc tgaacaaata gagaaaaaca tagggataag taaagactat 66 0 

aataattttg aattgcgtag tgcactggtc gaaaaggatg tactcaaggc caataaaata 72 0 

ataaaatact ttgaagaaaa tcctaaaaca aatccgatac aaatgacgct ttctttacta 780 

ttcaactttt actcaaacct aatgttggcc tactatgcac cggataaatc agaacaggga 840 

gtggctacca tgttagggct taaaaccccg tggcaggccc gcgattacct gacggcaatg 900 

cggaaataca ctggagtgaa gacaatgcaa attgtaggag aaatacgata tgcagacgca 9 60 

aaatcgaaag gtgtaggcaa tacctcgata agcgatggag atattcttcg tgaattagta 102 0 

ttcaagattc ttcattaa 103 8 

<210> 220 
<211> 2334 
<212> DNA 
<213> B. fragilis 

<400> 220 

tttaaccctc tatatagtgt tttgctggac cttatgaaga aaaatctttt attgttattt 60 

ctttttttac tgtttttgcc aatgcttgtc caggcacaga aagtcggatt ggtattgagt 12 0 

ggcggcggtg ctaaaggact gacgcatatt ggaattattc gtgctctaga agagaataat 180 
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atcccgatag 
atggggtatt 
tattccggtg 
gagtttttca 
ccgaccagtg 
gctacggctg 
tcggatgtgt 
cgggcttcta 
tacgacggtg 
gacatcatta 
atgagccaga 
ggtattttga 
gagctcgaga 
attcaccgta 
tatcccgaac 
tacataaaga 
cggggatatt 
ttcaatccgg 
tcggttcgtg 
gcataccaga 
atatataata 
tatcgtttta 
aaaaacgata 
cttcctttct 
cgatactttc 
ctcctgtttg 
attcaaggtg 
ccgggggtta 
tcatatatga 
gacgcagttt 
agtgagtttg 
caatatgttg 
ggagaatttt 
tattatggaa 
ttaccatttg 
aatgtagggc 



attatataac 
cgcctgacga 
aggtggaaga 
atatacgctt 
ttgttaatcc 
catgtgacgg 
ataacaagaa 
tgagttttcc 
gaatctataa 
tcggtagcgt 
tagaaaatat 
tgactttcaa 
aaataggata 
gggttaatgt 
tcagatttaa 
aagagtttca 
tccgtttact 
aagatgatac 
tgggaggtaa 
atctgaacta 
atgctcagtt 
tcgcttccat 
aaccggcttt 
tatccagcaa 
agaataatgt 
gtggttccgt 
caagagaggc 
attcggagaa 
aggagaaata 
atgcctctaa 
cccccactgc 
ctgccggaat 
atggtttttt 
aagcattttc 
gagctatctc 
tgacactcgg 



cggtacttct 

catggaaacc 

aaaatacatg 

ttcctttaag 

tatccagatg 

tgactttgat 

gcagctgatt 

ctttatgttc 

taattttcca 

tgtatctact 

ggttatgcag 

atataatgat 

tgaccgtaca 

ggataatatc 

gaacatctat 

tacctcggac 

ttcggataac 

gtacgatttg 

tgtgtctacg 

ttattcgaaa 

catggccaaa 

cagtactttc 

taatcagaag 

aagattggaa 

gattgatttt 

tagttttaat 

cttggttgct 

taaaaagccc 

tcataagatg 

gaacttctca 

acatagcaaa 

acgtcccatt 

gcctattttc 

ccgattcgaa 

tgcatatgta 

ctggcaactg 



atgggagcca 

ttactgaaat 

tactatttta 

gactcgttga 

aaccttgtct 

aaactttttg 

ctgaagcgtg 

aagcctatcg 

accgacgtga 

aatccgggaa 

aagacagact 

gtaagtctga 

atgagcctga 

cgtttaaggc 

atcgacggag 

gataaagaat 

atgatttcgg 

catctgaaaa 

accagctcca 

gagtttacgc 

gtcgattttg 

gattatttta 

gatgaacgat 

ttgggctttg 

gataaggaca 

ggcagtacac 

cagatattta 

gtaaaagaaa 

ggtgctaatt 

gagaactata 

ttgacgtata 

tatcgtttaa 

ccaattgaac 

tatttaggag 

aatcattata 

tttaattacc 



ttgtgggctc 

cagaagattt 

agaagaatct 

gcctgaagcc 

ttatcgatct 

taccatttcg 

gtgatctggg 

agatagacag 

tgcgtgagga 

agccgaaaga 

actctcttcc 

tggacttcca 

tggactccat 

ggttggtgta 

ctaatactca 

ttacgtatga 

agattattcc 

taaagatgga 

atcagattta 

ttgacggaca 

ccactactat 

aaaaagataa 

tcctgaaatt 

gaattgcgca 

aatatgataa 

tgaactccag 

cgggaaatga 

agcattcgtg 

ggatattggg 

cggccactat 

acgaagcttt 

accagatgtt 

ggaactccat 

aaatttcagt 

gctcaccaag 

ggttcatcga 



cctttatgcc 
caagcgatgg 
tcccacgccg 
gcagtttctg 
gtatgcgcgc 
ttgtatcgca 
tgacgccgta 
catgttggct 
ttttcatccg 
gaatgatctg 
tgattctgcc 
acgcatcgat 
caaaagccgt 
taaaagcaat 
ccaacaggtg 
ggatctgaaa 
ccatgccgtt 
gaatgaattt 
tctgggactt 
gttgggcaaa 
cccgacatcc 
gcttttctct 
gaaggtcgct 
gatagaggat 
gagcggatat 
gcaattcccg 
aagttttcgt 
gttacaattg 
atggtatctg 
gatgcaggct 
ccgtgccaat 
tcatgtccgt 
taataaggct 
ggtttgtcaa 
aagggagtgg 
ataa 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2334 



<210> 221 
<211> 225 
<212> DNA 
<213> B.fragilis 



<400> 221 

gtggcaatgt atgggaatgg tgtgacaatt ggtacactca agaatactct caaaacggta 

aatctgtcca tcccggatgg ccatttaatg gtacatctgc ctttttccgc cgggtcttgc 

gaggtggtag ttggggtggt actgcaaaag gctgccgagt gtcatatatt gactatgacg 

tgccaaacta tcgtgatgaa tatggaggtt ttaggcttgt tttag 



60 
120 
180 
225 



<210> 222 
<211> 300 
<212> DNA 
<213> B.fragilis 



<400> 222 

ccaaagaatc 

agtgaaatac 

attcctcaag 

agcggaagaa 

accgaccggg 



tcaaaagtag 
atatcccagg 
gaataagcgg 
aatttattta 
tagtcgatga 



ccgtactatg 
attttttatt 
ctttttaaaa 
tcaaaaatca 
gaaatatgca 



acatcaaccg 
accgtagact 
gaaaagtacg 
ggttggcgca 
atgaagaaca 



actccatttt 
tcctgcaaat 
ataaaatatc 
tggcatttac 
aaatgataaa 



acaattaata 
cggaaaggct 
tcatggtgcc 
attctatcct 
gaagcgataa 



60 

120 

180 

240 

300 
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<210> 223 
<211> 186 
<212> DNA 
<213> B.fragilis 



<400> 223 

ttaccggttc atcgaataat ttatttgcaa aaaaagttcc ccaaaagctt gcaaattcaa 
gaaaaggccg tatctttgca cccgttaaac aaaaacaatg gtcgcgtagc tcaactgaat 
agagtagctg actacggatc agccggttac aggtttgaat cctgtcgcga tcactttaag 
gtttaa 

<210> 224 
<211> 852 
<212> DNA 
<213> B. fragilis 



180 
240 



<400> 224 

aaaacaaatc atatgacaac aagaatgtat gtaattaata ccttgagcaa catgcacgta 60 
ggcagtggtg aggtcaatta tggagtgata gacaatctaa ttcagcgtga ctctgtcact 12 0 
aatctaccca atatcaactc ttccggtttg aaaggagcta tacgcgaata tttcaaggag 
aatgaaaatt tagtaagaga attattcggc agtgctccca aagacgaaaa aacactcccc 
ggaaaagtgc gtttctttga agccaacctc ctatcgatgc cagtaagaag tgacaaggtg 3 00 

360 
420 
480 
540 
600 
660 



ccctttctga tggctacctc agacgaagta cttcaagaat tgataaccaa aatgaagttc 

ttcaattgcg aagaagccac tcaatacata tcccatctgt ccacattgct tgataatata 

aaaacacaag cgcaaggtac tgattttgcg tacgtgtttg acccttcact gcaaggtgca 

atcattgaag aagtttctat acgggctact tgcccaagcc acattcctct tcaactgtct 

ctaaagaaac ttttaggcga tagactggtg attttatcac ataaatattt ctctatacta 

tccgatgaca atcatcttcc agtcctgtca cgcaataatc ttgaaaacgg gcagagcgcc 

aatttgtggt atgaacaggt tttaccgcgc tatagccgac tttattttat gttaatggac 72 0 

ggaaatgcac aaagtgagta tctgaaaaaa ttcagagata ccctatgtac cccttctacc 780 

attattcaaa taggagctaa cgccagtata gggtacggtt actgccaaat atcagaatta 840 



tcaccttttt aa 



852 



<210> 225 
<211> 540 
<212> DNA 
<213> B.fragilis 



<400> 225 

aatcattgtt ttctgccctt attcttctgc tgcaagaatc cggctatgtc agtttggatg 60 

tacatagcag tcaccgatcc cggttatggc aacgaacaaa acgatgagtt tatgaagaat 12 0 

atgggtatag aggcttttgt caagtacaat tattttcaca aagagcagaa gcggacctgg 180 

aacaaggatg cttttaccat acaaaaccta aagaaggcat ccattgctgg acaccttcct 2 40 

tctatattta ttcttaatcg ttatttttat aaaagatcaa gtgccttgaa acatttcact 300 

aacttaccga tagcaaaatc aatctgttct ttagagtgtg tagccatcaa cgagaaacga 3 60 

atcaacgtat cgttcggaga acatgcggga ggcacaactg gatttacaaa cacaccttcg 42 0 

tcaaataaca tcttagttac cataaatgtc ttctccatat cacgtacata tagaggaatg 480 

ataggagtgg aggtatgtcc gatctcaaaa ccaagttcac ggaaacactt taaagagtaa 540 



<210> 226 
<211> 798 
<212> DNA 
<213> B.fragilis 



<400> 226 

ggatctaata aaaaagatat gcagaagcaa 
ttgggaggac atgatttaga gatgcaaact 
atattcaagg accgttattt acaatgggat 



gcaaaagaga taaaaaaaca tcttttcctt 60 
atagtccaga tattgacaga tagaaacgtc 12 0 
aatgcgttat taagtcaata cgaagaagag 180 
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420 
480 
540 



atacaacaat atggtaataa agagccgttt attatctatg gtgtagaatt gaaagaggac 240 

atcacgcccc ccactaacta tattcgaatt gatcaccata atgaatatgc tacctatcct 300 

tcggcattgg aacaagtagc ttctattctt gatcaccctc tcaatcgata tcagacatta 360 
gttgcagcaa atgataaggc ttatattcca ggaatgttag aaataggagc cagtcatgaa 
gaaataaatc taatcagaca agaggatcga aaagcacagg gagtaataga ggatgacgaa 
aagttggctc aagaagcaat tacaaacgga acagaaaaaa ttggtagttt atatgttgtg 

ttcactacag cgaacaaatt ttcacccata tgtgacagat tatatcctta tgaaaaatta 600 

ctaatctaca ctccgaatga gttaatatat tatggaaaag ggataaattc tatccagaag 660 

atattaaagc gatacacccc aataagtaat atcttttggg gaggcggtat taatggattt 72 0 

atcggaaccg tacgcaaccg cctaacaaca aatgaaatac ttaacatagt cgaacagata 7 80 

aaactattag aattatga 798 

<210> 227 
<211> 747 
<212> DNA 
<213> B.fragilis 

<400> 227 

aaacgaaata aaatgaaaac aattttcaga atgttatcgg tattactgct aactacaggt 60 

ttattgagta gctgtataca aatcggtgaa ggtatccaac ccagcaagaa gctcatcaca 12 0 

agagactata aagtgaagga gttcaataag attgatgcgg ggactgtggg caacatctat 180 

tatacacaat ccacagacgg aaaaacggat ctgcaaatct acggaccgga taacatcgta 240 

gcactgatac aagtagccgt aaaggacaat acactatttt tgagtatcga taaatcaaaa 3 00 

aaggtacgca acttcaaaaa gatgaaaata accattacat ctcccacctt aaatggtatc 3 60 

tcctttaaag gagtgggcga tgtacatatc gaaaatggat taactacgga taatcttgat 42 0 

atagagagta aaggggtagg taatgtggac attcaatcgc tgacttgcca aaaattgaac 48 0 

gttcagtcga tgggtgtagg tgatgtaaag cttgaaggca cagctcagat agctgctctt 540 

cattccaaag gagtgggcaa catagaagcg ggaaatctac gagccaacgc agtggaagcc 600 

agctcacaag gcgtaggaga tataacctgt aatgcaacag agtccattga tgcagccgta 66 0 

cgaggagtgg gaagtattaa atataaaggt agccctacta taaaatcact cagtaaaaaa 72 0 

ggagtgggaa ctatcaagaa tatctaa 747 

<210> 228 
<211> 2355 
<212> DNA 
<213> B.fragilis 

<400> 228 

aaacagaata aaaaaggatc taataatatg atacgacatt atttgaaaat tgcatgtagg 60 

aatttgctga aatacaagac tcagagtatc atcagcatcc taggattagc tatcggattt 12 0 

acctgttttg cgcttgctgt cttatggata cgttacgaaa tgacatacga caccttccat 180 

gaaggttttg accgtattca tttggtgtat cagaaatcgg cattaagtga cacaggcgtt 240 

acaacaacaa ttccatatcc ggtatccact tctttagaaa agcagtttcc ggaagtggaa 3 00 

gatgcctgcg gttttctttt ttatgaacag gaagtgacag tagacgatgg cgctatccgg 3 60 

caactgtatg aaatcaatgc agactcttgc ttcatgcata tgttcgggat acaagtactc 42 0 

tccggcagcc ttgatttcct ggaatcggaa gagcggatag cactgacaga gcatgcggcc 480 

aaggaacttt tcggtacgga aaatccgatc gggaaggaaa tcaaactgta tggtgcccct 540 

aaaaccgtat gtgcgatcgt caacggatgg aaccgtcata ccaatttacc tttttctatt 600 

ttaacgggag gaatacgtca atggcataat gcatggtatc acggaggatt ccatgtattt 660 

ataaaattgc acaaagaagt aaatgccgaa acttttcaga aaaaactgga acaaacgaaa 72 0 

ctcgaagcag acagcaaggg cggcatacag aatctgatgg ttatgcccat cagtaagtgc 78 0 

cactatactg tactggccga ccaaaatgcc atccagttca gctatatcct attcttctcg 840 

attgtaggcg gattagttat tctttgttcg ctgatcaact acctgtcttt gtttgtcagc 900 

cgtttacgga tgcgaagcag agaattggca ttacgtaaag tatgcggttc atcagacctc 96 0 

catctgttca ccctgcttgt tacagaatat ctgctgatct tgttggctgc aggacttatg 102 0 

ggaatggccc tgatagaatt ggtattgtct ccgttcaaag aattgtcggg agtaaaagaa 10 8 0 

ggagatattt actgggaatc ttttttatac ttcgccctcg tcatcggatg ttctcttgcc 1140 

acattcctgc ctgtaacttt ctacttcaac aaacgaacgc tacaaagtaa catacagcaa 12 0 0 

aagaccgtaa acagatacgg gtatctggga cgcaagataa gcattgtttt tcaactttct 12 60 
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atcagtatct gttttatctt ctgcatcagt gtcatcatga agcagctcta ttatctcagc 1320 

actacagata ttggtataga acgcaagaat atagcgactc taagcatgta tccgcaaaac 13 80 

aatctattgc ctgccgctga taaaatagag cagtttcctt acgtcactca ggttctgaaa 1440 

ggacactttt cgctgctacc caaaactgca tccatggcta tgcacttcaa ggattgggac 1500 

ggaaaacaac ccggagatgc agagatcgat atggaagtac tgatggaaag tgaagagtta 1560 

gcacagtttt acggcatccg ccttttaaaa ggaaagatgc tgaaagaagg agagagggat 162 0 

gccggcacta ttgtaatcaa tgagacagcg gccaaagcac tcggatggaa tgatccgatt 1680 

gggaaaaaac taatcagacc caacggaaca gggacaaccg ttatcggttt ggtaaaagac 1740 

ttccatacga catcgcctac cactcccata aaacctatcg catttatagc caaaggcttt 1800 

tccggattcg acctgggcaa aggtgatgta ctgataaagt atagagaagg cgaatggccc 1860 

aaacttaaaa aagacattga acagctatgt cagaaagaat atccggaaaa taaaatcaga 1920 

ttatccaata tggaggagac ctatgacaac tatctcaaat cggagcagac ccttttgaaa 1980 

ctactgagtt gtgtagctgt tgtctgtatt ctgattgctg tattcggagt gttctcttta 2 040 

gtgacattgg catgcgagca acggcgtaaa gaaatcgcta tcaggaaagt aaatggtgca 2100 

acacttggca acatcctctc gatatttata aaagagtacc tgatactgtt actctgcgcc 2160 

tcgttccttg ctttcccggt cagctacatg atcatgaaag catggctgga gaattatgtc 222 0 

gaacaaatca gcataggcgt ttcgatgtat gtcactatct ttacgggaat cggtatcatc 22 80 

ataaccgcct gtatcggctg gagagtatgg aaagccgccc gtgaaaaccc ggcggaagta 2340 

gtaaaaacag aataa 2355 

<210> 229 
<211> 396 
<212> DNA 
<213> B. fragilis 

<400> 229 

caactggtca aaggtgatga tttcgtcaac acgatttata aattcgggtg caaacgattt 60 

attcagagcc ttctgaatca cgctgcgaga gaattcttta tcgtcaagac ggctttgagt 12 0 

ggcaaaaccg actccacgcc caaactcttt caactggcgg gttccgatat tcgatgtcat 180 

gataataaca gtattcttga agtcaaccat tctgccataa ctgtcagtca gccgaccttc 240 

gtccatcacc tggagaagca gattgaacac atcgggatgc gccttttcta tttcgtcaag 300 

caatacgata gaatagggtt tacggcgtac tttctctgtc aattgtccgc cttcctcgta 360 

tcctacgtat cccggaggcg ctccaaccaa gcgtga 396 

<210> 230 
<211> 1152 
<212> DNA 
<213> B. fragilis 

<400> 230 

attataataa agatgaatag acactattta ataacattga ctccgatgga ttggtttttc 60 

tttggaggag aaagaacatt ggatgatgga aagagtgcag attatatatc gcattcaaat 12 0 

aagttccctc aacaatccgc tcttttaggc atgatccgtt accaattgct gaaacagcac 180 

aatttactgt cccaatttcc ttacacagag aataaaccga cagaaaaaga gataatgaaa 240 

gcacttattg gagaacagag tttcaggatg accgaaagaa aggctaaatc acttggctta 3 00 

ggcgtcatca aacagatttc cccactcatg cttatagagt gcaaggatga tacctcgtca 360 

cgctctatct actttccatt gccattagac gatggataca aagtatcatt taatgaaaca 42 0 

agtaatgaag acaaagtttt ctataatgga attgaatgcc cgattcccaa tgtttacccg 48 0 

gcttccgaag agcaagattc cggtaatcaa aaaagaaaat ttttcgatca taaaacatac 540 

aataattatc ttttctggtg cacccaagga aataatcaga taaaaaaatt actatctgat 600 

gaaatatgga ttagtaaaat gcagatcggc attaccaaac atgtggaaga aggtgaggat 660 

aacgacaaaa gcttttataa acaggagttc cttcaattga aaaaatcatt tatatatgcc 72 0 

ttttatatca ccttatcggg agaatcagag ctatcttccg atattataca attaggaggt 780 

caacgttctg tattccgcat ggaagtagaa tcaatagaag agaatagcga tatacaagaa 840 

aaataccaaa cagctgctca gttcctgact caaagcgatc gtcttctaat attgagtcca 900 

acttatgtag ataacctaaa ggaactttct gctttatgta actttatgtg gagcgactcc 960 

attgtctttc gcaatattca aacgactaac gcaagtaact tttatggtaa acctatcaaa 102 0 

agcagtagta aataccactt cttaaagccg gggtcagtac tttattttaa gcaagggaaa 1080 

cgcaaagaag tcgagaaact attgatggat tacacttatc ttcgtttatc cggttataac 1140 



96 



atatatatat aa 



1152 



<210> 231 
<211> 183 
<212> DNA 
<213> B.fragilis 



<400> 231 

caagtcccaa aagcgatctc catgattcat ctcgcaagta tggcaaagtt catgcaaaag 
gacgtaatct atcagatgtt ttggcaacaa tacaaggaaa taagagagat ttatcttttt 
acgagaggaa cagcttcccc atcgcccacg gctcgaattt atctgcacgc tctcgtaggg 
taa 



60 
120 
180 
183 



<210> 232 
<211> 297 
<212> DNA 
<213> B.fragilis 



<400> 232 

ggtttaacaa 

tacaggtttg 

caactgaata 

cacaagaacc 

attggatcat 



atggccgcgt 
aatcctgtcg 
gagtagctga 
tccataatca 
acctaaattc 



agctcaactg 
cggtcactct 
ctacggatca 
aattatggag 
tgagggattc 



aatagagtag 
aagaaaataa 
gccggttaca 
gttttttgtt 
ctctttttaa 



ctgactacgg 
cgataaacgg 
ggtttgaatc 
ttccttgcat 
ttggcagtct 



atcagccggt 
tcgcgtagct 
ctgtcgcgat 
tatctctttt 
gtactga 



60 

120 

180 

240 

297 



<210> 233 
<211> 285 
<212> DNA 
<213> B.fragilis 



<400> 233 

attttaaaaa 

acaattgttg 

actctatctg 

ggagcccata 

gatccaaccc 



aaatatttat gaattataag aagaaaatta tttgtctttt ggtattattt 
ttgtaaatgt gcttaatgtt gttgtgaaat cggatgatgc tgagacatta 
gaatagaagc tgtagcagct acttatgaaa acagtccggg aaactatact 
atcaatattg tacaagtccc aaaaatgcta caggatgtgt ttcggatcct 
gcacttgttc atattcaatt ttttgtaaaa aataa 



60 

120 

180 

240 

285 



<210> 234 
<211> 1431 
<212> DNA 
<213> B.fragilis 



<400> 234 

ctaaatagaa 

atgatccatt 

ttattagata 

tatgctaaaa 

ttaagtatat 

cctctaccaa 

caagaaaagt 

gaccctcgta 

ggactggaat 

aaaatccaaa 

aaaggtttcg 

gaagatacct 

caaagtacat 

taccagattc 

tgttatttcg 

attaaagcaa 



aaaagaatat 
ttcaacataa 
aattcattct 
aaaataattg 
ctctacaaaa 
ctgagcgtcc 
gcgttggtat 
aaaaagaggc 
ggcaagactt 
catatttacc 
gttcttttac 
tgaaagaaaa 
tggattttat 
taaaatctgg 
ttagcaaata 
gaggttatga 



tatgaatcag 
tgaatcagga 
tacaaaactc 
gttaatagat 
aaaaagtaga 
ctctaatttc 
aaatacaaac 
tgagtttaaa 
tactataaaa 
agctttcttt 
tgttgaatat 
ttttgctttc 
ctatatctac 
atataatttc 
tcctaattat 
attgaaaggc 



ttaaccgcta 
gcaaccttac 
ggaaatggag 
aatgaaaaaa 
ctagaatatt 
tttacgatcc 
tctaccatta 
gaaaaaaatt 
atattctctc 
atatgccaca 
atcaataatc 
gtatataaga 
aatcagatat 
agaaacgaat 
agatgggaaa 
gatcattctc 



tactaaaaca 
gagcatcaga 
atattagaga 
attatgcatt 
taataacttc 
aaaatagtcc 
tcttaaaaaa 
ggagtcaaat 
tgaaaggtga 
attttggtac 
aaaaaaatat 
aaaaaatagc 
tttcaacaat 
atataaaatc 
aaagaaaaat 
ccatttctgg 



acacactcca 
agttaaacca 
aggacggctt 
aaattataag 
tagtacattt 
atattttgct 
aagcaatagt 
agacaaaaaa 
tttaataaat 
aagaaacaac 
atgcaatgta 
tttgtcatgc 
aaagaaggat 
acttctattt 
gaagcaatta 
aataagagag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 



97 



aacgacaact cctggaatga tcctaatcct aatggatata attatgctta tataagagca 102 0 

atcttaggat tagcagaaca atatgaattt caattagaaa caccatatca gaaagctatt 1080 

gtcaagataa aatcagcaaa caattgtatc agtcgttaca aatctccttt attatttaaa 1140 

ataataaaca acagcattta tttggttggt aatgaaataa atactgaaat actaaataaa 12 00 

ccatttcaat ataactatat agaacaaact aaaaataaaa atatgagaac aggaaagagt 12 60 

gaaataacag agcggacaat gcatataaat gagattgaaa tgaactataa taatagaatt 132 0 

aattatcatt atacgccaac ctccttttca ttaatcgatt ttatgcaata tgcaatgtct 13 80 

tataaaaaaa atgggaaaaa cattttaaat tatattcctt taaaacaata a 1431 

<210> 235 
<211> 888 
<212> DNA 
<213> B. fragilis 

<400> 235 

agtatgagaa aaataaaagt aggaatcatt caacaggcta acacatcaga tattaggata 6 0 

aacctgatga acctggctaa aagtattgaa gcatgtgccg ctaatggcgc tcaccttgtt 120 

gttctgcaag aacttcataa ttctttgtat ttctgtcaga cagagaatac ggatttattt 180 

gaactggcag aacccattcc tggcccttct accggattct attccgaact ggcggcagcc 240 

aatcggatag tgcttgttac ttctttgttt gagaaacgtg ctccgggact atatcataat 300 

acagctgttg tctttgaccg ggatggaagt attgccggaa aatatcgtaa gatgcatatt 3 60 

cctgatgatc cggcttatta cgagaaattc tattttactc cgggagatat tggctttgaa 42 0 

ccgattcaga cctctttagg caagttgggt gtgttggttt gctgggatca atggtatccg 480 

gaagctgctc gcctgatggc gttgaaagga gctgagattt tgatttatcc tactgctatc 540 

ggttgggaga gtacagatac agatgacgaa aagaaacgtc agctcaatgc ttggattatt 

tctcagcgtg cgcatgcggt agccaatggg cttccggtga tttcagtcaa tcgtgtcggt 

cacgaacctg atccgtcagg acagaccaac gggatattat tttggggaaa tagttttgtt 72 0 

gccggaccgc agggtgaata cctggctcag gcgggaaatg accgctctga aaatatgatt 780 

gttgaggtgg atcttgaacg ttcggagaat gtgcgtcgtt ggtggccatt tcttcgtgat 840 

cggaggatag atgaatatgg gaatttaaca aaacgtttta ttgattga 888 

<210> 236 

<211> 1839 

<212> DNA 

<213> B. fragilis 

<400> 236 

acctttggaa ataacacgga atccgaatta atatgtactt ttgcagacta ctttaacaaa 



600 
660 



60 
120 
180 



aatataaata atatattaaa tatgttcaga acgcacacgt gcggagagtt aagaatctcc 
gatgttaata aacaagtcaa gctgtcggga tgggtacagc gcagccgtaa aatgggaggt 

atgacttttg ttgaccttcg tgatcgctac ggtatcactc aattagtatt taatgaagaa 240 

atagacgctg agctttgcga acgtgccaat aaattgggtc gtgaattcgt catacagatt 300 

gtcggaaccg taaacgaacg tttcagcaaa aacagtcata tcccgaccgg tgacatcgaa 360 

atcatcgttt cggaactgaa tatcctgaac tcagccatta ctcctccttt tactatcgag 42 0 

gacaacaccg acggtggtga tgatatccgc atgaaatacc gttatctgga cttacgccgt 480 

agtgctgttc gttcaaattt ggaattacgt cacaaaatga cgatcgaggt tcgcagttat 540 

ctcgataaac tgggtttctt ggaagtggaa actccggtat tgatcggttc aactcctgaa 60 0 

ggagcacgtg actttgtagt accttcccgc atgaatccgg gacaattcta cgcattaccg 660 

caatctccgc agacactgaa acagctattg atggtttccg gtttcgatcg ttatttccag 720 

atagccaaat gtttccgtga cgaagacctg cgtgccgacc gccagcctga gttcactcag 780 

attgactgcg aaatgagttt cgtagagcag gaagatgtga ttactacatt tgaaggaatg 840 

gccaaacacc tgtttaaggt gatccgtaat atcgaactga ccgagccatt cccacgtatg 900 

ccttggagcg aagcaatgag attgtacggt agcgataaac cggacattcg cttcggtatg 960 

caattcgtcg aattaatgga tatcttaaaa gggcacagtt tctctgtatt cgataatgcc 102 0 

acatatattg gcggtatttg tgccgagggt gcagccagct atacccgtaa gcaactggat 1080 

gccttgaccg aatttgtgaa aaagccacaa atcggtgcaa aaggtatggt ctatgcccgt 1140 

atcgaagctg acggtactgt gaaatcaagc gttgacaagt tctatacaca agaagttttg 12 00 

caacaattga aggaagcatt cggtgccaaa cccggtgacc taatcttgat tttatcagga 1260 

gatgatgcca tgaaaactcg taagcagctt tgtgaattac gtctagaaat gggtaatcaa 1320 
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ttgggattac gggataaaaa cacatttgca tgtctgtggg ttgtggactt ccctctattt 13 80 

gaatggagcg aagaagaagg cagattaatg gctatgcacc atccgtttac ctcacccaaa 1440 

ccggaagata tccatctgct ggatacaaat cctgctgctg tgcgcgctaa tgcttacgat 1500 

atggtaatca atggtgtaga agtaggaggg ggatcaatcc gtatccacga tagccagttg 1560 

cagaacaaaa tgttcgaatt actcggattt accccggagc gtgcgcaaga gcagttcggc 1620 

ttcttgatga atgccttcaa gtttggtgcg cctcctcatg gcggactggc ttacggatta 1680 

gatcgttggg tatctctttt tgccggactg gactcaatcc gtgactgcat tgcattcccg 1740 

aaaaataact ccggtcgtga cgttatgttg gatgctcccg cagcactcga tccgtcacaa 1800 

ctggaagaac tgaacctgat tgtagatatt aaggagtaa 183 9 



<210> 237 
<211> 1245 
<212> DNA 
<213> B.fragilis 



<400> 237 

tatagtatga aaaagtatcc aaaaatcggg attcgtccca ccatcgatgg acgtcagggc 60 

ggcgttcgcg aaagccttga agaaaaaaca atgaatctgg caaaagctgt tgccgagttg 12 0 

atcacttcta atttgaagaa tggagacgga acccctgtgg agtgtgtgat tgcagatgga 180 

accatcggac gtgtggctga aagtgctgct tgtgcggaga agtttgaacg tgagggggta 240 

ggagccacta tcactgtcac ttcatgctgg tgttacggtg ccgaaacaat ggatatgaat 3 00 

ccgtattatc cgaaagctgt ttggggattc aatgggacag agcgtccggg agctgtatat 3 60 

ctggctgctg tgctggcagg acatgcacag aaaggacttc cggcatttgg catttatggt 42 0 

cgcgatgtac aagacttgaa tgacaattct attccggcag atgtagctga aaaaattctg 480 

cgttttgcac gtgcggctca ggctgtagcc acaatgcgtg gcaaatctta tctgtctatg 540 

ggcagtgttt ctatgggtat tgccggttcg attgtaaacc cggacttttt ccaggaatat 600 

ctgggcatgc gtaatgaatc gattgatttg acagagatta ttcgtcgtat ggccgaagga 660 

atctatgata aggaagagta tgccaaggca atggcttgga ctgaaaaata ctgcaaaaag 72 0 

aatgagggca atgactttaa tatacctgaa aaaacgaaga cccgtgcaca aaaagatgag 780 

gactgggagt tcattgtgaa aatgacaatc atcatgcgcg atctgatgca gggaaaccct 840 

aaattgaaag aactcggatt taaggaagag gctttggggc ataatgctat tgcggcaggt 900 

ttccaagggc agcgtcagtg gaccgatttc tatccgaatg gcgacttctc tgaagcatta 960 

ctcaatactt cgttcgattg gaatggtatt cgtgaggctt ttgtcgttgc aacagaaaac 102 0 

gatgcttgta acggtgtggc tatgctgttt ggtcatctgc tgacgaatcg tgcacagatt 1080 

ttttcagatg tacgcacata ttggagtccg gaagcagtga aacgtgtgac cggtaaagag 1140 

ttgacaggaa tggctgctaa cggtattatt cacttgatta attcgggggc aactactctt 12 00 

gacggaaccg gacaacagac gaatgctaac ggtcttaacc acggg 1245 



<210> 238 
<211> 411 
<212> DNA 
<213> B.fragilis 



<400> 238 

ttaaaaacga 

atacgcaatc 

ggtatgactt 

aacatgccca 

ggaggatggg 

ccgggtatta 

aatagggagc 



ataaatcaga 
tgctgaagta 
gtttcagcat 
atttcgaaca 
ggtggagtct 
agcaaatctg 
agggaagaaa 



gaaaatgaac 
taagacacac 
catccacttt 
aaggatttca 
caattcttct 
cttccactct 
agccttacat 



tatatgatac 
agcattattt 
tttatcaatg 
atccggatga 
gagatccgaa 
ttccaaagag 
catctcatat 



agcattatct 
ctgccatttg 
aaatagatgg 
tcaattccaa 
ccctgacaga 
aagacgaagt 
atggatactg 



caaaacagca 
tctttccgtt 
agcatcacgt 
ccacgaagta 
acatcccata 
tgtattcatc 
a 



60 

120 

180 

240 

300 

360 

411 



<210> 239 
<211> 495 
<212> DNA 
<213> B. fragilis 



<400> 239 

aagaaaaaga aaactatgaa ttggaaatta gtagaatgtg aaattgcact aatcgtatct 



60 
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ttgacagtaa 
actgtgtttt 
catctctcat 
gatgaaaaga 
gaagagctaa 
caagaaaatg 
gatctgaaat 
aagataacta 



ttgagtgtgt 
tttgcattat 
gttttcagaa 
tgaaaacatg 
cgaataaggt 
aattaaaaaa 
cgattgagga 
aatag 



gaatatggga 
gattgttctg 
tcgccagaaa 
gctactcgcc 
aaatggacta 
attttatctt 
gaacttcaaa 



cagaattccc 
ttgccactta 
gaaaaagagt 
cgtgaagcaa 
caacaaaaat 
tctattcttt 
aagatgaagg 



ctaaagacat 
ttggtgtatt 
atcaggctaa 
ttatcaagga 
gtgattcctt 
ctattattgg 
atttctttga 



tacatgtctt 
gcaacaatgg 
acaagaaact 
taaagaaaaa 
gatagaaaac 
cactaaagac 
agaatataaa 



<210> 240 
<211> 186 
<212> DNA 
<213> B.fragilis 

<400> 240 

ggagcaggga agaaaagcct tacatcatct catatatgga tactgatcct aatttctttt 
cacattataa tgcatccttc ttatatggcc aatgcttttc ccacaactcc gaaagaagtt 
gtgttgtccg aaagttgtgc ccgtaaagta tacgggaaag agcaacccgg tagggcacat 
tactga 

<210> 241 
<211> 318 
<212> DNA 
<213> B.fragilis 



120 
180 
240 
300 
360 
420 
480 
495 



60 
120 
180 
186 



<400> 241 

agagttaacc 

aaccatcttg 

tcggcttata 

ttttatgctt 

aatattcaca 

ttccgtggca 



tccagccaaa 
ttcgtgttgt 
atggaggagg 
atctgaacaa 
tcatgtggtt 
aggtttaa 



atgttctttt 
aagttacatt 
caccaacagc 
tatctattcc 
gtccggtaat 



ccggttcgtt 
gtagatgctt 
tatcatcccc 
tgccgcaaaa 
agtacatcca 



tatcggaaaa 
tggatattag 
gtatgatact 
cccaaaaggc 
atttccgcac 



aatatctccc 
ttacctgctc 
caaggtcctg 
cttgcagaag 
tatcaatgat 



<210> 242 
<211> 186 
<212> DNA 
<213> B.fragilis 

<400> 242 

tttaaaagcc tctacttcat ttacacattg cagcagaaaa tgtacagagt acaaaaagaa 
agactttttc tcatccttgc tttgatggta ggtattcgta aaatcgacca aagtagtgat 
cccaaagaat gggtcgtcaa aagcagaaaa gaatttaaaa tgttcttttc tattccaatt 
ttctaa 



60 

120 

180 

240 

300 

318 



60 
120 
180 
186 



<210> 243 
<211> 768 
<212> DNA 
<213> B.fragilis 



<400> 243 

tgcaatgaag 

tgttgtttaa 

aatgaaagac 

ggagtggtcc 

tttacttatt 

tttttctcag 

ctttgtgtga 

gaggttacta 

tt gaaagatg 



attcgcaaaa 
cacatccttt 
agaaatttcc 
gctcttcttt 
atgtaggcta 
tatttgtaaa 
ttcttacaat 
cttatgttca 
atcttaaact 



atatcgtgtc 
gagtgtagat 
tattcgtaca 
tagtccggat 
tagatgtgaa 
tatagtttgg 
ctatatatat 
gacagttgct 
ggatgttggt 



ttcccctgga 
acgttgaata 
ggattaaaat 
agcctttctt 
attgaaatct 
actcttattg 
aagttgtctg 
gttaaaaagg 
aaaggtgtat 



gagcaaggac 
tactatggag 
taactgtttc 
gtttatctta 
tagggtttgt 
gagttgttgt 
ttcaccctcc 
ggactctgcc 
tgatctgcga 



tgcatactgt 
gactatgttg 
tgataataat 
ttcttcgata 
atctatatct 
tgcttttgtg 
taaaataaaa 
tatatacgat 
aaacatggaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 
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gtatctctta ctccccagca gcgtgttctt ttagttttgt ttattaaggc tgagaatcat 600 

actctgtcta tgtctcaaat tatggcagat gtttggccgg gaaaatctat ttctcccgat 660 

tgtttccata aagcaataga acgtttgcgt gatttgttaa ggcagcttcc tatgaccata 72 0 

caaattgaat atttggggga ggaaatttat cagatgcaaa ttttataa 768 

<210> 244 
<211> 204 
<212> DNA 
<213> B.fragilis 

<400> 244 

tttattctgg ataagcaaga taaaatggta tgttatacaa catcaaaggc agagaacaaa 60 

gcaattattt atagtaatca tttactctac aaccaacaga gctactctta cttaaatata 12 0 

gaaaagcatc ccttgtgtta caagaaatct aaatctattg actttactaa tttaaagtac 180 
aagtccaagt ctatattttt atga 2 04 

<210> 245 
<211> 1827 
<212> DNA 
<213> B. fragilis 



60 



<400> 245 

ggcacattac tgaaaatagt caagttgaag gaaagcgaaa aagataaaag cacatactat 

aaagttgtca atgttatcag gaatcttccc aaaacactaa atgttgaaac agatatctat 12 0 

ttctctcatt tgagagaaga gaacagacaa caaggataca tcacagaagg tacactggaa 180 

acagcagacg gattgaataa agccaatgaa agcctaaagg ggataacaac tcttcataat 240 

aatgaaatgg catatttcat tgcgaacaaa gaagctgact cgtatcacga tccgcagcga 3 00 

atgataggca tagcattcat taccttctta tcctctctga ttctgttatc gggtatgata 3 60 

aactttctaa agttcatcat acagtcattc tacaaccgta atcgtgaact ggccctgcgg 420 

aaaagtctgg gtgccagccc caaaagttta tttgcccttc tattcaccga agctttctgg 480 

atgctgacat tttctttatt gttctcgctg gtcttatccg aatgtacctg tttactactg 540 

acaacctata ttccgcctaa agaaatgatt ccaatagata tccaaactct atatggcatt 600 

caagttaaac tttatatagg gctgttactg atctgcaccc ttgtaatgct atatcccatc 660 

cggcgtttgc aacggtccgg tcttgccggg cacatgaaaa ctaacagcca ccggcacctc 720 

tttcgcaaca tcatgatgtg cgtacagcta tgtgtatgta tcttttttct gggtatgagt 780 

atcgctatac atttattcaa tagtgtagga agcgttttgt accttcctct atcagacaaa 840 

gaaacaaatt ccacgctttg ctttgaaatg aatagtgtta ctctcggcaa aaataaggat 900 

gcgattttat cacaaataaa gatgttgcca ggagtggaaa atatcagttc agctttgatg 960 

agcggcaact ataattcgtt tctgaccagt gactatgagt ctgccgatca ccgtactctc 1020 

actgtcagag tcagacaagg agatcccagc tacttccagt tctttcggat tcctttccgg 1080 

ggagagatcg tcgaacctca tacaagtaac gtggtttaca tcagcgaagc ctttcaaaag 1140 

cagttggaaa atgattctgt aagcggaaat gtaaaattag gtaaagagaa ctatcggata 1200 

gcaggtacat acaaggcttg ttacggggag aacatctcag aacacaatca atacaatatt 12 60 

tctgttttct tcccgactga agaagcatcc gtaatttata tccgttttcg tgacgatatc 132 0 

agttttggta aagccaaatc agaaatagaa agggtatgcc gtaattacgt cccagagtca 1380 

ttgccactcg atatacaacg actggatata agaagaagta caacacaagg tatcagagac 1440 

ctgatgggcg atgcctcact gctattaggc atcataagtg ctcttctggt tatactgagc 1500 

atatactcag ctatctctat ggacacagtc agccggcaga aagaagttgc tattcgcaaa 1560 

ataaacggcg caactccgaa aataattgct ttgatgttcg gaaaagcata tataatccaa 162 0 

ttcatactgg cctataccat cacttatcca ttattaaggt tacttgtgat agacataacc 1680 

aaggatagcc cgatcagcag tattaccgga tttgcatggg ggatttacct cttcattctg 1740 

ataggtttac ttatctttgt aacaacagcc tataaaatct acagaatcat gcatctcaat 1800 

ccggcagaaa taataaaaaa cgaataa 182 7 

<210> 246 
<211> 894 
<212> DNA 
<213> B.fragilis 



101 



60 



<400> 246 

atgaaaacac tcctaaatat taaactccat ttatctaaaa agaatatatt taccatctta 

gtttttattc ttgttttaag tggtactacc ggttgcattc aacacaaatc cgaccagaaa 12 0 

cgactgcctg ctctttcttt tactgtaaat ggagagagct ttgaaatgat tccggtagaa 180 

ggtggaacct ttattatggg aggcacaagt gagcaaggta atgattgcga aaacaatgaa 240 

aaaccaacgc atgaggaaac tctaccgttc ttttatatcg gaaagtatga agttacccag 3 00 

aaactgtgga aagcagttat ggggactgat ttcgatcaat catacaattc aggatgtgaa 3 60 

gattgtccgg cagagtatat cagttggaat gacacgcaaa agtttataag caaattgaac 42 0 

acccttacaa acaaaacatt tcgcctgcct accgatattg aatgggaata tgccgcacgc 480 

ggtggcaagt atagtgaaaa atacaaatac agcggaagta atgatatcga tgaagttgcc 54 0 

tggtatattg aaaattatca aaaaagtaaa tatggagaca aagggactac acatccggta 600 

ggtatgaaaa agcctaatga attaggattg tacgacatga gtggcaatgt atgggaatgg 660 

tgtgacaatt ggtacactca agaatactct caaaacggta aatctgtcca tcccggatgg 72 0 

ccatttaatg gtacatctgc ctttttccgc cgggtcttgc gaggtggtag ttggggtggt 780 

actgcaaaag gctgccgagt gtcatatatt gactatgacg tgccaaacta tcgtgatgaa 840 

tatggaggtt ttaggcttgt tttagtaccg gactcagtac agactgccaa ttaa 894 



<210> 247 
<211> 840 
<212> DNA 
<213> B.fragilis 



60 



<400> 247 

atcactgttt gcacaatttc ccgtatcttt gcaggacgaa tacgaattta ctttcaacaa 

tacatgaaaa aatttatttt agacctgaca gtaactgaga atctcagatt gcataccaac 12 0 

tatgtgctgc tgaaattgac ctctcagacc gtcctcccgg atatgctacc gggacagttt 180 

gcggaaattc ggatagatgg ttcacccacc actttcctgc gtcgccccat ttctattaat 240 

tatgtagaca gacaacgcaa cgaagtatgg tttctgatcc aacttgtagg tgatggaaca 3 00 

aaacgtcttg cgcaagtaaa tcgaggagag attatcaatg tagtactccc actcggaaat 3 60 

agcttcacaa tgcccgaaaa gccttctgat aagctattat tagtgggcgg aggtgtaggg 42 0 

actgccccta tgctctactt gggtgaacaa cttgctaaaa acggcagtaa accaacattt 480 

cttttggggg cacgcagcaa caaagatctg ctccaattag aagattttgc cgcttacgga 540 

gaggtctata ctacaaccga agacggcagc catggagaaa agggatatgt gacccaacat 600 

tccatactga ataaaataaa attcgagcag atttatacat gtggcccgaa acccatgatg 660 

atggcagtag ccaaatatgc caaaggtaac gatatcaatt gcgaagtatc attggaaaat 72 0 

acaatggcat gtggcatagg agcctgcctc tgttgcgttg aaaacaccac agaggggcat 780 

ttgtgcgttt gtaaagaagg tcctgttttc aatataaata aactattatg gcagatttaa 84 0 



<210> 248 
<211> 306 
<212> DNA 
<213> B.fragilis 



<400> 248 

accttatgtt 

caagatgacc 

aactatagtg 

ttagaaagat 

tgctatacaa 

gtctaa 



gcgaaatggt 
gaagtcgtat 
tttttgaatg 
ggattaacag 
gaattatata 



aaaagctaag 
acaaatatca 
tatgtttaca 
gcgttatgat 
tcaacctata 



aaaattttct 
aaaatattag 
gacagacaat 
actgtggtat 
cgaaaaaaga 



gtgttgtagc 
aaaagtatgg 
ttcagaagat 
attatccgat 
taattaaaac 



atacgatatt 
aacacgtatc 
ccaaattaat 
gtgcatcaat 
cgtcgaaata 



60 

120 

180 

240 

300 

306 



<210> 249 
<211> 744 
<212> DNA 
<213> B. fragilis 



<400> 249 

gattgtttta ggcaaaaata cgggtttctc ttcaaatatt tgtacttttg cgaaaaatta 
aagattatgc gtatcgatat tataacggtt ttgcccgaaa tgattgaggg ctttttcaat 



60 
120 



102 



420 
480 



tgttctatta tgaaacgggc ccaggacaaa ggactcgctg aaattcatat tcacaattta 180 

cgtgattata ccgaagacaa gtatcgccgg gtggatgact atccatttgg aggttttgcc 240 

ggtatggtga tgaagataga acctattgaa cggtgtatta atgccctgaa agcagagcgt 3 00 

gattatgatg aagtgatatt taccacacct gatggagagc agtttgatca gaaaatggct 3 60 

aatagtttgt ctttatccgg taatcttatc attttatgcg ggcattttaa aggtatcgat 420 

tatcgcatcc gggagcattt gatcacaaaa gaaatcagta ttggagatta tgttttgacc 480 

ggcggtgaat tggctgccgc cgttatggcg gatgctatcg ttcgaattat tccgggtgtg 540 

atttctgatg aacagtccgc cctttctgat tctttccagg ataatttgtt ggcggcacct 600 

gtttatactc gtccggcaga atataagggg tggaaagtac ccgagattct gctttcaggg 660 

cacgaagcga agattaagga atgggaactc caacagtcgt tggaacgtac taggagactg 72 0 

cgtccggatc tgttggaaga ttaa 744 

<210> 250 
<211> 840 
<212> DNA 
<213> B. fragilis 

<400> 250 

atatttataa gtaaaatggc aagagaagct aaaaatgaac cgaaagagtt gacagtggaa 60 

cagaagttga aagctttgta ccagctgcaa acaacattat ctaagattga cgagattaaa 12 0 

acgttgagag gtgaacttcc attggaagta caagaccttg aagacgaaat tgccggtctg 180 

agtacacgta tcgacaaaat caagtcggaa gtagacgaac tcaaatcagc tatcgccggt 240 

aagagagtgg aaattgaagc agccaaagct tcggttgaga aatataaatc acagcaggac 3 00 

aatgttcgta ataaccgcga atatgacttc ctgacaaaag agatcgaatt ccagtctttg 360 
gaaatggaac tttgcgagaa gagaattaaa gaatttactg cagaagagca ggagaaatct 
gaagaaatag agaaaaatac caaagcgctg gaagaacgcc agaaagacct cgaccagaag 

aagaatgaac tggatgaaat catcgaagag accaaacagg aagaagagaa gttgagagac 540 

aaagcaaaag atctcgagac aaagatagaa ccccgcctgc ttcaatcttt caagcgcatc 600 

cgtaaaaatt cacgtaacgg tttaggtatt gtatacgtac agcgtgatgc atgtggtggt 660 

tgtttcaata aaattccgcc tcagagacaa ctggatatcc gttcacgtaa aaaaattatc 72 0 

gtttgcgaat actgtggacg tatcatgatt gacccggaac tggcaggtgt agaaatagaa 780 

cacaaagtag aagaagcacc ggttaccacc aaaagagcta tcagaagaaa agctgaataa 84 0 

<210> 251 
<211> 1359 
<212> DNA 
<213> B. fragilis 

<400> 251 

tatttatcaa gccaaaaaat ggctgtcgca cggggaaggc tatcggcctc ttcctgcaat 60 

tcatcttctt catcgatagg agatatagta gcttctttgt gttgctggca tgaaataata 12 0 

aagaaaaaag agagaagtaa cggtaaaaat gccttcattg gtttgagtct ttataaatta 18 0 

ggaggtaaat atattaaaaa atcaataaaa gcaaggcaag gtgctatctt tttcttatat 240 

ttgcaggcaa attttctact aaaaatgaaa attaaagaaa tagtaagcgc ccttgaacgg 3 00 

ttcgcgcctc tgccattgca agacggattt gataatgccg gcctgcaaat cggattgaca 360 

gatgcggaaa caacaggggc tttgttgtgt cttgacgtta ccgaagctgt gttggatgaa 42 0 

gccatcgcgt ccggatgtaa tctcattata tcccatcatc ctcttatttt taaaggttat 480 

aaatcaatca ccggtaaaga ttacgttgaa cgatgcattc tgaaagcaat caaaaacgac 540 

atcgttattt attcggccca taccaatctg gataatgttc cgggcggagt caatttcaag 600 

atagccgaga agataggatt aaaaaatgta cgtatactcg accctaaaga aagcagtctg 660 

ataaaattag tcacatttgt tccgtctgcc caggctgaag aagtccgtaa tgctttgttc 72 0 

acagcaggat gtggatgtat aggcaattat gattcgtgta gttataatac agaaggggag 780 

ggaactttcc gtgcacagga agggagccat cctttctgcg gaacagtggg agaacttcat 840 

cgcgaaacag aagtgcggat tgaaacgatc ctgcctgaat ataagaaagg agaagttatc 900 

cgtgcattgc tttccaaaca tccatatgaa gaaccggctt atgacttata tcctctccac 960 

aatagttggg cccaagtcgg atcaggaatt gtcggtgaat tggaagaacc ggaatccgaa 102 0 

ctcgaattcc tgaagcgaat aaagaaaata ttcgaagtcg gatgtttgaa gcacaacaaa 10 80 

cttacaggcc gcctgattca gaaagtatcc ctttgcggag gggcaggagc tttccttatt 1140 

ccgcaggcag tccgtagcgg agctgatgtc tttatcacgg gtgaaattaa atatcacgac 1200 



103 



gcgaaacgtg aacttttgct tattgtcacc ccctatcctg aggaggtgcg taagcagatc 
attggtacgg tgaatatgga taatgtacgt ttcctgaagt gcgacacgaa tgatacttgg 



180 
240 



tatttcggtc gtgaaactga cattttgctt gctgaaatag gacattacga aagcgaacaa 12 60 
tatacaaaag aaatttttta ttctataata cgggatttat ttcctaattt tgcactccaa 1320 
tttagtaagg taaacacgaa tcccattaaa tatttataa 13 59 

<210> 252 
<211> 192 
<212> DNA 
<213> B.fragilis 

<400> 252 

acatataatg gtgaagaaca atgtgaagaa ttgtgttttc atattcaatg caaatttgta 60 

aaaggtgata tggaagtcat ttgcatgaat atagcactca attatgcaaa tgtgtcaaga 12 0 

agtaaaatga atcaagaatt actcagattt atcgggtaca tctcttttaa ttgcgtatcc 180 

gtgttaatct ga 192 

<210> 253 
<211> 1191 
<212> DNA 
<213> B.fragilis 

<400> 253 

gtagcgagtg gcatttatta ctattggtgt agtgtccgtt cgctatttat gatttattct 60 

ttaaaaacta aaaaaatggg aataatggtt ggtcttccta cttcaggagg cacagagaaa 12 0 
gatttgcaat tgaactttgg tttaactgtt aatgatcaag tagagatgtt agcccccttc 
ttgcctgcag agtggtttct gcaaagtggt atacaactaa cttggccgca tgccgggaca 

gactgggcat atatgttggc tgaagttcag gaatgtttta taaatatagc ccgtgagatt 3 00 

360 
420 

gcgcgcgatc atggagcaat tactttgatg gatacaggtg gagcaagttt gctggatttt 480 

acttttaatg gttggggaga aaagtttgag gctcgtttag ataatcagat aactcgccgg 540 
gcagtagaag ccggtgcact gaaagggcaa tataaagatt gtctgaattt tgtactcgaa 
ggtggttcca tcgaaagtga cggagccggt actttgctca caacttccga atgtttattg 

tctccacatc gtaattcgcc gatgaatcgt gttgatatag aagaatatct ttgcagagta 72 0 

ttccatttac aacgggtttt atggctcgat catggatact tatcgggaga tgatacggat 7 80 

agccatatag atacattggc tcgcttttgt tctccggata ctattgcgta tgtaaagtgt 840 

accgactctg aagatgaaca ttacgaggcg ctatgcaaaa tggaagagca attgaaaacg 900 

ttccgcacta catcaggtgc tccttatcgt ttattggcat tgcctatggc agacaaaata 960 

gaagtagagg gagagagatt gcctgcaacg tatgccaatt ttttgataat gaatgatgtt 102 0 

gtactttatc cgacttataa tcaaccggaa aatgataaat tggcgaaaga agtgctgtgt 1080 

gaggcttttc cgacatacga agtggtaggc attgattgcc gtgcacttat taagcagcat 114 0 

ggatctttgc attgtgtgac gatgcaatat ccgacaggag tgattaaata a 1191 

<210> 254 
<211> 2448 
<212> DNA 
<213> B. fragilis 

<400> 254 

aaacccggcg gaagtagtaa aaacagaata aaaaaatcta ataatatgat acaacattat 60 

tttaaaattg catgtagaaa ccttttaaaa tataaggtac agaatatctt aagtattgta 12 0 

ggcttgtcta tcggttttac agctttcttg ttaggcgggt attggcatta ctgggaatat 180 

cattttgata gtttccaccc tcaaagttca aggacttatg ccttgactac caccggtata 240 

tttaaaacag ctgacggatc tgtaggagaa ttaaaccaga tacatcagat ggtggaaaaa 300 

gatctggtta ctttccctga aatagctaaa gtttgccatg tcagcaaagt aaaatacgaa 360 

tttgagaagg atacaaaaag ttggatcgga atgaaaatag actccacttt ctttgatatc 42 0 

tttcaatgta aactgatcga agggagctat tataaggttc catttaacgt aaatcatgtg 48 0 

attctgactc aaaaaatggc caacttctac tttggtgaca gtagttgcgt agggaaagag 54 0 

ttgaaaatca acgacaaatt atcatatacc attgcgggag taatggaaaa ttatccccaa 600 

aacagtgatt tcaaatttga atacctgatt ctggccactc catcccccaa tcaagtcaaa 660 



600 
660 



104 



agaaatacga 
atagcagcct 
tttcatttgc 
cttcaacata 
aatctgcttg 
tccactctcg 
cccctgttca 
aaggattaca 
atcacccgac 
tttctggttt 
tcgctggcac 
ctgacttctt 
gtcaccgatc 
acccctttta 
cagcctctcc 
gagggacgta 
ttcctttcct 
agagatatag 
cgtcttatac 
ggtattttac 
atgtatcaaa 
aaggtacatc 
tacagcaaaa 
ctgttcaatc 
atcctgattt 
aaagaaatag 
ctgaaagaat 
ttatttatca 
tttgtatgtg 
gtagtagctg 



cttatgtatg 
acagagtcaa 
gtccgttacc 
tccggatttt 
ttctattcat 
gtgcttccat 
tagcattttt 
ccagtttggt 
aagaagtact 
taagtacggt 
ttagaaacgg 
gcatgttcta 
acatctggca 
tagaagcact 
ttgttttaag 
ataatgtaga 
ttttcggaat 
ttatcaacga 
tcagtgatga 
gtgatttcta 
acaatgctga 
cggataatga 
aagaaatctc 
gtccggaaaa 
cttctttcgg 
ccatccgaaa 
atttatggct 
aaaggtggtt 
tattcctctt 
ccaaaatcaa 



gttacaccct 
agaacccgat 
cgaaattcat 
ggctacggcg 
tggccagcaa 
ctacagtttg 
gctttcaatg 
agcagagagc 
gaaagcatcc 
tcccatcgtc 
attgataatc 
cagtcagtac 
gatcgacctc 
aaaacaaaac 
aggagagtgg 
tgaagcaaca 
gaagatgaaa 
aaccggtgcc 
cgaagattcg 
ctactgtccg 
tgcagcaagg 
aaaacaagca 
cgaggatatg 
gacgatgttc 
cgttttcttc 
ggtaaacgga 
gacactggtc 
ggaaacctat 
cacctgcatc 
tccggcggag 



tcggccgatg 
accaaatgga 
acccgctgct 
ggaatattgg 
caacgaaaag 
atagggaaaa 
gcatttatcg 
agtagctatt 
tattggattt 
ggtctgctga 
ggacaaatct 
cgattcatga 
ggattcgatg 
tcagccatcg 
tattgcagtt 
gaagataatt 
gaaggagaat 
cgcgaactca 
gagaatcatg 
atgcagtacc 
gggtacaatg 
ctgcaatacg 
cagatcattc 
cggatcttcc 
ctggtatcgc 
gcacaatttt 
agtaacgcaa 
gcgtaccata 
atcgttattc 
tcggtgaaaa 



ctgcacatct 
gtaaatattc 
ctcctgagct 
catttgccag 
cacgctataa 
acttacttga 
aattcctgtt 
ataatggagt 
atccgttatg 
aacgaaacag 
tcatcggctc 
gccgtacaga 
caacctacaa 
atgacgtaac 
ttatcactca 
gcatcgttgt 
ggatacagga 
acattccttc 
cagtacccac 
cgctgtcgaa 
gattcagata 
ccaggagaat 
aactttccac 
tgttgttggc 
tctctaccga 
cagatatcct 
ttgctctgcc 
ctgatattca 
tctcggtcat 
gtgaataa 



aagtaaaaag 
tgaatggcgc 
gaaaggcagg 
tgcattaatg 
tgccactttt 
attgaccctc 
tccgttttat 
catccagagt 
ctgcctgata 
tcggggaact 
tttattcctc 
caaaggactg 
tactgattgt 
ggccctgaca 
atttcccata 
gcaaaagaat 
tcaggggaca 
actgacagga 
cagaatcagc 
ggtcttcttt 
tttctatata 
ctattctcaa 
cttaatggaa 
agtactctgt 
acagcgtaaa 
atacttattt 
tttaggatat 
cggatggctg 
gcgacaagtg 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2448 



<210> 255 
<211> 1191 
<212> DNA 
<213> B.fragilis 



<400> 255 

ctattcatgg 

gctaaaggcg 

atgagtggca 

ccgaaagtaa 

tcgcgttttc 

tttgttggta 

gtttcgtgtg 

attgttgaag 

gaatctcttg 

gatggagtat 

aaaaaatatg 

cacggacgcg 

acattcagta 

aattatttgc 

acagctgccg 

ttgtgggata 

catacctcca 

actaagatgt 

ccgaacgata 

tttgctatcg 



gattattaca 
tatatccata 
gaaaggtgtt 
ttgaagctgc 
tgaacggtac 
aagaagatgc 
tgacaggtcg 
gacgccgcct 
agaaagagtt 
tcagtatgga 
atgccaatat 
gtacttgtga 
agtcattggc 
gtcacaattc 
ctcgtgctgc 
taaccaatta 
ctcctatcat 
tatttgacga 
cgttgattcg 
gtaagttagt 



agagaagtta 
ctttcgttgt 
aatgtttggc 
tgttgaagct 
actcgacctc 
tatcatttat 
tgaagattat 
ttctttttct 
gcagaaatgt 
gggtgatatt 
catggtagat 
tcatttcgga 
cgctatcggt 
acgttcatat 
acttcagatt 
ctctttaaag 
tcctctatat 
aggtgtgttt 
tttctcgttg 
gaaatgtttc 



gctaaatacg 
atcgaaagtg 
tcaaactcat 
acccgcaaat 
catcttcaat 
tctaccggat 
gtgatctgtg 
accattctta 
cgtcctgatg 
gccaatttgc 
gaagcgcatg 
ttgactaaag 
ggctttattg 
atctttagtg 
atgaaaaacg 
tgtttccgtg 
gtacgtgata 
gtaaatccag 
atggctacac 
aaggcacttg 



acctccctca 
aacagaacac 
acttaggcct 
atggtacagg 
tggagaaaga 
ttcaggtaaa 
atgaacttga 
agttcaagca 
cagtgaaact 
ctgagatcgt 
gtctgggagt 
aggtggatct 
cagcagacga 
caagtaatac 
aaccggaacg 
aacttggttt 
tggagaagac 
ttgtgcctcc 
actctaaaga 
atcttttata 



gcagataaag 
agaggtgata 
gactaatcat 
ttgcgccgga 
attggccgaa 
tctgggtgtg 
ccacgcttct 
taacgatatg 
gattgtagta 
ccgtttgtct 
tttgggtaat 
tatcatgggc 
gtccatcatt 
gcctgctgct 
tattgagcat 
tgagatcgga 
atttatggta 
cgcatgttct 
acagattgat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1191 



<210> 256 



105 



<211> 570 
<212> DNA 
<213> B . fragilis 



<400> 256 

tttgttttta 

tttactaact 

gggcaagagt 

tcagggatta 

gtgtttccta 

ttatccagta 

atctataata 

atgcctcagg 

gttttatatc 

gtaaagtcaa 



tgagaaaaag 
gtctgtttat 
tggaacatca 
ttccacatgt 
atggtatttg 
cttcggattt 
ccgtttataa 
aggaatttaa 
ctttggcact 
tagatttaga 



taatgatata 
aggatactac 
gaaaaaacaa 
aatttctgat 
tgatgtttgt 
agtggtggtt 
acttaagttg 
agatatgaca 
tcatcataaa 
tttcttgtaa 



attttttatt 
tattaccaac 
aattatgaat 
aaaaaagaat 
aataaatggt 
gtccctgata 
tcgtctattt 
tatatattct 
aatatagact 



cgttgttagc 


attgtgtcta 


60 


aaaacaggga 


agtactgttg 


120 


taatagttaa 


tcaaatagaa 


180 


ttgcaggata 


ttttgtcctt 


240 


tgtttaaaca 


aatctctgaa 


300 


aattgaagaa 


gaatatggaa 


360 


tttgttcgga 


aaagtatgcc 


420 


attgctcaaa 


aactggaacg 


480 


tggacttgta 


ctttaaatta 


540 






570 



<210> 257 
<211> 786 
<212> DNA 
<213> B. fragilis 



<400> 257 

tggaatctta tgaaaacaaa gcaagaaatc gtagctaatt 
cgtaacctgg aagactttgg agagtatatc ctgttgacta 
atatttgcag agaaatttaa tgttcccatt ttgggtaaag 
agtgcagaag gaatcacaat catcaacttt ggcatgggaa 
atggatctgc tgagtgccat ctctccaaaa gcctgcctgt 
atcgataaaa aaaataaaat aggtgacctg attctgccaa 
ggtacctcaa acgactattt cccgccggag gttccgtccc 
cgtgccgtat catcggctat ccgtgactat gctcgcgatt 
acaaccaacc gccgtatttg ggagcatgat gacaccttta 
cgtgcaatgg cagttgatat ggaaacggca actctgttca 
atcccgaccg gagctttact actcgtatcc gaccaaccta 
actgataaaa gcgacaacat cgttaccaaa aactatgtag 
atcgcctcgc tacgaatgat cattgatgaa aagaaaactg 
tggtaa 



ggctaccccg 


ttacaccaaa 


60 


acttcaacaa 


gtatgtagaa 


120 


acgccaatat 


gatctctgcc 


180 


gtcccaatgc 


cgccataatt 


240 


ttctgggaaa 


atgtggcgga 


300 


ttgccgctat 


ccgtggtgaa 


360 


tgccggcatt 


tatgctgcag 


420 


attggacagg 


aacagtctat 


480 


aagagtatct 


gaaaagaact 


540 


gttgcggttt 


tgccaatcat 


600 


tgattccgga 


aggagtgaaa 


660 


aggagcatgt 


agagataggc 


720 


taaaacacct 


gaaattcgac 


780 






786 



<210> 258 
<211> 1395 
<212> DNA 
<213> B. fragilis 



<400> 258 

cagctactta tgaaaacagt ccgggaaact atactggagc ccataatcaa tattgtacaa 60 

gtcccaaaaa tgctacagga tgtgtttcgg atcctgatcc aacccgcact tgttcatatt 12 0 

caattttttg taaaaaataa tagttacttg ttcaaatgtt accggaagag gtgttttcct 180 

cttccggttt attctaataa tatgttagtt atgaagtact tgaatttgtt tatattcgtg 240 

ttgttgttgg caggatgtaa tcgacctgtt aaacactccg atattatcca agccgatact 300 

atggtaagta tcatacccca agaggatact atcacattat ctgctctctt ttctagatgt 3 60 

gaaattgtaa aattgaatga tattgtttta gcgtcaataa ataaagtatt taaatacgat 42 0 

tctttgtgga ttgtgcaagg aaagtctgat cagggtgggg tccatttgtt taataatgaa 480 

ggccgatatt taaaaaccgt tttgaaatgg gggcagggac ctgaagaagc atatgatatt 540 

tggagtatta aactattaga tggatctatc tatttattga ttaattctgg aacagaagtt 600 

gtggaatatt ctttgcagaa acaaaaaatg gtagagcgct ttcggctacc gtctgagata 660 

ctttcagcta cagattttgt tgttgataat ggtggaaatt atatattctt aaaatcgatc 72 0 

tcccgagaga aaaaaaagga agagtataaa ctttatgtgt ataataagaa agaggggaca 7 80 

atcgtaaata gaatattgaa tatggataaa aagtctagtg agtatatttc ttttgatcaa 840 

agtgattgtt tatatcgtgt tcaggatgaa atctattatt acgaggtttt tagaaatggt 900 

atttgtcggt tatctgctaa tgatatgact ggatacatcg cttttaaaca aaatgaatat 960 
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acttttccgg aaaaagaact ttataatgaa gatcatacat ttcagtcttt tatagatgtt 102 0 

tgtgaaaata gtccttttat ttgggcgcat cgtaatttat ttgaaggaga gcgctttgtg 1080 

agttctactt atatgtataa aaaagaactg ttttggaata ttatagataa atctgattat 1140 

agcgtacatt catataaatg ggtatatgat gacttgatat taaatgaggt tgtccctgtt 12 00 

gaagattatt tatatcgtgc taatgttcag gagaatatcc attattatac attgtctttt 12 60 

tacgattttg atagaattat gcagttgaaa aagaagtgta aaaaaagcgt aggagaaaag 132 0 

tggatggtaa aactagatga tatgttagat gaaaattcaa atgatataat agtttgtttt 13 8 0 

tatgagaaaa agtaa 13 9 5 

<210> 259 
<211> 1416 
<212> DNA 
<213> B.fragilis 

<400> 259 

cctaagctta tgaaactacg aataggaagt atcacgttct tgctgtttct ttcatccgtt 60 

gcctttccac aggccacgag ccgctatctg gacaaaccat taccacaagg atgggaagaa 12 0 

gatacacaaa tatttcagca agtattgcca gtggacgacc aatggtggaa agcatttcag 180 

gaccccgttc tcgactcact catctccgtt gcagtcaagc agaattattc ggtactgact 240 

gcgattgatc gtatcaatat ggcaaaagcc aacttaagaa tggaacgtgg aaattttttc 3 00 

ccaacaatcg ggttgaatgc cggatggacc cgccagcaaa gcagtggcaa caccagtgac 3 60 

ttgccacaat cgactcaaca ttattatgat gcctcgctca atatgagctg ggagttagac 42 0 

ctctttggaa gcatacgcaa tcgcgtaaaa gcccagaaag agaactttgc ggccagtaaa 480 

gaagaatata ccggcacaat gatatcactt tgtgcccagg tagcctcagc atacatcaac 540 

ctgcgggagt tgcaacaaga attggccgta gtgcaaaaga actgtgcatc ccaagaggcg 600 

gtattaaaaa ttacagaagt aagatacaac accggactcg tatctaaact ggatgtggca 660 

caggctaagt cggtgttctt cagtaccaaa gcatcgattc ctcaaatcga atcgggcatt 72 0 

aatcaataca ttacgaccct tgccatacta ttgggtactt atccccagga agtgcggcca 780 

gctctaaccg ctcccggaac attaccggac tatatggaac ctatcggagt ggggcttccg 840 

gccgatttgt tacttcgccg cccggacata cgcagtgccg aacgaagcgt caatgcacaa 900 

gccgctttag taggagcgtc taagtcggac tggttgcctc aggtctttct aaaaggatcg 960 

gttggttatg cagcaaagga cctgaaagac ctgacccatc ataaaagtat gacctatgaa 102 0 

attgctccgg cactgagttg gacgcttttt aaaggaactc aactagtgaa tgctaccaaa 1080 

ttggccaaag cacaattgga cgaagctatc aaccagttca atcagacagt attgaccgcc 114 0 

gtacaagaga cagacaacgc tatgaacgct taccggaatt ctatcaagca aatagtagct 12 0 0 

ttgcgcgaag tgcgcaatca gggacaagag accctgactc tctcgctgga actttacaaa 12 60 

caaggattga ccccattcca gaacgtactg gatgcccaac gctcactgct cagttatgaa 1320 

aaccagctgg ttcaagccag aggatattct ctgctgcaac tgatagctat gtaccaggca 13 8 0 

ttgggaggcg gatggtccgg aaacctgaat aattaa 1416 

<210> 260 
<211> 408 
<212> DNA 
<213> B.fragilis 

<400> 260 

aataacatta tggcacatcg tcttaacact aacaagcaat ttatggtagg aaacggcatt 60 

ttggcatttg ccgtcatatt tgtcgtggtc atctttgtat atatgagttt aagattacaa 120 

cgagaaaaag aagctaatcg tcattttagt gaaacatact ccattcagtt gacaaaaggc 180 

ttcgtgggtg attctatttc actgtttgtt aacgacagtc tgatcatgaa taaacagatc 240 

aaagaggaac ctactgccat cgaagtcgaa cgcttcgcag agcaaagtgc actgatgatt 300 

gtaaacaatc aaactgaaac agtagccgca tttgacctaa gtgaaaaagg aggtacttac 360 

cgttttgaaa aggatattga cggtatcaaa cagctgccac aaaaatga 408 

<210> 261 
<211> 192 
<212> DNA 
<213> B.fragilis 



107 



<400> 261 

aagttgatta tctggttgtc agtaatggga 
atactcaaag ttattcgttt ctgtcggata 
aatgttaatg actctttccg gcatctatta 
gcaagaagtt aa 



tgcaacacta ttgttgccgg atggattatg 60 
ttccggatta tgacgcttta taaaattaac 12 0 
tcctctttcc aggaagttaa tatcgtaaat 180 

192 



<210> ^262 
<211> 459 
<212> DNA 
<213> B.fragilis 



<400> 262 

tgttattcta 

aagaataacg 

aaaacaattc 

tatgcatatg 

gcttttcatg 

tgtgaagact 

gagtacggta 

tcccctggag 



ttctttttat 
atacctttgc 
tcggttgttt 
aacaagaagt 
tagaattgca 
ctttcgtctc 
cgtatagttt 
agcaaggact 



aatatccgtg 
tgaaattaac 
gattggaggg 
aaaggcattg 
aaaacgaggt 
ttcagtggat 
tcgggttgat 
gcatactgtt 



ataaagatag 
tgtgaaaaac 
tatgcccttc 
catgtatatg 
atggatcaag 
acagccttca 
gcaatgaaga 
gttgtttaa 



tgatttgttt 
gaatggtaat 
ttggtttgtt 
cggatagtgt 
tggaaagttg 
aaaaagttac 
ttcgcaaaaa 



tattcattac 
gtcttggggg 
aggggggaac 
ttttcatgaa 
gagatatgga 
tatacaggac 
tatcgtgtct 



60 

120 

180 

240 

300 

360 

420 

459 



<210> 263 
<211> 378 
<212> DNA 
<213> B.fragilis 



<400> 263 

gctatgaata tagaagaatt tagagaatat tgcctttcat ttaaaggtgt gcatgaccgg 6 0 

atgcccttta aaaaagcaac atctgaatat gatagagatt tactcgtctt ttatgtaatg 120 

gataaatggt tctgttttgt gaatatagac gcattcgatt tctgtaatat aaaatgtaat 180 

gccggacaga tagaggattt gctagacaaa tatgaaggag tacaaccggg ctatcacatg 240 

aataaaaagc attggattag tgtctatttt gataaagacg ttccggataa aatgattaag 3 00 

gacctggtaa agcaatcgta tgaaattgtt gtatcttctt tggcgagacg agagagggaa 3 60 

atattacaag ctatgtaa 37 8 



<210> 264 
<211> 744 
<212> DNA 
<213> B.fragilis 



<400> 264 

acatcgattt 

acaattacaa 

tctgatgatg 

gatatattat 

caatacaaga 

attgagaaat 

atgaagtttg 

aattatcata 

attagcattc 

tcatttacaa 

cttagttttg 

ttccaatgta 

tatggtacga 



tcttgctttt 
atgatatgaa 
aagttcaatt 
accacaatca 
atatcgaaca 
tcttttcaca 
cctctgtaac 
ttcataactg 
tttcagaaag 
aaggagtcaa 
ccaaactaat 
atctaaatct 
taactcgaaa 



ctgcttttta 
aaacacacat 
ctttagaagt 
tgtagagaaa 
acaagcaaca 
atgtgatttc 
tccttacaaa 
gttgccttta 
aattaatttt 
ttattttatt 
atctaataaa 
tccggattat 
ctaa 



ttaccgaata 
gtacttctta 
tcaattatac 
aataaatatc 
atcgtatgta 
aactttcaac 
ctattgattg 
aattcggata 
ctggaaaaga 
gactttccac 
aatattaaat 
attggaatag 



ttgctatcat 
taaaatttaa 
aaaaactggg 
gctactctta 
ttgatcaagg 
tgggaaatag 
aacgacaatc 
actacaaaaa 
ttttgatagg 
tacaatgtaa 
taatgtcgtt 
gcaaacatac 



tacaaccata 
aaataaaata 
tgaccagcca 
tcccttaata 
aacaaaagca 
aaaagtcaat 
aaaaatgata 
gtatcaaaat 
taatatatta 
actccttcaa 
tgatgcagat 
aagcattgga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

744 



<210> 265 
<211> 1152 
<212> DNA 
<213> B. fragilis 
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<400> 265 

agacttgtca 

ggatgtaaag 

gcatatccac 

gaacaaacag 

ccgggaactc 

gataatgtaa 

cgtaacaact 

gtattacaag 

actctgaata 

actgtcagcc 

actttggcta 

caatggcttt 

atcgtgcgcc 

cccaatgtcg 

ggtattctga 

gcagtgcttg 

gtaaacgatt 

acattgcgcc 

atgaaagtac 

tctaatcgtt 



taatgaaaaa 
ggaaaaaaga 
tcgtacaaaa 
taaatctggt 
gtgtgaagca 
ctcaagccga 
atagccgcat 
ctgaatcgaa 
ctgcacacac 
gctcgcttta 
cgatctataa 
caatgctact 
tgggtgagaa 
acttgaacac 
aaagcggact 
ttccggaagc 
cgaatatagt 
agataaagag 
gtgatggcat 
aa 



actaatgtat 
aaccgaaaga 
cattacccta 
agccagagtc 
ggggcaactc 
agcccaactg 
gaaagaggct 
tgtagccgaa 
caatctgggc 
cgatgtgggc 
agacgaccgc 
ctctcaaaac 
cggcacacaa 
aggaacactc 
atacgtcagt 
ttccatcggt 
acgctacagg 
cggactttca 
gaaagtcaag 



attttcctca 
ggagggatgc 
acaaaagatt 
aacggtgcct 
ttattcgtaa 
aaaaccgcac 
ctaaaaagtg 
gccactgcag 
tattgctata 
agctatatca 
atgtacacct 
ggtaaagaaa 
aactatccgg 
aacgtgcgcg 
atcacattgc 
acagatcagt 
catatcgaac 
cccaaagaac 
cccgtatcag 



tcctcccttt 
ctactccgga 
atccgggata 
tgcagtccgc 
tcgaaccaac 
ttgcacagtt 
atgcggtcag 
cagtcagcaa 
tccgtgcccc 
gcggagccgc 
attttaatgt 
aggaactccc 
ccacattgga 
ccaatctgga 
cttatgccga 
tggggaaata 
cgggacaact 
aatatgtcac 
tcaatcacga 



gataatgagc 
aatcagtgtg 
tctgactacc 
ctctttcaca 
gatctacaaa 
ggaatatgcc 
ccgtatacaa 
tgccgaagcc 
tttcaacgga 
acaacctgtc 
tgccgacaac 
caaaaatgtc 
ttatttatcg 
taatccgaaa 
ggcaaaacaa 
cctatacatc 
ggtcaatgac 
cacagcactg 
gtcaccaacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1152 



<210> 266 
<211> 1239 
<212> DNA 
<213> B.fragilis 



<400> 266 

cacatttgca 

tttgcattga 

ctccaagcac 

gttatcagta 

gataatggaa 

atagtaagta 

gaaacggtat 

atcaaagagc 

caaagacaac 

tcacctaaag 

gactccctcg 

acctctattt 

tttcctatct 

caaataggaa 

atgccggata 

ggaaaatacc 

cctgagatgg 

attgatgatg 

ctaaatgatc 

ttttatgtac 

gatagcctct 



taattgagtg 
atatgaaaac 
aacaaccctg 
tgatgcgtca 
agtttaagtt 
aaggagaagg 
ccatcatcgg 
agcaagaaga 
gattacaagc 
agaaaataca 
agctgttact 
ggctagatca 
cagaagctca 
aaaagataga 
ccgagctgtc 
ttttacttga 
aaatcttatc 
agaaatcatg 
ccaaaggggc 
ttgtaacgcc 
ccgaaagact 



ctatattcat 
acaattcttc 
tattattgaa 
acagggaaca 
cattatacat 
ctttcccaat 
cagtgataaa 
aaatcaatac 
tttatcatca 
aatgacggac 
gtccaaagaa 
tttggaagca 
agtattgtat 
agcttgttta 
aaatatcgat 
tttttggagc 
cgatatgtgg 
gaaagaattc 
attcggattg 
tgatggaaaa 
gaaacaaggg 



gcaaatgact 
acattgttct 
ggaaacatca 
ggtatgaaac 
actcttaata 
acatggctgg 
ctactccgta 
acaaatgagg 
gatatgtgga 
agcatccaaa 
gaaatcaatt 
ttaagccgtc 
cagcaactaa 
acgcctacaa 
ggaaaccacc 
agaagttgtg 
aaagaaaaag 
tctcaaagaa 
tatatccgtt 
attactgata 
ataaaataa 



tccatatcac 
tcaccattat 
atgggattcc 
gaattgccaa 
atcagactga 
acgtatatgc 
catggaatat 
gtttccgcaa 
aaaagatagc 
atatactgta 
taatgaagaa 
aatctgtcta 
catcaacaca 
aagctaagat 
accgtctatc 
gacactgtat 
taacttttat 
agaatatcaa 
acaaagcaaa 
tttggtacgg 



cttttacaaa 
atgtttatca 
tgatgggacc 
cgatacaatc 
agctttgaga 
ctctccggga 
agtaagtaac 
tttgacagac 
tatatcggat 
tccccaacta 
tcttcctgtt 
tctaaaaggc 
acgcaactca 
aggtgatgac 
cgattataaa 
tgaatcactt 
cggtataaat 
atggatcgac 
tggcactccc 
atataataaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1239 



<210> 267 
<211> 636 
<212> DNA 
<213> B. fragilis 



<400> 267 

atatactgca tgaaacagct tattgattta gaaaattgga atagaaaaga acattttaaa 60 
ttcttttctg cttttgacga cccattcttt gggatcacta ctttggtcga ttttacgaat 12 0 
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acctaccatc 
ctgcaatgtg 
aaatatgatt 
ggttttttcg 
gaaagagtga 
atccgctatt 
ggcagaggtg 
tacctgcttc 
gaacttatcg 



aaagcaagga 
taaatgaagt 
ttatccattt 
aatacgatgc 
aaaacagtac 
cggctttgcc 
attctgtacc 
caatttcaat 
agaagttaga 



tgagaaaaag 
agaggctttt 
atcacctacc 
agaccttgaa 
tggcctgtct 
ttggttcgca 
gcgcatttcc 
ttccggtcat 
aacaacaaag 



tctttctttt 
aaattacgca 
ataggacgtg 
gtatttatac 
ttttccgaaa 
ttttcagaga 
actggaaaat 
cacgctctta 
aaataa 



tgtactctgt 
ttgaaggcga 
aagatggaac 
aaaatgctga 
atataggccg 
tgaaacatgc 
taataaaaga 
tggatgggcg 



acattttctg 
acaagtagtg 
attcggtttc 
aaaagaaata 
gttagatctt 
tgtttctttt 
gaatggtgta 
taatgtggca 



180 
240 
300 
360 
420 
480 
540 
600 
636 



<210> 268 
<211> 432 
<212> DNA 
<213> B.fragilis 



<400> 268 

accgtcttaa tccttattat actggaatac atctacatat tgatgttatg gagtgtcttg 60 

tgtcaaaggg gaagaagtct taatccttat tatactggaa tacatctaca taaagacgaa 12 0 

aacggatatt acatcaaaag tgttacacgt cttaatcctt attatactgg aatacatcta 180 

catctaaaca gggcggcgaa gaagttgaac tgcttttgtc ttaatcctta ttatactgga 240 

atacatctac atgaaaagga gaaggatgag gttggtgaaa tgacaagtct taatccttat 3 00 

tatactggaa tacatctaca ttgtttgtgt tccatccgtg aagaaggcta tcggcgtctt 3 60 

aatccttatt atactggaat acatctacat attacttgtt tgggcagtta tctgtcttgt 42 0 

atcgtgtctt aa 432 



<210> 269 
<211> 285 
<212> DNA 
<213> B.fragilis 



<400> 269 

caagtcttaa tccttattat actggaatac atctacattg tttgtgttcc atccgtgaag 60 

aaggctatcg gcgtcttaat ccttattata ctggaataca tctacatatt acttgtttgg 120 

gcagttatct gtcttgtatc gtgtcttaat ccttattata ctggaataca tctacatgtt 180 

acaatcaaca atcaggatat gggccttggt gtcttaatcc ttattatact ggaatacatc 240 

tacatgaaca gtgacatcct ttaccggacg ccacacgtgt cttaa 2 85 



<210> 270 
<211> 420 
<212> DNA 
<213> B.fragilis 



<400> 270 

aagctagaaa 

aacaatatta 

tttggagcag 

gataatgatg 

gctatgcgtc 

gcaaattatt 

gagataacag 



tgaaaatcag 
tcaccaacga 
ctgtcatcca 
ccaacgctga 
agcaatatac 
ctatggcaca 
aagcagctgt 



<210> 271 
<211> 2250 
<212> DNA 
<213> B.fragilis 



taagaagcaa 
caatcaatat 
atccggcctg 
cagacataaa 
tgtaaccgat 
gtatatcata 
tgccatgaaa 



attgagtacg 
cccaaggtct 
attccggcta 
atcattggag 
gcaaccatac 
gaacatggaa 
ttagcattaa 



ctattgaagc 
ttaagggata 
ttatattctt 
ttttaaaaga 
ttgtgtcaag 
acactgatca 
gaatgtacaa 



actcagagcg 
tatctcttcg 
tgaaaacgaa 
tatcatcaat 
tcagattcct 
actgctaaaa 
aagtgaatga 



60 

120 

180 

240 

300 

360 

420 



<400> 271 

aacgaagaaa tcaatatatc tatgatacta cattatttaa agattgtttt caggcagatg 6 0 
gctaaacgca aagtacaaac tgctatatct atattgggaa tcactgccgg tttgctctgt 12 0 
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ttcagtgtat 
tacgaaaatc 
gaggattttg 
tcttcatcta 
aatgctgatt 
ggtatttcgg 
gtcccgccgc 
gttatcaagc 
ccactacctg 
gccgaacaca 
ataccacaaa 
gctattttgg 
ggagcatttg 
tggggacttt 
attaccctgg 
atacaaagaa 
ggaggcttgt 
aagacaatag 
aacacgttac 
atccgtatgc 
aaaaaagaaa 
gaactgataa 
gattacagcc 
ttaatgaaca 
ttataccaaa 
tatccggtta 
cttttaccac 
gatgctcctc 
cagaacgaac 
tcggtgatgt 
gtttatggag 
attaatggtg 
tttttaatag 
cagcatcgag 
ctattaatag 
aatccggcag 



gcaactatta 
aagcagaaat 
aaaagaaaat 
caatcacatt 
atttcaaagt 
gtaacgaagc 
tgggaaaaac 
cctatccggc 
aaaatgcttc 
tttcacaatt 
tagtgctgga 
ggctattggt 
ccaaccgtta 
tcatattgct 
ccattactga 
acttatacat 
tgcttatcag 
cgcaaggatt 
tgagcgtcca 
aaatgaaaga 
ttatggtagt 
acttcctgcg 
aagagtatgg 
tcaaatgcca 
ctttgcaagc 
aaggtttagt 
tttctgccat 
gcaaggaagt 
ctttcgagtt 
ggctgttcgt 
ctatcagcat 
cacgtttacc 
cttcggtagt 
tcatattgtt 
gcataataac 
aagtaattaa 



taatcgtatt 
atgcataaaa 
aggtaaagac 
agatgaaacg 
atttcctaca 
tgtagtaacg 
gatccttaac 
agggatgaat 
atttgggata 
actgcctaaa 
tagccagaca 
cttattggta 
caaagagatc 
attccttgaa 
gagcttgcta 
agatatacat 
cttactgatc 
gagggggggg 
gctactcttc 
atacgatctc 
aaatataggc 
ctcacgccgt 
atttaccgaa 
tcacaaaccg 
agattctacc 
ccatataggt 
gaatgacgaa 
aaaagcagaa 
catcagttta 
tgtctgctct 
tgacactatc 
ggacatctat 
aggaggttta 
tgattatgcc 
agctacaatc 
aaacgaataa 



ttcagtacag 
gaaagatctt 
aaatttgaag 
atatattgta 
gaatgtatag 
actgagtttg 
cagagaggca 
aattatcaca 
cacaaactac 
ttgggacttt 
gaacataagg 
ggcatgatca 
agtctacgta 
caggctgtca 
ccttggttta 
cgattgtggg 
tcattcatct 
acaactaccg 
tcttttctat 
tcagccaatc 
agatacgatc 
tggaatgccg 
ctctgcttcg 
ggagaacctt 
tccgagtctt 
cccgattctc 
atcgggaaaa 
atgagcaagg 
tatgaggaac 
tccatttgct 
cgtaaacaaa 
tggttgtttg 
attagcctct 
gatccatggc 
agctggcaaa 



gtaacaagga 
accaagtaaa 
cagttgcttt 
aagtggacaa 
acggctcctt 
tcaaacaatt 
aaatacatac 
gcagctacga 
tgctcaaacg 
ttcccaacca 
caggcgctga 
actatttctc 
atacgctggg 
tcatcctgat 
taagtacatt 
tttatgaatg 
cttcctggca 
gacaacgtca 
tcattgtagg 
ccaatctgag 
gtattcgaga 
aaacagccta 
tttcggatga 
tttgttacgt 
ttcgttttca 
caagtgcaaa 
tatatatacg 
agatgaatca 
aaaccggatt 
tagtgattac 
aagaagttgc 
ctaaaaacta 
tcgtcatggt 
tttggatggg 
tatattatat 



tttggcaact 
tattccgata 
ttatgtaaat 
aacagaatgt 
aaaacagttt 
ctgtggaggc 
cattattgcc 
tgtttttctt 
cccggaagat 
tccggaatgg 
attatgggtg 
attcagtata 
ttctacttac 
ctgtggaata 
ttctaatgaa 
tcaatatata 
cattgcccat 
cataatccgt 
tacagtcggc 
cacagaggta 
gcatcaaccg 
cactaacagg 
ttattttaat 
aaacgaacaa 
aaaccaagta 
gcaattggct 
attggtccct 
atatctacca 
aggaactatc 
ggtacttggc 
catccgtaaa 
cctgatttta 
catcggcagc 
tcccctcatg 
tgcacggact 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2250 



<210> 272 
<211> 426 
<212> DNA 
<213> B.fragilis 



<400> 272 

tgttgcggaa 

atcgtttcta 

tcgtgcgaac 

agtatcctgc 

gcattcccgg 

tccaccccat 

agcatcttct 

gaataa 



ttaaggataa 
tcgtatgcgt 
aaagacgcaa 
agatgttcat 
caagctatgg 
tttggatata 
ggagagtgtg 



atatctgcgt 
cattatctct 
agagatcgct 
caaagaatat 
aatgatgaga 
tatcgtcctg 
gaatgcagcc 



tccgaaagtg 
attttcggca 
atccgcaaag 
ttcgtccttt 
gtgtggatag 
tttgcaggta 
aaacaaaatc 



cgttgatgac 
tcttttcgca 
tgaatggagc 
tgcttgtcgc 
aaagttacgt 
tcggtatcat 
cggcggaagt 



cctgttgggg 
agtaactttg 
caccatagga 
ggctcttata 
cagacaaact 
catcgtcatc 
agtaaaaaca 



60 

120 

180 

240 

300 

360 

420 

426 



<210> 273 
<211> 996 
<212> DNA 
<213> B.fragilis 



<400> 273 

cctttattga tacgtatgaa aataactttt ggacaacaaa cgaccaaagt aaagcaactg 



60 



Ill 



gctgataaga 
tctattaatc 
cttgatttaa 
gggagattga 
tacaatagtt 
tacaatgaac 
attgtgatga 
aaacttcttc 
gactttgatc 
caaaaactcg 
tttgagagat 
ctt caggttc 
att aagaaaa 
aat gatacac 
gaaaagatgg 
ttgcctacgg 



tcagttttga 
agctcagtca 
aagaaagagg 
agaatgtatt 
ttgtaaagcg 
gtttgttcaa 
acttcgataa 
ttcttgattt 
aaggattcta 
ttttcgtgtt 
tttgtgccga 
gcagaggaga 
gtagggttga 
ctgcttatga 
gagataaagc 
aggtcaggtt 



tatctcgatg 
ggcgtatgaa 
gattattgat 
gcttctgctt 
tttgtctatc 
taccatcatc 
cgaaaaactg 
cggtaaattt 
taatgcgcta 
agtagatgat 
tcaacatctt 
agtttatata 
gggattgcaa 
agtgatagat 
tgctgagttt 
aagggcttct 



ggagtatata 
gtttcccgtg 
tctactccgg 
gatgaatatt 
cgctataaag 
cgtgaatcgt 
tctcctaatc 
gagaaagagg 
tttcaactgg 
agtatgcatc 
ggttgtgaag 
gcaattcgcc 
tgtggggttg 
cagggaataa 
gttttgcagg 
ttataa 



aatcgggaga 
atactgtgtt 
gaaaagggta 
ctccttttaa 
tcgacctgct 
tggggcggta 
tttataaaat 
gattttctta 
cagatcgatt 
cccggagtag 
tggtgagtga 
aaatagatgt 
attttggctt 
ctgcactaag 
ggaaaaccat 



ctcattgcct 
taaagctttt 
ttacgttgtg 
atatgcactt 
gtttcatcaa 
taataaatat 
aaatccatct 
tgtttgtcag 
gagaaaatat 
ccgtgacttt 
tattgaggga 
agtaagtatt 
gatcggatat 
tgtggattgg 
acaagattac 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
996 



<210> 274 
<211> 687 
<212> DNA 
<213> B.fragilis 



<400> 274 

aaaattacga 

gaaacatggg 

atgggacctt 

cct acaggag 

cgcaccaatc 

gaactgaatg 

gaacgtaaac 

cattttccac 

gtagccaacc 

ggtaaagagg 

gtaactcact 

caagttgtaa 



ttatgattaa 
cattaaataa 
ccggttgtgg 
gagagtatta 
tccgcaaagg 
tatatgaaaa 
aacgagtgga 
aacagctttc 
ctaaactgat 
ttatgggact 
ctcagcatga 
cagaagttac 



gacaatcaat 
cgtcagcgta 
aaaatctact 
tctgaacgga 
agttattggc 
tattgaattg 
aaaagcaatg 
cggaggtcag 
tcttgccgat 
attgagcgaa 
tgcaggtttc 
tatttaa 



ttgcaaaaaa 
gaggtaaaag 
cttctcaata 
aaagaagtat 
tttgtattcc 
cccttactct 
gagcgcatgg 
caacaacgtg 
gaacctaccg 
ttgaataagg 
gcagaccggg 



tcttcaagac 
agggcgaatt 
ttctcggttt 
ccaaatatac 
aaagtttcaa 
acatgggtat 
ccattaccca 
ttgccattgc 
gtaatcttga 
aaggcactac 
taattaattt 



cgaagaagtt 
tgtcgccatc 
actggataat 
agaatcgcag 
tctgattgat 
tccggcctct 
tagaagcaag 
acgcgccgta 
ctctaaaaat 
catcgttatg 
attcgatggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

687 



<210> 275 
<211> 630 
<212> DNA 
<213> B.fragilis 



<400> 275 

agatacacga 

tcggaatttt 

cgtggaaaaa 

atcaaagtag 

get tggattc 

cct tttgeat 

tttgatgaat 

ggtcagcgcc 

gtagatgaac 

cgtaatcagg 

tacggatgta 



tgttacaaat 
gtatgegact 
cctctctact 
gaggtatctt 
cacaggagtt 
tgaaagccaa 
taggactgga 
agegcattat 
cgacatctgc 
cagaaaaggg 
accagctaat 



agacaatgea 
aaataaaggc 
caatgeaate 
gcttgaacct 
agccctgccc 
tcgacacatc 
caaagagctc 
gatagcegtt 
actcgatgcc 
aacggctata 
cacactgtag 



tgeattgett 
gagacagctt 
atgggatttg 
actactatcg 
tccgaatggg 
tctttttcaa 
tatcagaaac 
gcagccatgc 
ggttccacag 
ctcgccgttt 



teggegagga 
gcatagcagg 
tcccattaag 
atgccatacg 
taaaagaaat 
aagaaaagct 
gggtaggega 
tggaaaaacc 
acaaagtttt 
ctcatgaccg 



tatactcttc 
tcaatcagga 
aaaaggcaaa 
cagacatata 
gatatcgett 
tttcacttgt 
aatatcgggt 
tttgattatt 
agctttcttc 
gaeattcget 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

630 



<210> 276 

<211> 513 

<212> DNA 

<213> B.fragilis 



112 



<400> 276 

acgctttact tttgtgaatc aactaataag tcatacattt gtaaatcttc aaaggatatg 60 

gaaataaaag ataggattaa aattatcatg gaaaaagaga atatggcttc cggtgctttc 12 0 

gccgaaagca taggtattca gcaatccact ctctctcata ttttgaatgg gcggaacaac 180 

cccagtttgg atgttattat gaaagtacat cagaaatata actatgtaaa attggaatgg 240 

ctgttgtatg ggcaaggcaa tatatccgaa gaaagcatcc aatcagcttc tgattttcaa 3 00 

ccttccttat ttgctgagaa tgccataatt ccgcccaacg ggacagttac tccggaaaat 3 60 

cgcagggaaa tgccgttaga aagttcccaa aacaccccga aagagattgt aaaacaagaa 42 0 

attagataca tagaaaagcc ttccagaaaa ataactgaaa taagaatttt cttcgatgat 480 

aatacgtatg agacattcag aggagaaaaa taa 513 

<210> 277 
<211> 189 
<212> DNA 
<213> B.fragilis 

<400> 277 

tttatttttg tgaaatatcc tccgtcaatg aatattccta ttgatgtgat agattcaatc 60 

atttttggtc ttttttgtat tagtttcgag ttatcgtacc atatccaatg cttgtatgtt 12 0 

tgcctattcc aatataatcc ggaagattta gattacattg gaaatctgca tcaaacgaca 180 

ttaatttaa 189 

<210> 278 
<211> 2061 
<212> DNA 
<213> B.fragilis 

<400> 278 

aaaatatacg ttattatgca aaaaggtaat attggggtta caacagagaa cattttccct 60 

atcatcaaaa agtttttgta cagtgaccat gaaatcttcc tgcgggaatt agtatccaat 12 0 

gccgttgatg ccactcagaa gttgaataca ttggcttcta tcagtgaatt taagggcgaa 180 

ctgggtgatt tgaccgttca cgtttcatta ggcaaagaca ccattaccat ctccgatcgt 240 

ggtatcggtt tgactgctga agagattgat aaatatatca accagattgc cttttcggga 3 00 

gctaacgatt tccttgaaaa atataaaaac gatgcgaatg ccatcattgg acacttcgga 3 60 

cttgggttct actctgcatt catggtttcc aagaaggttg aaattatcac caaatcatat 42 0 

aaagaaggtg cacaggccgt aaaatggact tgcgacggta gtccggagtt tacacttgaa 480 

gaggtggaga aagcggatcg tggtacagat atcgtattgt atattgatga tgattgcaag 540 

gagtttctcg aggagtcacg catctctgcc ctcctgaaga aatattgcag cttcctgccc 600 

gttcccatcg cttttggtaa aaagaaagag tggaaagacg gcaaacaagt agagacggcg 660 

gaagataatg tcatcaatga caccattcct ttgtggacaa agaaaccgag tgaattgtcg 72 0 

gacgaagatt ataaaaaatt ctatcgtgag ctttatccga tgtcagacga acctttgttc 780 

tggattcatt tgaatgtaga ctatccgttc catctgaccg gtatcctcta cttcccgaag 840 

gtaaagagca atattgattt gaataagaat aagattcagt tgtattgtaa tcaggtttat 900 

gttacggatt ctgtagaagg tattgttccg gatttcctta ctctgctcca tggtgtgctc 960 

gattcaccgg atattccttt gaatgtatcc cgttcttacc tgcaaagtga ttcgaacgtg 1020 

aagaagatct ctacctatat ttcgaaaaag gtatcagacc gtctgcaatc tatctttaag 1080 

aatgatcgcg ctcagttcga agagaagtgg aatgatttaa aaatctttat taattatgga 1140 

atgctcactc aagaggattt ctatgataaa gcacaaaaat tcgccctttt caccgatacg 12 00 

gatggcaaac attacacctt tgaggagtac cagactttga ttaaagataa tcagacagat 1260 

aaagataaaa acctgatcta tctgtatgcc aataataagg acgaacagtt tgcctatatc 132 0 

gaagctgcca aaaataaagg ttacaatgtg ctgttgatgg acgggcaact ggatgtggcc 13 80 

atggtaagta tgctcgaaca gaaactggag aaatctcgct tcacccgtgt agacagtgat 1440 

gttgtcgaca acctgattgt gaaagaagat aagaagagcg atgtgcttga ggcttcaaaa 15 00 

caagaagctc tgtcagcagc cttcaagagt cagttgccga aaatggaaaa ggttgaattt 15 60 

aatgtcatga ctcaggcttt aggcgaaaac ggctctcccg tgatgataac ccagagcgaa 1620 

tatatgcgcc gtatgaagga aatggccaat attcaggctg gcatgagttt ctatggtgaa 1680 

atgcccgata tgtttaatct ggtattgaat tcagaccata aattggtgaa agaagtattg 1740 

gctgatgaag aaaaagagtg cagtgctgcc attgctccta tacagacgga actggaagat 1800 

gtgacaaaac gtcgtgatgc actcaagaaa aaacaagaag gcaagaaaga cgaagatatc 1860 



113 



cctactgcgg agaaagatga actcaatgat ctggataaga aatgggatga gttgaagcag 1920 

cagaaagatt ctatttttgc cggatatgca ggcaaaaaca aagtggtacg tcagttgatc 1980 

gatctggcat tgttgcaaaa caatatgctg aaaggtgaag cattaaataa ctttgtaaaa 2 040 

agaagcattg agctgattta a 2061 



<210> 279 
<211> 402 
<212> DNA 
<213> B.fragilis 



<400> 279 

aatcactcag 

agcatgaaaa 

gcctgttcgc 

gtcggttcat 

cagggcgccc 

tctgtcagcg 

gggaaactcg 



taaaaaagga 
aatatatact 
aaggaaagca 
tcgaccaaat 
ccaccgttca 
gtcgaacatt 
agatcagagt 



gtgggaacta 
atcgagtctt 
aatcagtgga 
aaaatcgatg 
gatttatggt 
aacgattaaa 
atcttctcca 



tcaagaatat 
acaattactt 
agttccaact 
agtagttcag 
cccgacaata 
ttcaaaaaga 
tcattaacct 



ctaatataac 
ttttgttact 
acatcactaa 
atattgttta 
tagttgaatt 
atacctccat 
aa 



aaatcagaag 
cagcatcaca 
aaatataaaa 
tacacaaaaa 
gatggaaacc 
ccgtaatagt 



60 

120 

180 

240 

300 

360 

402 



<210> 280 
<211> 912 
<212> DNA 
<213> B.fragilis 



<400> 280 

agattattat taatactcat gatacagact agattgaaag gaatgggggt agcgctgatt 60 

actcctttca aagaggatga aagcgttgat tacgatgcgt taatgcgact ggtagactat 12 0 

ctgctgcaaa ataatgcaga ttttctgtgt gtgctgggaa ctacagccga aactccgacc 180 

ttgagtgaag aagaaaaaaa gaaaatcaaa aagatggtaa tcgaccgtgt caacggaaga 240 

atccccatcc tgctgggagt cggaagtaac aatacacgcg cagttgtaga gacactcaaa 3 00 

aacgacgatt tcaccggagt agatgctatc ttatccgttg tcccttacta caataaaccc 3 60 

tcacaagaag gaatttatca gcactataaa gcaattgcaa gcgctacaga gcttcccatc 420 

gtattatata atgttccggg acgtacagga gttaatatga ccgcagagac cactttgcgc 480 

attgctaagg actttcagaa tgttatagcc attaaagagg cttctggtaa tatcacccag 540 

atggatgata tcattaaaaa caaaccggct aactttgacg ttatttccgg agatgacggt 60 0 

attactttcc cgctgattac attgggagcc gtaggagtca tttcggttat tggaaacgcc 660 

tttccacgtg aattcagcag aatgacccgt ttggcgctgc agggcgactt tgccaatgca 72 0 

ctaaccatac accataaatt tacggaactg tttaacctct tatttgtaga cggaaaccca 780 

gccggagtaa aatccatgtt gaacgctatg ggaatgatcg agaataaact ccgtttacca 840 

ttagtaccga cacgcatcac cacatttgaa gcgattcgta aagtactcaa tgaactgaat 9 00 

ataaaatgtt aa 912 



<210> 281 
<211> 2236 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (16) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 281 

ggatgcattc ttacancccc gtggtgaata cgtactgctg gccatcctta aagataagga 60 

taatctggca gcaacggttc ttgaagcgaa tcatgtgaat taccagcagg tattcgaaca 12 0 

attgtcctta cagccggata tcagtgccgg catgggattt acagaagatg atgatgacga 18 0 

agaagagatg aatcagtccc gttcgtccca tggatccggt gaacgtcagc aacaggcgca 240 

gactgcctcc aggaagccga ctaatgatac tccggtgctt gataattttg gtactgatat 3 00 

gactaaggcc gccgaggaag gccgtcttga ccctgtggtg ggacgtgagc gggaaatcga 3 60 



114 



gcgcctggca cagatattaa gtcgccgtaa gaagaataac cccattttga tcggtgaacc 42 0 

gggagtcgga aaatcggcca tagtggaagg tctggcactt cgtattatac agaaaaaggt 4 80 

gtcccgtatt ctgtttgaca agcgtgtggt tgcactcgat atgactgcgg ttgttgccgg 540 

taccaagtac cgtggacagt ttgaggaacg cattcgttcc atcttgaacg aattgcagaa 600 

gaatccgaat gtgattctgt tcattgacga gatacatacc attgtaggtg ccggatcggc 660 

agccggatcg atggatgctg ccaacatgtt gaagccggca ttggcgcgtg gagagattca 72 0 

gtgtatcggt gccactaccc ttgacgaata tcggaagaat atcgaaaaag acggggcgtt 780 

ggagcgtcgt ttccagaagg taatggtaga gcctactaca gctgacgaaa cgttgcagat 840 

tcttcgtaat attaaggata aatatgaaga tcatcacaac gtaaattata cggatgcggc 9 00 

attggaagct tgtgtcaagt tgacagaccg ttatataacc gaccgtaact tcccggataa 9 60 

agctattgat gcactcgatg aagccggttc gcgtgtacat cttaccaatg tgagtgtacc 102 0 

caaggaaata gaagatcagg agaagttgat cgaagaagct aaaaataaca agaacgaggc 1080 

tgtcaaatca cagaatttcg aacttgctgc cagttttcgc gataaggaaa aagaacttgc 1140 

tgtccagttg gatgtgatga agaaagactg ggaggaacgt ttgaaggata atcgtgagac 12 00 

ggtggatgag gaagaaatcg caaatgtcgt atcaatgatg tccggcattc cggtacagcg 1260 

tatggcacag gcggaaggca tcaagttggc aggcatgaaa gaagacctgc aatcaaaggt 132 0 

gatagctcag gacgatgcta tcaaaaagct ggtcaaggcc attctgcgca gccgtgtcgg 13 80 

actgaaagat ccgaataaac cgattggtac atttatgttc ctaggcccta ccggcgttgg 1440 

taaaactcat ttggccaagg aattggctaa atatatgttt ggttcttcgg atgcattgat 1500 

ccgtatcgat atgagtgagt ttatggagaa attcacagtc tcacgcttgg ttggagcgcc 15 60 

tccgggatac gtaggatacg aggaaggcgg acaattgaca gagaaagtac gccgtaaacc 162 0 

ctattctatc gtattgcttg acgaaataga aaaggcgcat cccgatgtgt tcaatctgct 1680 

tctccaggtg atggacgaag gtcggctgac tgacagttat ggcagaatgg ttgacttcaa 1740 

gaatactgtt attatcatga catcgaatat cggaacccgc cagttgaaag agtttgggcg 1800 

tggagtcggt tttgccactc aaagccgtct tgacgataaa gaattctctc gcagcgtgat 1860 

tcagaaggct ctgaataaat cgtttgcacc cgaatttata aatcgtgttg acgaaatcat 1920 

cacctttgac cagttgtcat tagaagctat aacgaagatt atcgatattg agttgaaagg 1980 

actgtataac agaatcgaat ctatcggcta taaactggtc attgaagaca aggctaaaca 2 040 

gtttgtcgct tcaaaaggct atgatgtcca gtacggtgca cgtccgctga agcgtgccat 2100 

ccagacctat ctggaagacg gcttatcgga acttatcatt tcggctgatc tgaatgaagg 2160 

agatacgatc actgtctctt tgaatgaaga aaagggtgag ttggaaatga agaatgaagc 22 2 0 

caaaacggct gaataa 223 6 



<210> 282 
<211> 717 
<212> DNA 
<213> B.fragilis 



<400> 282 

tctaaatctt ccggattata ttggaatagg caaacataca agcattggat atggtacgat 60 

aactcgaaac taatacaaaa aagaccaaaa atgattgaat ctatcacatc aataggaata 12 0 

ttcattgacg gaggatattt cacaaaaata aatcaggctc ttgaggaaaa attgtcactg 180 

aatatcgaca taaccttttt ctttaaattt ataaaagaga aaatagccta tgaatataat 240 

ttaaacactg aattctgtca aataacagaa agtcattatt tccgtggacg gtatcgtgtt 3 00 

aacgatgcta ataacaaaca tttgttattc agtgaacgta agtttgaaga ttcactaatt 3 60 

gaaaatgatg tcatttttca ttacaagcat ttacgtgaaa tacaaaagga aggtgaaatt 42 0 

aacgttatag agaaaggcat tgatgtatgg ttcgctcttg aagcatacga gttatcactc 480 

tttcgaaaat ttgattttgt tattctgatt acaggtgacg ccgatcacga aatgttaata 540 

aaaaaattaa aagctctcaa aatccataca attcttttaa catgggattt atctccagaa 600 

tctgcaactg cacggctgtt gcgggaagaa gcatgtaaac atatagaatt aagtgaaatc 660 

gctatagaag ataaggatct aataaaaaag atatgcagaa gcaagcaaaa gagataa 717 



<210> 283 
<211> 771 
<212> DNA 
<213> B.fragilis 



<400> 283 

aaaattatgt ctgaaaatat aagagtaagc gaagtatccg acattctgcg gcagcagctt 



60 



115 



gaagggatcg agaccaaagt gcagcttgac gaaataggta cggtgctaca ggtaagcgat 12 0 

ggtgtagtgc gtatttatgg tctacgcaat gccgaggcca acgaactact tgaatttgac 180 

aatggtatca aggccattgt gatgaacttg gaagaagata atgtaggtgc cgtgttgctg 240 

ggaccgacgg ataaaatcaa ggagggattt acggtgaaac gtaccaagcg aattgcttct 3 00 

atccgtgtgg gagaaagtat gttgggacgc gttatcgacc cgttgggtga accattggat 3 60 

ggaaaagggc tgataggagg tgaactttat gaaatgccgc tggagcgtaa agctcccggg 42 0 

gtcatctatc gtcagccggt gaatcaacct ttgcaaacgg gtctgaaggc tgttgatgca 480 

atgatcccta tcggtcgtgg acagcgtgag ttgataatcg gtgaccgaca gacgggtaag 540 

acatcgatag ccattgatac gatcatcaat cagcgaagta attatgaagc aggtgatcct 600 

gtatattgga tttatgtaac tatcggacaa aaaggttcca cggtagcttc tatcgtaaac 660 

accttacgcc aatatggggc gatggattat actattgtgg tggcggctac agctggagac 72 0 

ccggctgcat tgcaatattt tgctccgttt ggcgggggct gccatcggtg a 771 

<210> 284 
<211> 798 
<212> DNA 
<213> B.fragilis 

<400> 284 

aaaggagctt tatctatgga gttgcgtact gtcaatgtca ctcgttatat tatgcctctg 60 

cgtgaaggtg gttcactgcc tgcattggca gaagctgatg acagttttaa gtatgttgtc 12 0 

aagtttcggg gagcgggaca tggaaccaag gcattaattg cagaactgat tggcggtgag 180 

gttgcacgag tattaggctt tcgtgtaccg gagttagtgt ttttgaattt agatgaagct 240 

ttcggacgtt cggagggtga cgaagagata caggatttat tgcaaggaag ccgcggatta 3 00 

aatatgggac tacattttct ctcaggggct ctaccattcg atccggttgt cactgaagtt 3 60 

gatgaaaaac tggcatcaca ggtggtatgg ttagatgctt tattgactaa tgtagatcgt 42 0 

acagtgaaga ataccaatat gcttatgtgg cataaagagt tgtggttgat agatcatggt 480 

gcatctctat tttttcatca ttcatgggtc aattggcata aacatgcact tagttctttt 540 

acccaagtta aagaccatgc cttattgccg cttgccggta agttggacga agtggatgcc 600 

gaatttcgga aattactgac ttcggaaaaa atacgtgaaa tagtggatct gattcctgat 660 

agctggatag agtggcgtga taaagatgaa actcctcaag atattcgtga tatctattat 72 0 

cgatttttga aagaaaggat tgaacattct gaaatatttg taaaagaagc acaacatgcc 7 80 

agaaaagcat atttatga 798 

<210> 285 
<211> 441 
<212> DNA 
<213> B.fragilis 

<400> 285 

tctgttttta ctatgaatat gagcatcaca aaacgcaatt ttctgggtta tctcagcatc 60 

cttactcttg tagggggagg attgggagcc ttggtcttgc attatctgga acccggacat 12 0 

tatttcggag gttatccgtt gataccggtg tacttttata tattcggtgt attttatatt 180 

tatatgtttg atgcctgcag gcgtcatgca ccggagaaga tggtgatgct ctttttagtg 240 

gcaaaagtat tgaaaatgat tgtatcagtt ttcttactaa tcatttattg tgtggctgtg 3 00 

cccgattccg ctattgaatt tctattgaca ttcctggcgt tctatctggg ctatcttata 3 60 

tatgaaagct ggtttttctt cgttttcgag tggaatcaga aacttacaaa gaaatcaaaa 42 0 

aaatatgaaa cagttgcgta a 441 

<210> 286 
<211> 1386 
<212> DNA 
<213> B.fragilis 

<400> 286 

aaatatgtaa cgcttcacta tatggcacaa caaaccgatc cccgcatact gggtacagaa 60 

cctattggca aacttctgtt acaatattcc atcccggcca tcatcggaat gactattacg 12 0 

tcactttata atatcatcga cagtattttt atcgggcacg gtgtcggtcc catggctatc 180 

tccggactgg cgatcacctt cccgctaatg aatctggtcg tagcgttctg tgtactgatt 240 
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tcggcaggtg 
accgatgttc 
ttggcttatc 
ccctatgccc 
atagggttga 
ctggtgacag 
gggattcggg 
gtaaaccact 
aagaaacgca 
acggcatgta 
gctatcggtg 
gggttgacta 
cgtgtaaagc 
ttcattatct 
ttgattgaca 
gcacaaatcg 
ctgtcacttt 
tatggtgtaa 
gcggtagtga 
ctataa 



gagctaccat 
tgggaaatac 
tattcctgga 
gtgattttat 
acaacgtgat 
ttattgccaa 
gagctgctat 
tccgtaacaa 
tcatcggcag 
tcattgtcat 
cctatggtat 
tgggaatgca 
atacgctccg 
gtgaactttt 
tggcatcgtc 
ttatatcaaa 
cgcgccagtt 
agggggtatg 
ttttgatggt 



ctcatccata 
attgatgctt 
cccgatattg 
gcaagtgatt 
gcgtgctacg 
tgtcatcatc 
ggctacagtc 
agagagtttt 
tatcttctcc 
actcattaat 
cattaatcgc 
gcccatcgtc 
tctcggtatc 
tccgcacaca 
cgggctgcgc 
tttcttccag 
agtatacctg 
gatcagcatg 
gtatatcaag 



cgcttgggac 
tgcctgacga 
tttttcttcg 
ctcttgggaa 
ggatatccga 
gctcctgtct 
ctgtcacagt 
gtccatttca 
ataggaatgt 
aatagtttgc 
ctgctgatgc 
ggatacaatt 
attgtcggtg 
gtttcggcca 
atttgcacgt 
agtataggaa 
ctcccgggat 
cctgtctctg 
aaagtaaaag 



aaaaagacat 
atgcagtgct 
gcgccagtac 
ctcccatcac 
aaaaggccat 
tcattttcca 
ttatcggaat 
tgccgggttt 
ctccttttgc 
aaaaatacgg 
tgtatgtaat 
atggtgcaca 
tattgataac 
tctttaccga 
tgatgttccc 
tggctaagat 
tacttctgct 
acggattggc 
agaaaacatc 



caagggtgcg 
gttcggtgga 
cggtacactt 
ttataccatg 
gttgacatcc 
tttcggctgg 
gatatgggta 
ctggaaaatg 
catgaatgtt 
tggcgatatg 
ggtggtaatg 
gaagattgac 
gagtagcggt 
tagcgatgaa 
gtttgtaggt 
cagtattttt 
tccaccctta 
ttttgtaaca 
cggacagaag 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1386 



<210> 287 
<211> 993 
<212> DNA 
<213> B.fragilis 



<400> 287 

actgaggtaa 

ctatgtgctt 

gtatacgact 

ctgtatgtat 

tgtgcaccgg 

gtggcttggt 

tctacgttgg 

gtggtcgaca 

tataataatg 

atgcagatgc 

cccaatggga 

gaatatactg 

acgttgcgtc 

aatgtgattc 

gctgatatac 

tttaaaggta 

ttgatccgga 



ctatgaacag 
gctgcgtcag 
acaacctgca 
tcgatgaaaa 
attacctgat 

cggggttgta 
aagatctgga 
gagaactcca 
atatcaccac 
tcgacgacag 
ggtacaacca 
cttatcatac 
tgatgactga 
ttgacatccc 
ctttgcagga 
tggatggcaa 
agcaagaggt 



atttatcgga 
agacgggatg 
atacatagac 
aggtgtattt 
gactctaccc 
tgacaagtcg 
agtcagtgtc 
tttactgtgg 
tgtctctttg 
cagtattcac 
tgaaaatggg 
cgaagacgat 
taccgaaaac 
cttgaataaa 
atatctggac 
cggaaattat 
cgatggagtt 



tacatacagg 
gatgaagatt 
ttgattcata 
gtgacagaat 
ggagccatgg 
tatgacaaag 
aacaatctga 
catggcaaac 
ctgaagaata 
gtggatgatt 
cttttgggag 
cctgagaccg 
agattggtta 
tatcttaatg 
cgtgccgata 
ataagtgtgg 
taa 



ttgcatgttg 
gcaactgtta 
agcaggcaac 
cggaagaaga 
caggcagaag 
ttaccctgac 
aaacccggat 
agacggaagt 
cgaagaaatt 
atgattttcg 
acgaaacgga 
gagctatagc 
tcacgcataa 
cgctgaggct 
aacacggtat 
atgtacagat 



ctgccttctg 
tgtgcgcttt 
caagatgaac 
atccggtgct 
atatattttt 
tcccggggta 
tggaggaggg 
atctccacag 
ccgtatcatc 
aattatttcg 
cgagaaagtg 
caaactgaat 
gtcatcgggt 
tcagcaatat 
tattctattc 
caacggttgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

993 



<210> 288 
<211> 2307 
<212> DNA 
<213> B.fragilis 



<400> 288 

aaagacaaaa 

agttgtgtca 

tctcgggtag 

attgtaaaga 

ttaataccga 

tgctcaagta 

tttattttat 

ccgaatgatt 



caatgaaaca 
ttaaggtgat 
ctttcgaact 
cggaatggat 
tagcatcgac 
tatcttttga 
cagactccct 
tgaccaatcc 



attccattat 
ctcattgtcg 
gagctatgat 
taaggatgga 
agttgctgag 
agctattttc 
ttatttccgt 
tgatgtgctc 



accattcaga 
ttgggattat 
aactgttttc 
gtgatcaaag 
gagtttccga 
aaaataggta 
actatgggaa 
tttttatcac 



cattaattcg 
tggtatctat 
aggatgtgga 
gaaatgcagg 
aggaggtgga 
atcggaagat 
ttgaggtcat 
agtcggttgc 



cgatcgtaga 
cattttattt 
caacctttac 
atcatataca 
aagtgcggtt 
gaataaatct 
tagtggtaat 
acgagaagca 



60 

120 

180 

240 

300 

360 

420 

480 
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ttcggtgaag 
gaaactttgg 
gaagcagtcc 
agtggtggta 
attaatactg 
ttacatatga 
acgattctca 
gtgcttattt 
aacggtgcat 
ggtgtttcct 
ttggcagagg 
gtagttacct 
cccgttactc 
ttgtttattg 
cagtgtcact 
aagcatgatt 
gagggggtag 
gaaggtggaa 
ttattgggac 
aatcagccat 
aatcgaggta 
aatgaacctt 
gaacctttta 
gaggatatag 
atttttcgtg 
ttgattggtt 
ataaatggtg 
gcaattctct 
agtcaatttg 
ttattggtct 
ccggtgaaca 



aaaatcctat 
taaaaggagt 
tgtcttttgc 
attacaatgc 
acattgataa 
ttgtggttcc 
tattatcttt 
ttgtttcctc 
ccgataaggc 
tggtgcttat 
tatccttatc 
ttctgtttgt 
aggtcttcca 
aatttgcagg 
acattataaa 
ttgctgaacc 
cttctattcg 
aagttctttt 
ttcatatcaa 
atgtagagaa 
cggtggttgg 
tggaaattga 
cagagaactt 
agtttaggtc 
acgccacttt 
atataaatga 
ccgaagccag 
cggttgccat 
aagatacaat 
ttatctttgt 
gcatcaagtc 



tggaaagact 
atttgccgat 
cagtcatagt 
ctttattcgt 
agtaattgca 
tttgcggact 
gttaggattc 
actttctcag 
tatattttct 
gatcatattt 
atctcttttt 
tattggaggc 
tccctatatt 
ggtggctttt 
tagggatatg 
tgacaacgca 
tggaagtatg 
cactcctcgt 
gaccgggcgt 
gatgggctgg 
cgtgcttgct 
atatggtact 
gcatagactg 
tttagaacag 
tttggctttt 
tgaagtccgt 
atctattctc 
cggtacatac 
ctgtgtttat 
tttcattata 
tgagtaa 



ttgcatatga 
cttccgtata 
aaatatggtt 
ttaaaagatg 
aaacatattc 
attcacctgg 
gccattcttt 
cgtgccaaag 
atgtttatat 
ctattccaat 
acctggcata 
atattgcccg 
aaagagaata 
atctttggat 
ggatatcagc 
cgtaataacc 
acctggtttg 
tgtgcggcat 
aattttacag 
aagggtagtg 
cctttttgtt 
aatttgagga 
aataatgaaa 
gatttagaac 
ataacgatac 
cggcgtagca 
tttttattat 
ggagcttact 
gcaggttggt 
gggagatcct 



tggtttgggg 
atgtatcgtt 
ggggacgtcc 
gagaaaggag 
cctcagacat 
aacattcaga 
ttgcggccac 
gaattggaat 
atgaaacggc 
ttcaggaaaa 
atttatgggc 
gaaaaatatt 
ggggatggaa 
tgatgtgtgt 
cgaagggtgt 
tgaaatcttt 
ggaatcggga 
tcgacaaaga 
gtgaaaggca 
gggttggcga 
gtggtgtgct 
atgtccatgt 
tgaaaaagat 
gatactatcg 
tgtttatcac 
aagagatagc 
ctaaagatat 
acatgagtct 
atgtagtgac 
ggcatatcgc 



cactcccgta 
ggaacgtcac 
tggctggacg 
tgctgatgtt 
gaatatgcac 
tgtgaaaagg 
catgaactat 
tcataagtgt 
tttgattatc 
gatcgaagaa 
tccattatcg 
ttctttgatt 
aaggatattg 
ggcatatctt 
tgcatcttgt 
accttatgtg 
ggtaactgat 
ctttgttcca 
gtttctggtt 
gatagttcct 
tccggctgat 
tcgtctcaag 
atatccgcaa 
ccccacaata 
tctgatggga 
tattcgtaaa 
cttctgggtt 
gttgtggata 
agctatttgt 
taatgaaaat 



540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2307 



<210> 289 
<211> 1215 
<212> DNA 
<213> B.fragilis 

<220> 

<221> unsure 

<222> (295) , (339) , (357) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 289 

agaactgcga 

tggacattgc 

ttgttgtacc 

acgggagtca 

tatttggtcg 

tgcagcaacc 

attgtgcagg 

atcaccaatt 

gggatgatta 

ttgctatatg 

gttccctttc 

cggggactta 

cctgtgcttg 

gctttggtcg 

gataataaaa 

atgctgttct 



cttcccgaac 
attttatgcg 
ctgtactccc 
tatttatatt 
atgtttacaa 
agccgggtta 
gattatcttt 
ccacttttcg 
tcggagcggc 
ttgctgtggc 
gtgcacctat 
ttccggcatt 
ccggtgcgcc 
gatgcggatt 
tgtggttgca 
ctcctgaaac 



gagaaagaac 
gatatgcctt 
ggtaatgatg 
ttttacgctt 
gcggaaatat 
tacgttggtg 
tggaatggct 
cagcgcaggt 
tttgggagta 
gttgggagcg 
cgggatgaaa 
caatttaata 
gagtgatgtg 
tctcttatca 
gatagttgtc 
aagctggaat 



aatattaatt 
gctaatttat 
gcttcacgtt 
gcaatgtttt 
atatgtatgc 
cagaacgcna 
gccacggcag 
aatgtggtct 
tatttgtttc 
ttgggaatat 
gtttgttcca 
ttgattgctt 
ccggtaggag 
gtattaattg 
gggctggtga 
gctccggcag 



gtcgtatggc 
tgttgttcat 
taggcgtgcc 
ttatcggccc 
tttcgtttgg 
cacatctgtt 
gcatcacatt 
tttcctgggc 
ggacacatgg 
tatttgtgtc 
tggaccgatt 
ttataccggg 
gcgagacagt 
tgaaattgtt 
cagtaatcgg 
ctgtattgat 



gactaaatta 
atccctgtat 
tgtcagtcag 
gttccatgcc 
gggtnttggt 
gatgctntgc 
ggccatcgat 
tgcacgcttg 
ctttgagact 
aagggtatat 
cctgcttctc 
attgatgctt 
accttttttt 
tttccgctat 
atcgatggct 
gggtttagga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
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ctcggcttgg tcactccgga gtttcttatg atgtttgtca aattgtcgca acactgccag 102 0 

cgcggcacgg ctaataccac tcatctactt gcatgggaac ttggagtcgg attgggtatt 1080 

gcatctgcgt gtcatttaca tcttacggct aatgaacagg ccgtttatcg ggtcggattg 1140 

ttatcggcaa tcgtgtcttt ggcattcttc gttttactta cgtatcctta ttttaaaagg 1200 

aagaaggtaa gataa 1215 

<210> 290 
<211> 1401 
<212> DNA 
<213> B.fragilis 



<400> 290 

aatgcgcgga 

gaagagcaag 

cgttactctt 

gtagtgagag 

ggtcgtgaag 

ttgctggaga 

acaaaagagt 

aatcatcagg 

gaagaaaatc 

gtgcctgagg 

accaatatgt 

ccgaatgaaa 

aactttacaa 

tcagggactc 

atggacgctg 

acgccgaatg 

ttcaaaggca 

attcctcagg 

aacccgaatg 

tatcaaggag 

gtattgatgg 

aataagatta 

gaagacactt 

aatgtagagg 



cccttccagc 
tcgtgaatca 
gggacctgaa 
gagaagatgt 
tggtgaaaca 
ttaccgaaca 
cgctgataca 
accttatttt 
cggtaagggt 
tagtaccgca 
atacctactg 
tggaacaact 
gattcagttc 
cggagttgag 
ccgaccagcg 
gcttcggatc 
atgctatcag 
aattacgtga 
catttaattc 
tgtgctatta 
gatacggacg 
tcggtccggg 
cctggatctc 
aacttttata 



cccgtggtgg 
tgtacgcatg 
tgtatctact 
gccatcggct 
ggattatgag 
gggaaatccc 
gccttatggt 
tgttcctgag 
cgaggtggaa 
cggtgatcgg 
gatgcgtaag 
caatgccgga 
ctggaatgga 
caagaatttt 
gcatgacgta 
ggttgcgact 
agtggaagcc 
tgcaggtttg 
acctacggtt 
tacggttctg 
ctatggcgtg 
acagccggtg 
ggctgacgtc 
a 



aagaccgacc 
gtgctgtatg 
gacggaatga 
acgccgactg 
ttattgatat 
agaagttatt 
attgctgccg 
gtggagttga 
cgtgcggtag 
attgacaatc 
atgactttta 
taccgcgagg 
ggaaaccctg 
gatgactatg 
actacccgtg 
cgtaacggag 
atgcgggata 
gaacaggcta 
tctttttcgg 
attcgtcatt 
gtacggaata 
atcaatcctc 
aacattatgc 



cgacaacttc 
aaaccaagaa 
atgaatttac 
tgagtcgttt 
tgatcaatcc 
tatcgcgtgc 
ataataactt 
gggataatca 
ctaaagtcgt 
tgaaatgggg 
tagctaactc 
aacgctatgc 
tcggacagtt 
attatacgct 
tagtgattag 
ggggcatcag 
tggtaaatga 
tagagaacgt 
agggtggcat 
tctccaataa 
atgtatatca 
ccggaacaga 
ggtggtatat 



gggacgtact 
taataccgtg 
cggaggagat 
tgttacagta 
tcccggtgaa 
agccaatatg 
ttatatgact 
acggatggcc 
tgtcagcgga 
actggatgtg 
gggtggtgtt 
cgaagatcct 
tgagtatttg 
ggaaaatacg 
cggaacttac 
tttctactat 
cagggggcag 
actggcctgg 
tcatttctac 
catggtacct 
gcttagtatc 
tccggatgac 
ccgtaatcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1401 



<210> 291 
<211> 1395 
<212> DNA 
<213> B.fragilis 



<400> 291 

ggaaatgttg 

aaaatgggaa 

ctattgaaaa 

gtgcttagag 

aataatggta 

ccggtagtac 

ggagcatctg 

aatgccgctt 

gtttctgcca 

aaagtagcaa 

atgctcgcac 

gatatgggtg 

tcttttactg 

ctgtttcttg 

attcagcaga 

ctaatctgcg 



ctaactttgt 
ctatcatcat 
atcatttttc 
aagaaaaccc 
acgaaggtct 
tattcacagc 
attttgtagt 
cacaagcaaa 
tgtattgggg 
cgaccaatgc 
gtgagattca 
ctatcaccga 
acgcccatgc 
acgaaatagg 
gaagcatagt 
caactaaccg 



tgccgacaaa 
tgttgacgat 
gaaagtcatt 
ggaagttgtc 
gttttggtta 
ttatgctgat 
gaaaccatgg 
agacggaaag 
tgaaagcagt 
gaatatactg 
tgctttatca 
atccttgttt 
cgaccgtaca 
aaaccttccc 
tcgtgtaggt 
aaacttgcag 



ttattattcg 
aataaaggag 
accttatcct 
ttattggaca 
catgaaatca 
attgatcttg 
gataatcaaa 
aaaaagaatc 
gctatgcagc 
ataacaggtg 
ccacgctctg 
gagagtgaac 
ggtaagtttg 
tttcatttgc 
agtaaccaat 
gagatggtag 



tatttaatat 


tctattgacc 


60 


tgctgacagc 


cgtacaatta 


120 


ctcctgtcag 


tctgtccaca 


180 


tgaatttcac 


ttccggaatc 


240 


aacggcaata 


cagagacctt 


300 


ccgtacgggg 


gataaaagaa 


360 


agctgttgga 


aactctttta 


420 


gcaaaaaaga 


atcatctccg 


480 


agctccgcac 


gttgattgag 


540 


aaaacggaac 


gggaaaggaa 


600 


ccgagagcat 


gatatccgtt 


660 


tgttcggaca 


tgtgaaaggt 


720 


aagcagcaga 


ccgaagttcc 


780 


aggctaagct 


actgacagcc 


840 


ctatccccgt 


agacattcgc 


900 


acaaaggctt 


attccgtgaa 


960 
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gatttattat 
gaagatattg 
gcctctatca 
aatatccgcg 
atccccgctg 
actcttgaag 
ctttcggctg 
aagtttggtt 



accgtatcaa 
ttccgctagc 
gcctgagtcc 
aattggaaca 
aaatgttcca 
atatggagaa 
tagccgctca 
tatga 



caccattcat 
tgagcgtttt 
ggccgcttgc 
ttccattgag 
attagtgcag 
agccatgatt 
attaggcatt 



gtagaaattc 
atagcccgct 
gagaaactga 
aaagcagtca 
aaaacggaga 
cgcaaggctc 
acccgccaaa 



ctccacttcg 
tttgcaaaca 
ccgcacatgc 
ttattagcga 
acccggaaac 
tcgacaaatg 
cattatataa 



caaacgtaaa 
gtatgacaaa 
ctggtatggc 
tggtgaaacc 
agagacctct 
tggaggcaac 
taagatgaaa 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1395 



<210> 292 
<211> 1230 
<212> DNA 
<213> B.fragilis 



<400> 292 

aaatgttatt 

tctcctttac 

gtacttgtcc 

ggcacacttc 

tttgcattta 

tcgggaatgg 

ggtatcaggt 

atagggcttt 

ttgctttcgt 

atacctattt 

gatatgtcgg 

ttttcgtggg 

attggggatg 

ttgcatcgca 

cctcctgtaa 

cctcaagtaa 

agcgctttat 

ggtacgatcg 

acacgtaata 

gcatggggcg 

aatctggtac 



tttgcagctt 
gcaaaggagt 
ccttattggt 
tttttcatct 
ttgctcctat 
taggcgttgc 
tgatcgagcg 
cacttgccgg 
tgtttactgc 
tttgtggaat 
gagtcagaaa 
aacctatttt 
tgtatgtggt 
cactattagg 
ctacctattc 
taagaatagc 
tgaagtctat 
cttgcgcggg 
tcattattgt 
aattctcact 
tgccaaaaga 



aaataatatc 
agtaggagta 
cgggctcgat 
ggtaacaaaa 
tattaaagca 
aatggtctac 
cctgtttcca 
gactggagtg 
cgtgattgta 
tattgtggga 
cgctgcgtgg 
gtttatgatg 
aaacactgtg 
tgatggcttg 
ggaagttaca 
ggcgattacg 
tccttcggct 
aattgctaac 
ctcactgact 
gtcgggaatc 
agagagatga 



attctattct 
cagttccttt 
ccttctacag 
gggaaggt cc 
accgaactgt 
tttgttatga 
ccggtagtta 
aatatggcaa 
tctattcggg 
tatattgctg 
ttgggttttc 
ccggtggcta 
acgggaaaag 
gcatgtcttt 
ggagccatgt 
gccattctgt 
gtattaggag 
cttgtcaata 
ctgactattg 
ggtcttgccg 



atatggattc 
ttgtggcttt 
ctttgtttac 
ctattttttt 
atggacttgc 
gtgctttagt 
ttggtccggt 
aggaaaactg 
cgaagggact 
cgttgatctt 
cacagtttgt 
ttgctccggt 
actatgtaaa 
gtgccggttt 
cgcttactaa 
tttccgtaat 
gaatcatgtt 
attgtattga 
gcatcggtgg 
cattggtagg 



aaatcatctt 
tggagctact 
tgccggtatc 
aggtagtagt 
tggcacactt 
taaatggcag 
aattatattg 
gacattggcg 
attgaagtta 
ttatgatgtt 
gtttccacag 
gatagaacac 
agatcccggg 
attgggagga 
agtgacgaat 
cggtaaagtc 
actcttattc 
cttgagccgg 
tgccgtattg 
agtaggcttg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1230 



<210> 293 
<211> 933 
<212> DNA 
<213> B.fragilis 



<400> 293 

acaccttacg 

acccggctgc 

cgtgataccg 

taccgtgaag 

attttctatt 

gtggcccgtg 

tcgctgacag 

actaatgtga 

ggtaatcgtc 

attaaggcta 

ttggaagcat 

aaggggcaga 

aagcagattg 

aaggtagaag 

gtgctggatg 

acggcggcaa 



ccaatatggg 
attgcaatat 
gccgacatgc 
tgtctttgat 
tacactcccg 
agatgaacga 
cattgcctat 
tctctattac 
cggctattaa 
tgaagaaagt 
tctccaaatt 
aaaacgcccg 
ccattctcta 
attttgaagc 
tattgaaaac 
tggttgccaa 



gcgatggatt 
tttgctccgt 
actggttgtt 
tctccgtcgc 
tttgctggag 
tttgcccgaa 
tattgaaact 
agacggtcag 
tgtaggtata 
ggccggtaca 
tagtggagat 
tttgctggtt 
ttgcggtatc 
agcgttcctc 
cggagtgatc 
acagtatagt 



atactattgt 
ttggcggggg 
tatgatgatt 
ccctcgggac 
cgtgcagcca 
agcctgaaag 
caggccggag 
atattccttg 
tcggtttccc 
ttgaaaatcg 
atggatccgg 
cagccccaat 
cacggattat 
aatacactcg 
aatgacgagg 
taa 



ggtggcggct 
ctgccatcgg 
tgtcgaaaca 
gtgaagccta 
agattattaa 
gtaaagtgaa 
acgtttctgc 
atacggattt 
gtgtgggagg 
atcaggcaca 
ttaccgcact 
actctccaat 
tgcgaaatgt 
ctctcgatca 
taacgaaggc 



acagctggag 
tgagtatttt 
agcagtatct 
tccgggcgat 
tcaggaagaa 
aggtggaggt 
ctatattccg 
attcaatcaa 
taatgcgcag 
atatcgcgaa 
gaccattgac 
gccggtagag 
tccgttggat 
tcaggcggat 
cattgaagaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

933 



120 



<210> 294 
<211> 879 
<212> DNA 
<213> B.fragilis 



<400> 294 

aagattatgg cttcactaaa agaagtaaaa accagaataa attcggtaca aagtacccga 60 

aaaatcactt cagcaatgaa gatggtggct tctgccaagt tacacaaggc gcagggagcc 12 0 

attgagaata tgttgcctta tcagaggaag ttgaataaga ttctgactaa ctttctgagt 180 

gctgatcttc cggtagagtc tccgttctgt gtggaacgtc ccgttaagcg ggtcgctatt 240 

gtggcttttt cttccaacag ttctttatgc ggtgctttca atgcgaatgt actgaaaatg 3 00 

tttttgcaga cggtgggaga atatcgcgag ttgggacaag ataatatcct gatctatccg 3 60 

gtgggcaaaa aaatagagga ggctgtcaag aagttaggat tctttcctca aggcagttat 42 0 

cagaagttgg cagataaacc gtcgtatgat gaagccgctg cattggctaa attgttgatg 480 

gaactttttc tggaaaaaaa tatcgaccgt gtggagttga tttatcacca tttcaagtca 540 

atgggggtac aagaactgtt gcgtgaaaga tatttgccga ttgacttgtc tgccgttcaa 600 

aatgacgaag agagaggcgg agtagtgaat gactatatca tagaaccttc tgcagctcaa 660 

ttgatagcag acttgattcc gcaggtgttg agtcagaaga tatttacagc tgctctcgat 720 

tctaatgcat ccgaacatgc tgcacgtact ttggctatgc agatagcgac ggacaatgcc 780 

aacgaactga ttcaggagtt gacaaagcag tataataaaa cccgccagca ggccattaca 840 

aatgaattgc tcgatattgt aggtggcagt atggcatag 879 



<210> 295 
<211> 858 
<212> DNA 
<213> B.fragilis 



<400> 295 

agaaaactct atctttgcat tgagttttca tgcactaaaa taaaaattat gagacaaata 60 

aaaggaatta ccgcaatctt tctttgttgt ctgctagttg ccggatgtga cttgatagat 12 0 

tatcatccat atgacgtcga cataaaagga gaaagagaca ttaatgcgaa aaatattcaa 180 

aagatcgagg ccaaatgcct gggaaagtct actatacgct ttatcgccat gggtgactcg 2 40 

caacgctggt atgacgaaac cgttgacttt gtaaacgctg tcaacaaaag agacgacatc 3 00 

gactttgtag ttcatggagg cgacttcagt gacttcggac ttaccgatga atttctttgg 360 

caaagggata taatgaataa actaaaggtt ccttatgtag gacttatcgg aaaccatgat 42 0 

tgtttgggaa ccggagaaga tgcattccgg caaatattcg gcgatacaaa cttttcgttc 480 

atagccggag gtgtgaaatt tgtatgcctc aataccaacg caatggaata tgattattcg 540 

gaaccgatcc ctgattttga ctatattgaa agacaactca cagaacgtgc cgacgaattt 600 

aataaaaccg tattctgtat gcatgcccgt cccctttgtg atcagttcaa taacaatgtg 660 

gccaaagtgt ttcaaatgta tgttcgccaa tttcccggtt tgcaattttg cactgtagct 720 

cacgaacatc ggatcagtgc gtcagatgtg tttgacgatg gcgtgatgta ttatggaagc 780 

aattgtatga aaaatcgcag ttatttagta ttcacgataa aacctgatgg ttatgattat 840 

gaagtggttg aattttaa 858 



<210> 296 
<211> 981 
<212> DNA 
<213> B.fragilis 



<400> 296 

tcaataaccc aaacggcagt tatgaaaaat tatatcgtta acgaactcat tgcagcaatg 60 

aaagaacgga ttccccgtgg aataaatctg gccaactacc tgacagatgc cctatgtatg 120 

ggaaaagagg ctgtataccg aagattacga ggcgaagtgg ctttcacctt tgacgaaatt 180 

gccatgattt catgcaaact gggaatatca attgatcaga ttattggaaa tcaccagtcg 240 

aaccgtgtga ctttcgattt aaacctgctt cactcacccg atcctctgga aagttattat 3 00 

gagattatag aacgctatct gcgcatattc aactacgtaa aagatgatat cagcacgaag 3 60 

atatataccg cttcgaacgt aattcctttc accctctatt cttcgtacga atacttatca 420 

aagtttcgcc tgtgcagatg gatttatcaa aatggaaaaa tacgtacccc aaacagctta 480 



121 



tcgggaatgc 
aaagcgtgca 
gagatgaagt 
aacgaactgg 
aacggaaaca 
atagaaaaga 
tctcaaagcc 
tccacactga 
agtttcatcg 



acataccgga 
gaaagacctg 
attttgccgg 
agctgttgct 
aagtagccat 
aagatttcca 
cacgaatttg 
tttcagaaag 
acaccctgta 



caaagcggtc 
ttttatatgg 
cctcaatctg 
gcatgaactg 
ttacttatcc 
aatcagtctc 
cggcatacaa 
cggagagtcc 
a 



catgcccata 
gacagcaatg 
atttcggaaa 
gaacagatat 
aatatcgatt 
ctccgggtat 
aaagactgga 
caaagaatta 



aactgttgag 
tcttctactc 
cagacctgat 
ccgcaaaagg 
ttgaagcaac 
attctattaa 
tacaatcatt 
ctttcctgga 



tgaggctgtc 
gtttgtaaaa 
acatttaaaa 
tgaattcagt 
ctacagctat 
ctcaatggac 
gaaaagacac 
acagcagaag 



540 
600 
660 
720 
780 
840 
900 
960 
981 



<210> 297 
<211> 987 
<212> DNA 
<213> B.fragilis 



<400> 297 

acagagaaac 

gaagccgcaa 

ttatatatag 

gcagaagctg 

agtttcagca 

acctatcacg 

actacagaaa 

gtactttcaa 

cattttgatg 

atgacacagc 

gtgagagaca 

ataaaagacg 

tatgaaaccg 

agctatgtgg 

attacaacgc 

aaattctcaa 

cagcgtgaaa 



agtacaaact 
aagaaaagat 
gtaaggaggc 
ccgtcatctc 
acaatgcggt 
atattctcac 
tggcaacctc 
agtttcgtct 
aactggagat 
agatgaagac 
tacagttctt 
atttattgct 
gtaacgatgt 
caaccagcaa 
aggatgacgg 
ctcaaatatc 
taataaatac 



gaaaattatg 
gccgaccgga 
catctatcgc 
gagaaaattg 
attcgacctg 
gaaatatgtc 
ttcaaacata 
gtttaaatgg 
tccccacaaa 
gactgattat 
ttcggaaatc 
tctgacggat 
acgtatctat 
cagccatatc 
catgttccgc 
cgaaagcgga 
cttataa 



ataacaaacg 
accaacctgg 
cgactgcggg 
ggaatatcgc 
aacgtcgtac 
aatgcattcg 
ttgcctcaag 
atgtaccaga 
atatataaca 
atctgggaca 
cacctggttt 
gaattggaag 
atctcaaata 
agtatgatac 
agcctgaaag 
gagatgcaac 



aattaaatat 
caaacactct 
gagaagtacc 
tcgacaaaat 
accataccaa 
ataacatccg 
cattatatct 
atgaaaatat 
tccagaaaga 
ataccgtatt 
cggaagaaga 
agttggccgg 
tcaagttcga 
gcatatactc 
agtgggtaca 
gtatacgttt 



aggcttaata 
aatggacatt 
gtttactttg 
gatcggtgtg 
tacattcgaa 
ggaagatccc 
caaacatgat 
caaatgcaag 
ctttgtcaat 
cgaacatgta 
caaagagttg 
aaaaggtaag 
tgccacctat 
catcaatgcc 
atcactcaaa 
ctttaatgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

987 



<210> 298 
<211> 1392 
<212> DNA 
<213> B.fragilis 



<400> 298 

caattcttcc 

cttataattg 

acaaggacta 

gaagcccaac 

tgtaatttat 

caggccggac 

aatggaaagt 

atacgccttg 

gccggttatc 

acagaagttt 

gagcatttgc 

atggcgagac 

aactatttgt 

acagagcctg 

gccgatttac 

actgcttcgc 

aaaggtagtt 

atgtcggtgc 

ctgaaggcgg 



ttgtcccgtt 
ctttcaaacg 
tgatcagaaa 
agccgtccgt 
ccctattggc 
tgttcgataa 
actttgactt 
ccggacaacg 
agttcgaaga 
tctatctctc 
tgacaggtat 
tcgagtctat 
cccggagagg 
taatagatga 
aagagagggt 
aagccgatct 
atgatcgcca 
ctatctttaa 
atcgggagca 



ttttctgctt 
gtttgcttca 
gttttttatt 
cggcttgact 
cgagcgctac 
tccggtaatt 
cggcaaaaaa 
gaataagcag 
agtgatgagg 
aaagtcattg 
aaaagagcag 
gctgctttct 
agaactgaat 
aggagatctt 
acacgggaga 
gaaattgcag 
aggtaatttt 
ccgtaaccag 
agagtattcc 



tctaataact 
tgggggagtt 
ctcttttttc 
ctgaaggagg 
aatgtagaca 
tcattcgaac 
ggcgagtcgg 
atacggctgg 
actttacgcc 
tctgtttatg 
catgcaaagg 
ctgaagaaag 
ctgttactga 
cgacaattaa 
cctgaccaga 
aaggcgttgg 
attaataact 
gggaatatta 
cgaaataaag 



ttctaatctt 
gtacttttgc 
ttggcttttt 
cagagcaacg 
tcgcacaagc 
aaaatgtgta 
tagtcgaaat 
aaaagataaa 
aggaacttgg 
ataaagaaat 
gaaatatctc 
acaagaacga 
atctgcctgc 
acatggaccg 
agttggcacg 
cttttccgga 
actttgcaat 
aaatggcccg 
ccgaggccga 



atctgagtat 
ttccaaaata 
cggatttgcc 
tttcctgaaa 
caggttgctt 
taaccgattg 
agagcaggtg 
caaggaaatt 
cgaggcattt 
caattcgctg 
tttaatggaa 
atgcgaaagc 
cgactttcgg 
gttgtcttat 
cagctgtgtc 
attcgcagta 
cggattcagt 
tttcaatctt 
actatacgca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 



122 



gcttatactg ctttagagaa agcatgtcag ttgtatcagt cgactgatat gggactggaa 12 00 

cagaattttg agaaactgat agccggagcc aacgagaact ttatcaaacg taacatcagt 12 60 

cttttagaat tcatcgactt ttatgatagc tacaaagaga cttgcatccg gctttacgaa 132 0 

atcaagaaaa acgtactgct cggtatagag aacctgaatg cggtggccgg acaacctatt 13 8 0 

tttaactact aa 13 92 



<210> 299 
<211> 678 
<212> DNA 
<213> B.fragilis 



<400> 299 

caaacacgta 

gatggtgtaa 

aagtatgtaa 

tatgataaat 

cgctttgaaa 

ctgcgtcggc 

aatgtttatc 

cgt tttaagc 

tccgattcaa 

tcatccggag 

aaggcagact 

acttcacgtt 



aaagaatgtt 
ttgtcgatac 
atgatgcaaa 
actttgcagg 
taaaaatgaa 
atggtgtgaa 
atgcccatcc 
gttctaagcc 
aagattcgta 
ccattgttgt 
atgtaataga 
atatctga 



tatggatgca 
agaagggcag 
ctttggttcc 
agaaccggaa 
ttatgactat 
aatagctttg 
cgagtttaaa 
tgatcctgaa 
tgtgttcgaa 
cggattggca 
cgatttcaga 



acaaagaaaa 
tataccgttt 
aaggttaagg 
aaacagcggg 
gttcccggaa 
gttaccagtt 
tccctttttg 
tgtttcttgt 
gattcatttc 
actacgaatt 
gggatgactt 



taacggcatt 


attcgattgt 


60 


tctggaatga 


aatgggccaa 


120 


gccagacact 


ggtacagatt 


180 


atataaccga 


ggcattgaac 


240 


tagttgagtt 


tatagcagat 


300 


ccaatacggc 


aaaaatggag 


360 


atgaaatatt 


gactgcagag 


420 


tgggaatgac 


aattttcggt 


480 


atggtttgca 


ggccggtaga 


540 


cacgcgaagc 


cattgccgac 


600 


acgaaaaact 


gctgactata 


660 






678 



<210> 300 
<211> 687 
<212> DNA 
<213> B.fragilis 



<400> 300 

tatcatttca atcttaaaat tatgacctac ctcgctacca accccctatt ccatggaatc 60 

tctccagaaa cgctttcccg tgattttgac ggaatcgtat ctcacctccg catgttccgt 120 

aaaggagaca ttcttgccag gcaaggtgat gtatgcaatc ggctgatgat attactgaaa 180 

ggcagtgtcc ggggagaaat gatcgattac tcgggcagat tgattaaagt ggaagatatt 240 

attgctcctc gtgcaattgc ccctcttttc ttatttggtg cagacaatcg ctatccggta 300 

gaagttacag caaacgaagc taccgaagtt ttcgaaattc cgaaagaaag cgtactgaaa 3 60 

ttatttcgac ggaatgagaa attcttagag aactacatga atctttctgc caattatgcc 420 

cgaacacttg ctgacaaact gttttttatg tcctttaaga cgattcggca gaaacttgct 480 

tcctatctgc tacggatgtt gaaacaacaa ggagacagtc cgatacaact tgaccgctcg 540 

caacaggaac tggctgatta tttcggagta tctcgtccct ctctggcacg cgagctggct 600 

catatgcagg atgacggcct gatcaaaacg gacaggaaat tagtgcatat cttgagaaaa 660 

gaagatatga tgcaactgat acaataa 687 



<210> 301 
<211> 213 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 

<222> (16) , (34) , (78) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 301 

tccctgcaca atgcanagca tcaacagatg tgtngcgttc tgcaccaacg tataacccgg 60 

ctggttgctg caaccaanac ccccaaacga aagcatacat atatatttcc gcttgtaaac 12 0 

atcgaccaaa taggcatgga acgggccgat aaaaaacatt gcaagcgtaa aaaatataaa 180 
tatgactccc gtctgactga caggcacgcc taa 213 



123 



<210> 302 
<211> 396 
<212> DNA 
<213> B.fragilis 



<400> 302 

aagaagcaca acatgccaga aaagcatatt tatgaatacg ccgttgttcg gatagttccg 60 

aaagtggagc gtgaggaatt tatcaacgtt ggggttatct tgttttctaa acaggctgcc 12 0 

tttatccgga tgcgttatga aattaataag aagaggttgg aggccttatc accggaacct 180 

gatatcgatt ctttccggaa atatttggag gctttcagta aagtgtgtgc aggctgtccg 240 

acgggaggag gcattgctaa actggaagtt ccggaacgtt ttcgttggtt gacagcccat 3 00 

cgtagttcct gcattcagac ctcaagacct catgttggct attctgacaa tttagaggaa 3 60 

acattggagc ggttgttcga ggaattggtt ctttga 3 96 



<210> 303 
<211> 207 
<212> DNA 
<213> B.fragilis 



<400> 303 

tttcgtggca aatatacagt ttttgtacga gaaaacggac acttcagcac agaaagagag 60 

tatgccgata ctttcttttc gattagaatt ttgcataaaa atgtgagatt cattcgtgaa 12 0 

atagagaaaa aagataaaaa tagatcattc ttattaggta atatgagcta ttttgttacc 180 
tttgcgcccc cttattccat agcgtag 207 

<210> 304 
<211> 279 
<212> DNA 
<213> B.fragilis 



<400> 304 

accattttaa aactgaaaat tatgttacta tcagtattat tgcaagctgc tgctgcagga 60 

gtaggattaa gtaaattggg agcagctctc ggagctggtt tagctgttat cggagcaggt 12 0 

atcggtattg gtaagatcgg tggctcggcc atggaaggta ttgcgcgtca accggaggca 180 

tcgggagata tccgtatgaa tatgattatt gccgctgcct tggttgaagg tgtagcgttg 240 

ttggcattag ttgtttgtct attggtactt ttcttataa 279 



<210> 305 
<211> 1140 
<212> DNA 
<213> B.fragilis 



<400> 305 

agctacataa agatgaatat ggaaataaat ccctcggaat ataaaattct cattgtagac 60 

gatgttatgt ccaatgtcct tttattgaag gtattgctga ccaatgagaa gtttaacata 12 0 

gtgacagcta gcaatgggaa tcaggcattg gaccaagtaa agaaagagaa tcccgacctg 180 

atattgctag atgtgatgat gccggatatg agtggttttg aagtttctca aaagttgaag 240 

gcggatcccg aagcggccca tattccgatc atctttttga ccgcattgaa tagtactgcc 3 00 

gatatagtca aaggatttca ggtaggcggc aacgatttta tctctaaacc ttttaataaa 360 

gaagaactga ttattcgggt cagtcatcag atttctttag tagcggccaa acgtattatt 42 0 

gaagccaaaa cggaggaact taaaaagacg attatcgggc gtgataagct ttattctgtg 480 

attgcccatg acctccgttc gcctatggga tctattaaga tggtgcttaa tatgctgatt 540 

cttagtttgc ccaaagaaaa aatcggcgaa gatatgtatg aactgctgac tatggccaat 600 

cagactaccg aagatgtgtt ttcgttgttg gataacttac tgaaatggac aaaaagccag 660 

ataggtaagc ttaaagtcgt atatcaggat atcgacatgg tggaagttgt agagggagta 72 0 

ggagaaatct tcgcaatggt tgccggcctg aagaatattc gtttgcgaat tgaatcgccg 780 

gaatgtcagg cggtacatgc cgatatcgat atgataaaga cggtgatacg caatttaata 840 

agcaatgcca ttaaattcag taatgaaggt tccgaagttc ttataaaagt tgaagagtcg 900 



124 



gatggaatgt cggtagttag tgttaaagat agcggatgcg gtattgacga agaaagccag 960 
aaaaaactgt tgcataccga tacacatttc agtacattcg gtactaataa tgaagaaggt 102 0 
tcgggactcg ggttactttt gtgccaggat tttgttgtga aaaatggagg aaagttgtgg 1080 
tttacttctg ttaaagatga aggttcaact ttctatttct cgattccact gaaaaaataa 1140 

<210> 306 
<211> 1599 
<212> DNA 
<213> B.fragilis 

<400> 306 

atttgcacaa aattcagaga ttacatgaaa actgtcaaaa cacatatcac ccaattgttg 60 

catgccatga acaagggaat tttcgaaaaa gaacatccca tcgcattatc actactctcc 120 

gcaatttcag gagaaagtat tttcttcctc gggcctccgg gagttgccaa gagccttatt 180 

gcaaggcgct tgaaactggc tttcgaccaa agtactgctt ttgaatatct gatgtctcgt 240 

tttagcaccc ctgatgaaat attcggtccg gtatctatct ctaaattgaa agatgaagac 300 

aaatacgaac gcattatcga aggttatctc ccctcggcaa caatcgtttt tttagatgaa 3 60 

atatggaaag caggaccaag catacaaaac tcactgctaa cagttatcaa cgaaaaggta 42 0 

taccgcaatg gacaatatac catacagtta ccattaaaag gattaattgc agcttccaat 480 

gaattgccgg ctcagggaga agggttagaa gccctgtggg atcgtttctt aatacgctat 540 

tttataggga atatcgaaca ggaattcgct ttcgatcaaa tgatagcctc tgtcaatgac 600 

atggaagcag aaattcccac cggactttct attacagaag aacaatatac agattggaga 660 

actcaaatca gccaaatcaa gatacattat actgtttttg aattaattca ttccatcaag 720 

cggcaaattg aaaaatataa catacagaaa gaagaggttc cacactcaac gctctatatt 780 

tccgaccgtc gatggaagaa aatcgtatcg ctgctcagaa cttctgcttt tctgaatgaa 840 

acagatacca tccgcttttc agattgtact ttattacttc attgcttatg gaatgaaata 900 

gaacaaatac caattatcga acaaatggtg tcatcggcac ttgacgaatg tatcagccat 960 

tatctttgtg gcgaacggac tttagaacaa aagctgagca gtattcggga agacatgaag 1020 

tcagaacaca gtttgcgtga aacaaaagat ccggccctgc aaattgtaga cactttctat 1080 

catcaaattg aaagatatcc tgtagcaggc aatctgttga tttttgcttc cgactaccaa 1140 

agtttacaaa aagatactca aaaattattt tacattcaaa gagataaata tcgtcctgtt 1200 

aactggatat taaaagtata tgaccatgtc cgcaaccgga acatatccca atcagccata 1260 

gtttcactca agaaaggcac acgttctgta ttcatcaaca atcaagagta tccactagct 1320 

tgcaacgcag gatacgacat agcttaccct caagaagcct ctttaccctt tgaatttcgt 1380 

tttcaggaag tgatcgattt atatcacaac agggagaacg aattaaagcg catgaccgac 144 0 

attgaactga cctattgtaa agaacatcta ttcatggatg acaaacaacg taatatggtc 1500 

aaacaaatat taaacagaca aaaagaaatg ttggaaattt accaaaatga aatcagagaa 15 60 

atagcttata cgcatggatt ggaaaataag gaatattag 1599 

<210> 307 
<211> 2991 
<212> DNA 
<213> B.fragilis 

<400> 307 

agcaaacaaa aaatacaaca aacaggtaaa aggatacatg cagctcctct cccgaatggg 60 

atacaaaaac atcacaggtt atctgtggta tgtagaagaa gaaataatcg aaaaagtatg 12 0 

aaaacatttc tccaattggt cgcgcaggac ctttactgta aaataggaaa tgatctgtca 180 

cgcacagcta tcatattccc taacaaacgt gccagtctgt tctttaatga acatttggca 240 

aatcagtccg atcaacctct ttggtcaccg gcatatttaa gtatcagcga attatttcag 3 00 

catttatcgg tgttaaaact aggagatccg attcggttag tatgcgagct ttataaaata 3 60 

ttccgtgaag aaacaaatag tgacgaatca ctggacgact tctacttctg gggtgagtta 42 0 

ttaatcagtg attttgatga tgtagacaaa aatctggttc atgcagataa gttatttact 480 

aatctgcagg acttaaagaa tgtcatggat gattatgaat ttctcgacca agagcaagaa 540 

caggctatcc agcagttttt ccaaaatttt tctatcgaaa agagaacact gttaaaagaa 600 

aagtttattt ctctttggga taaattgggt gatatttatc gccgatatca taaaaagctg 660 

gaagaattag gatttgccta cgaaggtatg ctctaccgaa acgttattga acaacttgaa 72 0 

ccggattcat tgaaatatga ctgctatgta tttgtcggct tcaacgtact gaacaaggtg 780 

gaaactcact tcttccaaca attgcagaat gcgggtaagg ctcttttcta ttgggattat 840 



125 



gacgtgtttt acactcagct tccttcccgg caaaaacaac gccatgaggc cggagaattt 900 
atcaaccgca atctcaaact cttccccaat gagcttcctg ccgaattgtt caatgaattg 960 
ataaaaccca aaaaagttcg ttttatctct tccccaaccg agaacgcaca agcccgttat 1020 
ctgcctcaat gggtacatga gaacctgagt aacgaagaaa aagaaaatgc cgttgtgctg 1080 
tgcaatgaag ctttacttct ccctgttttg cactctatac cggaggtagt aaggaatgtc 1140 
aatatcacca tgggatttcc gttagcacag actccggtat atagttttat caacgccata 1200 
ctcgaactgc aaaccagcgg ataccggaca gactcgggac gatatatcta tgatgccgta 12 60 
cagacggtat tgaaacatcc ctatacccgc cgcctctcag ataaagctga gccgctgcaa 1320 
cgggaactta caaagaccaa ccgtttttat ccttttccat ccgaattaaa gaaagacaaa 1380 
tttctggaca tattgtttac gccccgcaat ggtatccgtg aactctgtgt ctacatcacg 1440 
gaactgctga aagaagtatc tgttttatat cgtcaggaac aggaaagcga tgacattttc 1500 
aatcagctgt accgtgaatc tctcttcaaa agtttcacat tagtcaacag gttgctcaat 1560 

ctgatagaca acaacgaatt gcaggtacgc atagaaactc tgaaacgttt attaaataaa 162 0 

atactgaatg cagccaacat tccttttcat ggtgaaccgg ctataggaat gcagatcatg 1680 

ggagtattgg aaacacgtaa cctcgacttc cgtaacttgc tccttctatc gcttaatgaa 1740 

ggccaattgc ccaaatcggg aggtgagtca tcattcatcc cttataattt acgcaaagct 1800 

ttcggcatga ctactattga acataagaat gctgtttatg cctattattt ctatcgcctg 1860 

attcagcgtg cagaaaatat aaccttaatg tataacacct catcagatgg gttaaacaga 1920 

ggagaatggt cacgtttcat gctacagttt ctgattgaat ggccacatga aatcagccgt 1980 

gaatatcttg aagccggaca atcgccacaa aacagcaagg aaatccgcat tacaaaaacc 2040 

ccggaaatta ttgaccgttt ataccggact tatgacttct cacgcaaccc ggatgcccta 2100 

atactttcgc cttcagcatt aaacacctat cttgattgtc ggctgaagtt ttatttccgt 2160 

tatgtggcac gccttaaagc tcccgatgaa gtcagtgctg aaattgactc cgccctgttc 222 0 

ggaaccatct tccaccgttc tgcccaattg gtttatttag atctgacagc caacaaacga 22 80 

gatgtccata aagaagatct tgaacgccta ttgcgcgata atatccgtct tcaaaactac 2340 

gtggatatag cttttaaaga aatatttttt catgttccta tcgacgagaa gcccgagtat 2400 

aacggaatcc agctaataaa ctcaaaggtt atcacttcgt atctccgcca actgctgcgt 2460 

aatgacctgc aatatgcccc tttccgaatg atgggtatgg aacaggaagt agtggaggat 2 52 0 

atccggatag aagggcctgt gggaaagtta tcactaagaa tcggaggcac catcgaccgt 2580 

atggatagca aagaaggtac actccgaatt gtggactata agaccggagg cagccccaaa 2640 

gtaccgacaa atatagaaca attattcaca cctgccgaag gacgccccaa ctacatcttc 2700 

caaactttcc tatacgccgc cattatggca cggcaacagg cactaaaagt agctccctcg 2760 

ctactctata ttcatcgggc agcttccgag agttactctc ctgtaattga aataggagaa 2 82 0 

gctcgcaagc ctaaactgcc ggtcgatgat ttttcggttt atgaagatga attccgtgag 2880 

cgtctcctga aattgcttga agagatatac gatgacaaag aggaattcac tcaaactgag 2 940 

gatacaaaga aatgtgaata ttgcgatttt aaagcaatgt gcaaacgata a 2991 

<210> 308 
<211> 183 
<212> DNA 
<213> B.fragilis 

<400> 308 

tacagacaaa agaaaggccc ggtttcagcg aaaccgagcc tttctttcat ggaattgaaa 60 

cattactcag acttgatgct gttcaccgga ttttcattag cgatatgcca ggatctccct 12 0 

ataatgaaaa caaagataaa gaccaataaa caaatagctg tcactacata ccaacctgca 180 

taa 183 

<210> 309 
<211> 369 
<212> DNA 
<213> B. fragilis 

<400> 309 

aaaagtaaaa atatgaaagt cattgattta acaaaagaaa gcttcgtaga gaaagtggcc 60 

gaattccaag aatacccgaa taaatgggat tttaaaggtg ataaaccttg cctggtagat 12 0 

tttcatgctc cctggtgtgt atattgcaaa gccctgtcac ctatactcga ccaactggct 180 

gtagaatatg atgggaaaat agatatttat aaagtggatg tagatcagga accggaactg 2 40 

gaggctgctt ttgccattcg tacaatccct aacctgttgc tttgtccgat gggaggaaaa 300 
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ccaagtatga aattaggaac tatgaataaa acccagttaa aagcattgat agaagaagtt 3 60 

ttgttataa 369 

<210> 310 
<211> 1347 
<212> DNA 
<213> B.fragilis 

<400> 310 

gtcgaaaaaa agaagatgaa gaaaatatat gttttggctt tgttgagctg tctgttgatg 60 

ttatcagctt gtgacagtta tcttgatatc agacccgtgg ggagcgtgat tcctcaaacc 12 0 

gctgaggagt atcgtgcttt gctggcacgt gcttatctga atgtgcccaa tgacagaggg 180 

ctggcttgtc ttcgttctga tgaaatgttg gttaatgata atgaatatga ccgaaattcg 240 

tatggagaca ttgaacgttg gaacgatgtg tcaccatttc cgggaaccag ccagtttacc 300 

tggtctaatt tctataatgt actttttatt gctaatcaag taattgagag tcaaaaggag 360 

attacagaag gaactccgga ggtcgtgaat cagttggtgg gtgaagctca tttgcttcgg 42 0 

gcttatttgc attttgtatt agtgaacctg catggacagc catatacgaa gtccggtgct 480 

ttaaattcaa aatcaatacc tttaaaattg gacacggatc ttgaaaaaac gttgggacgt 540 

aatacggtag aagaagttta tacttctatt ttatcggata tagagcatgc ccgtgaatta 600 

ataaataagg aaaagtggga aactgtcttt tcatatcggt tcaatgtttt gtctgtagat 660 

gcgttacagt ctcgtgtcag cttatatatg ggagcatggc cgaagtgttt ggaatcggct 720 

gaagcagtat tggcaaagaa atctgttctt gtcgatatga atgaaactcc tttggctctt 780 

cccaatcatt ttgagtcggt tgaatcgata actgctttgg aacaggttat gggttcttct 840 

gtcaacaatg ctgtgtgggt acctgctact tttctggctc tttatcagga aggagataag 900 

agattggccg cttattttgc tgctccggat gaaaatggga accgaaaaag ttctaaagga 960 

ggaaaaagag agttttcttg tacttttcgt gtaggtgaac tttatcttaa cgcagccgaa 1020 

gctgcagcaa acatggataa actgccacat gcacgtatgc gtcttttaga attaatgcgg 1080 

aagcgttata ctcccgaagc atatgacaag aaagagaatg cagtgaatgt gatggataaa 1140 

aatgccttga ttagtgaaat actgaacgag cgtgctcgtg aattagcttt tgagggacat 1200 

cgttggtttg atcttcgtcg tactacacgt cctcggatgg tgaaagtact tcaaggtaaa 1260 

acttatatat tggaacagga cgatcctcgt tatacaattc ctattccgag agatgctatt 1320 

gctgccaatc cgggattagc taactaa 1347 

<210> 311 
<211> 1683 
<212> DNA 
<213> B.fragilis 

<400> 311 

atgtgcagaa attacaaaat ggaacaagaa caacggttca ttggttatat tgaacaaagc 60 

atcataaaca actgggatgc aaacgccctg acagactaca aaggaatcac ccttcaatat 120 

aaggatgtag cccgcaaaat agccaaattt cacattatat tagaaatggc cggtattcaa 180 

ccgggtgata agatcgctgt ctgtggccgc aatagtgccc attgggctgt aacctttttg 240 

gccaccgtga cttatggtgc tgtaattgtt cccattttac atgaatttaa ggctgacaat 3 00 

atccataata tcgtcaatca ctctgaggca aaactcctct ttgtaggtga tcaggtatgg 3 60 

gaaaacctta atgaagatcg gatgccttta cttgaaggta tatcttcttt gacagacttt 420 

actccacttg tgtcgcgcaa tgacaaactg acatatgcac atgaacaccg taatgagata 480 

tatggacagc gatatcctaa aaattttcgt ccggaacata tctcctaccg taaagatatg 540 

ccggaagagc tggctgttat aaattacaca tcaggaacaa caggttattc caaaggagtg 6 00 

atgttaccct atcgtaggct ttggtcaaac attgcttatt gtcacgagat gcttccggta 660 

aaacctggtg atcacatcgt ttcgatgctt cccatggggc acgtattcgg catggtctac 720 

gattttcttt acggattttc tgcgggtgca cacctctact tcttgacacg tatgccgtct 780 

cccaaaatca ttgcacaatc atttgccgaa atcaaaccga gagtaattgc ttgtgtaccg 840 

ttgattgtag aaaagattat taaaaaagat attctcccca aactggataa taaaataggt 9 00 

aagttgttgc tgagagtacc cattgttaac gataaaatta aagcagctgc ccggcaggca 960 

gcaatggaaa tttttggtgg aaattttgat gaaattatta tcggaggagc tccgttcaat 1020 

gcagaagtgg aagcttttct taaacaaata ggatttccat acaccattgc ctatggtatg 1080 

acagaatgtg gtcccatcat ttgttccagc cgctgggaaa ctctcaaaca ggcttcatgc 1140 

ggtaaagcta ccagccgaat ggaagtgaaa atagattctc ctgatccgga aaatattgca 12 00 
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ggagaaatta 
acttcacaaa 
tcggaaggat 
cagaatattt 
tcgttggttc 
gctttcgcac 
gaactcaatc 
gagtttgaga 
taa 



tttgtaaagg 
tcatcgatgt 
atgtaacagt 
atccggaaga 
tactccagaa 
acggactgtt 
aacaacttcc 
aaacagcgaa 



tacaaactta 
aaacggatgg 
acgaggtcgg 
aatagaaagc 
agataaactt 
gcaaagcgat 
ggcctactgt 
gaagagcatt 



atgttgggat 
ctccacactg 
agcaagaata 
aagtttaata 
gtagcactaa 
attgagaaga 
cagattacta 
aaacgattca 



actacaaaaa 


tacggaggcc 


1260 


gagatttggc 


taccatggat 


1320 


tgttacttac 


ttcaagcggg 


1380 


acatgcctta 


tgtgtccgaa 


1440 


tctacccgga 


ttttgacgat 


1500 


taatggaaac 


caaccgaata 


1560 


aaatcaaaat 


ccacttcgag 


1620 


tgtatcagga 


agcaaaagga 


1680 






1683 



<210> 312 
<211> 252 
<212> DNA 
<213> B.fragilis 

<400> 312 

ggaatcatga aagaactgca tttgaatatt gtatcgccgg aaaaagaggt ctttaatggt 60 

gaagtgaaga gtgttaccct tccgggcacc agtggagtct tttctattct gccgcagcat 12 0 

gcaccgattg tttcttccct gcaagaaggg acagtcagtt acacgacaac ggatggcgaa 180 

gagcatacgc tggatattca cagcggtttt gtggagctaa gcaatggtga agcttccgtt 240 

tgcgtatcct ga 252 



<210> 313 
<211> 567 
<212> DNA 
<213> B.fragilis 



<400> 313 

aaacggaaga 

aagttcttca 

catacaccga 

tgggcgggac 

atgaactcgg 

atcaatgtgt 

ttattgaatg 

cctacgggtg 

aaaacaattc 

attgtcagtt 



atactatgca 
tctttttttg 
tagatgcatt 
gcagtgcgga 
tacagaagaa 
tgtttgaaga 
acgcagatct 
agatttaccg 
aggattgggt 
tcgacgggga 



taagtttata 
tacgacgatt 
tcccgatgtg 
agaggtggag 
gacggacatc 
tcatgtggat 
tccggatggg 
ttatactttg 
gatcgaccgt 
agtattc 



gacaatattg 
gccgtcattg 
acgaatacga 
aagttcatta 
cgttcgacaa 
gattttgttg 
gtgactccgg 
cgaagtgaca 
aacttgcgtg 



tggcattttc 


gttgaaaaat 


60 


ccggtgtggt 


ttcgttcaag 


120 


aagtgaccat 


cattacccaa 


180 


cgattccggt 


ggagatagcc 


240 


ccctattcgg 


actgtcggtc 


300 


cccgtcagca 


ggtatacaat 


360 


aagtacaacc 


tctttacgga 


420 


aacggagtgt 


acgcgaactg 


480 


ccgtatcgga 


agtgacggat 


540 






567 



<210> 314 
<211> 231 
<212> DNA 
<213> B.fragilis 

<400> 314 

agctttagga agtgtaaatc agagatgaaa gttctcaatg cgaacattga ggaaatacat 60 

gtgagagtta aaccaataaa aacctcttat tgtttgatgt tgcaaagtaa ggcattaatt 12 0 

ccggataata caccatatcc gcttcttttt atccttttaa atattcttta ttgtgtaatt 180 

agaccaaaaa ttaacatttc tctttggcta tatgtgtctt atttgttata a 231 

<210> 315 
<211> 747 
<212> DNA 
<213> B.fragilis 

<400> 315 

atgataagga ataaagttat ggaacagagt tttatcgaat attcattagg aaaagatgct 60 

tcttcggctg tcctttgggt ttatccggtt cgcaagccaa gaggtaaagc cattattatg 120 

tgtcccggtg gcgggttcaa tcagatagct tcagatcatg aaggacgtga ttttgctgcg 180 
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tggtttaata 
gttgaagtga 
gagtggggaa 
actgccgcaa 
gttatcagca 
actatctccg 
cctcctacat 
gtctattata 
ggcggacata 
ctacaaaaat 



atcagggcat 
ttcgtgagga 
tccatcaact 
ctctttatac 
tgaccgacag 
aaggtttgaa 
tcatcgtttt 
cagcattatt 
gcttcggatt 
ggttgctgac 



cacatacgcc 
tattcgtgaa 
gggtgttatg 
cggaacagac 
gctgactcat 
agaaacacta 
ggccgaggat 
gaaacatgga 
tcgtgacagt 
cttttaa 



gtactgaatt 
gcgattcgcc 
ggagcttcta 
cggcccgact 
tggccttcac 
tcccttgaac 
gatcaggccg 
gtctctgccg 
ttcatatata 



atcgcatgcc 
tgatccgccg 
tcggaggcta 
ttcaagtatt 
gcgaacgtat 
ttcacgtcac 
tatctcctct 
gactgcatat 
aggagttatg 



taacggtgat 
tcagtcggca 
tatcgctgct 
gctttatccc 
gttgggagaa 
agccgatact 
caacagtatt 
ttatccggaa 
gactgatgaa 



240 
300 
360 
420 
480 
540 
600 
660 
720 
747 



<210> 316 
<211> 204 
<212> DNA 
<213> B.fragilis 

<400> 316 

agggagtctg ataaaataaa agatttattc atcttccgat tacctatttt gaaaatagct 60 

tcaaaagata tacttgagca aaccgcactt tccacctcct tcggaaactc ctcagcaact 120 

gtcgatgcta tcggtattaa tgtatatgat cctgcatttc ctttgatcac tccatcctta 180 

atccattccg tctttacaat gtaa 204 

<210> 317 
<211> 765 
<212> DNA 
<213> B.fragilis 



<400> 317 

gtattgagag 

tatgctatct 

gataaagaag 

ttaggtttta 

attggttcac 

tgggaattaa 

tataatttca 

actccggaga 

gagtattatt 

aaattaaatt 

actaaatctt 

tttggttcaa 

cctatcccta 



aagctattcc 
tcgcgataaa 
ggaaaaaagt 
cggttaattc 
aagatactcc 
cagccaattt 
ttaattttga 
ataccgatgg 
ggtatgatca 
atttccggtt 
taggaatggg 
gttataagaa 
aatcaattac 



tgcagcgcta 
aacagccggt 
aactttgaaa 
ggatgtaact 
ttatacgggt 
atcatttaat 
tagagggcag 
gcgtctgccg 
gaagagcgaa 
gcaaaatttg 
atcggcttct 
ttttcttgat 
gtttagcttg 



tctattccgg 
ctggatgaag 
gaactgtatc 
ccggcagaag 
ggcctgatca 
ctgggaggat 
aacgtaaata 
gcattaatca 
atttacaaga 
cgtttaggtt 
gtggctattg 
ccggagtcga 
aatttaaatt 



gacgtgaggg 
aaggttatcc 
gctggcagga 
agcgtagctt 
atacattcag 
atgtgcgtac 
gtgatatttt 
ccagcgagaa 
atttagatat 
accgtctacc 
aaggacgcaa 
tgtataatcc 
tttaa 



gtatccggta 
tttgttttat 
tccttttgga 
ttattcgtat 
ttataaaaat 
aacgccttcc 
agatcgttgg 
acgggctgac 
ttgggtaaag 
tgagaaaatg 
tttacttgtt 
gtatgcaccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

765 



<210> 318 
<211> 1050 
<212> DNA 
<213> B.fragilis 



<400> 318 

catctcaggg 

aagcaaacga 

gtt tcctgcc 

tatacttgtt 

ctgtcggccg 

ggatttatcg 

att tttcgct 

ttggatgtta 

ataaataaga 

aaggagttac 

ctgatgaaac 



aagaatccct 
ttcgtcctgt 
tcaaatgtta 
ccgacaaaca 
gtttcgataa 
aagtaggtac 
tgcctcagtg 
ttaaacgccg 
atccctcttc 
atcctcatgt 
aggtgctaca 



tagctttgca 
tttatttctg 
ccggcatttg 
gttgatatgg 
aggagccgaa 
tgtaactccc 
tgaatcttta 
tcttgaacag 
ggaaggagag 
cggttatttc 
ggggttggca 



tggtatttaa 
atggaacccg 
ccatggtgca 
aatcatctta 
atttttgatg 
gattctcagg 
atttcccgta 
aaaagcggtt 
caggcggtgg 
acacttaatt 
gcttttcgtg 



aaagagagaa 
aaaaggtaca 
gatgctggat 
cttttcgaaa 
aattggccga 
atggaaatcc 
cgggcttcaa 
cgtatgtttt 
ctgacttttt 
ggggatctgt 
tggagcaaaa 



agccatgtat 
tgccttgttg 
acgtcacctt 
ccgtatcggt 
ttgtggtttc 
ccgtccgcgt 
taatcccgga 
aggtgtaaat 
gcgtctgtat 
ggatgttgct 
catacatgtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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ccgttgttac tcaaacttcc tgccgatatc acagaagaag gaatggatga tgtgatcgac 720 

tgtactcgtc tgtaccgggt agatggagtg atagctaccg gacctaccat ggagcggagt 780 

tacttaaaag gttattcacc tgcacaatta cagaagatcg gctccggtgg aatcagtgga 840 

cgcgggatag gggagaggtc attaaaagcc gtcagctatt tgcgtgccca tgccggaaaa 900 

agccttctga tagtaggggc aggcggaatc attactcccg ctgatgcccg taggatgttg 960 

gatgcggggg ccaatctgat acaaatctat tcttcgttta tttatgaagg gccgggtata 1020 

gtgaaaaaaa tgattcagga aattaaatga 10 5 0 

<210> 319 
<211> 3174 
<212> DNA 
<213> B.fragilis 

<400> 319 

atgagcgaac tcattgtcta taaagcctcc gccgggtccg gaaaaacctt caccctggct 60 

gttgaatata tcaagttact aatccggaat ccgcgcgcct atcgccagat tttagcagta 12 0 

acctttacta ataaagctac tgctgaaatg aaggagcgta ttcttagtca attatatgga 180 

atacagatag gcgatccgga ttcggatgcc tatctaaaac gcataattgc cgagacaggg 2 40 

cattcagaag acgagatacg aacaacggca ggcatagctt tgggttatat gcttcatgac 300 

tacagtcggt tccgcgtcga aaccattgat tctttttttc aatcggtcat gcgtaatctg 3 60 

gctcgtgaac ttgaattgag tcccaatctg aatatcgaac cgaacaacgt agaagtattg 42 0 

agcgatgctg tggacagtat gattgaaaag ttgggaccca attcacctgt actggtatgg 480 

ctgctcgatt atatagatga acgtatcgct gacgataaac gctggaatgt ttcggatgag 540 

atcaaaagtt tcggacggaa tatttttgat gaaggataca tcgagaaagg tgatggtctc 60 0 

cgccgacgcc tccgtgaccc gaatgtaatc cataattatc gtaagacatt aaaagagatg 660 

gaaacagccg ctcttgaaca gatgaaagag ttcgctcaac agtttgaaaa tgtactttcc 72 0 

agtcaatcac tgaaaccaac tgatttaaag aacggagcca aaggaatagg aagctatttc 780 

aataaactaa aaaacggtat actcggagac gagatagtca atgctactgt aatcaaatgc 84 0 

ctcgatgacg agactaattg ggctgcaaaa acatcaaaac aatatacgga tattatattg 900 

ttggcttctt ccatcttaat gccacttcta caaaatgccg aacaatatcg ctcacgcaac 960 

aatcgaatag tgaacagctg ccgactgtcg acacagcacc tgagcaaagt ccggctttta 102 0 

accaatattg atgaagaagt acgtcaactg aatcgtgaaa acaatcgttt cctcctttcg 1080 

gataccaacg ctttgctcca ccaattagtg aaagacggtg attcctcttt tgttttcgaa 1140 

aaaatcggaa ctaacatccg caatgtgatg atagacgaat ttcaagacac cagtcgaatg 12 00 

caatgggata attttaaact cctgctactt gaaggattgt ctcaaggagc cgacagcttg 12 60 

attgtaggtg atgtcaagca atctatttat cgttggcgaa acggcgattg gggaattctg 132 0 

aacggattga ataagcaact tggatatttt tctatccgta cagaaacgtt aaaaaccaac 13 80 

cgccgaagtg aaaccaatat catacgattc aacaatagta tattctccgc ggctgtggac 1440 

tatctcaacg aaatgtataa taagcagttg ggaagtattt gtgagcctct gatcaatgca 15 00 

tatgccgacg tggaacagga atccccccga aacaaacaac aaggatacgt taaggtagag 15 60 

tttctcgaac cggacgaaga acacgattat acagaacaaa cccttatcag cctgggaatg 1620 

gaagtagaac atctgttaca atccggcgtc aaactgaacg atatagctat tctcgtcaga 1680 

aagaataaaa gtattccgcg tattgccgat tacttcgata aacaactaaa ttataagatt 1740 

gtgtctgacg aagccttccg tctggatgcc tcgctcgcca tctgtatgat gttggatgcc 1800 

ttgcgctatc tttccgatcc ggagaatcgg atcgtgaaag cacagttggc cactaattac 1860 

caattacaaa tacttcattc ggagtatgat ctcaattcgc tgctcctcca taaagccgaa 1920 

gaattatttc caccggcttt tctggaacga atggcagaac tacggttaat gcctctgtat 19 80 

gaactgttag aggaactttt cagtttattt gaactgcacc gtattgaaca acaggatgct 2 040 

tacctgtttg ctttctttga tgcagtaacc gattatctgc aaagccactc ttctgatccg 2100 

gacagcttta tacgatattg gaacgagacg ttatccggga aaacaattcc gagtggcgaa 2160 

gtggagggta tacgtatctt ttcaatacat aaatccaaag gactggaatt tcatacggta 222 0 

ctcctaccct tctgcgactg gaaactggaa aatgaaacaa acaaccaact cgtctggtgc 22 80 

gtacctcaag aagccccttt caacgagttg gatattgtgc cggtaaatta ctcttccgcc 2340 

atggcagaat ctgtataccg cacagattat ttacacgaac gcctgcaact atgggtagac 2400 

aatctgaatc tgctttacgt agcatttaca cgtgccggca aaaatttgat tatctggagt 2460 

cggaaaggac aaagaaatac aatggccgaa ttgctaaccg gagcactacc acaggctgcc 2520 

aacaaattag atcaggaatg ggatgaagaa caggtatatg aattaggtga cctttgccca 2580 

tccgaaaatg agaaaaaaat cgattcaggt aacaagctga ctcgcaaacc ggaaaaactt 2 640 

ccggtcaata tggaatctat gcatccggat atagaattcc gacagtccaa ccgatctgct 2700 
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gattttatca 
ttgttacata 
cacttgatat 
accgtaaagg 
ttcaacgagt 
cgtgtaatga 
aaaaaataca 
aacatcacag 



aggggctttc 
ctttattttc 
tcgagggaat 
ccttctccct 
gcgcaatcat 
tgaaaaacga 
acaaacaggt 
gttatctgtg 



tgaagaagaa 
tgccattgag 
cattggcagc 
gcctgaagta 
ttacaaagac 
gcaagtagtt 
aaaaggatac 
gtatgtagaa 



tccgatgacc 
accaaggatg 
aaagaagcgg 
caagagtggt 
aaaggtgtat 
gtggtagact 
atgcagctcc 
gaagaaataa 



gcttcataaa 
acatagaacc 
aagaacgaat 
actccggtga 
tacaaacccg 
tcaaatttgg 
tctcccgaat 
tcgaaaaagt 



<210> 320 
<211> 1095 
<212> DNA 
<213> B.fragilis 



tcacggacaa 
ggccattcat 
acgttctctg 
atggagattg 
tcgccctgat 
taaagcaaac 
gggatacaaa 
atga 



2760 
2820 
2880 
2940 
3000 
3060 
3120 
3174 



<400> 320 

tttatgaatt 

tcatcagaag 

aaaatagttt 

ggacgtgtta 

gtgacggagc 

ttgcgtagcg 

attattgccc 

gataaagatg 

atcaaagaaa 

cccgttagcg 

cagggtgagg 

tatgaaagcg 

tatccgggta 

agtaagacaa 

atgttcacca 

gcacatgcct 

cgcctgaaag 

tccggacttt 

ttaaatgcag 



ggacaaaata 
tgaagcaccc 
ccgtcgatac 
cttttaatca 
ttcgcgctga 
gtgaagtggc 
gcagaaatgt 
tattgcaggc 
ttttttccat 
gttttattgt 
agatatttac 
acatcagtaa 

aggtgttctc 
tgaacgtacg 
cggtcaatgt 
tgatatttga 
tgaaagaagt 
ccgagggtga 
actaa 



ccttccatgc 
cggagaaaat 
ggtgcatctg 
ggaacaggtg 
agttggggat 
cgattacgaa 
aaatgctacc 
acgtcaggaa 
aaataacttt 
ggaaaagagt 
tgtctccggt 
agtagcagaa 
cggaaatata 
ggtaaagctg 
tgagtgcaaa 
aggaggtaag 
cgatgtatac 
cagagtgctg 



ctattgattt 
caagatctgt 
catgatgtgg 
gcacacgtct 
tatgtgagaa 
agacagatga 
cgggatatgt 
ttgatcaatg 
agtggccggt 
gtgagcagaa 
ctggagcatg 
ggagcatcgg 
gataaagtat 
tgtaacgaag 
tcttccggga 
aattacgtcg 
aaacggcaga 
aatcagaatg 



tgggtatggg 
gtctgacaga 
cagatgaatt 
atccgatgtt 
aaggagacat 
aagaggcgga 
tcgattccgg 
ctgaagcgga 
cattctatga 
atatgcagct 
tatgggtgat 
tacatatcac 
accacatgtt 
actatctgct 
aacagatgcc 
taaccgtcac 
atcaggaatg 
tattattggt 



gagtggttgc 
cagtttactg 
gactttgaac 
tggcggaaca 
acttgccata 
gcagcaggtg 
gttggcatcc 
agagaatcgc 
agtcaaatct 
tcgtcccgat 
ggcagatgtt 
tacgctggca 
gaatactgaa 
gaagccgggt 
tcggatcaat 
ccccgacaac 
ctatgtccgt 
ctacaatagt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1095 



<210> 321 
<211> 627 
<212> DNA 
<213> B.fragilis 



<400> 321 

gtaccgagtt 

cgccagacgt 

cctcttcatc 

tattcgttga 

tttggcgtag 

ttgtcaaatg 

gtttcgtatg 

tatattgtag 

ccaaacaaga 

gtactcaatc 

tttcgaagtt 



gcgtaagcac 
tgactacaaa 
agaaaacgta 
tgaaccgtta 
acaagaaata 
aaccttttat 
gtattcaggg 
ataatatttt 
aacttcgttg 
aggcgcttaa 
caaatgattc 



ttggtatgaa 
accggttgtc 
taaagaaaat 
tacattcgga 
ccgttatttg 
gcagggaact 
aaatattgat 
accgggtggt 
ggagaaaact 
tctgagtgta 
cacttaa 



acacttttct 
tttccggatg 
gcctatgtct 
ggaagtatcc 
cctctgtact 
agaaaatgga 
aaaaatacat 
tcggaacata 
caatcagtaa 
gattactatt 



ctgcaggata 
aagaccgtgc 
ctttcttctc 
gttttgatgg 
ctgtaagtgg 
tggataacct 
ctccctttct 
tgattgatat 
atgttggact 
atcgtaaagg 



tggttttgat 
cagacagttt 
tacagcttcc 
ttctgactta 
attatggaga 
tgcattccgt 
gttgggtaaa 
aaattctgct 
tgatttttcg 
tacagacctt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

627 



<210> 322 
<211> 2574 
<212> DNA 
<213> B.fragilis 



<400> 322 
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acatacggct ggcagcaccc gaagcgggta caaacttcac aatcgttttg tctgaattgg 60 

tataggcatt ccatgctgca agataagctt tctgctcctc ggcatcagga gccaaaatgc 120 

ctttttctac agatgcggca gcatctaact tcaaataagg aaagcctttt tggaagcaag 180 

ccaactgctc agcgatttgt acttccgaaa tgcccttttt ggcaagcaat tctttgtctt 240 

caggtgttat catactatct aattttatta attcatgccc aaaaatacaa aaaaaggaag 3 00 

aggacaccct acaaaacccg gaaaaaaact atctttaccg acgataaatt tgagctgctt 3 60 

atggaaagag atgaattctt tacgaaagaa gagagagaat tattgttctc actatacaaa 42 0 

aaactactgc gtctcaccgg agaaacctta caaaaaggag attgcagaaa gctgaaaaag 480 

catcttatcg actccactca aaacaatacg atgcagaggg acagttttgg gctgaatcct 540 

gttatcaaag atatgcagac tgctgtaatc gtggctgaag aaatcggcat gaaacgggca 600 

tctattttag gcattatgct acacacgcct gtacgttgcc actcttatac aatagaatac 660 

attcaacagg agtatggtga agatgtggcc ggaattatcc ggggattaat caagatcaat 720 

gacctctatg ataagagtcc gaccatagaa tcggagaatt tccgcaatct gctactgtct 780 

tttgccgaag atatgcgggt aattctgatc atgattgccg accgtgtaaa cgtgatgcga 840 

caaataaaag atgccgaaaa tgacgaggcc cgcagacggg tggccaatga agcagcctat 9 00 

ctgtatgctc cgctagccca caaactggga ttgtataagc tgaaatcgga actggaagat 960 

ttgtcactaa aatataccga acatgacatc tattaccata tcaaggaaaa gctgaacgag 102 0 

acgaaaaagt cacgtgaccg ttatattgcc aacttcattg ctccgataca acagaaattg 1080 

gaggaagcag gactgcattt ccacatgaaa ggacgtacca agtccattca ttccatctat 1140 

cagaaaatga agaaacagaa atgccagttc gaaaacgtat atgacttgtt tgctatccgt ' 12 00 

atcatcctgg aatctcagtt tgaaaaagag aagcaggaat gttggcaggc atattccata 12 60 

gtgacggata tgtatcaacc taaccccaaa cgtctgcgtg actggctgtc ggttcccaaa 1320 

agtaacggtt acgagtcatt acacatcact gttatggggc ccgaaggcaa atgggttgaa 13 80 

gtacagattc gtacggagcg tatggacgat attgccgagc gcggattggc agcccattgg 1440 

agatataaag gcgtgaaggg tgaaagcgga ctggacgaat ggctgacttc aatacgtgaa 15 00 

gcactggaga atacggagaa cgacctggaa atgatggacc agttcaaact ggatctgtat 1560 

gaagacgaag tattcgtatt tacaccgaag ggagaccttt ttaaactggg caaaggggct 1620 

accgtacttg attttgcttt ccacatccac agcaaattgg gatgtaaatg tatcggagca 1680 

aaagtaaacg gtaaaaatgt acagttaaga caaaagctga acagcgggga tcaggtagag 1740 

attatgacat cgaacacaca gactccgaaa caagactggc tgaacattgt cactacttca 1800 

aaagcccgta ctaaggttcg tcaggccctc aaggagatgg tggcgcgtca gcatgatttt 1860 

gccaaagaga ccctggaacg caagttcaag aaccggaaga tggaatacga cgaagctgtg 192 0 

atgatgcgct taatcaaacg cttgggattc aagaacgtga cagagtttta tcagaagatt 1980 

gccgatgagg tactcgacgt aaacgatatt ctggataaat acatcgaaca acaaaagcgg 2 040 

gacagcgaac gtgatgaggt gacctatcgc agtgcagaag aatacaacct gcaaaaccag 210 0 

atagacgaaa caacagtcac taaagaagat gtactcgtta ttgaccaaaa cctgaaagga 2160 

ttggatttca aactcgccaa atgttgtaat cccatatacg gagacgatgt attcgggttt 222 0 

gtcacagtat ccggaggtat caagatacac cgaaatgact gccccaatgc aggacagatg 22 80 

cgcgaacgct tcggctatcg gattgtaaaa gcacgctggg ccggtaaatc ggaaggtact ' 2 3 40 

caatacccaa taacactccg cgttgtgggt catgatgata tcggtattgt aacaaatatc 2 400 

acttcgatca tctcaaaaga aaatggtatc tcgctacgtt ctatcggtat cgattcgaac 2 460 

gacggacttt tctcgggtac attgaccatt atggtaagtg ataccggacg tctggaagcg 2 52 0 

ctgatcaaga agttgcgcac agtaaaagga gtaaaacagg ttagcagaaa ttaa 2 574 

<210> 323 
<211> 1479 
<212> DNA 
<213> B.fragilis 

<400> 323 

tactcattgc ttatgatttt taccgctgaa aacattctac tcattggttc tattttacta 60 

tttgtcagca ttgttgtcgg aaaaaccgga tatcgcttcg gagtgccggc cttattatta 120 

ttccttcttg taggtatgct tttcggaagc gacggattgg gattacaatt tcataatgcc 180 

aagatagccc aatttatagg tatggttgcc cttagcgtca ttctgttctc cggaggtatg 2 40 

gatactaaat tcaaagaaat tcgtcctatt ctttctccgg gaatcgtact ttcaacagtg 300 

ggagtatttc tcacggcact ttttaccgga ttattcattt ggtatctttc gggaatgagt 360 

tggaccaata tccactttcc attgatcact tccctattac ttgcatctac catgtcgtca 420 

acggattctg cttcagtatt cgccatcctc cgttcgcaaa agatgaatct gaaacataac 480 

ctacgtccta tgcttgaact ggagagcgga agcaacgatc caatggccta tatgcttacc 540 
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atagtcctga tacaattcat tcaatcagat ggcatgggta caggcaacat aatcggttca 600 

ttcatcatcc aattcttggt aggtgctgct gccggatata tcttgggaaa actggcgata 660 

ttgatactca acaaaataaa tatcgataac caatcacttt atcccattct gttattgtct 72 0 

tttgtattct tcacttttgc catcaccgat ctgcttcgcg gtaatgggta tttggctgta 780 

tacattgccg gcatgatggt aggtaaccat aaaataactt tccgaaagga aattgcaaca 840 

ttcatggatg gtctgacctg gctgttccaa atcattatgt tccttatgtt aggactgctt 900 

gtcaatcctc acgaaatgat tgaagttgcc gttgtagcat tgcttatcgg agtattcatg 960 

atcgttatcg gacgaccatt aagcgtattc ctttgtcttt taccatttag gaagattact 102 0 

ttaaaatccc gtctgtttgt ctcgtgggta gggctacgag gagctgtacc catcattttc 1080 

gcaacttatc cggtagtggc aaacgtggaa ggatcgaata tgattttcaa tatcgtgttt 1140 

tttattacga ttgtttcatt gattgtacaa ggaacaagtg tttcgtttgt ggcacgcttg 12 00 

ttacacttgt ccactccact cgaaaagacc ggaaatgact tcggtgtaga acttccggaa 12 60 

gagatagata ctgatctttc ggatatgacc attactatgg aaatgctgaa tgaggcagac 132 0 

accctgaaag atatgaattt gccaaaaggt actttagtaa tgatcgtcaa acgtggtgat 13 80 

gaatttctta tccccaacgg cacactaaaa ttacatgtag gagacaaact actgctgatc 1440 

tcagagaaaa ataagcagga aacggttaag aatgaatag 1479 

<210> 324 
<211> 312 
<212> DNA 
<213> B.fragilis 

<400> 324 

ttactgatgc tgttcacgca acaggtcgtt cacagtcttt accggattga aagtggaaag 60 

tggaacctcc acaaataccg tactccaatc gctcatagca ccattccaca aaccgggaag 120 

ttcaagagct ttcagatcct taccactctt cgacttataa gagatgaagc cggtagcttt 18 0 

atccacatat ttcgccaaat caaatttatg acccttataa tcacgtacgg cgcaaaccag 240 

atcgaccggg ttgaagtgcg tacccttttc aaacatttcc tttgcttccg gattattcat 3 00 

atcgatttgt ga 312 

<210> 325 
<211> 1248 
<212> DNA 
<213> B.fragilis 

<220> 

<221> unsure 
<222> (473) 

<22 3> Identity of nucleotide sequences at the above locations are unknown. 
<400> 325 

cattcctggc gttctatctg ggctatctta tatatgaaag ctggtttttc ttcgttttcg 60 

agtggaatca gaaacttaca aagaaatcaa aaaaatatga aacagttgcg taacatagtt 12 0 

gccggaatgc ttgtcttgat aggaggaatg ttgcctgcta caacctttgc acaggagcct 18 0 

gtaccgggtg ataccaccgg tactttgcag catgagatta ttgtaggtaa agacacaatc 240 

aatcaagaag ctaatcaggt agatgtaaaa ggcattgtgt tcggccctat tggagattct 3 00 

tacgagtggc atattacgaa tataggaaaa acttcgattt gcattccgtt gcgattaatc 360 

gtgtatagcg aactttctgg ttggcatgct tttctgtctt cgcgcctaga agagaatggc 42 0 

ggcaaatacg agggatttta tatagctcct gccgggagca agtatgaggg ganagtagta 480 

gaacgtaatg cgacgggaga ggaagtacgt ccgtgggata tttccattac aaaggtaact 54 0 

ttgtctctct ttatcaatag cgctattttg ctggcgatca ttctgagtgt agcgcattgg 600 

tatcgcaaac gtgaacaggg tgcatatgct ccgggaggat ttatcggatt tatggagatg 660 

tttattatga tggttcatga tgatgtgatt aagagttgtg tgggacccaa ctataaaaag 72 0 

tttgctccct atctgctcac agcctttttc ttcattttca ttaacaatat tatgggactg 780 

atccccatct ttcccggagg agcgaatgta accggaaata ttgccataac attggtatta 840 

gctttattca cattcgttat tgttaatata ttcggaacaa aacactattg gaaagatatt 900 

ttctggcctg atgttccctg gtggctgaag gtacctatac ccatgatgcc gtttatcgaa 960 

tttttcggtg tatttaccaa accgtttgcc ttgatgatcc gtctgtttgc caacatgttg 1020 

tccggacaca tggccatgtt agtgcttacc tgcctgatat ttatatcggc aagcatggga 1080 
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ccggctatca atggttcgct tacggtggct tccgtattat tcaacatctt tatgaatttg 1140 
ctggaagtgt tggttgcctt tattcaggct tatgtgttca cgatgttatc tgctgtattc 1200 
atcggactgg cccaggaagg cggtaaaaaa gaagaagtaa aagaataa 12 48 

<210> 326 
<211> 2658 
<212> DNA 
<213> B.fragilis 



<400> 326 

gtgctaatta 

tcatttttgt 

ggatggtttt 

gagtatgaga 

caaaagaaga 

gtggtgaata 

actaattatg 

atccgtcaaa 

aaggataatt 

tctaccagca 

aacgtattta 

tccatcaagt 

gaaggtacta 

gaagtgccga 

actaatttca 

ctggagccca 

aaacctattg 

caaggaattg 

gctaaagata 

acttatgccg 

ggagaaatcc 

tggattacgg 

agtctgatgg 

ctgcgccata 

tttaccgacc 

gtggcacagc 

tttgctatag 

ctgttgcagg 

gatccgcggg 

cgttcttcac 

acaactaccg 

attaatcaat 

gaaaatacaa 

caggctgctt 

cctgaaatag 

gccccaacac 

aatcgcctga 

gttgaattga 

cccgatgtta 

gccgaaagcg 

tttcagacac 

atgggggctc 

atttcagtaa 

acatcggatg 

ttgggcattt 



gatttaacat 
gcgagcctgt 
tgaaaaagaa 
aattaacagg 
acgattatta 
aactgcaacg 
agaatcagat 
gtcgtccgtt 
atatttctcc 
tactgattaa 
ccaatattaa 
cctttgataa 
ctactatcta 
tgaccggacg 
gtgatggaca 
ggcctgaaga 
tcttttatat 
aagactggca 
ttaccgagga 
catctaccaa 
ttgaggctga 
tacaaacagg 
gagatgccat 
acatgatggg 
gaatgaattc 
cgggtgatgg 
aatatggtta 
attttttagc 
atgccgttga 
agtatggcat 
gagagaaagg 
ggaataatta 
cggtaggtga 
tgaggttttt 
ctcaatatac 
aagtgctgaa 
tgcgtatgct 
tggatggttt 
tgacacgtaa 
agggagtgaa 
cgatttgtag 
gccgtgaact 
aacgtggtga 
tagctaccaa 
cgaaataa 



gagactaaaa 
tgctgctatg 
aaaaaagagt 
aagcgatagt 
ctttgagatt 
ggtacctgca 
gatccgcttt 
acctatttca 
gctgatagcc 
agtgaacgat 
tcttggcaca 
taatgtggta 
tgtgacggta 
tttggataat 
acaacgggta 
tcgggcagcg 
agaaaattcg 
agtagccttt 
catggaggta 
agcaaatgca 
tatcatgtgg 
tgtagtgcgt 
gcgctttgtt 
atcatgggct 
gacttcatcg 
tataaaggca 
tcgttggtat 
gaaacacact 
ccctcgtgca 
tgccaatttg 
acagacgtac 
tctttatcat 
tggtgagaaa 
gcttgatgag 
ttatctgctt 
aaatgcacag 
tgagaatgaa 
gcataaaagt 
cttacagaaa 
agttaacaag 
ttgtgatgac 
gaatttttat 
attgcttcgc 
atatcactat 



acaatcctac 
tgtattgaac 
aatccccaag 
gtcgttcgtc 
ccttccaccc 
gaactgaatg 
gagttggata 
ccatctgaag 
gggtttaagg 
atatatgatg 
tcggccatca 
gcaacctccg 
gaggttagtt 
ccgcgtgtgg 
aataaaaaac 
tatttacgtg 
acaccttatc 
gaacgtgccg 
gatatggatg 
atgggacctt 
tggcataatg 
cctgaggctc 
gcctgtcatg 
tttcctacag 
tctatcatgg 
ctttctcccc 
ggcaagcaaa 
gatcggcttt 
cagaacgaag 
aaatgtattg 
gaagaggctt 
gtaatggcga 
acttatacgt 
gtgctatgct 
aaaaatactc 
gcttatgttt 
tcggtaaacg 
atttttgctg 
ggttttgtag 
aaattgattg 
catgcacatc 
gggtctcaga 
attaaagatt 
aaagatttga 



tgaccactat 
ctcccgcaac 
acagtattaa 
gtggtatgtt 
tgttggggcg 
aagccggagt 
aatcggctaa 
atgccattag 
tagaagcata 
gtacagagac 
agaatttatc 
aactgactac 
cctctatttt 
gatatttcac 
aatttataac 
gagaattggt 
gttggaggaa 
gatttaaaaa 
atgtgaatta 
ctattcttga 
tactttcaat 
gtggtgttgc 
aagtgggaca 
attctctccg 
attatgcccg 
acatcgggcc 
caccggaaga 
ataaatatag 
atcttggcga 
ttccccaaat 
ctcgtttgta 
acattggagg 
ttgtggaaaa 
atccgaaatg 
ctttgggagt 
tttgggattt 
ggaaaaaagc 
taacagagcg 
atgcattgat 
ataatcactt 
gttcggcaca 
taaaccgtat 
tgcttcaaag 
ttttacgcat 



ggccaccggt 
tcctgatatg 
ggttaagaat 
caatgtatac 
tgatatgctg 
gaatcgtgga 
taaattattg 
ccaatcggtg 
taataatgat 
aagcataaat 
aagaattcta 
tcgtgtgacc 
gttgcttcct 
taatcctctg 
tcgatggcgt 
agaaccccga 
gtatattaaa 
tgccattatc 
ttctgtgctg 
tccgcgttcg 
gcttcaggag 
tttaccggat 
ttcactcgga 
ttcgaaaaca 
ctttaactat 
gtatgatatg 
agaaaaagaa 
tgaggcacag 
tgatccgatc 
cattcaatgg 
ttatgccgtt 
tatttatatt 
ggagaagcag 
gttgttcgac 
agtagagaat 
actgtcgaat 
cttcacagct 
tggtggactg 
tactgctgct 
cttgttcgac 
tactgatcgc 
ttccgacgcc 
ccgtttgggc 
aaatactgcg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2658 



<210> 327 
<211> 933 
<212> DNA 
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<213> B.fragilis 
<400> 327 

aatacaaaca gggctgatat gagacaattg tactacactt tccgaactct tctccgtgga 60 

agaggtggaa atctgaccaa aataatttcg ttaactttag ggcttttggt cggcatcctg 12 0 

ttatttgcca gggttgcatt tgaattaaac tatgatagct attatcaaga accggaaaat 180 

ctttttctaa ctttacgtac agttgtttcg caaggtgaaa agaaagagcc tgtttgtagt 240 

aattacggaa aacttccagc agcaattcgt gaaaattttc ctgatgaagt ggaagatgca 3 00 

actttgattg acttatttag tcgcagttcg ctttaccatg aaggccagga aaagaaagat 360 

gcaatactgg ctacttcccg aagccatatt ttttccactt tgggcgttaa agtactttcc 42 0 

ggaaatgtgt ctgaattgga taatatggat gcactgttta tatcccgttc tcttgctcaa 480 

agtctttttg cagatgccga tcctattgga aagacagtaa tgattaatat tgattatcca 540 

ttgactgttc gaggtgtttt cgaagatatt ccggaaaatg ccgagtttcg gtttgatggg 600 

gtctattcat ttgtgactcg tgctaataga ttcagagatg aacgtggtgg atggcggggt 660 

gatatcagct atacatgtat ggttcgtttc cgccatccgg aagatgtaga gaaagtggcg 72 0 

gcacgtatgc ctgatatgct gaagaagtat atacagtata ataaagactg gtttgaagaa 780 

ttttcgttta taactccttc acagtttcat ttgcagaaaa aggaatcacg taaaattatc 840 

agtattctat cgattctcgg atttgccatc ttgctgattg ccggcatgaa caatgtactt 900 

gatttctatt tcatcattgg ctcaacgagc taa 933 

<210> 328 
<211> 399 
<212> DNA 
<213> B.fragilis 

<400> 328 

cagaaattaa gaatggaaaa attcagcacc agaaaaagaa tacggagctt cggatatgcc 60 

tggaaaggta tccgaagttt tgtaagcaaa gaacataatg cctggataca ttgcacggca 12 0 

attattatag taacagtggc cggattctgt ttcggcatca cccggaacga atggatggct 18 0 

atcatacttt gttttggagt agtactggca gcagagggat tcaacacggc tatagaaaga 240 

ttggtcaatc ttgtatctcc ggaacgtaat ccgatagcag gtgatgtgaa agatatcgca 3 00 

gcgggttccg ttctgatatg tgctatagtt gctgccattg taggaattat catcttcatg 360 

ccttatgtac ttgctgtttt actgtgtaat atgggataa 399 

<210> 329 
<211> 1536 
<212> DNA 
<213> B.fragilis 

<400> 329 

tatataaaaa gattgcttat gtcacagatt atcggacata tctctcaggt aattggccct 60 

gtggtcgatg tgtattttga aggtacagaa tcggacttga tattgccaag tatccacgac 12 0 

gcattagaga taaaaaggca caacggcaaa aagctgattg tagaagttca acaacacatt 180 

ggtgaaaata ccgtacgtac ggttgccatg gatagtaccg acggcttgca gcgcgggatg 240 

aaggtatttc cgacgggagg tcctatcaca atgccggtag gcgaacagat caaaggacgt 300 

ttgatgaacg tagtcggcga ctccatcgat ggaatgaaag aactcaatcg cgacggtgca 3 60 

tattctattc accgtgatcc tcccaaattt gaagatttaa ccactgtaca ggaagttctg 42 0 

tttaccggaa ttaaggtgat agacctgctt gagccttatt caaaaggagg taagatcggt 480 

ttgtttggcg gggccggagt gggtaaaact gtacttatta tggagctcat taataacatt 540 

gccaagaagc ataatggttt ttccgtattt gccggagtgg gagagcgtac ccgtgaaggt 60 0 

aatgatttgc ttcgtgagat gattgaatcg ggtgtaatcc gttatggaga agcattcaaa 660 

gaaagcatgg aaaaaggaca ttgggacctc tcgaaagtgg attataatga ggtagaaaag 72 0 

tcacaggcta cattggtgtt cggacagatg aacgaacctc ctggagcacg tgcttcagtt 7 80 

gctttgtcag gattgactgt cgctgaatct ttccgggata tgggggcaaa gtcgggagcg 840 

agagatatat tgttttttat cgataatatt ttccgtttca ctcaggcggg ttccgaggtt 900 

tcggctttgt tggggcgtat gccttctgcg gtaggttatc aacctacgtt ggctaccgaa 960 

atgggtgcta tgcaagaacg tatcacttcg acaaaaacgg gttctatcac ttcggtgcag 102 0 

gctgtttacg taccggctga tgacttgacc gaccctgctc cggcaacaac ttttacccac 10 80 

ttggatgcaa cgactgtgtt gagtcgtaaa attactgagc ttggtattta tccggcagtg 1140 
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gatccgttgg 
gatgtcgcac 
tctattttag 
agggtacagc 
ggagcaatgg 
gtagattatc 
aaaggtaaga 



agtctacttc 
aacgtgtgaa 
gtatggagga 
gtttcctgtc 
tggccattga 
tgcctgaacc 
aactgcttga 



tcgtattctc 
gcagattctt 
attatccgac 
tcaaccattt 
agatacaata 
ggcgtttctg 
acaggctaac 



gatccgcaca 
caacgcaata 
gccgaccgtc 
acagtggccg 
aaaggattca 
aatgtgggaa 
aaataa 



ttgtgggtca 
aagaattaca 
tggtggtaaa 
aacagtttac 
aaatgatttt 
ccattgaaga 



agagcattat 
ggatat catc 
ccgtgcgcgc 
cggagtaccg 
ggatggtgaa 
agctatcgaa 



1200 
1260 
1320 
1380 
1440 
1500 
1536 



<210> 330 
<211> 1809 
<212> DNA 
<213> B.fragilis 



<400> 330 

caaatgaaaa 

tttgctgcca 

atcggtgctt 

ataacaggag 

acacgtttat 

aaagatcaat 

actggttatc 

tcggatgaaa 

ggtctttccg 

ggtacttcgt 

gaaggtacgg 

tctattgccg 

gctacagcca 

ggaaaagtgg 

agtacaaacc 

cttcgttcta 

tacggattga 

acggatataa 

gcattcaatc 

acttctatcg 

aatattgtag 

tttgttaatc 

gtatactatt 

gtatatgatt 

gagcgtaaaa 

gagttacgtt 

gcatcgaaag 

agtaaatatt 

cataaagcgt 

gacagtttca 

ttggtatga 



gacacgtttt 
gtaggcaagt 
ctgtctatat 
taattaccga 
tctgcagtta 
atgaaatcac 
agacagtgga 
ccatcggtgc 
taacctctac 
cattgaacgg 
atgtacctca 
gactgaatcc 
tttatggagc 
gaaaaccggt 
gactcaatat 
attttgcgta 
ctgatgccta 
gtcggttgcg 
aggagtatag 
gatattatca 
cgaagacttc 
gccgtaacaa 
cgcgtaaggc 
tcgatgttca 
atacttcgaa 
ttaatgataa 
aacagattgc 
gggattctgc 
atgagaatac 
atgatatcca 



tattctttta 
acaaggagtg 
aaaagcagaa 
tatagatgga 
tgtaggacat 
gcttttccca 
gcgccgtaag 
tgtgaaaagt 
ttccggagcg 
aactcaggat 
gtcgaatgta 
tgcagatatt 
acgtgctgca 
gattaatttc 
gttgaattca 
cggtgacaat 
taaaaaaggg 
gaatacagaa 
tttaagtctt 
agaaaatggt 
atataaagtc 
taaaacctac 
taatccatac 
gaacaattct 
tgaggaaacg 
actgaagttt 
cgataaggag 
ctcccaaagc 
gaactcccag 
cgaactggaa 



ttgtcttttg 
gtgatctctt 
gacctgtcga 
aaattcaata 
gaagtacagg 
tcagctcaga 
ttgacagcag 
attgaccagg 
ccgggtgcac 
ccattgtggg 
ttgaatgatg 
gaaaatatta 
aatggtgtaa 
tcctcgaagt 
caagagaaag 
aaggggggag 
gggtggagtg 
actgattggg 
tcgggaggta 
aatgttaaag 
aaccggatgt 
ctgaccgata 
tatcaacctt 
gatacggatt 
attaatgcac 
acaactcaac 
agtttttcaa 
aataaatact 
attacctgga 
gtaatggtag 



ccggagtttt 
cagaagataa 
aagatggtaa 
tttcagtacc 
aactcaagct 
tgcttgatgc 
ctgtcgggaa 
cactggccgg 
ctgcaaaaat 
tattggatgg 
tttctaatat 
ctgtgctaaa 
ttgtgataac 
ttacttatat 
tagatttgga 
tttctaaaat 
cgctgactcc 
gcgatattct 
acgaacgggt 
gcgtcgggct 
tgaaattcgg 
cttatggatt 
tcgatgtaaa 
tagggtttaa 
tttcgtctat 
ttggtttgca 
tgcgtataat 
ttattccgga 
aagcaatggg 
gtaccgagtt 



gacttctgct 
tatgccgttg 
ttctccgaca 
ggagggggtg 
cgttcccgga 
tgtggtagtg 
actgaacatt 
gcaaattgcc 
acgtattcgt 
tattccgttg 
acagcaatcg 
ggatgcggca 
gactaaaaaa 
gcctacattg 
acttgaattg 
aatttccggt 
cgaagcccaa 
tttccgcgat 
gacttattat 
ggatcgtctg 
agtttcttta 
ggtgaatccg 
cggaaattat 
tatttttgaa 
ttttgatgca 
attggataaa 
tcgcaaaaac 
cggaggagtg 
agagtaccgg 
gcgtaagcac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1809 



<210> 331 
<211> 1593 
<212> DNA 
<213> B.fragilis 



<400> 331 

tttttttccg 

ataaaattag 

tcggaagtac 

ttagatgctg 

gcttatcttg 

cccgcttcgg 

tatactgaac 



ggttttgtag 
atagtatgat 
aaatcgctga 
ccgcatctgt 
cagcatggaa 
gtgctgccag 
ccactacaaa 



ggtgtcctct 
aacacctgaa 
gcagttggct 
agaaaaaggc 
tgcctatacc 
ccgtatgttc 
gttcgaacaa 



tccttttttt 
gacaaagaat 
tgcttccaaa 
attttggctc 
aattcagaca 
aagaatctat 
actttttttg 



gtatttttgg 
tgcttgccaa 
aaggctttcc 
ctgatgccga 
aaacgattgt 
ttgaattttt 
aatcgattga 



gcatgaatta 
aaagggcatt 
ttatttgaag 
ggagcagaaa 
gaagtttgta 
ggatgctgac 
aaaatttgct 



60 

120 

180 

240 

300 

360 

420 
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ttttatgatg acttgaatac ggcttgtgtg cggacagaag gcaaaggtat tcctaccctg 480 

attgcagaag gcaattataa agcggtcgtt tcaggattgc ttaatgtagc gggactgaat 540 

tacggtgcgc ttcccaaggg gttacttaag tttcataagt atgaagaagg ttcgcgtacc 600 

ccgcttgagg aacatttggc agaaggtgct atgtatgctg ccggaaaaag cggaaaagtc 660 

aatgtgcatt ttaccgtctc taccgagcac cgtgaactgt ttaaatcatt ggttaccgaa 720 

aaagtcgatg cttttgccaa acgttatggc gtggattata atatcacttt ctccgaacag 780 

aaaccaagca ctgatacaat tgccgctgat atggaaaacc agccattccg cgataacggt 840 

aagcttttgt tccgtcccgg cgggcatggt gcgctgattg agaatctgaa tgatctggat 900 

gcggatgtta tttttattaa gaacattgac aatgtggttc ctgataaatt gaaaggcgat 960 

acagtgcttt ataagaaact gattgccggt gtactggtat ctctccagaa gcaagctttt 102 0 

caatatctgg aattattgga tagtggtcgt tatacacatg agcaggtgat ggatatcctg 1080 

caatttgtac aaaagaaact tttctgtaag aatcctgaaa caaaagatct ggaagatgcc 114 0 

gagttggtca tttacctgaa gaataaattg aaccgtccga tgcgtgtttg tggtatggtg 12 0 0 

aagaacgtgg gcgagccggg cggtggtccg ttcctggcat ataatagtga tggaactatc 1260 

tccctgcaaa tccttgaaag ctcacaaatc gatatgaata atccggaagc aaaggaaatg 132 0 

tttgaaaagg gtacgcactt caacccggtc gatctggttt gcgccgtacg tgattataag 13 80 

ggtcataaat ttgatttggc gaaatatgtg gataaagcta ccggcttcat ctcttataag 1440 

tcgaagagtg gtaaggatct gaaagctctt gaacttcccg gtttgtggaa tggtgctatg 15 0 0 

agcgattgga gtacggtatt tgtggaggtt ccactttcca ctttcaatcc ggtaaagact 1560 

gtgaacgacc tgttgcgtga acagcatcag taa 1593 

<210> 332 
<211> 2595 
<212> DNA 
<213> B.fragilis 

<400> 332 

cttattatta tattgctttc ttctttcttt tgctttaact ttgtcgcaaa gtattattgc 60 

atggaagaga accacgaaat agaattagcc tggcaagtca ttgaaaatac cggtacgcat 12 0 

ctttttctga caggaaaggc cggaaccgga aaaacaactt ttctccgtcg gttaaaagaa 18 0 

cttaccccaa agcgtatggt agtggttgca cctacgggga ttgctgccat taatgccgga 240 

ggtgtcacta tacactcctt ttttcagttg aacttcgcac cttacattcc ggaaagcaca 300 

tttaattctg ctcagcaagg ttttcataaa ttcggaaaag aaaaaatcaa tattatccgt 3 60 

agtatggact tgttggtgat cgatgaaatc agtatggtac gtgccgatca actggatgca 42 0 

attgatgctg tactgcgtcg gtatcgtgat cgctcgaaac ctttcggcgg tgttcagctc 480 

ctgatgatag gcgacttgca gcaattggct cccgtggtga aagaagagga ctggagcctg 540 

ttgagctctt actatgatac agcatttttc tttggcagtc attcactgaa agagacggaa 600 

tatatcacga tagagttaaa gaaagtctat cggcaaagtg atacggaatt tgtcggatta 660 

ttgaataaaa tcagagagaa agaggcagac gacgctgttc tggaagaatt gaacaaacgt 72 0 

tatcttccgg gattccgtcc gagagaggaa gaaggatata tccgactgac tacacataac 780 

tatcaggctc agcaatataa cgaccgacaa ctgctttctc tttcaggaag agctttcagt 840 

ttccaggcga aggtggaagg cacttttccg gaatcggcat acccggctga tgaaatgctt 900 

accgttaagg aaggggctca gataatgttt attaaaaacg attcttccgg tgaacatcga 960 

tattataatg gaatgatcgg tttggttacg gctgtcagta aagatggcat ccgggtgaaa 102 0 

gggaacggag aatcacagga ttttctgctt gaaaccgaag aatggacaaa tagtaaatac 1080 

agcctgaatc cgcagacgaa agagattacc gaagaggtgg aaggtacttt ccggcaatat 1140 

cccattcgtc tggcatgggc aataaccatc cataaaagcc aggggttaac tttcgaacgt 12 00 

gcaatcattg atgcaaatgc ctcttttgcc catggccagg tttatgtcgc tctgagtcgt 12 60 

tgtaagtcgc ttcagggatt ggtgcttagt tctcctttaa ggcgagagtc cattatcagt 132 0 

gacgatacga ttgatgaatt tacccgtaat gccggagaga tgactcccga caagcataaa 13 80 

ttggctctat tgcgtcaaca ttacttctat gaattgttgt gcgaacagtt tgattttcat 1440 

ccgattgaac agcatttttt acgtttgctt cgcttgcttg acgagcattt atatcgtctc 1500 

tatccaaagt tgttggaacg atataagaca actgccgatc tgtataaaac gcagataatg 1560 

aaagtcgccg atacatttaa actgcaatat tctgccctat tgatggaggc tgaagattat 162 0 

accgccaacc cgaaattgaa tgaacgggtt atggccggtg cccactattt ccgtcaacat 1680 

ctggaagatt tattaactcc gctgattact tctacaaaag tagaaacgga taataaagaa 1740 

ttgaaaaaga aattctccga agcggcagat gcaatgaaga cagcattgca cgtaaagctg 1800 

ggaaccttgt gctataccga gaaggaaggt ttttctgttt ccgcatttct aaaacagaag 1860 

gctgttctta cgttatctgt ttcgggagga gaagctgcgt cctcttccgg aagatcggag 192 0 
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cgtaaatccc ggacagccga gaaaatagaa gtaccgactg atattcttca tccggaatta 1980 

tataagcaat tgattgcctg gagaaattct gaagctgcaa aagctggttt gcctgtatat 2 040 

accatcatac agcagaaagc aattctgggt attgtaaatc tcttgccgaa tgatgcggct 2100 

tcactgatac gtattccgta tttcggaaaa cgcggtgccg aaaaatacgg tgatgccttg 2160 

cttgaaatgg tgaaccgata cgtagaggag catggcatag aacgtccgca aatgccaaca 2220 

gcgacgttga ctgtcaataa tgggattaaa acgtcgaaag agcccaaacc tttaaaagag 22 80 

gctaaatcgg tgaaagaacc gaagccagat accaaagagg taacgtatcg tcttttcagg 23 40 

caggggaaga gtattgaaga aatagccagg gaacgcgagc tggtttccgg aaccatagcg 2 400 

ggacatctgg aacactatgt acgctctggt gaagttaaaa tagagcagtt ggtggcaaga 2460 

gagaaaatca cgaaaatcat ccgttacgta caggcccatg gaagtgataa aggactgacg 2 52 0 

gttattaaag cagctttggg ggatgatgtc tcatatgcag atataaggtt ggtacttgct 2 5 80 

gccggaataa aatag 2595 

<210> 333 
<211> 1587 
<212> DNA 
<213> B.fragilis 

<400> 333 

attaacccca tgaaaaacta tttaggactg attttcctct tgtttgcttt tacagcaacc 60 

gcacaaaaca atcgttcagc cctgttgcct atgcccaatc acatagagca agtgcaaggt 12 0 

aaacctttta gcttaacagg taagaacatc acgattcacc ccggacaacc ggaattaaag 18 0 

tttgcggcta ctactctgca aagtatactg aaagaccgca tgcaagtaga cattcccctt 240 

tccggctctc gccaatcccc catccggtta attattgatc cacaattgga aggaaaagaa 3 00 

cattatcaac tcaaagttga ccagaaaggg atgaccatta gcggagccag tgcagcagcc 3 60 

gttttctatg gtgtaatgac tgtcgatcaa gttctcttgg gagatgtatg ctccagcaat 420 

cggaaagaaa tgactcctat cagtatcgat gatgcgcctc gctttggcta ccgggcttta 480 

atgctagacc ctgcccggca ttttttacca atagaagatg taaagttcta catcgatcag 540 

atggtacgct acaaatataa tgtgcttcaa cttcacctga cagatgatca aggatggaga 600 

atcgaaatta gaaagcatcc gaaacttacc gcaggacaat ctttttatac tcaagaagag 660 

ttggccgacc tgattcgtta tgcagccgaa cgccatgttg aaatagtgcc ggaattggat 72 0 

attccgggac acactgtcgc tgtattagcc gcttatcctg aactgggatg tacacacacc 7 80 

gataccattg caaagaatgt aggtgagact gtaaacttaa tgctttgtgc caataatgaa 840 

aaagtgtatg aagtgtacaa tgatattatt gatgaagtaa gtgctctctt tccttcacgt 9 00 

tatattcacc tgggtggtga cgaagcagtt atagaaaaga actggaccaa atgtgaacgt 9 60 

tgccaaaaga tgatgaagga actgaaatac gaaaaggctt cccaattaat gattcctttt 1020 

ttcagccgta tgctcagttt cgtagaggct gatggaaaat accctattct ctggtgtgaa 1080 

ttagataaca ttcgcatgcc ggccaacgat tatctgttcc cttaccctaa aaatgtaaca 1140 

cttgtgagct ggagatacgg attgacgcca acttgccaga aactgaccca acagcatggt 12 00 

aaccctctga ttatggctcc gggagaattt gcatatctgg attatccgca gttcaaagga 12 60 

gatcttccgg aatttaataa ctggggaatg ccggtaacta cactcgaaac atgctatcag 1320 

tttgatccgg gatacggaaa acccgcagca gaacaggcac acattctggg agtaatggga 13 80 

acactttggg gagaagcaat aaaggacatt aaccgagtga catatatgac ctatccccgc 1440 

ggtctggcac tggcagaagc aggatggacc caaatggaac atcgcaattg ggattctttc 15 00 

aaagaacgtt tatatcccaa tctgaataac ttaatgaaaa aaggcgtttc aatacgtgta 1560 

ccattcgaaa tagtaaaaag aaaataa 1587 

<210> 334 
<211> 948 
<212> DNA 
<213> B.fragilis 

<400> 334 

aataattaca tgaaaagaat cttagttagc ggaggtgcgg gttttattgg ttcgcatctt 60 

tgtacccggc taatcaacga ggggcacgac gttatttgtc tggataattt ttttaccgga 120 

tcaaaagaaa atattatcca tttgatggat aaccaccatt tcgaagtggt acgtcatgat 180 

ataacatttc catatagtgc tgaagtagac gaaatataca accttgcctg cccggcgtct 240 

cccatacatt atcagtacga tgccattcaa accattaaaa catccgtaat gggagctatc 300 

aatatgttgg ggttagcccg taggctcaat gctaaaatat tgcaagcttc aaccagtgag 360 
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gtttatggag 
atcggcatcc 
tatcatcggc 
cggatgttgc 
gacgatatta 
ttggttgagg 
cttggcaatc 
ggatcgaagt 
cctgatatca 
ggtttggatc 



atccggaggt 
gttcttgtta 
agaataacgt 
cgaatgacgg 
ctatctatgg 
gtatgatccg 
cgaatgaatt 
cgaagattac 
gactggcaca 
gtatgattga 



tcatccgcaa 
tgatgaaggg 
acgcattaaa 
gagagtggtt 
aacaggtgag 
gatgatgaat 
ttccatgctt 
ttttaagccc 
ggagaaattg 
ctatttcaaa 



cctgaatctt 
aaacgttgtt 
attgttcgta 
tctaattttc 
caaacccgta 
acgggtgatg 
cagttagcgg 
ctgccgcacg 
ggttggcaac 
atgaagtata 



attggggaaa 
ctgaaactct 
ttttcaatac 
ttatccaggc 
gcttccagta 
attttatcgg 
agaagatcat 
acgatcccca 
cgactatttt 
agttataa 



tgtcaatccg 
ctttatggat 
atacggtcct 
actgaagaac 
tattgatgat 
accgataaat 
ccagaagacc 
acagcgtaag 
gctagatgaa 



420 
480 
540 
600 
660 
720 
780 
840 
900 
948 



<210> 335 
<211> 375 
<212> DNA 
<213> B.fragilis 



<400> 335 

ttttgtgaca 

gatttagaaa 

gagggagatt 

cagttcaatg 

ctgagtaatg 

gatgaaattg 

acgtccgggg 



taagccggaa 
aggtattaat 
gctggagtgc 
cgtatgatcg 
caatgattga 
aaatctgcat 
tataa 



agcttgtttt 
tagagagata 
gcacgataat 
catttatcag 
gaagtttatc 
tccaaaagaa 



ttatgtcaaa 
aacaatgata 
tccgcacggc 
gcgtatgaga 
gaacatacat 
aaaagagcag 



cccctaaagc 
gtcgtatttt 
atctttgctt 
ttgtattaaa 
tggtgagtac 
aatttgaaag 



gattacgatg 
tctgtataaa 
tttatattcg 
gtgcgttatg 
ggttcatgaa 
ctggcgtagt 



60 

120 

180 

240 

300 

360 

375 



<210> 336 
<211> 1380 
<212> DNA 
<213> B.fragilis 



<400> 336 

cttatacgca 

caagagaagc 

gtgaaacaag 

ctacaagagt 

cttacagaca 

tataaccgtg 

cgatcaaaag 

gcaaataatg 

cgcttacaga 

ctgttgagaa 

aatcacccgg 

aagaaattcc 

agtgatatta 

tgttatctgt 

ctgcaaatga 

gggaatgaaa 

tccatgagtg 

acagaacaac 

gaaatcgaac 

catgggggta 

tcctatcgaa 

gaactttcag 

gtacataaac 



tggattggaa 
tgaaaaacat 
aggaacttga 
tttattcaca 
gtgcattttt 
gtgatttaaa 
agtggaaaca 
aatataatta 
aaaatattgc 
cacaccaaga 
ctatcagaga 
gtatggttgc 
ccggagtatg 
cagatcctgc 
tggattatga 
ttgtagaaga 
gagaaagaga 
aagatcgaaa 
gtctgggaca 
cagatttaac 
atgccgattt 
aggaaataaa 
aaagtgaaaa 



aataaggaat 
tgcttataga 
gggagaaatt 
ctacgctaca 
acgctttctt 
tcttcaatat 
ccttcgtaat 
ccaaatagaa 
agatcaactt 
acttgccaaa 
actgacaaag 
aggtatccac 
tgaaggtaat 
tttgcaacca 
atcaaaagat 
acaatcaggt 
agagttcgtg 
atgttatctt 
aaatatacag 
tcccgcactt 
agtaatgatg 
aaaaataaag 
tacctattta 



attaggctac 
gtatacgagt 
atgtcttatt 
cagtgggaac 
gaaaattcgg 
tatattgacc 
cttttttttg 
cgtatcaaca 
ccccaacgag 
caactgtttc 
atactgggaa 
cgggaacaaa 
gatttaaaca 
cttttctttg 
caacatcgca 
ccatttatca 
aaatcggcca 
ataaatttct 
gaactggcaa 
ctacacgcaa 
tcagattttg 
caaaataaga 
aacgtctgca 



aagaactcag 
cacattttca 
atcagcatac 
atttttacga 
cttaccctct 
gatttcatac 
ataaatggta 
atctttgtga 
gcaatgcccg 
attatgatga 
agcagcatta 
tcatcactca 
gcctcctacc 
agcgattcaa 
ttaaagatat 
tttgcgtgga 
ttcttgccat 
ccaacgacat 
actttctttg 
tacacattct 
aaatgcctcc 
ctcatttata 
ataagttctg 



ggagatttat 
aaatggtatc 
acaaccttct 
gggacatgag 
acaaatgaaa 
actcaaaaaa 
tcacctgttg 
gagattctat 
tttaatgtgg 
aatagccaaa 
tggtaaagaa 
tgccactaaa 
catagagtat 
caagaagaag 
aaaaatacaa 
tacttcagga 
tgcagaactt 
agcttgtatc 
tcaaagtttt 
aaaaacaaag 
tttaaacgaa 
cgccctctct 
gtttgtttaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 



<210> 337 
<211> 1644 
<212> DNA 
<213> B.fragilis 
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<400> 337 

tacgtaaaaa 

tgtatcttaa 

ctttttgtag 

tctgcccggg 

tttgataaac 

agtaaccaag 

gagtttgaag 

tcattgaagg 

tataacttag 

ttgacccgcg 

catggtgtgc 

gaaatctccg 

gacttgaaag 

acacacagtg 

gtaggaaact 

cctattctct 

cgcatctata 

gacggaaagc 

gtggcaatag 

gccgataagg 

ggatgtgacg 

gcagatacgg 

ggggatatta 

ttggctataa 

ccgattgtat 

ttggcacttg 

aacgtgaagc 

gat attgcaa 



ttatgagtat 
gcggagtatg 
tacgcggaat 
cagataagtt 
actccattat 
tcacaataga 
agaaagccca 
agttggttca 
gatatgagaa 
aagatattac 
aggctatggc 
aagtaaacat 
atattgaaga 
aaatgctacc 
acggtaatgc 
tcactactaa 
cgacaggcgc 
agaaagattt 
aaagtggtaa 
tggttgaagc 
ggcgaatgaa 
taatcctgac 
atggcattcc 
ttgctatgaa 
ataacattgc 
gagtgaaaaa 
aggtactgat 
aatttttagc 



gttctgtttt 
cgggaaaact 
cgcagtctac 
tatctttgac 
cgagaaaata 
acatgcgccc 
aacggtgggg 
ctatggtatc 
tccggagata 
tgtggacgaa 
ccaactcgat 
tggtgttcga 
acttttgcaa 
ggctcattat 
ctggtggaaa 
ttgcattgtt 
aaccggcttg 
ctccgttatt 
gattgtaggt 
agtaaaaagc 
aagtcgcagt 
agcagggtgc 
tcgtgtactg 
gttgcaggaa 
gtggtacgaa 
gattcattta 
cgataatttt 
atga 



cagtgtcagg 
cccgaagtag 
aatcaggcat 
gcattgttca 
aagaaaggac 
gacgaatgta 
gtactgcgaa 
aaaggaatgg 
tttgcattca 
ctgataaccc 
actgccaata 
aacaatccgg 
cagactgaag 
tatcctcagt 
cagaaagagg 
ccaccacgcc 
gaaggtgcta 
attgagcatg 
ggatttgctc 
ggtgctatcc 
tactacacag 
gccaaatacc 
gatgcaggac 
gtcttcggac 
caaaaagccg 
gggccgacgc 
ggaattggcg 



aaaccgcaaa 
ccaatatgca 
tacgcaaaga 
ctaccattac 
tggagctaaa 
cttggtatgg 
cttctgacga 
ctgcttatgt 
tgcaatatgc 
tcacactcgc 
ccagccatta 
gtatccttgt 
gtaccggtat 
tgaagaaata 
aatttgaaag 
cgaatgcgac 
cctacatacc 
cacggcgttg 
atgcgcaagt 
gtaaattttt 
agtttgcaga 
gatataataa 
agtgtaatga 
taaaagacat 
ttattgttct 
ttcctgcatt 
gtatcagtac 



aggtacaggt 
agacttgctg 
tggacgttct 
aaatgccaac 
gaaagatcta 
tgacgaaact 
agacattcga 
ggagcatgcc 
tttggctgaa 
tacgggtaac 
cggaaatccg 
cagtggacat 
cgacatatac 
taaacacctg 
tttcaacggt 
ttataaagat 
cgaacgaaaa 
ccaaccacca 
aatcgcactg 
tgttatggcc 
aaagctaccg 
attacctctg 
cagttactca 
caatgatctt 
gctggctctg 
cttatctcct 
agcagacgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1644 



<210> 338 
<211> 510 
<212> DNA 
<213> B.fragilis 



<400> 338 

acaaagaaga 

tct tttggaa 

gtggaaggcc 

cagttgtcac 

ggacgtatca 

caagctgaaa 

aaagacgaag 

gagaaggtca 

atgcttgatg 



ttatgtcatt 
ttgtattcgc 
gtaaaaccta 
gactgaaaga 
tgaaagaggc 
tagctgctca 
ccatacgtga 
ttcgcaaaaa 
aagtattaac 



gttactaccc 
agtattggca 
cattgacgaa 
ggagggcgaa 
tatgcaggaa 
gaaggaactg 
tatccgtcgg 
tctggatgat 
gaagaactaa 



gatagtggcc 
aaatacggct 
tcattggagg 
gctattgtgg 
cgtgaaaaga 
gatgaagtga 
caggtagctc 
aaacaagagc 



tgatattctg 
tccccgtcat 
tagccagaga 
ctgccgctaa 
ttatttacga 
aacgacagat 
tgctttctgt 
agatgggaat 



gatgctcctt 
tattaagatg 
agcaaacgcc 
caaggaacag 
ggcccgcaaa 
tcagattgaa 
ggatatagcc 
gattgaccgt 



60 

120 

180 

240 

300 

360 

420 

480 

510 



<210> 339 
<211> 570 
<212> DNA 
<213> B.fragilis 



<400> 339 

cagaagaaga 

gctgaagaga 

ttccgtactg 

aagtttaatc 

cgtttcattc 

atgtatctgg 



tggaagtcgg 
gaggagcgga 
tgaagggatt 
tgatctgtac 
ggttggtact 
atctttaccg 



aataatttca 
ggagaggctt 
ttgtgccgtg 
ggcagccgat 
caaggagaga 
gaagaagaaa 



atgcggtatg 
tatcatgagc 
cttgacaatc 

ggggaccata 

agggagacct 
cacatcggtg 



cgaaagctct 
ttgtcacact 
ctattgtttc 
aaccgagtga 
atctgcaatt 
tagggaaact 



gatggcttat 
ggcgcacagt 
tgttaacgaa 
agaatttatt 
tatgagtctg 
gatcactgct 



60 

120 

180 

240 

300 

360 
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gttccggtag acaaagctac agaagagcgt atccgacaaa ctgctgcaca tattttgcat 420 

gcatatatgg agttggaaac agtggttgac ccttctatag aaggcggctt cgttttcgat 480 

attaacgatt accgactgga tgccagtatc gctacacagt tgaagaaagt caaacaacaa 540 

ttcattgata agaatcgaag aattgtataa 570 

<210> 340 
<211> 1437 
<212> DNA 
<213> B.fragilis 



<400> 340 

acaatgtact 

aagtgtaatg 

ttgatattgc 

gaagatcttt 

attttggttt 

gctattcctg 

tcattgcttt 

ctt ttacagt 

gttgggtggt 

atagttgagg 

tacacagatg 

gtt ccggtta 

ttgattaatg 

ctggaagatg 

caaagtgcgt 

gtgcttaata 

aacacattga 

gagattgata 

att gctatcc 

cgt cgtagca 

cgt ttggttt 

ttt gcatata 

gggggacatt 

gggaggtcct 



tgatttctat 
gtgcttctga 
tttctttatt 
ccggagcttc 
ctttggtttt 
tgacacaagt 
ttatacagtt 
atcatcaggt 
cgccatatag 
agttttgtaa 
cacatgggaa 
tgggacttca 
aagagatggt 
gcaaaaataa 
acatgccaca 
aacggaatat 
tgaaagaggc 
agcaatatca 
tgttgattgc 
aagaaattgc 
ccgggaatat 
ttgtaagtaa 
ttcttgtcgt 
ggaatgtggc 



ttcatcattg 
tgggcatatc 
gttcgtgaca 
attaaaggct 
atttctggtc 
tttccatcgt 
tgcgggaatt 
gatgacccgt 
agaaattgat 
tgcaagtacg 
ggaatttatg 
gattataaaa 
tcgtcaaatt 
ctttggtacg 
ggctcctgta 
tattttgaaa 
ttttccgaca 
agaggtccgc 
tttgatggga 
cattcgtaaa 
tttctggacg 
taaatggctg 
gatcataatt 
taatgagaat 



gctcaacgag 
tatcgtatgt 
gtcctgttat 
ttatttacat 
atcggactat 
tttactgcac 
gcatttattt 
gacatgggat 
aaaatggacg 
attatttatg 
ggacgtatcg 
ggcaggaata 
ggttggacgg 
attgtcggag 
gcactgatga 
gaaccttttg 
gtagatattg 
cgtttccgta 
ctgttcggct 
gtgaacggtg 
gcactaagtg 
gaacagtttt 
atcttgttgt 
ccggtgaaca 



ctaagtctgt 
ttctgtatga 
tcacttttaa 
ggcaaactct 
ttccgggcaa 
atcgttttgt 
taggcctgtt 
ataaggtcga 
gtattctccg 
gtggctacat 
aatttgttga 
tccagcaaga 
atagtcccat 
ttgtgaaaga 
gtaatttgga 
gtgagaacct 
tattccgttc 
atgtcgtgat 
ttgtaaatga 
ccgaagtgcc 
ctgttctggt 
ctgaccgggt 
tgatcatagg 
gtatcaagaa 



tggcatacat 
atcagcacta 
gttggagata 
gtgggtaccg 
gttgtttgcc 
ttggaagcga 
gatggttatt 
taatttggca 
aggactgccc 
gggtcaacct 
tgaacattat 
taaagagatc 
cggtaagaat 
ctatgttgta 
atggatgaat 
ggctaagatt 
tgcccgtcag 
tatagcttct 
tgaaattcaa 
ggatattctt 
cggaatagta 
atcagtcaat 
aagtgtcatc 
cgaatag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1437 



<210> 341 
<211> 288 
<212> DNA 
<213> B.fragilis 



<400> 341 

agctttattg 

aat cttagct 

gtagaatcag 

gtcgaaatgc 

tatgccggtc 



ttattttatt aattttttat ttttgtatat ttatgaatat gtacgttgga 60 

ataatgttaa ggagtcagat ttgagacaag ttatggaaga gtatggagta 12 0 

taaaactgat cacagaccgc gaaacaagaa gatctaaagg gtttgcgttt 18 0 

cggaatcttc agaagcaagc aatgccatta aagaattgaa cggagcagaa 240 

gtccgatggt agtaaaagaa gctttgccaa gaaattga 288 



<210> 342 
<211> 921 
<212> DNA 
<213> B.fragilis 



<400> 342 

ccttttaatg 

atacctttgc 

gacacaagaa 

tct atgctta 

gtaattcttg 



tggatccgct 
gcccgatttt 
aagtgacgac 
catcgtatga 
taggcgattc 



aatttattca 
taatttaaaa 
tcatcgccta 
ctacacaatg 
cgcatcgaat 



cttttacttg 
agattggata 
atcgaaatga 
gcacagattg 
gtgatggcag 



tatggtatga 
tggctggtta 
agcaaagagg 
tcgacggtgc 
gtaatgtgac 



attaaaagtt 
tatatcagat 
cgaaaaaata 
cggtatcgat 
tacacttcct 



60 

120 

180 

240 

300 
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attaccctgg atcagatgat ctatcatgga aaatcggttg tacgtggtgt aaagcgtgca 3 60 

atggtagtag tggatatgcc ttttggctct tatcagggta atgaaatgga agggcttgct 42 0 

tcagctatcc gcataatgaa ggagagtcat gccgatgcat tgaaactgga aggtggtgaa 480 

gagattatag atactgtgaa acgtattctg agtgccggta tccccgtgat gggacatctt 540 

ggattgatgc cacaatctat caataaatat ggtacataca cggttcgtgc caaggatgat 600 

gccgaagcag agaaattgat tcgtgatgca catttactag aggaggccgg atgtttcgga 6 60 

cttgttcttg agaagattcc tgcagcattg gcatcacgtg tcgcaagcga actgaccatt 72 0 

ccggtgatcg gtatcggtgc cggtggagat gtagacggac aggtattggt aattcaggat 780 

atgttgggta tgaataacgg tttccgcccg cgcttcctcc gtcgttatgc cgatctttat 840 

acggtaatga ccgatgctat cagtcactat gtttcagatg taaagaactg cgacttcccg 9 00 

aacgagaaag aacaatatta a 921 

<210> 343 
<211> 1332 
<212> DNA 
<213> B.fragilis 

<400> 343 

gatgaaaaag tttggtttat gaaaatccat ttaaagttac tcacagagcg ctattggttt 60 

cgtctcggac taagcctctg tttcgccata actgcggctc tgtcttatgc cgacagagac 12 0 

ttcatttgga tgggattgag cctctgtttg ctactattca gcatttggtg gcaactttca 180 

ctttaccgta ttcataccaa acgagttctt ttcatgattg acgccctcga gaacaatgac 240 

agcgccattc acttcccgga agagcagata atgcctgaga cccgagaggt caaccgtgca 3 00 

ctcaaccggg tcggacgcat attatataat gtaaagtcgg aaacggtaca gcaagaaaag 3 60 

tattacgaac tgataatgga ctgtataaac accggtgtac tcgttctcaa tgaaaatgga 42 0 

gcggtttatc aaaaaaataa tgaagcgctt cgcctgctcg gattaaatgt gtttacccat 480 

atccgccaac tgaacaaagt ggatatacag ctgatgaaga aaatagaatt ctgccgtccg 540 

ggagataaaa tacaaactat tttcaacaat gaacggggta caatcaattt atccattcgt 600 

gtatcaggca tcactgttcg tgaagaacaa ttgcgcattc tcgcttttaa cgacatcaac 660 

agtgaattgg atgaaaaaga aatcgattcg tggatacgac tgacacgtgt attgactcat 720 

gaaatcatga attcggttac tcccatcacc tctcttagcg aaacactact atcgttggcc 780 

gatacccggg atgaagaaat acgccggggc ttacaaacaa tcagtactac gggaaaaggc 840 

ctgctctctt tcgtggaatc ctaccgccgt tttacccgta tcccgacccc ggaaccatcc 900 

ttattttatg taaaagcttt tattgaccga atggtagagc tggcacgcca tcaaaacaaa 960 

tgtgacaaca taacattcca tatagatatt gctcctgctg atctgattgt gtatgccgac 102 0 

gaaaatctga tttcgcaagt agtaattaat ctattgaaga atgccataca agctatcgat 1080 

gcacaggccg atggaaagat tgaaataaaa ggacgatgta atgctgctga agaaatattg 1140 

attgaaataa aaaataatgg ccctgccatt ccttcagata tagcagatca tatattcatt 12 00 

ccttttttta ccaccaaaga aggaggtagt ggtatcggat tgagcatttc acgtcagatc 12 60 

atgcgtctgt caggtggaag catcactctg ctgcaaggca aagaaactaa atttattctg 132 0 

aaatttaaat aa 1332 

<210> 344 
<211> 723 
<212> DNA 
<213> B.fragilis 

<400> 344 

aacctgatgg ttatgattat gaagtggttg aattttaatt ccattattgg catggcagta 60 

ctatcgctgc tgttttacac agaaaacgtc gctgcacaaa ccgacaaaaa cgataccaaa 12 0 

caaaagatag ataccatcca gacaacacag ccggaataca gcaaatatga caaacgtatt 180 

caccgttttc gtaaaggatg gaattcactt atacctacac acaacaaaat acaatatgcg 240 

ggtaacatgg gaatgttctc gttcggaacc ggttgggatt acggaaaaag agatcagtgg 3 00 

gaaacggatc tgttcttcgg cttcataccc aaacatgact cccatcgggc taagatgacc 3 60 

atgaccttaa aacaaaatta catgccttgg agcctggagc ttgggaaagg attttcaacc 42 0 

gaacctttgg catgtggtat ctattttaac actgttttcg gacacgaatt ctgggtacac 480 

gagcctagcc gttatccgga aggatactac ggattctcgt ccaagatacg cacacacatc 540 

tttctgggac aacggctgac atacgatata gatagagaac gccggttctt tgcaaaatct 600 

gtgactctct tttatgagct gagtacctgt gacctattat tgatcagccg cgtaaccaac 660 
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agttacctgc gggctcggga ttatctgagt ttatccttcg gacttaaatt ccaatggctt 72 0 
tag 723 

<210> 345 
<211> 255 
<212> DNA 
<213> B.fragilis 

<400> 345 

cttgatatat tcaacagcca gggtgaaggt ttttccggac ccggcggagg ctttatagac 60 

aatgagttcg ctcattcaca gtttgtattg gataatgatt attttctttt tactatttcg 120 

aatggtacac gtattgaaac gccttttttc attaagttat tcagattggg atataaacgt 180 

tctttgaaag aatcccaatt gcgatgttcc atttgggtcc atcctgcttc tgccagtgcc 240 

agaccgcggg gatag 2 55 

<210> 346 
<211> 1269 
<212> DNA 
<213> B.fragilis 

<400> 346 

ttgtcttttt ctctctttta tcgtaatcat ttagtagctc gttcgggatt tattggaaga 60 

aaagtagtat ttttgcgtca ttatatctta aaatattcaa tcatgaagaa aatactttta 12 0 

ctcggatcgg gcgaattggg caaggaattt gtaatttctg ctcaacgtaa aggtcaacac 18 0 

atcattgctt gtgattcata tgccggggca cctgccatgc aggttgctga tgaatgcgaa 240 

gtattcgata tgctgaacgg tgaagaactg gagcgtattg taaaaaagca tcggccggac 3 00 

attatcgtcc ccgagattga agccattcgt acggaacgtt tatacgattt cgaaaaagaa 3 60 

gggattcagg tagtgccgag tgcacgtgcc gttaattaca caatgaaccg aaaggctatc 42 0 

cgtgatttgg ccgctaagga actgggactg aaaactgcga aatactatta tgccaagtca 480 

ttggaagaac tgaaggaagc cgctgagaaa atcggtttcc cttgtgtcgt gaagccttta 540 

atgtcatcat cgggtaaagg gcagtcattg gtcaagagcg ctgccgagtt ggaacatgct 60 0 

tgggaatatg ggtgtaatgg cagccgtgga gatattcgtg agctaatcat tgaggaattt 660 

atcaaattcg atagtgagat aactttgctt acagtgacac agaagaatgg tccgactctg 72 0 

ttttgtccgc ctatcgggca tgtacaaaag ggtggggatt atcgggaaag tttccaacct 780 

gcacacattg atcctgcaca cttgaaggaa gcagaagata tggctgaaaa agtaactcgt 840 

gcattgaccg gtgcaggact gtggggagta gaatttttcc tgagccatga aaacggggtt 9 00 

tacttttcgg aactgtctcc acgtccacat gatacgggaa tggtgacatt ggccggaaca 9 60 

caaaatctga atgaatttga acttcaccta cgtgccgtat tggggttgcc cattccggga 1020 

ataaaacaag aaagaatagg agcgagtgcc gttattctgt cgccgattgc cagtcaggaa 1080 

cgtccgcagt atagaggtat ggaggaagtt accggagaag aggatactta tctgcgtata 1140 

tttggtaagc cgtatacacg tgtgaatcgg cgtatgggag tagtgctttg ctatgctcca 12 00 

aacggttcgg atctggatgc tttgcgtgat aaggcaaagc ggatagccga taaagtagaa 12 60 

gtatattaa , 12 69 

<210> 347 
<211> 645 
<212> DNA 
<213> B.fragilis 

<220> 

<221> unsure 
<222> (97) 

<223> Identity of nucleotide sequences at the above locations are unknown. 
<400> 347 

aataagagat ttatgagatc attaattgga aaacaagcgc cgaaattcga tgccacggct 60 
gtaatcaatg ggcatgaaat cgttcagaac tttcgtntgg atcagtataa agggaaaaaa 12 0 
tacgtagtat tcttcttcta cccgatggac ttcacttttg tatgtcccac cgaactgcac 180 
gcatttcagg aaaagctcga agagtttgaa aaacgtgatg tcgctgtggt aggttgttcg 240 



143 



gtcgactctg aatattctca cttctcttgg ttgcagatgc ccaagaacga aggaggtatc 300 

cagggcgtga agtatcctat tgtatccgac ttttctaagt caatctctga gagttatgga 3 60 

gtgctggccg gaagctatgc ccccgatgaa aatggcaatt gggtatgcga agggacaccg 42 0 

gtagctttcc gtggtctgtt cctgatcgac aaggagggcg tagtaagaca ttgcgtcatc 480 

aacgacttgc cgctgggacg taacgtggat gaagtattgc gcatggtaga cgctttacaa 540 

cattttgaag agtatggtga ggtttgtccg gccaattggt cgaaaggcaa agacgccatg 60 0 

aaagctaccg aagacggagt agccaactat ctgagtaagc attaa 645 

<210> 348 
<211> 234 
<212> DNA 
<213> B.fragilis 

<400> 348 

gatcatttgt tttgtgtgat tatttgcaac ttgatactta aaaacaacag tttattcttt 60 

atacggaata aaagacatta ttacgtatta aaaaagtctc ttaaatattt gccatttaaa 12 0 

aataaaggtt tacctttgca cccgaaatca agaacgcttg atggcaatgc actcttagct 180 

cagctggtag agcaattgac tcttaatcaa tgggtccagg gttcgagtcc ctga 234 

<210> 349 
<211> 900 
<212> DNA 
<213> B.fragilis 

<400> 349 

actttaactc caaagaaaag cgaatattca cctctatatc ataacaatat gcacaccatt 60 

cagataaatg atgattgtta ccgagttccg gaaagttggg atgaactcac cgaaaagcaa 120 

ttgagctacc tggttaatct tacacaaagc gatattccca tcgaagaact gaaggtacac 18 0 

atgatgctat attgtctcaa tgcacatgtt tgccggtatc gggatatcta tcgccatcaa 240 

gtaaagatca gcattgggac tcccggcaat aaaatccctt tccggacaca caagaagaaa 3 00 

tatttgcttc ttcctgaaga agtcaatcgg ctggccaaac tcttcaactt tctgttgatg 3 60 

tgcgaaaagg ataccgaaat gaaataccat gtacacccgg aactcaccgt caatccctat 42 0 

cgggcattct tttgccggtt ccgtaaattc cgtggtccgg aagatggcct gctcgatatt 480 

cgcttcgaac agttcatgca cctgcaacac tatcttgacg ccataaatca ggacccggaa 540 

caaattaacc atgctctggc ctgtttatgg cacacaagca aaacattcaa tatcaatcgt 600 

ctggagaaag atgcttccat tctcagccat cttccccaca gagtgaagat gattatgtac 66 0 

tggtacatta tagggagcct ggcctatctt gccaatggct ttccccgtat cttttccgga 72 0 

aatggaaaga gtaatggtcg tgtctttgat tcgcaaatgc gtcttttaga ctccctcgca 780 

cagtcagaca tgaccaaaaa gcctgaaata aaaaaaggat tcctgatcga tgccctgtat 840 

acaatggatg aatctctgag aaaacaacaa gagctgaatg aaaatatgca gaacaaataa 900 

<210> 350 
<211> 498 
<212> DNA 
<213> B.fragilis 

<400> 350 

cccctgactt ttatggagat atacaaccac tttgaatatg gcaaaacact tgccatccgc 60 

ttaaagccta ttgcccacac acccgaaaag cccagattct tcaccgcttt cggacttgag 12 0 

gacttatata attttaatga taaactatca tccgtatccg gcatgatcct gattgcagtt 180 

gatggttgtg aatctgaatc aaaacgaaac gaatccgatg cgcttaataa caatgatata 240 

ttctctttca ttgttgtaca gaacactgtt tctgatcgtc cggaaacagt caaccaggca 3 00 

gcaaaagaat gcaaagctat cgcaaaacaa attcggaacc atatcctgca agaccccgac 3 60 

atttcagaat tcattgacga taccattcaa tttaatggta ttgggccgat tggtgataat 420 

ttctatggcg tagtactgac attttctttg gttcaacctg aaacctattt cattgatcaa 480 

acatactggg aggattaa 498 



<210> 351 
<211> 204 
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<212> DNA 

<213> B.fragilis 

<400> 351 

atctatcccc caacttccaa cgtatactta aagatgcata tgtcggacaa atacgaaatg 60 

ctgtcgctca tacacaatac cattgtattc aaggaggaat cttatatgac aactactcac 12 0 

catcaagtaa atattctatc ctgcaaggtc tttcttatga agaatgggag aagaaatatg 18 0 
tctactcttt tttcatattt atag 204 

<210> 352 
<211> 714 
<212> DNA 
<213> B.fragilis 



<400> 352 

atgagccaac 

aaagtaggca 

attcttgata 

gcttccgtta 

acccgttcac 

aataagaata 

tccctgaagt 

gaagagttgc 

ataccgatgg 

tcccgccagc 

tttacattta 

cgt tacagtg 



ctttttttca 
cagacggtgt 
ttggtaccgg 
ttgctcttga 
cttggggaag 
atagcctgaa 
gtccggacag 
tgaagggagt 
atgcaagtga 
tcttggtcat 
taaaacaaga 
atgaatacat 



gtttaaacag 
cttgctcgga 
taccggattg 
aatagacgga 
caggatagaa 
atatgataca 
ccagcgcaac 
atcgaattta 
ttcttttaaa 
cactaaaccg 
ctgcaaagaa 
taaattaacc 



tttactgtct 
gcttggaccc 
gtggcactta 
acagccgcac 
gtcgtttgcc 
atcgtatcaa 
acagcccgac 
ctttcgccaa 
gacatcgcat 
ggagcacccc 
gagaaattat 
cgggagtttt 



ggcacgataa 
cggtagagtc 
tgctggccca 
aacaggctgc 
aggatttcag 
atcctccata 
ataacgataa 
atggtacttt 
cttcacaagg 
caaaacgtac 
taacagaagt 
atttgaaaat 



atgtgccatg 
ctcggcacgt 
acgctgttcc 
agagaatatc 
gttatacagc 
tttcacagac 
cctgtcttat 
tacagtagtc 
cctgtatcca 
cttgatctca 
ttctcgccac 
gtaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

714 



<210> 353 
<211> 480 
<212> DNA 
<213> B.fragilis 



<400> 353 

gaaccggagc 

aactcgtttt 

aacatgggac 

tctattgcat 

accgaatgga 

cacaaaggaa 

gacggccaga 

aagcgggatg 



ccaacctatc 
ctaacttaaa 
gtgatgccga 
gtacagagcg 
tacccgtcgt 
gcaaactgta 
aacgaaccgt 
ctccctcact 



ccgaaagaag 
cataaaaaca 
ggtccgtacc 
tgcttataca 
agcctggagg 
tattgaaggc 
ttctgaaatc 
ccctccggaa 



ataatagccg 
atgagtgtaa 
actgaaaccg 
aacaaagccg 
agattggcgg 
agattcacaa 
gtagccgaaa 
cccgagcaga 



gttaccacaa 
acaaatgtat 
gcatcaaagt 
gtcaaacgat 
aaaccattga 
cccggaagta 
gtattgaaat 
aattgagtta 



ccgacaggta 
ttttatcggc 
agcccaattt 
tccggagaga 
gaagtacacc 
tgaaacaaat 
gctcgatccc 
taatccataa 



60 

120 

180 

240 

300 

360 

420 

480 



<210> 354 
<211> 609 
<212> DNA 
<213> B.fragilis 



<400> 354 

agcagaaagt 

aatttggatt 

ccgccattaa 

agagtagtct 

gtacattctt 

cggagtgcaa 

agaaatggcc 

tggcacactc 

catctaatca 

ttattcaatg 



ttatgtacca 
atcacacaga 
atctatcgga 
atagtcgatg 
tacagattgt 
taaatgaact 
tgttgacaga 
ccaaacaccc 
gagaaaaaga 
caatgattga 



atttattgaa 
gcgaatgaac 
aagcctgcaa 
gatagaagag 
tcattcggat 
ttatatgcat 
tacatctata 
tctccttaaa 
aataaccgta 
ttttggtaag 



acgatacgta 
cagacgcgtg 
ccaataatga 
atcctataca 
aatatcgatt 
aaacgggaac 
gctaatattg 
ggtgtccaaa 
gaccaattgt 
ataaagattg 



tagagagagg 
ctgttttttg 
atgtagaaat 
caccttatca 
acacttacaa 
aagatgaaat 
ctctctttaa 
gagcggcatt 
ttaactattc 
atgtcaatag 



agtagtgtat 
gccagacgaa 
gataaagtgt 
aatacgccca 
aagtacagat 
attaattact 
cggaaaagag 
gattgacaaa 
ccagatttgt 
agagctaata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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cggatataa 



609 



<210> 355 
<211> 1260 
<212> DNA 
<213> B.fragilis 



<400> 355 

acgatgaagt 

cgttttgagg 

gacttaatcg 

ttgaataaat 

cctacccgtg 

cctgtttcaa 

aaaggcttga 

ctgagtctgg 

cgtatgcttg 

gaacgccaga 

actatcctga 

gttcaggctg 

gcggaagagg 

gtggcaaagg 

caggtacagc 

gctacagaca 

gatgttccac 

aatgatggag 

gagaatttcc 

cctcaatata 

ggaaatagaa 



tttccgaatt 
aatgtactcc 
ctgtagcgca 
tgtcggaagg 
aactggccca 
gtgtggctgt 
tgcttggtgc 
gatacgttga 
atatggggtt 
cgattatgtt 
ataaccctgc 
cttatgtctg 
tgcccgaaag 
cgctgaaaat 
gtgaatttat 
ttgtatccag 
atgacagtga 
ttgccctcac 
tcgaaaaaga 
atcctcgttc 
aaaataataa 



acaattaaat 
tatacaagaa 
gacaggaacc 
tggacatccg 
acaaatagat 
ttacggagga 
cgatgtggtt 
tttatcccgg 
ctacgaggat 
ttccgctacc 
tgaagtcaaa 
ttacgaaaat 
agttatcatt 
gatgaagttg 
tatgcatgaa 
agggatagat 
agattatgtt 
gtttgtgaat 
aatttataag 
gtatactaac 
cggtggacgt 



gacaatgtac 
caagcgatcc 
ggtaagacgg 
gaagatgcca 
caacaaatgg 
aatgacggaa 
atagctacac 
gtttcttatt 
attatgcaga 
atgccagcta 
ctggccgtat 
cagaaattgg 
tttgcatctt 
aatgttggag 
ttcaaatcgg 
attgatgata 
caccgcatcg 
gagaaagaac 
ataccggtac 
gcaggaagag 
tctacggcac 



ttgaagcact 
cagtaatact 
ctgccttttt 
tcaactgtgt 
agggcttctc 
tactttttga 
cgggacgcct 
ttattcttga 
tcgtaaaata 
agattcagca 
cgaaacctgc 
gtattgtaag 
caaagataaa 
aaatgcattc 
gacgtatcaa 
tccggttggt 
gacgtactgc 
aaactaattt 
cggctgaatt 
gaggaagaaa 
ccagatcggg 



cgacgctatg 
cgaaggtaga 
gttgcctata 
gattatgtca 
ttattttatg 
gcagcagaaa 
gattgcacat 
tgaagcagac 
tttgccaaaa 
attggccaat 
tgaaaaaatt 
aagcttgttt 
agtaaaagaa 
tgatcttgag 
tattttggtc 
gattaatttc 
acgtgcgaac 
taagaatatc 
gggggaagct 
cttccggaac 
cagaagataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 



<210> 356 
<211> 471 
<212> DNA 
<213> B.fragilis 



<400> 356 

atgataacta 

tgtaacctga 

ttcgaccttc 

tgtctgcctg 

tcccaggtaa 

ctggacgaac 

cgcagatatg 

agggcaaaag 



ccaaaatcga 
cctctgatcc 
tccagaagag 
aacgaagtat 
tcatttcccg 
agaagcaccg 
gaattgattc 
tgaggagaaa 



agttcccccg 
ggtccgtttc 
accgtcggaa 
aggcaaatcc 
gaaaatagaa 
gtacggaatc 
tcttacggaa 
agaaaaaagg 



catctatgtg 
cccgataacc 
gctccggttg 
ccagtgacct 
ttgatgatgt 
aaatacattg 
gaagcttttc 
agctataaaa 



agtatatccg 
tgaatatcta 
atcgtggtaa 
acaactattt 
gggcggagtt 
atggagtgca 
tcaaacacta 
agcgagaata 



cggcaaatac 
tcacgtgata 
tttagaaatc 
agggcttcgc 
gcatgaatac 
attcttcatg 
ccagcgttgg 



60 

120 

180 

240 

300 

360 

420 

471 



<210> 357 
<211> 312 
<212> DNA 
<213> B.fragilis 



<400> 357 

gaggatgata 

aatagcggtc 

cttattcagg 

ctatcggtga 

tcatggcatt 

aatctgtcat 



ctggcgataa 
tcgacagtga 
taataaaatc 
tgtgccttcg 
ctcctttctt 
ag 



tggatatact 
caaaacctat 
cgaattaact 
gataatagga 
ttctttctca 



tttaccagtt 
aataatacaa 
cacgaagctg 
tatcgacgag 
atttggttaa 



gcattcctac 
aagtctatca 
cattgctgca 
ggttaatgct 
taagattacg 



ttcttcttac 
tagttcttct 
cgttcttttt 
ttcttcttat 
tatagcgatt 



60 

120 

180 

240 

300 

312 



<210> 358 



146 



<211> 312 
<212> DNA 
<213> B.fragilis 



<400> 358 

gcaggatgta catcactctt ggctaataaa acgaaagccc agatccaggc aatcaacgat 60 

atagtgatag ctattgtttt taatatacgc tcttggccct tggtaaacgc tccttcagga 120 

ataccgttac caatcatttt tgagttggca ggaatcaatg aaaaacgggt agaagatgtt 180 

gctgcggttg ctatacaagt agttatcagt cccactccgg caactacatg tccagctacg 240 

aaagaggagg tagaggtaga tcgggtaaac atgcaaattc cgccaataat tgtgacaaca 3 00 

gcagccagat aa 312 



<210> 359 
<211> 1143 
<212> DNA 
<213> B.fragilis 



<400> 359 

ttacaatgca 

aatcccatca 

gaccgcacca 

ctttcaggta 

gattctactt 

actctttctc 

gatgaaaata 

accacccgta 

ttcctttatc 

ggaacagccg 

actaatcaaa 

attgtcatca 

tatggagcct 

tccgactcca 

cgacagtctg 

gtgcatttaa 

aacatcaggg 

gaaagcatta 

tcagaggacg 

tga 



ctatgagttt 
agctgaccat 
tcttttccgg 
ttctcagtcc 
cagctacaga 
tgaaagcagt 
gcaatatatt 
ccaacgggcg 
cggatggtgc 
gacaaccggt 
agttagcttc 
ctcccggaac 
atgaacgcat 
cttatcagat 
cccgtgacaa 
tggatatgct 
taaatgccgt 
aaatgactct 
aaataggaaa 



aacagcaaac 
aaactccagt 
aagtggtgaa 
caaacatctg 
tattgccatt 
tataggaggc 
cacttggaag 
tatcatcacc 
attaaaagtc 
agcccttaac 
tgttttcgat 
agtatcccgt 
tgaagtcacc 
ttacgatgaa 
gcttctggtc 
tgcttccgat 
agctgacaac 
tcatttcgtt 
tccccgtata 



atatatccgt 
tcagtagtca 
ggtgagttct 
ctcaatgaat 
agtgtccaaa 
atcagcaagc 
ctgttcaatt 
atccgcgaaa 
gttgcagccg 
ctatatcggc 
atctattccg 
gagcgttatt 
ggtatcggta 
agcattgatg 
gaatcaggat 
gacataaaga 
ctcacccatg 
gactccgatg 
cataccgaac 



ctacaatcgc 


tttagccgga 


60 


gctacactat 


tcgtcaggcc 


120 


ctgtttttct 


tcaggatatc 


180 


ccactgatat 


attactgctc 


240 


acacccaggg 


agagactaaa 


300 


ggctattacg 


gcgtctgtta 


360 


catcggtcaa 


tttcttcaag 


420 


ctgaactcct 


acctattcct 


480 


gcattgaaac 


ctctttatcc 


540 


tccggaaaaa 


actgtttcaa 


600 


gatcaaccaa 


aagttgtact 


660 


tacttgaatt 


tctcaactcc 


720 


acatcgagtc 


tgaaattgag 


780 


attatatcga 


ggcccgcgag 


840 


atcgcaatac 


cgaagaactt 


900 


tactcggact 


ttccggacga 


960 


ccatacgctc 


cactgtacca 


1020 


ttcgctacac 


cggatcactt 


1080 


agttcacacc 


tcaatttaat 


1140 
1143 



<210> 360 
<211> 969 
<212> DNA 
<213> B.fragilis 



<400> 360 

aaatacaggt 

acactcaaga 

attgatgatg 

gcactgaccg 

accctaatgc 

gaaaacaaac 

agcttctatt 

ccggaaactt 

atccgtgatg 

tcgttccgta 

ctcaaagaag 

ccaaaagaaa 

ctctatacat 

cccattatcc 



ttaacctcag 
aagtagtcaa 
ctatggatat 
gaactgataa 
ttgccactcc 
aaggtaccta 
ttcgtggcat 
accccgaata 
ccagagaatt 
tgatgctacc 
acctatacca 
aggtactgct 
cacagacctc 
ggcccatcta 



aaacagaaga 
aatcaatgct 
ctatctgacg 
aaggctgaat 
ggaacttggc 
ctcgccggcc 
gcaggccctt 
cgccgagcac 
tcaagatacc 
tactgtccgg 
acgtctgctt 
ggggcacata 
acgtgaacag 
ccaggatcag 



aaaatgaatg 
acactgcctg 
ccatacatcg 
gataaaattc 
atacgtatcg 
aatgaagcaa 
gatcggctgc 
tgcaaacaag 
ggtttagtca 
cagttgcaag 
gatgcccata 
ctccgttacc 
cgtaccatca 
gcagcaaccg 



ctatcatccc 


tgacatcgac 


60 


acgaagccat 


caatccgtat 


120 


gtattaaaac 


cgtagaaaag 


180 


tccgcaccct 


ggggcctctc 


240 


gagacagtgg 


aattacggtc 


300 


aaattgccgc 


cgctaaagaa 


360 


tcacttttct 


gaccgatcat 


420 


ccacagattc 


ttcatgcttc 


480 


atatcgagta 


ttctaccgta 


540 


aacgcaatgt 


gcgtgaaatg 


600 


ccgcagggaa 


agaactgaca 


660 


tcgctaacaa 


aaccgctgaa 


720 


acgacacacc 


ggagtttact 


780 


gtaatttctt 


cgctgatcaa 


840 
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gcgacctact acgccggaaa gatacaaaac ttcatttccg aaaatgctga ggagttagga 900 
gtcacaccaa ccgttaccgc tataaacttt aactccaaag aaaagcgaat attcacctct 960 
atatcataa 969 

<210> 361 
<211> 789 
<212> DNA 
<213> B.fragilis 



<400> 361 

aacatgataa 

acaggagcta 

atagcaggca 

cccccactcc 

gggttattga 

tccactgtct 

ccggctttgc 

gtattgggat 

ctggtgacag 

ctcgggtctt 

acctgggcgg 

aagtttcgca 

tgctggtatg 

aataattaa 



cctgggataa 
tatttgcact 
tcacctgctt 
gtactatggg 
cttatattcg 
ttgtaataat 
aaagtatttg 
gtgctttcat 
cagacaatct 
tgtgggctaa 
tagtaacctg 
aaaagatgct 
gagtgaatta 



cttctatttg 
tcgttcgtca 
gggaatattc 
cgagacccgt 
ttggaaatat 
caatctgctg 
gtttattccc 
catagccctc 
ggtctattcc 
ggaggcatgg 
gatgggatac 
gtacgtcata 
tttgccttcg 



ttcgcagtcg 
gtacgaagta 
attgccgggc 
ttatggtact 
cgatggatac 
aaaccggaga 
catgtcaccg 
tgcggtcttg 
ggagtggcgt 
ggtaattatt 
ctattatata 
ctgattttct 
gctcaacaaa 



cttcaatctg 
gaatggccgt 
tgtggatatc 
ctttctttat 
tttctttctc 
tacatgacca 
tctatatgtt 
tccatcacaa 
ttttgtcaat 
ggagttggga 
tacatctgcg 
cctttctggc 
gtgtacattt 



tctttggttg 
tgtgctcact 
tctgcaacgt 
gggtattgcc 
caccttattg 
gtcgcttatg 
ctcctattcc 
agaagagtac 
cggaatgctt 
ccccaaagaa 
cttgcgaagg 
attgcagatg 
atacaatcgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

789 



<210> 362 
<211> 3750 
<212> DNA 
<213> B.fragilis 



<400> 362 

ataaacgaaa 

gatggctcag 

gataaactca 

aaaaaagaac 

acagaacggg 

tcccgcgttc 

cttgagcaga 

gaggtaggtg 

ggtctgatcg 

cgagaacagc 

tccaaggacg 

gaatccggca 

ggctctgcca 

acccttatat 

ctttctctca 

gccgccggct 

tccggggtag 

acactggccg 

cttactctac 

ttggataacc 

gaaggataca 

gaggcagtca 

gcagctaaac 

aagctcaatc 

attaaactat 

cttatagctt 

tggaataata 

tatggtataa 



cagtctctaa 
aagccactaa 
ggtcacttac 
tggatgcaaa 
ttctcaaaag 
ggaaagagct 
atcggcgtgt 
cacaaggtaa 
gtactgtcat 
gaaacaaacg 
atataaactg 
ttcgcattag 
agcccgaatt 
tagcttctgc 
atcaatacgg 
ctaaatatgg 
ctgctgcctc 
aaaaaggtat 
aaaccggagc 
ttcagaaaaa 
atgtggcctc 
ccggtacgtc 
tatcgcaagc 
ctgccctcat 
taaacttcat 
acacagctgc 
atattgcaaa 
tagccgtagt 



tatggcaaac 
taaaatagac 
cggaaaagaa 
aaaccgaact 
cctctccgga 
tcgtaatgca 
cactgaagcc 
tatctggtca 
agcagctatc 
cgaggaagcc 
gttggaacag 
acagtccgca 
tcttgataac 
atcaggtatg 
tgatggtgcc 
agcagcagcc 
tgccgaaatt 
caaagacgaa 
agatgataca 
gcaactctca 
cgtacttatc 
cgtagccatg 
caaaaaccgc 
atcagcagca 
caatgaaaac 
taagaactct 
gtctttaaaa 
cgcggccaca 



gacctaaacc 
ctggtaaaag 
gtagattatg 
cttcagaatt 
gcaacttaca 
gtgcccggaa 
ctttccagag 
cgtgcctccg 
accggagttt 
aaggccgatg 
caagctgtcc 
acggaaattc 
aaagaagctt 
accctgaagg 
gaccaagctt 
gtggagtccg 
cctatcgaac 
atagccggta 
aatcccaaaa 
gcagcccaga 
aatgaagccg 
gaacaggccg 
atgcaggaac 
aatggtgctg 
aaaagggcaa 
gatgtaataa 
gccattaaga 
gctatagcct 



gcagtattaa 
aaagtatttc 
caaaacgctc 
acgagaaaca 
acgaactcct 
cgaaacaata 
cacaggccaa 
gattcattaa 
caatgaaact 
ttgaagctct 
agttgtcaac 
ttgatgccta 
tggccgaggt 
atgcagtcga 
cacgctatgc 
tcaccacagc 
agcttgtagg 
ccggtttaaa 
tcgtcggttt 
ttaagaagca 
ataaggtaaa 
ccacaaaatc 
ttggtattga 
tcagttggac 
ttacgttatt 
tcagtaaagt 
aagagctgat 
acctcataaa 



actttatatt 
tcgtcttgaa 
ccaggatctc 
gttagccgaa 
tgctgtccag 
tactgctgct 
catgcgcgtc 
caaatatatt 
caaccaactc 
taccggactt 
gacaatgacc 
caaattggta 
gactaaacag 
tgccgtaacc 
aaacgtcatg 
cgtcaccaaa 
tactattgaa 
gaaattcttc 
agagaaagct 
atttggagaa 
atactacact 
agaaacagcg 
attattagaa 
tggaaaactc 
gaccattgcc 
cgttacattt 
gacaaacccc 
cttaaaaaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 
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aaaaacgatg 
aaatcattta 
ggaattgcgc 
tacaatgccc 
gattatctgg 
aatctttatg 
tggaagattc 
atttccagac 
cgaaaaaatt 
tcagctttag 
acaactccca 
gaagctaagc 
gacgaaacct 
catcagcaac 
aatcgaatca 
gaagaaaaga 
aaggatgaaa 
atgcacctgg 
gaacaacaac 
aaacttgaag 
ctcactgaac 
caaatgatat 
ctattcgatg 
gtcggagctg 
tttggagcaa 
gccgctgcaa 
atcgataaca 
ggcagatacg 
ggt gattcac 
ctgatcatca 
gtgcaggcca 
gatccgatcc 
gaagcaaact 
ctcaaagcat 
aaggaaccat 



aattgaaaga 
ttcaacaaga 
ttggtgttcg 
aactaaccga 
tacaacttga 
cccagaaacg 
gccaaaccaa 
tttttggtac 
tatcctcaat 
ccatagagga 
taattgatga 
tctactccca 
tgcaaaccga 
gtatcattaa 
acgatatcaa 
cactttatga 
atctgaaaac 
aacgtgttct 
tactcgactt 
atgccgctca 
aggcacaaca 
caggtcaaga 
tactaagcca 
tagcccgttc 
ctggtgcagc 
aatcaacgct 
ataccgacag 
atgtcattgg 
cgaccggaat 
atgccgaaga 
ttcaggatgc 
gtaacagtat 
tggctcaact 
acgtcgtgct 
tcacccgcaa 



ttctgtatca 
atcgaagata 
tcgaaaggct 
tgaaggaaca 
aaagcaaatc 
tacactggaa 
taccttacaa 
agaaaaagaa 
ttctgagaaa 
ggtcaacaaa 
agagaaagcc 
acaccagtcg 
acagcagttt 
tatagccggt 
aattaaacag 
aaaccaacaa 
agagaaagag 
caaaattgct 
taaagtaaaa 
aaagaaaaaa 
gtaccggcaa 
aaatgccctg 
gatgattgat 
cgctgccgaa 
ccgtgctgca 
caaaggactg 
taccaaaact 
tgaagatgat 
cgtccgccgt 
tctttcccgt 
ccgcagtggc 
ttcccgtacc 
gatcaaagag 
tcgcgagctc 
aaaacaataa 



ggaataaaaa 
cgtgctttga 
ttaaatgatc 
ttaacgaaaa 
aagttaaagg 
aaagatgaag 
ggatataatc 
ggaaaagcat 
atagatgaaa 
gcgaatgaag 
aaagcccttc 
gaacttaaag 
aatgaccgga 
gcaaaaagta 
caaaaagagc 
aaggacctaa 
tacaatgaag 
aatctcgacg 
tgtcttcaag 
gatgaactgg 
tacggtgaac 
cagaactttg 
attgaaatag 
gcctatgcca 
gttctctccg 
attaaaaggg 
gcccaggtgc 
ggccggactt 
acctcattga 
cttcagcacc 
cgagttcccc 
tctcagacaa 
ttacatgcac 
aacgaagcac 



aagtaaatga 
ctgctgtcat 
taaaagaaat 
acaatacaga 
cagcacagca 
aaacccaaag 
ggaatagcct 
tagaaactct 
taaccaaaga 
aaactacaaa 
ttaaaaagaa 
aagcctatct 
tggaaaccct 
aagaaggtat 
agatgaaccg 
aacttctcta 
caatggagca 
ctgatcaacg 
atgaagaaaa 
ccaggaagga 
agatcggcga 
ctgatactat 
ccaaggccac 
tgcccgactc 
gactgatcat 
ggagttcttc 
aagtcaagca 
atcgggatgt 
tatccgaatc 
acattaatta 
agcgtgctga 
cttcttcacc 
tgattgagaa 
aagaattagc 



agagaccaat 
caatgataat 
cattccagac 
cgcaatcaaa 
agaacttgaa 
tgatcaatat 
tacagctaaa 
taatgaaacg 
gataggtgaa 
taacaaaata 
acttgaagaa 
caaacgccag 
cgaattagaa 
tgatgctcaa 
acagctcgct 
tgtttccggt 
cctcactatc 
gcgcaccatt 
agaacggaag 
gaaacaaagg 
taccctcgga 
gctcgatata 
gggggtagct 
tgttgcaacc 
gggagcattg 
cacttccgcg 
atgggcatcc 
tccctacata 
cggagcagaa 
ccccattgtc 
aggcaattac 
gactgataag 
acttaaatac 
agataaatca 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3750 



<210> 363 
<211> 462 
<212> DNA 
<213> B.fragilis 



<400> 363 

actatggaaa 

gtcaaaggta 

gacagtgagg 

gagtaccggg 

gagaaaatga 

agatggaaag 

tattgttatt 

gctatttttg 



cattaacggc 
cagaacagtg 
tccatgaaga 
atgcagccaa 
aaacccggtg 
ttggagagga 
gccatgagag 
attttcataa 



attacaatgg 
gactaactgc 
taaggacgct 
gaaacgggag 
gcaatggtta 
gctgaataag 
atatacccat 
gaacggaaat 



gctaaaaagg 
tattattctg 
gccaaagcca 
gagaaaagaa 
caggagggaa 
acattttgta 
gaacctaaaa 
agctgggtat 



gttttattcc 
ccaaagctgt 
tcctgtctgc 
aaaaaaatgc 
gaatacctaa 
catgtgctta 
atgatgaaga 
ga 



caatgagggg 
ctatttcaaa 
caaaaggaag 
tgcataccga 
cgataatgcc 
cggaagtaac 
gatgcaaaaa 



60 

120 

180 

240 

300 

360 

420 

462 



<210> 364 
<211> 1098 
<212> DNA 
<213> B.fragilis 



<400> 364 

cgacttatga caaagctaat cagaacattt catcctgtag gacatggagc tttttatacc 60 
gaaaagcatg ttttagaaga tcagactata aatattgtgt atgactgtgg ctctaaaact 120 
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ttggaaaaac aattaccaag tataattaat aatactttta agaaaggaga agagatcgaa 180 

tttttattta tctcccattt tgatgcagat catgtcaacg gaatagagta cttgaaaacg 240 

tactgtaaaa ttaagaaggt tgtcattcct ttaattgaag ataaagacgc tatcttaatt 3 00 

atcaaagcaa ttaatacatc taaaatagga agtaataaac tggatacttt aatagatagt 3 60 

cctgaagagt attttcccgg atctgagatt ataaaagtta aggctgtaaa cgaaggttat 420 

gatgatgatc gatattttgc taatgatttg aacaatgggg gaacaatacc aagtggaagt 480 

gaaattatat tacataaatc aagcgctgaa aataaatggt gctttatccc attcaactac 540 

aattacactg aaagagtaaa tctatttaaa gataaaatca aagaaaaagg attgattttc 60 0 

aataaattaa ataatattga ttatgttcaa atatcacaga aaaccataaa gtctatatat 660 

aaagcaataa aagggaaagc caatgggaac tctttagttg tattttctgg aggagattat 72 0 

gcaatttctg caacgcatta tttttctact gacaaaaaaa tagtattaga atatgataga 780 

tgcagctata taagctgcat ctatttagga gattcttttg ctaacaagtc ggacttttat 840 

agtcaattaa aaggaagatt ggataagttg accgaaagta ttggtataat acaaatagct 900 

catcacggag ccaagggaaa tttttctcct aatatcttaa agttaggaac taatcctttg 960 

gctataatat cctgtaaatc aacagacaag catcacccat ctgttaatgt cgtaaaacag 102 0 

atacaggaaa atggttcaat accattcata gttacagaaa aaccaactac agaagtagaa 10 80 

cagataggat actactaa 1098 

<210> 365 
<211> 351 
<212> DNA 
<213> B.fragilis 

<400> 365 

ttgttttatt ttactctatt ttctactgaa tcagttgaat ccagatctaa tctcaaactt 60 

ctgttgtttc taaatttatt tatcaatttt tatttttttt attttatgaa catgtacatt 12 0 

ggaaacctta gctatcgtgt taaggaagca gatttgagac aagtaatgga agagtacgga 18 0 

acagttgatt cagtaaaact gatcatcgat cgcgagactc gcaaatcaaa aggattcgca 240 

ttcgttgaaa tgccgaacga cgatgaagca aaaaatgtga tctctgagtt gaacggagct 3 00 

gaatacgaag gccgtcagat ggttgttaaa gaagctctgc ctcgtaacta a 3 51 

<210> 366 
<211> 1299 
<212> DNA 
<213> B.fragilis 

<400> 366 

aaaaacattg caaaggtaag ctttatcttt ataaccggac agctttcata taattttata 60 

cctttgcatt tcataacaac tatcaagagt atgcgtaaat ttattatttc tttctgctgc 12 0 

tatgtcttct ttatttttac tttggcagcg caggacaagg ctccgcatta caccgtaatt 180 

gtttcattgg atgccttccg ctgggattat ccggcaatgt atgatacccc taacctgaac 240 

cagatggccc gcgagggagt aaaggcgact atgcttccct cctatccggc gtctactttc 3 00 

cccaaccact atacattggc tacgggattg gtgcctgacc acaacggtat tatcaacaac 3 60 

actttctggg atgtaaaacg tcgtcgccaa tactctatgg gagatcccgc cacgcgaaac 42 0 

aatcccgact actatctggg tgaaccgatc tggattacgg cacaaaagca gggagtgaag 480 

acaggaaacg tttattgggt tggttcggat attgccatca aaggcggtta tcctacttac 540 

taccgggaat atgccgagaa gcctcgtctt acttttgaac agcgggtgga ttcgaccatc 6 00 

gctcttctgg aaaagccgga agcggaacgt ccccgtctcg ttatgcttta ctttgaagag 660 

ccggatggcg tgacccatca tcatggcccc cgcagtgtag aagctgctgc cattatacac 72 0 

cgtatggata gtttagtcgg aatgttgcgg cagggaattg catcgcttcc tttcggtaag 7 80 

gatgtcaacc tgattgtcac cgccgatcat ggaatgaccg agatcagtga cgaccgcgtg 840 

gtagacatga ataagtatct gcgtccggaa tggtgtgagg ctgtggatgg acggactccg 9 00 

acctctatct tcacaaaacc ggaatatcgc gactcggtat acaatgcctt gaaagatgta 9 60 

ccccatattc atgtgtggaa aaaggaggag attcctgtcg aattaaacta tggaagcagt 102 0 

gaccgtatcg gtgatattgt agtggctccc gagttaggat ggcaatttac cgatgtacca 1080 

cgtgccttga aaggtgctca tggatatttt ccgcaaagtc cggatatgca ggtaatgttt 1140 

cgtgcctgtg ggcccgactt taaggcaggg tatgaatcga agggatttgt caatgtggat 12 00 

atctacccgc tgttggccca tttattgaaa attactccgg agaagacaga tggacagttc 12 60 

gaaagaataa aagacattct gaaagatgtg tctttttga 12 99 
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<210> 367 
<211> 969 
<212> DNA 
<213> B.fragilis 



<400> 367 

tacatatatt ttatgacaag aaacgaacag ttagaaaaat ggttgtcaaa ccgtcagcgt 60 

aggtacgctg acggtatgga actctttaac gctttagcaa aggcaaacac caagagcagc 12 0 

tatgggaact atctttccca ggcaccggag aatcctcaca ttttcgatcc ccactttaca 180 

caattagtca atatactgac taaaatagcc agggaaataa aagatgctcc ttctgtttac 240 

ccggctgcat tcgaagagat cctgatcgtt caaacactga atgatgaaca acggactcaa 3 00 

gaaaccgata tccggacaga ggcaatcgac cgactccaag aggagatcga cggactgcat 3 60 

aaccgtatca gcgaacttga gagtgacacg gaaaatcatg ctgacgaact ctcagcttta 42 0 

aatgaagagt tcgaggagaa aatgaaagag ctctccgcta tccggggcga actggatgcc 480 

ttgaacactc cgggcgtcaa gatcgtaaca gaagaatccc tcactcctgc cttacgtaaa 540 

gcatacgccc gtatcaaaga gatcgctccc ctgtacgcca gtctccataa tgatattgcg 600 

aatccggata tcccggcaga ggaacgtcac cccctcgcag aagaactctg caagctggac 660 

gacgaacgtc gcaaactttg gaaacagatt gacgattacg cagaaggcaa acaggcaacc 72 0 

ttagagcttg atgctaaacg tcctgagtat agtgaaaatg cagtggtcag aggcttcgaa 780 

atagcccgtc agatcaaacg tctgaagcag aacattacga acagcaaaac agccgcagag 840 

agggccggga aagagggaaa acaggctgtt ctacagaacg cactcgaccg gattgccaaa 9 00 

tacgaaactg aattagccgc tttaacggca gaattatcgg cagaacaagg tgaaaaggtt 960 

tcaggataa 969 



<210> 368 
<211> 192 
<212> DNA 
<213> B.fragilis 



<400> 368 

tttgcgatag cacaagtcgg aacatttatc cggccgcggt acgcatcagc gtatcgcggc 60 

tttgtttttt ggggtcgttc cccctttacc cccttttgct tagaagaacg ttttgaacaa 12 0 

ctgtacctga gacgaagtaa agccggcaac aaggtgtcta tatattattt ttattttttc 180 
tttcttctgt aa 192 



<210> 369 
<211> 1725 
<212> DNA 
<213> B.fragilis 



<400> 369 

ggaatgacag cccaagcctc tcccatacca tcggcgtacg aactccggat gaaacaggcc 60 

aatgtgatac ggaagttctt caacaaaatg caacgccagg caatggctat tgccgcacat 12 0 

gacgaatata tcgttgcatc gcgtggtacc ggtaagtcag aaggtatcga cgcccgcttc 180 

attctcagaa acgtctggga aatgcccggt tcattgggtg gaatgatctc tcccagctat 240 

gctaaagcct gggggaatac ccttccggct atctgtaaag cactcgccga atggggatac 3 00 

attcaaaata tccattatgt cgttggccat aaagcaccac cttccatggg ctttgccaag '3 60 

cctgttcgtc cggtactcgg agacggatgg agtaatgctt tccatttctg gaatggcacg 42 0 

gtcatggtca ttctttcctt taatcaaggg atgtccgcaa actccatgtc gcttgactgg 480 

gtgataggtc cggaggcaaa gttcctttcc tatgacaaga taaagaacga ggtcaatccg 540 

gccaaccggg gaaaccggca atatttcggg cactgtcctc accatcacag cgtatgttac 600 

tcaacggaca tgcccggatc atccatggga cgttggattc tcgacaaaca ggaagagatg 660 

cagcccccac atatccaact cattcgcaac ctgtataaag aacttcagga ttacaaacgt 720 

aaaccgctga ccgaacacac catgcggatg atccgggaac ttcaacgtga tcttgacata 780 

gcccggaagt ttcagcctgc actcaaaccg aatgataaga aaaaacggga atacactgta 840 

ttttatggtg aatatgatgt ctttgataac cttgaggtct tgggagaaga cttcatttgg 900 

cagatgcagc gtgattctcc cccgttggta tggcgtaccg ccttcctgaa cgaacggctg 960 

atgaaagttc ccaacggctt ttatagtgcc ctggacgacc gcatacattt ctatcagccg 1020 
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gctgataacg gaaggctgaa gaatcttgga agtaattgga agcaactgag ttcctgcggc 1080 

tgcctgggag acggtgacct tgattttgac aaagaactgc atattgcatt cgactccaat 1140 

gcgtcaatct cgacagcggt agtggcacaa ttggacggga atacgatgaa aatcatcaaa 1200 

tcgttctatg tcaaaacccc atccaaactc ggagacctgg tacaacagat agctgactat 1260 

taccgtccca agctcaatca cgatgtagtc gtctactatg atcatacttt tacctgggag 1320 

tcgggctcca caacagaaac ctatgcggat atcattgagc gtgtattcaa agagaaccgg 1380 

tacactcctg caatggtata tgtcgggcaa gcacccaaac atgaatggaa acacctcaat 1440 

atcgatctcg cattgaaagg tgatccgcaa ttcctgtgga ttcgtttcaa tctctatcaa 1500 

aacgagttcc tcaagatcgc catggagcaa accggtatta agcaaggtaa aaacggtttt 15 60 

gagaaggaca aagctccgga aggtactgac gatactccgg acaatccgga tcaatacaaa 162 0 

acccatgtta cggatgcctt cgacacatta tggctcggca tgaatttcta cttcacacgt 1680 

ccgggaaccg gcaccggagg aatatttttc ctcaatcgga aataa 1725 

<210> 370 
<211> 3057 
<212> DNA 
<213> B.fragilis 

<400> 370 

actatgtatt ttaatgatga tgagataaga cgtatcaaag atgctgccac aggacatttg 60 

cttgatgttg cacaagactt ccatgaactc aaacgctccg gagtgaatta caattgcgat 120 

tgtccccggt gcaaagccgc aaagaaactc tcaattagtc cggccaaaca aatctttaaa 180 

tgctttggat gcaatgaatt gaaaggtgga gattcggttt ctttcttaat gtccgctgaa 240 

ggaatgactt tcaatgatgc tcttgaatac cttgccaaaa aattcaatgt cattctcgat 300 

caacgtccgg ccatcaagaa acaaccggca aaaaagatga aaaaaagcag caaggctgcc 360 

aaaggtatcg atgtcgacag ttattgtgcc aggatgttgg ctgaatcagg tcttaccttt 42 0 

gaggatgtca cagcaaaggt atataagaca ggagatacac aaagtatatt cgaacaacgt 480 

actttccgtc ctggtaccat tgatgaacga ggaatgttaa ccactaaggg agatgatgtc 540 

atcattgaat attatgatct ggaaggaatg ccggttgtct tcacccggaa agataataaa 600 

agaagggacg ttggtactcc tcaagaatat tatcgtatca gatggcagtt tccggatgcc 660 

caccttgata aagagggtaa accttacaaa tacaaatccc cgcgtggcag cggtactccg 72 0 

atctatattc cggagcgcat acgcagtctc tataaatcaa agacaaagat accccgtctc 7 80 

tatattcagg aaggtgaaaa gaaagcggag aaagcatgta agcacggcat tccctcaatc 840 

gcagtcagcg gtatacagaa tctcggtctt tacggtgccc ttccggaaga cctggtgaag 9 00 

atcatctcta cctgtgaggt acaggaggtt gcttttatct ttgattcgga ctgggacgat 9 60 

atcagctcca atatccggat caatgatcag gtcgaaaagc gtccccgctg ttttttctat 1020 

gcagcaaaaa atttcaaaga atatatgcgt tctctcaaga accggaacat cttcgttgaa 1080 

atattcgtcg gacacattaa taagaacgaa gcaggagaca aaggccttga tgatctgctt 1140 

gcaaattctc tgcgtggaaa agaagaagag ctggccgccg atatcgagtt tgcatgcaat 12 00 

gaaaagaaag gtttgggcaa atatattgag atgttcaagg taactacctg gacagatcat 12 60 

aaattgcaag aattatgggg actccactct catgaagtct ttgccgagcg tcatgccgac 13 20 

ctcctgcgta acctgccgga gttcctattc ggccgatatc gatggaaatt cgacgaacat 13 80 

ggaaaagtaa tcttggcaca accttttgac gatgatgaaa agttctggag agaagtcact 1440 

aaatatgatc gtagccaaaa tgaacgtatt gaatacgagt tctgctatgt caactcacaa 1500 

aacttcttgc aaaacagagg attcgggcgt ctgcggagaa ttgataagag ttatcagttc 1560 

attcaccttg aaccgcctgt tgttcgtgct atcgatgcct ctgatgcccg tgactacctg 1620 

tttcagtttg ccaagcataa ttgcaagact gaggtaaacg aaatgttgat taaaggcgtg 1680 

tctcaatatg tgggtccgga caagttatcc ctgcttgagt tcattcagcc caatttcgtt 1740 

aagcccaacc gggaatccca gtatttctat tttgataaaa attgctggct ggtcacaaaa 1800 

gattctgtaa gcgaactcgg ttacgagaat atcacacacc acatctggga agagcaacgt 1860 

aaaatgacac cggccaaata tctgggtaaa ccgttggtta cttttagccg gcaagacaac 192 0 

acatttactt acgaactttc agaggccggt aagaaatccc attacctcca gttcctgatc 1980 

aacaccagta actttacctg gagaaaatct gctgaagaaa tagagccgga agaagagaat 2 040 

gaaaatcgta tccatctcct tagtaaactg tgtgcaatcg gatatatggt tatggaagcg 2100 

aaagacaata atgtggccag agctgtcatc ggcatggatg gcaagcaatc tgaagtagga 2160 

gaaagtaacg gccgttccgg gaaatcactt gtaggggaat tgatgcgtaa tatcattcct 2220 

acagcctata ttcccggaaa acgctctgat ctttttaatg atcaatttgt atggaatgac 22 80 

attcaggaaa acactaaact cgtttttatt gacgacgtgt tacaaaactt caactttgaa 2340 

tttctgttcc ccaacattac cggggattgg tcagtaaatt ataaaggagg tagaaggatc 2400 
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actttaccat ttgcgcgatc acccaaaatg tatattgcca ccaaccatgc catccgtggc 2460 

agtggttcaa gttacacgga tcgccagtgg ctacttgcat tctccgattt ctataacgat 2520 

acccataagc cggttgacga cttcggggtt ctcttcttct cggagtggga ttttgaacaa 2580 

tggaatctta cctggaacct gttggccaat tgcgtccaat tgtatttgac ttatggcgtt 2 640 

:gtccaggctc ccggcgaaag gttagagcaa agaaagctgc gtcaagaaat gggtgaaacc 2700 

ctaatctcct gggctgatga atacttctcc ggagaagagc atctcaatgt ccgtttaccc 2760 

cggaaagatt tatatgacgc attttgccaa tacgacaatc agcaacgaaa gtttgtatca 2820 

ccaaccgcat ttaagaagaa atttataatg tattgttctt ggaaaggtta tgtattcaat 2 880 

cctcacaaat atgacagtat aaccgggaaa ccttttcaag tcgataagga cgggaaggcg 2 940 

gttgtagatg ataaatccgg aggtgtagag tactttacgg taggaaccgg agcccaacct 3 000 

atcccgaaag aagataatag ccggttacca caaccgacag gtaaactcgt tttctaa 3 057 

<210> 371 
<211> 840 
<212> DNA 
<213> B.fragilis 

<400> 371 

aagttgccgg caatttgcat taaaaagaag agaataatga agattagatt tataagcctg 

gccagtggca gtagtggaaa ctgctattat ctaggtaccg aaaaatacgg catactcatt 12 0 

gatgcgggta ttggaattcg taccattaaa aaatcactga aggacataaa tgtgactatg 18 0 

gactcaatac gtgcagtatt tattactcac gatcatgccg atcatattaa agctgtagga 

catttaggtg agaaattgaa tattccggta tatactacgg cacgtgtaca tgcaggaatc 

aataaaagct attgtatgac agaaaagttg catggttctg tacgctattt ggaaaaagaa 3 60 

gaaccgatgc aattggaaga ttttcgtatc gagtcttttg aagttccgca tgatggaaca 42 0 

gataatgtag gttattgtat agaaattgac ggaaaggttt tttcattcct tacagacttg 480 

ggagagatta ctccaaccgc tgcccgatat atttgcaaag cccactatct gatcattgag 540 

gctaattatg atgaagaaat gcttcgtatg ggaccttatc cgacatatct gaaagagcgt 

atctcaagta aaacaggtca tatgagtaat atagataccg ccaactttct tgcggaaaac 

ataatggagc atcttcgtta tatttggctt tgccatctga gtaaagataa taatcatccg 

gagttagcat ataagacagt tgagtggaaa ttgaagagta aaggtattat tgtcgggaaa 7 80 

gatgtgcaac tacttgcttt aaagagaaat acgccttcgg agctctatga gtttgaataa 840 

<210> 372 
<211> 342 
<212> DNA 
<213> B. fragilis 



60 



240 
300 



600 
660 
720 



60 



<400> 372 

tgtaaggagg catatcaatg taaaatcatt gaagccgggt tcgattcccg gaagcccaca 

ctaattttga tcattatgaa aacagtacat tcatcaccaa gcctgtctcc aagtggaaca 12 0 

aagagacaga aagctaatct atttacaaac gaaaatccgg aaactatcgc acaaatgcgc 180 

atgcagtctg cacaaaaaga gcagcataag gtcatggttc gtcttgataa ccgcacacat 240 

gtacttgttg ctccgcaaaa tgtaactcct gagtatatag aaatgctgcg aaaaaaatat 300 

caaattacct acaatgctcc agctcgagga ggaaggaggt aa 342 

<210> 373 
<211> 222 
<212> DNA 
<213> B.fragilis 



60 



<400> 373 

ataagacatt ttgtacatgt gcttacggaa gtaactattg ttattgccat gagagatata 

cccatgaacc taaaaatgat gaagagatgc aaaaagctat ttttgatttt cataagaacg 12 0 

gaaatagctg ggtatgacta taaacaagat cagaaaggat ctaatatgaa attaagaaaa 180 

aaggaactaa agtatggaat cagaaactat aaaaacaggt aa 222 



<210> 374 
<211> 1080 
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<212> DNA 

<213> B. fragilis 



<400> 374 



tcagttaatt 


ttgcacacat 


gattgaattg 


gcacagcata 


ttgaagtatt 


attattagag 


6 


0 


aatgattgtg 


tgatcgttcc 


cggttttggt 


ggatttatag 


ctcactacgc 


tcctgctatg 


120 


agagtggctg 


aagagaattt 


attcctccct 


cctacccgta 


ctattggttt 


caatcctcaa 


1 


80 


ttgacgttga 


atgatggtgt 


cttagtacaa 


tcttatatgg 


ctgtgtacga 


tactaacttc 


2 


40 


tcggatgcta 


cgaaaatggt 


agaaaaagag 


gtggcagaac 


ttatctccgc 


tctgcatgag 


3 


00 


gatggaaaaa 


ctgatcttcc 


taatattgga 


gaaatccgct 


ataccattca 


taatacttac 


3 


60 


gagtttgttc 


cttacgacaa 


taaaattacc 


actccttacc 


tctatgggct 


cgactcgttt 


420 


gagatgaagg 


agttgtcggc 


tttgcgacgt 


ccggagaaag 


aacaaattct 


tccgactgtt 


4 


80 


cttaagaaaa 


agacaagtta 


tgaattcaga 


gcgaactggg 


cctttttgag 


aaatgccgta 


5 


40 


gctatgattg 


cagcggttgc 


attgtttttc 


tttatgtcaa 


caccggtaga 


gaatacatat 


6 


00 


attgaaaaag 


gaaattatgc 


ccggctactt 


ccaactgatt 


tgtttgaaaa 


gatagagaaa 


6 


60 


caatcggtgg 


caatgactcc 


ggttatgcta 


aaatcagttg 


atgccatccc 


acaaaccaaa 


7 


20 


ccggcaactg 


cgaagaaaaa 


gtcgtccact 


gttcgtaaag 


catctgtagt 


aaagccggtt 


7 


80 


gcggtaaaag 


aagtgaaagt 


taaccaaccg 


gaaaagacaa 


tgaaagcaac 


cgaaactaag 


8 


40 


gttgtggaaa 


aaacttttcc 


atatcatata 


attattgcca 


gcgtagctaa 


tacgaaagat 


9 


00 


gcggaggcga 


tggcaggtga 


acttaaagcc 


aagggatata 


caggtgccag 


agtattgacg 


9 


60 


ggtgacggta 


agattcgcgt 


gagtattatg 


tcatgtgccg 


atcgtgaaga 


tgccaaccgt 


1 


020 


caattgttga 


aattaagaga 


gaacgaggct 


tataagaacg 


cctggatgtt 


agctaaataa 


1 


080 



<210> 375 
<211> 2025 
<212> DNA 
<213> B. fragilis 



<400> 375 

tacatccttt ttatggaaaa gactcttaat cttattaaaa atgatccttg gctggaacct 60 

tacaaagacg ctatcgttgg acgttttgaa catgccatgg ataagaaggc cgaattgacc 12 0 

aatgggggaa aatctacgtt atcggacttc gcttccggat atctttattt cggtttgcat 180 

cacactgata aaggatggat attccgtgag tgggcaccga atgcatcaca tatttatatg 240 

gtgggtacat tcagtaattg ggaagaaaag cctgcttata aactaaaacg cctgaaaaat 3 00 

ggtagttggg aaatcaaatt accgatcgac gctatacaac atggtgactt atataaatta 360 

cacgtctact gggaaggcgg acagggagaa cgaatccctg cttgggccaa tcgggtagta 42 0 

caagatgaca atacaaaaat attcagtgct caggtatggg caccggaaaa gccatttaaa 480 

tttaaaaaga aaacttttaa gcctagtaca gatccactgc taatctatga atgtcatatc 540 

ggtatggcac agcaggaaga aaaagtcgga acttataacg agtttcgtga aaaaattctg 600 

ccgcgtattg ccaaagaggg atataattgc attcagatta tggctataca ggagcaccca 660 

tattatggta gttttggcta tcatgtatcg agtttttttg ctgcatcgtc tcgttttgga 720 

actccagaag agcttaagca gctgattgat accgcacacg gattgggtat tgctgtcatt 7 80 

atggatatcg ttcactcaca tgcagtgaag aatgaagtag aagggttagg aaactttgca 840 

ggtgatccaa atcaatattt ctatccaggc ggacgaagag aacatccggc atgggattca 900 

ctttgttttg actatggtaa aaatgaagtg atgcattttt tactttccaa ttgtaaatat 960 

tggttggaag aatatcattt tgatggcttc cgttttgacg gggtgacatc catgctttat 1020 

tatagccacg gattgggaga agcattttgc aattacggcg actactttaa tggacatcaa 1080 

gatgataatg ccatctgcta tctgacattg gcaaacgaat tgattcatga agtaaatcct 1140 

aaggctatta ccattgcaga agaggtttcg ggtatgccag gacttgccgc caaggtggaa 12 00 

gatggaggat atggatttga ttatcgtatg gctatgaata tccccgatta ttggatcaag 12 60 

acaattaaag agaagataga tgaagattgg aaaccatcca gcatgttttg ggaagtaact 1320 

aaccgtcggc aagacgaaaa aacaatttcg tacgctgaaa gtcatgatca ggcattggta 13 80 

ggagataaaa cgattatttt ccgcttgatt gatgcagata tgtattggca tatgcagaaa 1440 

ggtgatgaaa attatatagt tcatcggggc gttgctcttc acaaaatgat tcgtttacta 1500 

actgcaagca ccattaacgg tggatatctg aactttatgg gaaatgaatt cggacatccg 1560 

gaatggatcg attttccgag ggaaggtaat ggatggtcat gtaagtatgc tcgccgccaa 1620 

tgggatttag tcgataataa aaacttgact tatcattatc tgggtgattt tgatgcagat 1680 

atgttgaaag taattaagag cgtaaaaaac atccagcaaa cccctgtaca agaaatatgg 174 0 

cacaacgatg gcgaccaagt gttagcgtac caacgtaaag atcttgtttt tgtattcaat 1800 
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tttaatccga gtcaatcatt caccgattat ggctttttag taacaccggg aacttatgag 1860 

gtggtactga atacagataa cataatttat ggaggaaacg gcttgtcaga tgatagtgtg 192 0 

aagcatttca cattgcccga tcctttgtat aagaaagaaa agaaagaatg gttgaaactc 1980 

tatattcctg cacgtacagc aatggtattg agaagaacca aataa 2025 

<210> 376 
<211> 1146 
<212> DNA 
<213> B. fragilis 



<400> 376 

cgaatgaatt 

atatcttcag 

ctggatggcg 

gatcttcgga 

tataccggca 

tctatacaag 

gacattcgct 

ggaatcactt 

aatgaagtca 

tatcgtccgg 

acccctaacg 

gtgagtacga 

ataaaatttg 

atacgtactc 

cttcagtgct 

gccggtcgtc 

aacgatgagg 

tattatatta 

agtataaaag 

ttttaa 



tatatttaaa 
cacagtctaa 
gtgtttatct 
taggtgcaaa 
ataaagcgac 
tcggacaatt 
ttaatcaatc 
atggttaccg 
acaacctaaa 
ttttagatag 
aatccttaaa 
ttgataaccg 
gcacagaatt 
atgtggaacg 
cttggctttt 
cggaaggaaa 
atgcctccat 
ataaatatat 
agatcagccg 



acacacctta 
tccggataaa 
caaaaatgat 
agtcgcatat 
aataaaagat 
ctacgaaccg 
tccaggagct 
aaacaaacgt 
aaaagcgtca 
caagaaactt 
tgaagaagat 
aaatattgca 
actggtttat 
agacaatgct 
atccggagaa 
aagtctggag 
ttggggagga 
aggtataaaa 
caaaaacttt 



ttttatctat 
ctacaatgta 
aataattttg 
caaaattggg 
gcttttgcaa 
tttagtttag 
gttcttgcat 
cattatatgt 
catggatatg 
attcatatag 
aaaaacatat 
atggctacta 
tatcaccgtt 
tttaagaact 
acttatctgt 
gtttgttcac 
gaacaaaaag 
ttaaactata 
agtgtttttc 



taggtatatc 
aagtgacagg 
gtaatggagt 
atatgaaact 
agtacacata 
aaatgatgtg 
taacaaacgg 
caggaggggc 
ctttggatgg 
gttttgcggc 
ttatctataa 
ttgatcatgt 
tctgcctgca 
atgtggcaca 
acgatgaatc 
gatttaacta 
atatctccat 
gctatctgat 
aaggacgatt 



ttacgctttg 
acgcatgctt 
tgaattcagt 
tgaaataggc 
caagaaccac 
cagcactttc 
caggcgaatg 
attcatggat 
cagggtagta 
caattaccgc 
atctcctggt 
agcgtatcag 
aagtgaatat 
aggagcctac 
tgttgcatgt 
tttaacactc 
cggactcaac 
gcccggagcc 
tcaattcatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1146 



<210> 377 
<211> 516 
<212> DNA 
<213> B. fragilis 



<400> 377 

cggcagaatt 

tgtcccggtt 

ccctctcttt 

gaggacagtt 

ctcctgctcg 

atcacaccac 

aaatatcagt 

ggattctatt 

tataatcaag 



atcggcagaa 
ctatcgaacc 
tatctgaaat 
tacgccctct 
atacgacggt 
gcatacggat 
tcggtattgc 
tcacttccgg 
caatcagtta 



caaggtgaaa 
gttcatgcac 
cggaccggcg 
cttcttcctg 
aaaacggcac 
tgactcctgt 
cggttccgcg 
aaagcatttc 
cgaaatatta 



aggtttcagg 
aaaggagact 
gatataagga 
gccgatgata 
aagcttgact 
catgcaaaag 
aacctaaacc 
aattacttct 
gaatag 



ataactttcc 
gggcaataca 
tcgctacatt 
aaaaaattac 
tgttactgtt 
tgttattggt 
agaatcaccg 
tggaaatgtt 



tttggctttg 
tgaagtgttg 
cagtatctca 
aggtctgacc 
tgcctccaac 
ggaaaatgac 
ctgggaaaat 
cgagcaggca 



60 

120 

180 

240 

300 

360 

420 

480 

516 



<210> 378 
<211> 582 
<212> DNA 
<213> B. fragilis 



<400> 378 

gaaattgatt 

atacaatttg 

aaatggatga 

caggcattcg 

gcatggtcca 



attattgggg 
aaattaaaga 
cccttattaa 
attcggaggc 
gatacactta 



gcaattaata 
aaaattacct 
ggaagatatt 
aactgtggag 
tcggctgttt 



agtcctggcc 
gatattatcg 
tcgggtagga 
atttactctc 
gttttaggag 



cacaggcgaa 
gggaaattct 
aattggttgt 
gcgaagtgac 
actgtgtatg 



tgatatgaga 
gaattccgaa 
aatccgtgat 
tattaagaca 
gtgtgagtat 



60 

120 

180 

240 

300 
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aatggtgctt atcgtggatt attagagcaa aaactgttgc catctatcac ccctaaagag 3 60 

agtctgttgg attcggaagt tctggacagc tcattgtatg ggcatgaaaa gaagaaactt 420 

cgggaatatg ctgaagataa tcttaaactg aagaaattca gacgtgagaa ttttaatgaa 480 

aatcgtacgg gggtagctcc ttttgatcat ccaaagaaag tatatgatga attcattaag 540 

gaagactaca ttgctccttc ctcgaaggag aataataaat ag 5 82 

<210> 379 
<211> 1227 
<212> DNA 
<213> B.fragilis 

<400> 379 

ggtatagatt atatggtaca aagtcagact cagccgattc gtagaattgc atttcctata 60 

ttaattgcat taagtgtatc tcactgttta aatgatcttt tgcaatctgt catttcggct 120 

gtatatcctc tttttaaaga agatctttcg ttaagtttcg ctcagattgg attgataacc 180 

ctagtttacc agatgtcagc ttctgtattt caaccactga ccggccttat ttttgataaa 240 

cgtcctatag cttggtcgct tcctatcgga atgagtttca ctttgatagg tatgctgaat 300 

ctggcttttg catccaatct gaattggctg cttgcttctg tctttatcat tggaataggt 360 

tcgtctgttc tccatccgga agcatcccgt atcacctttt tggcttcggg agggaaaagg 420 

ggattggcac aatcactttt tcaggtaggt ggaaatctgg ggggatcgtt aggcccttta 480 

ttagtcgcat tattagtggc tccttatggc aggcatcata ttgcactatt tgctatcctt 540 

gctttggcgg ctatttgtgt aatgtttcct atttgccgct ggtaccgttc ttatctgaac 600 

catcttaaaa aacgtccgat ccatgcaaaa gcatatatcg agcgcccgct tcctcctcaa 660 

aagactgtat ttgctatcac gatactgatg attcttatat tctctaaata tatttatatg 720 

gcaagtctga acagctatta tacattttat ctgatccata agtttaatgt aagcattcag 780 

cagtcgcaac tctttttatt tgtatttctg gtagccactg ccattggtac attgatggga 840 

ggccccattg gagacaagat aggccgtaaa tatgttattt gggggtcgat cctgggaaca 9 00 

gctcctttta gtttattgat gccacatgcc ggactcgtat ggactataat tcttagtttc 960 

tgtgtcggct taatgctttc gtctgctttt ccagctattc tgttatatgc acaagagctg 1020 

cttcccaaca agttaggact gatttccgga ctttttttcg gttttgcatt tggagtggca 1080 

ggcattgcat ctgctgttct tggcaatatg gccgataagt ttgggattga tgctgtatat 1140 

aatgtttgtg catttatgcc gttgttagga ttggtgacct ggtttttacc ggatctgaag 1200 

aaagtgagaa gtgaaaaaca agaataa 1227 

<210> 380 
<211> 195 
<212> DNA 
<213> B.fragilis 

<400> 380 

gataccattc tgtatggacg agatggcact agtaaaggag aactgctcat tgacataaaa 60 

ctgtgcaata tggtgaaaga gtttacacct gacatggcaa acagcatgca aaagattgtt 120 

cggaaatgtt tccctagaac cctgcagata gtgaacaaga ttcatgtatt cacactggtg 180 

tacgaagcga tgtga 195 

<210> 381 
<211> 2484 
<212> DNA 
<213> B.fragilis 

<400> 381 

actaatttat acgaaatgaa gaaagaaaga tatttaagag agatggatga ccagaatgat 60 

aacgcatttt cattaattgc cgattttgac ggaaacgaag atcaagtgtt tgacataaag 120 

gttggtgaaa ctcttccggt actccccctc cgtaatatgg tattgttccc cggagtattt 180 

atgcctgttt ctgttggcag aaaatcatct ttgagattgg tgagggaagc cgataagaaa 2 40 

aaatcttata ttgcagtagt ttgccagaaa atggcggaaa cggacgagcc ggcatttgag 300 

gacttgcacc cgatcggaac cataggtaag attgtgcgtg tactcgaaat gcccgaccag 360 

acaacaacag tcattatcca gggaatgaaa cgcctggagc tgaagaatat cacggagaca 42 0 

catccgtacc tgaagggtga agtgaacatt gttgaagaag aaatcccttc aaaagatgat 480 
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aaggagtttc aggcattggt ggaaacctgc aaggatttga caatacgata tattaaatca 540 

tcggacactt tacatcagga atcagcgttt gccatcaaaa acctgacgaa ccacatgttt 600 

ctggtggact ttatctgtac gaaccttccg ttgaagaagg acgagaaaat cgaactgttg 660 

cgcattgatt cgttgcgtga acgtacctat cggttgcttg aaatcctgaa tcgtgaagtg 72 0 

cagttggccg aaataaaggc atctatccag atgcgtgccc gtgaggatat tgaccaacag 780 

caacgtgagt atttcctgca gcagcagatt aaaacgatcc aggacgaact gggtggtagc 840 

ggccaggaac aggaaataga agagatgcgc caaaaggcag aacacatgaa gtggagtacc 900 

gaagtgcggg aaactttcct gaaagagctt gccaagctgg aacgtaccca tcctcaatcg 960 

ccggattaca gtgtgcagtt gaattatctg cagacaatgc tcaatctgcc atggggagtt 102 0 

tatactaccg acaatttaaa cctaaagaat gccgagaaaa cgctgaataa ggatcattat 1080 

ggtctggaga aagtaaagga acgcattctg gaacatttag ccgtacttaa attgaagggt 1140 

gacatgaagt ctcctattat ctgtttatac ggtcctccgg gagttggtaa gacttcactg 1200 

ggaaaatcga ttgccgcagc cttgaagcgg aaatatatcc gtatgtcatt gggaggtgtg 12 60 

cacgatgaag cggaaattcg cggacaccgt aaaacttata tcggtgcaat gccgggacgt 132 0 

attatcaaaa acctgattaa agcgggttct tcgaatccgg tatttattct ggatgaaata 1380 

gataaagtga gtgccgaccg tcagggagat ccttcatcgg ctttacttga agtgcttgat 1440 

ccggaacaaa atacggcttt ccatgataat ttcctggatg tggattacga cttgtccaaa 1500 

gtgatgttta ttgctacggc aaacaacttg aataccattc ccggaccatt actcgatcga 1560 

atggaactga ttgaagtgag tggttatatc acggaagaaa aagtggaaat agcacgaaag 162 0 

catttagtgc cgaaggagtt ggaagcaaac ggaatgaaga agaccgacat taaaattcca 168 0 

aaagatacgc tggaagctat tatcgaatcg tacacacgtg aaagcggtgt gcgtgagttg 17 4 0 

gaaaagaaaa tcggtaagat tcttcgtaaa tcggcccgcc aatatgcaac agatggtttc 1800 

ttcttaaaaa cagaaatcaa accgactgat ttgtatgact tcctaggtgc tccggaatat 1860 

actcgtgata aatatcaagg caatgattat gccggtgtgg tgacaggatt ggcatggaca 1920 

gccgttggag gtgaaatctt atttgttgag accagtctga gccgcggcaa gggcggacgt 1980 

ctcacattaa ccggaaattt gggtgaagtg atgaaagagt ctgctatgct ggcacttgag 2040 

tatatcaaag cacatgcttc actcttaaat ctggatgaag agatctttga taactggaac 2100 

atccatgtcc atgtccccga aggagctatt ccgaaagacg gtccgtcggc gggtatcaca 2160 

atggctactt cgttggcttc tgctttgaca caacgtaagg tgaaggctaa tctggctatg 2220 

accggggaaa tcacgttacg tggcaaggta cttccggtag gtggtattaa ggagaagatt 22 8 0 

ctggcagcta agcgtgccgg catcaaagaa attattatga gtgccgagaa caaaaagaat 2340 

attgacgaaa tacaggatat atatctgaaa ggactgactt tccattatgt gaatgatgta 2400 

aaagaggtct ttgccattgc actgactcaa gagaaggttg ccgatgccat tgatttatcc 2460 

gtaaagaaag ccagccagga atga 2 484 

<210> 382 
<211> 198 
<212> DNA 
<213> B.fragilis 

<400> 382 

cagattatat acataacact aaatatcgga tgccggttat ttttgttatc caaagatgaa 60 

aaacaatcgt taaatattga attatctcgg gaagaaatag aatatttctt taaaccttat 12 0 

cctgcagatg agacggaggc atacgagata tgcaatgatt ttataaagaa aatatcaaca 180 

gataaaagta ttctgtaa 19 8 

<210> 383 
<211> 213 
<212> DNA 
<213> B.fragilis 

<400> 383 

tatataatat attatcagcg ttttattttg ctttgggagc agggggtcgt gggttcgaat 60 

cccgctaccc cgacaggaaa taagagtaat cacacatgtc aatgtggtta ctcttatttc 12 0 

attttgtgct atggtggaat tccgataata cggaatgatg caactggaac agagcgacgc 180 

cttttacatt ttcaaataaa actcccgggt taa 213 



<210> 384 
<211> 696 
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<212> DNA 

<213> B. fragilis 



<400> 384 

aaatatgatt 

agccaaatag 

gtttatactc 

ttaccacaac 

cccaatttgg 

cttctgtcat 

agttcgtact 

aatttacctg 

aaagcagaca 

gcctcgacat 

ggctgcccaa 

gaatattata 



ttacaattaa 
atgccttccc 
taagtccgga 
taaaacaatt 
ttatagatgt 
acttatcgca 
caaaacggta 
acggttcttt 
tttataaagg 
ttgcatctgc 
ctgtatattt 
tctcactcaa 



aagaggaaaa 
tgttttaaaa 
taaacagact 
ttgtgactca 
caggaataac 
tgatgcttat 
caatgagcaa 
atttgctatt 
atcagttaca 
cattaaaaaa 
tggcaattac 
caaattttat 



aagatatgga 
gcacgacttg 
gctactctgc 
gtcttttcgg 
aaaggagggt 
acattatata 
aaacatccgg 
cgggattctt 
gtattggtaa 
tctcatgcag 
atgtcattca 
gaataa 



aagaatcatt 
gcaaaagtct 
aaataatgaa 
tgattaacag 
caagtgctgg 
tcaaaactga 
aaacctatga 
tcgtagaggg 
atgaatccac 
gaaaagttct 
cattacccaa 



ggatggtatc 
gccacaattt 
cttatatcaa 
agaacatgta 
agttgacatg 
tttaaaaatc 
agagatcaaa 
aaaccgggac 
ttattccgga 
tggcgaaacc 
ttcccgatta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

696 



<210> 385 
<211> 552 
<212> DNA 
<213> B. fragilis 



<400> 385 

ctaaataaat 

tttgatgaaa 

aagcaattca 

ttggatgaag 

ggtttcaagc 

ggtacagata 

attattaatg 

agcctgaccg 

gatgacggag 

gaagaaattt 



attttataat 
ctaaatatag 
gtattcgtct 
atgtatataa 
aagtcttacc 
ttaatactcc 
tgaagcgtaa 
gaactttctt 
ccattattac 
ga 



gaaaagaatt 
cgaaggccag 
ggggaagagt 
agaagagttt 
tctgcaagaa 
gattgaaacc 
cagacaagga 
gatgaacgag 
tattggagcc 



ctttgtccta 
tcattggttt 
aagatgaagg 
ggctgtatcg 
ggtgataata 
ggtgatatga 
aaattggttt 
atcttggggg 
actactctta 



aatgtgagaa 
ttgtatgtga 
ctcctcgtaa 
ttgtcattga 
tcattggccg 
gtatggacag 
atactcttcg 
ataaagaccg 
tccttcgtgc 



ctatctttct 
acactgtggt 
ggaagagaaa 
aaatgtcttc 
ccgttgtgta 
acgccactgc 
tgatgcccca 
tattcgcatt 
tgcaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

552 



<210> 386 
<211> 210 
<212> DNA 
<213> B. fragilis 

<400> 386 

gaaaggaggt acgaaatggg acgaataaaa gaagaagcct gggtcgaaaa gtgtaccgta 60 

cttcatgaag gaaaggccac acccaatatc tattataacg tttttgccga tggtgagcag 120 

ctctgcgaaa tctcctatga cagattaatc gctatacgta atcttattaa ccaaattgag 180 

aaagaaaaga aaggagaatg ccatgaataa 210 

<210> 387 
<211> 513 
<212> DNA 
<213> B. fragilis 



<400> 387 

cctaatcatc 

agtgattata 

aat ccaaccg 

tggctgctta 

cgggctgagt 

gacgatgaaa 

ctcgaagctg 

ttttccgatc 



ttatggatat 
gatttgagaa 
cggatatatt 
gaggtgaagg 
cagcatctac 
ataaaaccct 
acaatgaatc 
taccattagt 



aatcgacaga 
aacattatcc 
aatgaagatg 
tgagatgttg 
agatgaaaac 
aatcaagcag 
attaagaagt 
agactacgaa 



attaagcaat 
ctatcaaaag 
tgtggtatat 
agggagaaaa 
tctttaatct 
aatgccgttt 
cagtcaggag 
gaagattatc 



atcttaatca 
ggtacataaa 
ataccgacat 
gagaagacct 
ataagatgta 
tagaggaacg 
ctgataggat 
cgcccgtaga 



taaaggaatt 
taaagctaaa 
atctactgaa 
tggccttcat 
taaagagaaa 
catccgccaa 
aaccgatact 
acgtccttca 



60 

120 

180 

240 

300 

360 

420 

480 
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agttccaaac atccgttagc aggaaaagcg tga 513 

<210> 388 
<211> 579 
<212> DNA 
<213> B.fragilis 

<400> 388 

atctcttttc atatgataga taataatatt ttctcttgtg gtcctttacc atcaaatgat 60 

ggatatacat ggacaattgt ttctcgttta ggcgatatgc ttaacgaagc tgaagcccta 12 0 

tttggtgaac gagataaaag atatacaata cttggtattg agttagctaa tataaaacaa 180 

ccacaaatat ggtatccaaa cgattgtaat catgtcataa tacaggtcac cgaagattgc 240 

agcaacaata tggaaagggc aatatttcag gtggctcatg aagcgataca ttgcttatgt 3 00 

cccaatccaa agaaaaagac tactatttta gaagaaggac tggctaccta tttttctatg 3 60 

tattatacac gtaaacgtaa aatttattac aatattgata atcttcagta tcaaaagcct 420 

tatgaatttt gttctaaatt actaaactat gattctgagt tgattaaaaa agcaagaata 48 0 

atagaacctg acatttcttt tatcaacaaa gagatattac taaatatatg tcctaagata 540 

gaccatactt tattagatga actaactaaa aaattttaa 579 

<210> 389 
<211> 333 
<212> DNA 
<213> B. fragilis 



<400> 389 

ttaatatttc atcttgttcc cgtttatgca tataaagttc atttattgca ctccgatctg 60 

tacttttgta agtgtaatcg atattatccg aatgaacaat ctgtaaagaa tgtactgggc 12 0 

gtatttgata aggtgtgtat aggatctctt ctatccatcg actatagact actctacact 180 

ttatcatttc tacattcatt attggttgca ggctttccga tagatttaat ggcggttcgt 240 

ctggccaaaa aacagcacgc gtctggttca ttcgctctgt gtgataatcc aaattataca 3 00 

ctactcctct ctctatacgt atcgtttcaa taa 333 



<210> 390 
<211> 246 
<212> DNA 
<213> B. fragilis 



<400> 390 

tcaaaacatc atatttgcaa catgaatgac gataaaacca tcacagcagc aattgagaca 60 

agcaatgtaa ctgcattgct tgccgcttac cggaaattta caagttcctc cggggctaca 12 0 

accgatgaat ttttccgttt catcaccacc cccactccgg aacgggaaga gttcctggca 180 

ttgtactgct cttcgacctc ttctgtgtcc ggtaccatta tacaaactaa ttacaatgca 240 

ctatga 246 



<210> 391 
<211> 321 
<212> DNA 
<213> B.fragilis 



<400> 391 

tcaagcaatc agttacgaaa tattagaata gaaatggagt tatcagatga aaccttgcaa 60 

caaatcagag agatggccgc agctctgctg cctccggcag aaatcgccat tctaatttcg 12 0 

ctgcctgccg gtgaacgcag ctacttctgt gatatttgca gaaatcatca tcattctcct 180 

atctacgaag cataccatca gggacgcctg caaacaaaat tcgaactccg aaaaactgtg 240 

atcaagttag ccaaggccgg aagtccggcg gccgagccac ttgctgataa atacatgaaa 3 00 

gaacaaatca tcaacgacta a 321 



<210> 392 
<211> 201 
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<212> DNA 

<213> B.fragilis 



<400> 392 

ttacatgatc agttctgttt cattgattcg gggtcaaatg aaaccattta caatggattg 60 

gcagaggact gtttatggga gatcgtactc ttcagtaacg gattcgttta tcacgctctg 12 0 

cagaagttcc ggatctttat agaggaaaag gactttaacc ataccgtcct ttacaactcc 180 

catttccatc tcaatttcta a 201 



<210> 393 
<211> 1125 
<212> DNA 
<213> B. fragilis 



<400> 393 

aaagagtgtc aacaagacta taaatttatg agaagcaatc ggtttattaa acgcctggac 60 

ttatatatca tcaagaaatt cttggggacg tatgtatttg ctattgcatt gattatctcc 120 

attgcagtag tattcgactt caacgagaag atggataagt ttatggaacg gagtgcgccg 180 

tggtcggcaa tcatcttcga ttactacatg aactttattc catatttcgc gaatctgttc 240 

agtccgttgt ttgtatttat tgctgtcata ttcttcacct ctaaactggc tgaaaactcc 3 00 

gagattattg caatgttttc taccggtatg agttttaaac gtatgttgcg tccttacatg 3 60 

atatcggccg gtatcattgc gatttctacc tttatattag gatcgtatgt gattccaaga 42 0 

ggcagtgtga ctcgtctgga ctttgaagat aaatacgtga aaaagaaaaa gaccacttat 480 

gtacacaata tacagttgga gatagacaca ggcgtgattg cttatattga taactatcag 540 

gattacaata agacaggaaa ccgtttttcg ctggataaat tcgtagataa gaaactggta 600 

tcccatttga ctgcccgtag cattacttat gatactactg cggttaataa atggaccatt 660 

aaggattata tgattcgtaa tctcgacgga ttaaaggaaa ctattgtccg tggagataag 72 0 

atggattcca ttataccgat ggaacctgcc gatttcatga ttatgcgtaa tcaacaggaa 780 

atgttgacca gccctcagct tagtgcatat atagataagc agaaacaaag gggtattgcc 840 

aatatcaaag agtttgaaat agagtatcat aaacgaatcg ccatgtcatt tgcatcattc 9 00 

atcctgactg tgatcggagt atctctttct tcaagaaaaa caaagggggg aatgggattg 960 

catttgggaa taggacttgg actgagcttt tcatatatcc tgttccagac cgtggcatct 1020 

acttttgcgg taaatggaaa tatgcctccg atgatcgcca tgtggattcc taatttactg 1080 

tatgcgctga ttgcatttta cctatataga aaggctccca aataa 1125 



<210> 394 
<211> 246 
<212> DNA 
<213> B.fragilis 



<400> 394 

ggtttaagtc atcttcattc ttttttatct tccattttcg gatttggtct tgccggcgtt 60 

ctgctgacca agtattgtcc ggatccaact ttgtttgaat ccagagaggc ctgggaagtt 12 0 

gccagtgtga atgcacatta catctggtat tactttgcgg caatcggttt ggttgcagca 180 

attgctttgc ttatttttgc aaaaatcact gatttcatcg ataaaaagaa gaaaactaac . 240 

gtctga 246 



<210> 395 
<211> 1521 
<212> DNA 
<213> B.fragilis 



<400> 395 

aacaacaacc aacgtatgat gaaccaagaa ttattaatga gtcccaaccg tttggtgact 60 

tttctgcaaa agcctgctgc tgagtttaca aaagcagaca tcattaacta tatccaacag 12 0 

aatgaaatcc gcatggtcaa ttttatgtat cctgctgcgg atggacggct aaaaactctg 180 

aattttgtga taaacaatgc ttcctatctg gatgccatcc tgacttgcgg tgaacgggta 240 

gatgggtcga gtctgtttcc tttcatagaa gccggaagta gcgatctgta tgtaatacca 3 00 

cgttttcgca ctgcattcgt cgatccgttt gcagaaatac ctacactcgt gatgctttgc 3 60 
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tccttcttta ataaagatgg ggaacctttg gaaagctctc ccgaatatac tttgcataag 42 0 

gcttgcaaag catttacaga tgtaacaggt atggaatttc aggctatggg agaattggaa 480 

tattatgtaa tttccgagga tgacggtcta tttccggcta ccgatcagcg tggatatcac 540 

gagtcgggac cttatgcaaa attcaatgat ttccgtacac aatgtatgtc ttatatagcc 600 

caaacaggtg gacaaataaa gtacggacac tcggaagtag gcaattttat gcttgacggc 660 

aaagtttatg agcaaaacga aatagaattt ttacccgtca atgccgaaaa tgcggccgat 720 

caattaatga ttgccaaatg ggttatccgt aatttagctt accaatatgg atatgatatt 780 

acttttgctc ccaaaattac agtaggtaaa gctgggtcag ggctacacat tcatatgcga 840 

atgatgaaag acggacaaaa ccagatgctg aaagatggtg ttctctcgga taccgctcgt 9 00 

aaagccattg ccggtatgat gcaacttgct ccttccatta cggctttcgg caataccaat 9 60 

cctacttcat acttccgtct tgtaccccat caggaagcac ctaccaatgt ttgttggggt 1020 

gaccgaaacc gttcagtatt ggtacgtgtt ccgttaggat ggtccgcaca aacggatatg 1080 

tgtgcactag ccaatccttt ggaatcggac agtaactatg atactactca gaaacagacc 1140 

gtagagatgc gttcaccgga tggctcagcc gatctttatc aattattggc aggtcttgca 1200 

gtagcttgtc ggcatgggtt tgagatagag aacgctttgg ctattgcaga gcaaacgtac 12 60 

gttaatgtaa atatccatca gaaagaaaat gcagacaagt tgaaagcttt agcccaactt 132 0 

cc.cgatagct gtgcagcatc tgcagattgt ttacagaaac agcgtactgt atttgaacag 13 80 

tacaatgtat tcagtcctgc tatgatcgat ggtattatca gtcgattacg aagctataat 1440 

gatgccactc tacgcaaaga tatacaggac aaaccggaag agatgctggc actggtgagc 1500 

aaattcttcc attgtggata a 1521 

<210> 396 
<211> 570 
<212> DNA 
<213> B. f ragilis 

<400> 396 

aatatggacc aattgcaatt aatacaaagc aaaatatatg agatacgtgg acagaaggtt 60 

atgctggatt ttgatttggc ggaaatgtac ggtactgaaa ctaaatattt aaaacgttca 12 0 

gtaaaaaata atattaaacg ttttccatca gattttatgt ttgagctaac gaaggaagaa 18 0 

ttcgacagtt tgaggtgcag ttttagcacc tcaaaaagag gcgggacccg atatatgcct 2 40 

tatgctttca ctgaacatgg agttgctcaa ctttcttcag ttcttaacag cgatttggca 3 00 

attgagatta atattcaaat cataagggca tttatagcag ttcgtcagtt aatctccaat 3 60 

cctccggttg atagagtcga taaactgaaa gaagaaatca aagcattaaa agattacatc 42 0 

gaagaagcat ttactgacta caacgatata aatgatgata cgcgcatgca attggaatta 480 

attaatcaaa ctttggcaga attgcaagcg aaaaagaaag cggaagaaaa acctcgtaac 540 

ccaatagggt ttatcaaacc taaacactaa 570 

<210> 397 
<211> 231 
<212> DNA 
<213> B.fragilis 

<400> 397 

caaaaacgcc actccggaat agaccagatt gtctgctgtc accaggtact cttctttgtg 60 

atggacaaga ccgcagaggg ctatgatgaa agcacatccc aatacggaat aggagaacat 12 0 

atagacggtg acatggggaa taaaccaaat actttgcaaa gccggcataa gcgactggtc 180 

atgtatctcc ggtttcagca gattgattat tacaaagaca gtggacaata a 231 

<210> 398 
<211> 1002 
<212> DNA 
<213> B.fragilis 

<400> 398 

atagtaaaac aaagaataat gcaattctat agtagaaatg aagctattaa ccgtataaac 60 

aaactggctg gagcaggaaa agcatttctg tttattatag attataaaca agaatgttct 12 0 

tttatagaga aagtggatga tattgattca tcggagctac tctacaatct gaacggtttt 18 0 

acaaactgca cgtctgttgt tacacctttc agatacccaa taatatggca gccccaacct 240 
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atttctttaa gccaatataa aagatcgttt gatattatac ggaaaaatat cttgagtgga 3 00 

aatagcttct taacgaatct cacttgcatg acccccgtca acactaatct agggttaaaa 360 

gatatatttt atcattctcg ggccttatac aaactttggt tgaaagagac ttttgtcgtt 42 0 

ttttctccag aaatatttat tcgtatagaa aatggaagaa tcagttctta tccaatgaaa 480 

ggaacaatag atgcaacttt accttctgcc acaagattac tgatggagga tgaaaaagaa 54 0 

gcagcagagc atgccacaat cgttgatctg atacgaaatg atttaagtat agtggcagat 600 

aatgtatctg taacccgcta tcgatatgta gatacactct ataccaatca tggtcccata 660 

ttgcagacca gctctgaaat aagtggagtt ttaccgaaaa actatgttga tcatctggga 72 0 

gaaattcttt tcagacttct tccagccggt tctattacag gagctcctaa gtacaagaca 780 

atagaaataa tagagcaagc agaagaatat gagagaggat tctatacagg catcaccgga 840 

tactttgacg ggagaaaact ggatagtgcc gttatgatcc gctttattga agagcagaat 900 

gggcaaatat tttttaaaag tgggggagga atcacctgca aaagtgattt ggaaaatgaa 960 

tataacgaaa tgaagcagaa agtttatgta ccaatttatt ga 1002 



<210> 399 
<211> 537 
<212> DNA 
<213> B. fragilis 



<400> 399 

acaatgaaac atcatgtaca ccttatcatt tattttgctt gcatttcagt tggtatactg 60 

ctgtgtgctt gtcgttcttc ttctctacat tctaatcaat tcaaagagaa tggaactttt 12 0 

cagcataatt acaatgaact caataccggt accgggacca ttgcctcaca agtcaaaacc 18 0 

actaaagacg aacacggttc atcctggaag atcacgtacc attttgacac ggcacaaaca 2 40 

cccgatccca caaccggcct acccccacta tcgggtatcg agattgaagg gagcgaaaaa 3 00 

cagagtaaaa ccgcgcagga aagtaatgac actgtacact cttcgaacag ctcttcaaag 3 60 

agagaggtat ccggtcaaac catacaaaga gaatccggga cagagaccaa gaaagatagc 42 0 

aaagtagcaa ccggtacgga tgatggcata agaaacggcc tcagtatcgg gatacctttg 480 

ctttttatca tcatagcact atcgtattat gccaagcgac agaatacatc aaagtaa 537 



<210> 400 
<211> 828 
<212> DNA 
<213> B. fragilis 



<400> 400 

gtaattttgc acaaaattaa acagatgata aagtacattg caacattgtt actgacggtc 60 

ttattcgtag catgcaataa tggcaaagga caacagccct ctgaagaaaa tgaagacccc 12 0 

aaggccaaag agattctcca aggcatttgg cttgatgatg aaactgaaac tcccttgatg 180 

cgcataatag gagatacgat ttattattcc gatgctcaaa gtgctccggt ttatttcaaa 240 

atcctaaaag atacgctcta tacctatggc aaagacgtaa cccactatca aattgacaag 300 

cagagtgaat attctttttg gttccattct ctggctgata atattatcaa gctccataag 360 

tcggaagatc ctaatgatac attggcattc tcctttaagt cggttgaaat cattccgacc 42 0 

tatacagaag tcactaagaa ggacagtgtg gtaatgttcg atggcgtccg ctacagagcc 48 0 

tatgtataca tcaacccctc acaaatgaaa gtagtgaaaa caacttattc ggaagatggt 540 

atcagtatgg acaatattta ctatgacaat gtaatgcata tatgtgtgta tgaaggcaaa 600 

aaaagtttat atgccaagga cattaccaag caaatgtttg tagatgtaat tccaacagat 660 

tttctgcaac aggccattct atctgatatg aattttacag gaattgaccg caaaggctat 72 0 

cattatcaag cactcgtctg tattccggaa agtccggtat gcaatcttgt gaatcttacc 780 

atcagtttcg atggaaaact aaatataacg gctgcaaaat ataaataa 82 8 



<210> 401 
<211> 381 
<212> DNA 
<213> B. fragilis 



<400> 401 

aatcatttta atactttaaa ttatttaatt atgggcttag atatagcaat tgcttcagct 
gtagttgaga ttattacact gatttttttc ttcgttttat gtcgaaatgt ttccaaaatc 



60 
120 
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aaaaaagaga ttgttagcaa tgacaattta 

ggagaaacgg acaaagcaaa aaaaatattg 

atcgcagcat tctgctacaa tggcaataat 

tataaaccat acttggaaat ccttggactt 

atccaagaaa gagaaaagtg a 



cctggtatgt ttgccatgta tatatccttg 180 
tataaggcga ttagtaaaga accggaattt 240 
tcagcacagc aatctacatt gaaaagaaaa 300 
gagttagatt ttgaattggt aaataagttc 360 

381 



<210> 402 
<211> 1413 
<212> DNA 
<213> B . fragilis 



<400> 402 

gtaatcatta 

agaaagggca 

atagggaaaa 

gatgccgaga 

tggtggcata 

ccatggaacg 

gagtttatgc 

gaagccgaca 

cagaaacagg 

gcgcgttaca 

gtccagataa 

tggggtggac 

catctggcac 

actttcctga 

gagacagtta 

gaagtgaatc 

gacaacggaa 

acggatcaat 

aacgacggat 

gacccggaag 

gaaagtgcag 

gcctcgttcg 

ctggttgctt 

tat gaagcac 



agaatagaat 
aagcccctaa 
ttaaatttga 
aaatgattaa 
cactttgtgc 
gtgaccctga 
agaaaatggg 
gcatcgaagc 
ccgaaacagg 
tgaacggtgc 
agaatgcgat 
gcgaagggta 
agatgctgac 
ttgaaccgaa 
tcggcttcct 
atgccaccct 
tgttggggtc 
ttccgattga 
tgggtaacgg 
atatctttat 
ccaatctgct 
atgctggcaa 
atgccaaagc 
tcgttaatat 



attaatcatt 
aattttaaat 
aggtaaagac 
cgggcgtagt 
cgaaggcggt 
tcccgtgcag 
catcgggtat 
ctatgaggcc 
catcaaactg 
agccaccaat 
tgatgccacc 
tatgtcgttg 
cattgcccgt 
acccatggaa 
gaaggctcac 
tgcaggacat 
tatcgacgca 
caacttcgaa 
cggtacaaac 
cgcacatatt 
caatgagtct 
gggcaaggag 
caacggagag 
ctactcatta 



ccggctttct 
aagatggcaa 
agtaagaacc 
atgaaagatt 
gaccagtttg 
gctgccaaga 
tattgcttcc 
aacctgaaag 
ttgtggggaa 
cccgattttg 
attgaactgg 
ctgaatactg 
gattatggac 
ccgacaaagc 
ggactggatc 
acgttcgaac 
aaccgcggcg 
ctgacacaag 
ttcgatgcca 
gccggaatgg 
ccttaccaga 
ttcgaagaag 
ccgaagcaga 
taa 



ttcttcttcc 
caaaagagta 
cgatggcatt 
ggttaaagtt 
gaggcggaac 
ataaaatgga 
atgatgtcga 
agctggttgc 
cggccaatgt 
atgttgtggc 
gaggtacgaa 
atcagaaacg 
gtgcacgtgg 
atcagtatga 
aagacttcaa 
atgaactggc 
attatcagaa 
ccatgatgca 
aaactcgccg 
atgctatggc 
aaatgttgtc 
ggaaactcag 
ccagcggcca 



caaaagaaga 
cttcccgggt 
tcgctattac 
tgcaatggca 
caaacagttc 
tgccggtttc 
tctggttacg 
ttatgccaag 
attcagccac 
ccgtgccgct 
ttatgtcttt 
cgaaaaagaa 
tttcaagggc 
tgtcgataca 
agtgaatatc 
ggtggctgta 
cggatgggat 
gattatccgt 
gaactctact 
acgtgccttg 
cgaccgttat 
cctggaggaa 
acaggaactg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1413 



<210> 403 
<211> 597 
<212> DNA 
<213> B. fragilis 



<400> 403 

atgatctata atattaacgt aataacaaat aataaattta gtatgaaaaa gaacagattg 60 

actttggttg ctgccatttt tctcagtggt actatcctat ttagttcatg tgtaggatca 120 

ttcggactgt ttaatcgtat ttcttcctgg aaccagtcta ttggtacgaa atttgtaaat 180 

gaacttgttt ttcttgcttt aaatatcgtt ccggtatatg gcgttgctta tctggcagac 240 

gctttggtta tcaattctat tgagttctgg agtggcacca acccgatggc taatgtaggt 3 00 

gatgttaaga aagtgaaagg tgagaatggt gattatttag taaagactct tgaaaatggc 3 60 

tattctatta ctaaagaagg tgaagattca gctatggagt tgatttataa taaagaagca 42 0 

aatacttgga atgttgttgc cgatggagtt agcacagagc tattgaaaat gaacaatgac 480 

ggtactgctg aaatgaactt accgaatgga gataaaatga atgtaactct tgatgcacaa 540 

ggtatgatgg ctgcacgtca ggctacaatg ggcggactgc tctttgctgc acgttaa 597 



<210> 404 
<211> 1533 
<212> DNA 
<213> B. fragilis 



163 



<400> 404 

actacaaata 

tcggtcaagg 

ccgaaaacag 

agttggtggg 

gccaaagaca 

aaacggcaac 

gggcagaggg 

cccggaaact 

gaacagatcg 

gtatgtacga 

ccggccgact 

cctactttcg 

gaaggcactc 

gtgttcaatc 

aacggcgaaa 

actgctgccg 

tcgtggatca 

gcttcatccg 

gaacgaatgc 

catgacaaat 

ggcatcgaca 

aatatgtttc 

gaactctatg 

atatataaag 

gatgccggca 

cagtccatgc 



aattaataaa 
cgagcctggt 
aggcgggcat 
agagcttgaa 
tcaaagccat 
gcacactgcg 
cattcgaggc 
ttactgcttc 
ataaaatcat 
caatagaggg 
tcctgatgca 
ccgaacaggg 
cgatcactta 
cgggtgagat 
tcaattatga 
atccccgcct 
ggcgcaatgt 
ttcccatcgg 
tcgataatcg 
cacatctgat 
ttatggaaga 
tgagttccgt 
acacagacgg 
accatgaaga 
aacaacagga 
agacagaaac 



cagtatcatg 
agatgcagaa 
cattgccata 
actctccacc 
cggcatctct 
tccggccatt 
aatcggtgag 
aaaactggca 
gttgccggga 
actctccgaa 
atattacgga 
gcgactgaca 
ccgtgcagga 
agcttccaca 
tccgcagtca 
cggcgtattg 
tgcccccgaa 
cagtgcaggt 
ggcaaccggt 
ccgggcggca 
gatgggcatc 
ttttcgcgag 
ttccgttggt 
agcattcgcc 
atacactgat 
agagaataag 



tttttattag 
acaggtaaat 
cgtcccggat 
cgatccattc 
tatcagatgc 
atctggtgcg 
aagttctgtc 
tgggtgaaag 
gattacatcg 
ggaatgtttt 
atcgatcctt 
ggtacggctg 
gaccagccca 
gccggaacat 
cgtgtcaaca 
ctctgtatta 
ggtatttcgt 
atcagcatcc 
tgcggtatac 
caggaaggaa 
cccgttaaaa 
acactggccg 
gcagccaaag 
actctcgata 
gcttacgcac 
taa 



gttatgatat 
gtgtcgcttc 
gggcagagca 
tatcggaatc 
acggtctggt 
attcacgtgc 
tggcacattt 
aaaacgaacc 
ccatgaaact 
gggacttccg 
cgctgatagc 
cccgggaact 
acaacgctct 
cgggagtcgt 
cctttgccca 
atggaacggg 
acgccgagat 
ttcctttcgg 
acggcgtaga 
tcgtcttttc 
agatccatgc 
gtacgacggg 
gagcaggaat 
agctgacagt 
ggtggaaaca 



aggcagctct 
tgcgttcttt 
agaaccggaa 
acgggtagat 
gtgtgtggac 
ggtctcctat 
gcttaattct 
ggatatctat 
gagcggtgaa 
aaacaaccgc 
cgacatccgg 
cgggctacaa 
ttccctgaat 
atacggagta 
tgtcaaccat 
gattctcaac 
gaaccgtttt 
caacggagca 
cttcaacagg 
atttaaatat 
cggacacgcc 
agccaccatc 
gggagcgggc 
cgtagaaccg 
atgtctgact 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1533 



<210> 405 
<211> 255 
<212> DNA 
<213> B.fragilis 



<400> 405 

gcgatgaata aacaaatgac aatagcgaaa aaacgctatt cttttaaaaa agcatatgaa 6 0 

agggtgccat tagggcagat tgaaagttta aaaaaagaac tgtatagtgt ctttagtatc 12 0 

aataatcgaa cctcttggta caataaactt aaaggtataa cttctcccag catagaagta 180 

gttgaagctg ttgagactgt atttctaaaa tatggtattg aaaattgttg ggaaattaca 240 

gagatcaaat tatga 2 55 



<210> 406 
<211> 237 
<212> DNA 
<213> B.fragilis 



<400> 406 

tcagaggata accaatccca tatctattta ctttgctttt ccagatctct gacagccgga 60 

atgcacatca tcaaaattga ggtaaataca attgctatcc cacaacagat aaaaatattg 12 0 

gctattccta aactatcagc aataaatccg gtaatgaaca acccaatgat agaaggcaat 180 

aagctcacgc tgtcaaaaag agaaaacaca cgtcctaaat atgccggttt aaactga 23 7 



<210> 407 
<211> 1158 
<212> DNA 
<213> B.fragilis 



<400> 407 

tttatccgta aagaaagcca gccaggaatg acattcgaat tacaatatac agacgcaaaa 
agtaatgccc gtgccggtct gattacaaca gaccacgggc aaatacaaac ccctatattt 



60 
120 



164 



atgccggtag gtacaatcgg cagtgtgaaa ggagtacatc agactgaatt gaaagaggat 180 

attcaggcac agatcattct gggaaataca tatcatcttt atttgcggcc gggactggat 2 40 

gtactcgaaa aagccggtgg attgcataag ttcaatggat tcgaccgtcc gatgctgacc 3 00 

gatagtggtg gttttcaggt gttttctttg tccggtatcc gtaaattgcg tgaagaaggg 3 60 

gccgaattcc gttcgcatat tgatggcagc aagcatatct ttactcctga aaaggttatg 420 

gatatcgaac gtatcatagg tgccgacatc atgatggcat ttgacgaatg cccaccgggg 480 

gattcggatt atgcatatgc caaaaagtca ttgggattga cacaccgctg gctcgacaga 540 

tgcattcaac gattcaatga gacggaacct aaatatggtt acagccagtc tctttttcct 600 

atcgtgcaag gatgtgtata tcccgacctg cgtaaacaat ctgcagaata catagcttcg 660 

aaagatgcag acggtaatgc tattggcgga cttgccgtag gcgaaccggt agataagatg 720 

tacgagatga ttgagttggt gaacgagata cttcccaagg acaaaccacg ttatctgatg 780 

ggagtcggca caccggttaa tattcttgag ggtattgaac gtggagtaga tatgttcgac 840 
tgtgtgatgc ctacccgtaa cggacgaaac ggaatgttgt ttacgaaaga cggtatcatg 
aacatgcgta ataaaaaatg ggaagcagac ttctctccta ttgaagctga cggtgcttcg 
tatgtagaca cattgtacag caaagcatac ttgcgccatt tattccatgc gcaggagttg 
ctggccatgc agattgcgtc tatccacaat ctggcgtttt atttgtggct ggtaggagaa 

gcacgcaagc acattatcgc aggagacttt tcaacctgga aacctatgat ggtgaaaaga 1140 

gtgtcaacaa gactataa 1158 



900 
960 
1020 
1080 



<210> 408 
<211> 1068 
<212> DNA 
<213> B.fragilis 



180 
240 
300 



<400> 408 

ctaaaaagta aaaaacaggt aatgaaaaag tttttccgat ttcaattatg ttgtatttgt 60 

cttttggtac tgattgtatc tgcttgtaag gtgaaaaggc cggacagcgt catatcagaa 120 

tcggagatgg aaaatttatt atacgattat cacattgcca aagcgatggg agagaacatg 

cctggtggtg agaactataa aaaggcattg tacgtcgaag cagtattcaa aaagtatggt 

acaacagaag aagttttcga ctcatcaatg gtatggtata cccgaaatac aaaaatatta 

tcggaaatct atgagaaagt gaacaaaaga ctgaaagcgc agcaaaatgc catcaaccat 3 60 

ctgattgcat tacgtgacaa taaacctaag atgtctgctc cgggtgacag catcgatgtt 42 0 

tgggcatggc agcgaattgc tcaattaaca gaggctccat taaacaataa atttacgttc 480 

actctacctt ctgatacgaa cttcaaaaaa cgcgatgtgt tgctttggaa aatgcaatat 540 

aacttcctga gtgaaattcc tgattcaaca atggctccaa taatggctat gcagattgtt 600 

tatgaaaacg acacagtgac ccatagttgt gtgaagcaca tttttaaatc tggcattcaa 660 

aatattcgtc tccaatcgga tacaatgaat atcaaggaga taaaaggatt tatcttttgt 720 

ccgctatctg aggaatcaat aacacttctg gtcagtgata tttcattgac ccgttatcat 7 80 

gcaaatgatt caataacaca gataggtaga gattctctaa aaactgattc aataaaagaa 840 

aaaagtaaag acgattctat tcagaagaaa actcccaaag acactattca agcatcatca 900 

ccccatcaac gtacgaatcc gaacgatctg aatcgtccta ataatgatgt ccggcctatt 960 

aaaccggaac aacgtgaaaa agagatgcag atagaaaaag agaaacagca attggaaaga 
caacaaagga ccaatccaag gaggccatta cgtcgtcaga ataattaa 



1020 
1068 



<210> 409 
<211> 183 
<212> DNA 
<213> B.fragilis 



<400> 409 

ctcttaatca atgggtccag ggttcgagtc cctgagggtg tacaaaagga gattataaat 60 

aatctccttt ttgtttttgg tggcattttg aaatattgtt ttatctttgc caccgcaaaa 120 

attaatttga tgaaaacaac ttatcagttt aacatactcg tcaatcattt ggagctggct 180 
tag 183 



<210> 410 
<211> 402 
<212> DNA 
<213> B. fragilis 



165 



<220> 

<221> unsure 
<222> (276) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 410 

tatattccca ataaacaaga tatatcactg ataattaaca ttaattttat tattatgaac 60 

tttgatttaa aagcgtttag aaaacgattt ggtttaaaac aggttgaagt ggctcattta 12 0 

ttcaattgtg ggcagagcaa tatttcagat attgagactg gaaaaagagg gcttgaagag 180 

tatcaaacaa gaattctctt cgataaatac ggagaagagg tagttaaaga gtacttaata 240 

cctgagagtg ccattcatca agggaatata aacggngata atataaacgg gcacaatgtc 3 00 

actgtaaata aagcagactt tgataaactt attagcttgt taaacaaaag ggatgaacaa 3 60 

atagatagat tattgcgtat tattgaaaat ttaaataaat ag 402 

<210> 411 
<211> 621 
<212> DNA 
<213> B.fragilis 



<400> 411 

gagaggaaca atattatgac attaaagcaa gctcagaaac tgtacgatga ttcagtgcag 60 

gcaaaaatga ctcatgccga ttattgcatg actcaatcgc aacttgaata tatcggtaga 120 

actatgtggg gattcacccc agacaaacaa gcaaaggtgt tattcaccaa agtaggaaag 180 

agggtgtcgg tagttattgc gtcacgagaa gcatttatta aagagatagg aaaacctgtt 240 

atctgcaaat gttcggtatg cgatatgtat tatttagctt atagaaagtc ggtcgatgct 3 00 

cacgatgaat taaatgccca atgtccaaaa tgtgattctc ttggttgtga ttcagatatt 3 60 

gtacattttg aaacaagccg caaattttgg ctaaacgaga agatcgttaa aatccttact 42 0 

cccaataaag accctgaacg agtggaagct atgtacgatt ccgctccgga agattttcct 480 

gcacaatatg agatgttgct tcccgatgga aagaggtgta cagattgtgt gcgatgtgcc 540 

acatgttgca gcgtatttgg tcaaaaagaa agtgccacta tttgccagtg gcatccttcg 600 

agatattcag cgggagaata a 621 



<210> 412 
<211> 690 
<212> DNA 
<213> B.fragilis 



<400> 412 

cattcactaa atagctttaa tcaccaatgt ctaattttaa aacggcaaaa catgaaaact 60 

ccatctttaa ttctgatgac aattatttta tgtaatctca gtatcccaat aaatgctcag 12 0 

atactaacct cccgccagca aaaggaagat tttgacacct tatatagctt actacatcag 180 

gtacatccgg acttatttgt gtatcaaaca caaaaagaat ttgaaaagaa acatgattca 240 

atatatagtt cgttgaataa agaacgaaac ctttctgatt tttactttat agtctctcca 3 00 

tttgttgcat ctgttaaaga tggtcatact aatttcacaa ttcctgctac tcaagacaga 3 60 

attacctatt tgaataatgg agggctgact ctgcctttac gcttaaaaat agtagagaat 42 0 

aagatattgg ttgattttcc tctaatatcc tgttcaatac aggaaaatga tgaaataata 480 

tgtatgaata atataaatag tcaaaccata ttaagccaat tgtatctctt actgggagct 540 

gaaaaaggaa acgctattaa ggaaaatcaa ttgaccagtt atctttctac tttgctatgg 600 

tataaatata attggggtga aaatatgatt ttacaattaa aagaggaaaa aagatatgga 6 60 

aagaatcatt ggatggtatc agccaaatag 690 



<210> 413 
<211> 477 
<212> DNA 
<213> B.fragilis 



<400> 413 

catattgtaa atcaacagaa tatgaataca aacaatatcg gaggagtcat tcaggcagat 



60 



166 



ttcctgttca cggatgaaat aagtttattt tcagtcatca atcactcagc cgttatcagc 120 

cttcaccggc ctaatacctg gagaaacctg cctatcacct atatgggagt ttctccagat 180 

gtggaagcgg acgacactca agccggtacg ctatataaac agacccttac catccgcctg 240 

aaacgcacag gactgacaga ttcagaactt cacatcctgc ggactatcaa tgtacgtggt 300 

tgcgtagtaa gatgcaagga tgcgaatggc aatatccgat tatatggaag caaagagtac 3 60 

ccgcttctgg gaaccgtgat agagaaaaca ggaaccaaag cctccgacct ctccggaatt 420 

gaagccactt tttccggaaa aggcgcctat cctccactac ctgttacaga gttataa 477 



<210> 414 
<211> 243 
<212> DNA 
<213> B.fragilis 



<400> 414 

aaaggatcaa aaatgggaac aggatttacg gaatacgaag aaagccttat acaagcaatt 60 

tgttcgttat attatataca gacgaggact tataaacagg gtgtgttcat aggtatgatt 12 0 

cctaaaaaca cacgtataac cttgaatggc atatatatga tgaagttatt aaataccggg 180 

aacgctgttt atattgaagt aaaaggggga attaatgtat taacaattat acatcaacaa 240 



<210> 415 
<211> 609 
<212> DNA 
<213> B.fragilis 



<400> 415 

attatgccaa 

ggtgaatccg 

tgtgttagta 

atgcacggat 

ggtatgatca 

atgatcgtag 

ggagctgccg 

gcattcgact 

cttgagggta 

tgcaaagata 

gaggaatga 



aaaaagacac 
ctctccagtt 
agaaaatgga 
gtggagggca 
accggttggt 
aaggaggaaa 
ccgctctcga 
tcagtcagct 
ttgaagtgat 
tgttgaccaa 



aacctacgac 
atcaccaaag 
aagcccatta 
agcggaacct 
cggtaacatc 
gaaggcattt 
caagataggt 
tattccccca 
agacaatctg 
acaggcgaca 



cgcatcgaac 
gagatggaaa 
attgaagacc 
gtttcccaat 
cagttggcgg 
caactcgcta 
aaatacaccc 
tcttttgaac 
gagcaacgcc 
gatattcaaa 



gctccctgtt 
ttaagaatcg 
aggagctcgt 
cacaggccta 
ccaaatcctg 
tcgacaacgg 
gttccgacaa 
cttctgacga 
gccaggaact 
ccattgaaga 



caaagatcgg 
gatgatgctt 
tacttttctc 
tcgcgatatc 
gtatcgctac 
agatgctaaa 
agacgatgac 
tgtgacgaca 
ccgcagcttg 
ggaggatatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

609 



<210> 416 
<211> 363 
<212> DNA 
<213> B.fragilis 



<400> 416 

cgtactgcaa agatggaaaa tatttttgat tctgcaaaaa caattcaaga aaaacgcaca 60 

atattaaaag gtttatcaaa gccgcttcaa attttggtga aagaggctgc tattcctacg 120 

gtaaacgatg gactgaaagc gatatacgca cagtctgggc acaccgaact taaaacgctg 180 

aaacagtgga ataaggaggg caggagtatt aaaaaaggtt cccatgcttt atgcctttgg 2 40 

ggtgcaccta agaaagtaga gacgacccaa gtggaagaag cacagggaga agataatgac 3 00 

ccaatgaatt tctatccgat ttgttttgta ttctcaaatt tgcaggtata tgaaaaacag 360 

tga 363 



<210> 417 
<211> 195 
<212> DNA 
<213> B.fragilis 



<400> 417 

attgcacaaa attgtatttt gtacgcacaa tgtatcaatc gtacaaaaac gtaccatgtt 



60 



167 



tcgtacggat ataaaccaat tataatcaac acattatgtg ataagactgc acaatttaca 12 0 

cagttgcaca aaaaaggagt accgtttttg caagggggat tagtttgttc cggtaagtct 180 

tgtttatgtc cgtaa 195 

<210> 418 
<211> 759 
<212> DNA 
<213> B.fragilis 

<400> 418 

acaattaaaa acacaataac catgaaaaaa attattttat tacttgcttt atgttttact 60 

gcaaataatt tctttgcaca aaccacagat ccgaatcagt tgaagaatga aggtaatgat 12 0 

gctttgaatg caaaaaatta tgccgttgct tttgaaaaat acagcgaata tctgaaattg 180 

actaataatc aggattctgt cacagcctat aattgtggtg tatgtgcaga taacataaag 2 40 

aaatataaag aagccgccga ttactttgat attgcgatta aaaaaaatta taatcttgca 3 00 

aatgcatata taggtaagtc tgctgcctat cgcgatatga aaaataatca agagtatatt 3 60 

gctacattga cggaaggtat caaagctgtt ccgggcaatg ctactattga aaaattatat 42 0 

gctatttatt atttgaaaga aggacagaaa ttccaacaag ccggcaatat cgagaaagca 480 

gaagagaact ataaacatgc cactgatgtg actagtaaga agtggaagac tgacgcttta 540 

tatagccttg gagtgttatt ctacaataat ggagccgatg ttctacggaa agcaactcct 600 

ttagctagtt cgaacaaaga aaaatatgct tctgaaaaag caaaggcgga tgcggctttc 660 

aagaaagctg ttgactattt gggagaggca gttactttat cacccaatag aactgaaatc 72 0 

aaacagatgc aagatcaggt aaaagcgatg attaagtaa 759 

<210> 419 
<211> 369 
<212> DNA 
<213> B.fragilis 

<400> 419 

aacatgaaaa agaaaaaaga aactccaatg catcccgttg tggaaaatat ccgtaaaata 60 

attatggata aaggaattac ccaagttgct gcatctgaac ttgtgggtac ttctgcatct 12 0 

caaatgagta aaattttgaa tggagaagta caaattagca tttggcagat ttcaaatttt 180 

gcaactaatc ttggaatgga gataatagac gtatttacat atcctaataa atatgtaaaa 240 

gcagaagaca ggaatgataa taaagaacct attgaggcag ttctccaaat taaactcaga 3 00 

aaagataaaa aagatcaagt actaaagttg atatttgggg aacataattt agaaatatta 3 60 

aacaaataa 3 69 

<210> 420 
<211> 1077 
<212> DNA 
<213> B.fragilis 

<400> 420 

acaattagaa aggagggctg cttaatggcg gtatataatc gaatccctga ccggtttact 6 0 

aacctggata tccgcgatac cctgaacgct tatggtggaa gtgtgggcga taactcgctt 12 0 

aactatttct ctgctgctgc acacattaac atgtggagca aacgtaaacc ggtgaaaaga 180 

aatatcatgt ttaatacgga ggacccgaac tggttccgtg ccgattccgg aaactacggt 240 

atcaatgtcc cccgtgcagc ggatattgcg ctactgaccg gaacttacac ctatgatata 3 00 

cctgttcagg gatcgtacaa cctgcgtgtc ggtgattttg ccggatacaa tccggaagct 3 60 

accgtaccat tcactaccat gcttccctcc ggacttatcc ttgcttccgg cagtgccact 42 0 

gttgtgaagt tgatgctgaa atcacttgat tcaacataca atgttgtccc ggccgatata 480 

ttcccctcta attcatattt gggatgtgct gtcacatacg gaaaccggac gcttattaaa 540 

acgctttcgg ttacaatttt caatggaggg gtgacactga acatatccga ttgcgagctc 600 

ttgaaatcag ataagacggg agtcaggata aaggtattca tctgtacatc gcaggttcca 660 

tcctggcagg gtgaaacgac acaatcctat tacagcttga acgcagagga cggttttgat 72 0 

gagtcgaccg ttgatattgt caccccgcat gccgatgttt actcgttcgg catccttgga 780 

cttagcatta tcgaagcgag aaagatatct ttaatcggta cggcgattat caactccgga 840 

agtctctttc aagagggccg cttaataagc agactggaca ataattacta tttaaagtct 9 00 



168 



gtaaaagtcg ttgcgacccg tgcaagtgac 
ataacatctt ccactacacc gacacgctta 
aacttcagaa caccggtctc taggtcttca 



ggtgttactg ttgccgagaa agcacaaagc 960 
ggaaacgact ggatggcagg tgagtccgtc 102 0 
ccgggggcgg gaggcgagca cgtcgtc 1077 



<210> 421 
<211> 252 
<212> DNA 
<213> B.fragilis 



<400> 421 

accctcggaa ataccggatt tactgaaagg cagtttctaa acaattcttt ttttaattta 



60 



tcaaactaca aattaaaagt tatgagtaga cgtagacaat tagagcatga agtgtcttta 12 0 

gctcaggaaa gaataaaaaa agctcccaaa gatactccta aagaaatttt gaagacgtgg 

gaacaagagt tagtcgactt ggaattagaa ctcaataatc tggttgacga cgaagaagac 

aacaatgaat ga 252 



180 
240 



<210> 422 
<211> 996 
<212> DNA 
<213> B.fragilis 



<400> 422 

aatagactga aaaacatatt gtccattgca caatttcacc aaatctgtgc tatgatattg 60 

caaaaatata taactttgca cccgcaaatt aaattaaaaa acaaaaatat gaaagcattt 12 0 

gtattccccg gtcaaggtgc ccaatttgta ggtatgggta aggacctgta tgaaacttca 180 

gctttagcaa aagaattgtt tgaaaaagca aatgatat cc tgggatatcg cattacagat 240 

attatgttca acggtacgga cgaagatctt cgtcagacca aggttactca gcctgctgta 300 

ttcctccact ctgttatttc tgcactttgc atgggtgatg acttcaaacc tgaaatgact 3 60 

gccggacact cactgggtga gttttctgca ttggttgctg ccggcgctct gtcttttgaa 420 

gacggcttaa aattggttta tgcacgtgct atggctatgc agaaagcttg tgaggcaact 480 

ccttctacaa tggctgctat tatagcttta ccggatgaga aagtagaaga aatctgtgct 540 

tctgttaccg ctgaaggaga agtttgtgta cctgccaatt acaactgtcc gggacagatt 600 

gtaatttccg gatctgtacc gggtatcgaa aaagcttgtg aactgatgaa agcagccgga 660 

gctaagcgtg cgcttccgtt gaaagtaggc ggtgcattcc attctcctct gatggatcct 720 

gccaaagtag aattggaagc tgccattaac gcgactgagt tccacacacc gaaatgtcca 7 80 

gtttatcaga atgtagatgc cctgccccat acagacccgc aggaaatcaa gaagaatctg 840 

gttgctcagt tgactgcttc tgtacgttgg acacagaccg taaaaaatat ggttgccgat 900 

ggtgctaccg acttcacaga atgtggaccg ggtgccgtat tgcagggatt gatcaagaag 960 
atcgactcta cagtttcggc tcacggaata gcataa 



996 



<210> 423 
<211> 474 
<212> DNA 
<213> B.fragilis 



<400> 423 

tgcattatga agaatttaga aatcctccct ctctctgccg agagtaaaaa gcgtattgaa 60 

gagttcgcaa ggcagtatca gcgatatgcc catatcgcta ttgagattgt gtcctattca 120 

gaaggccggc tgattgtccg tgccgagcaa aaggacctgg ttaatgataa gttcctttca 180 

aagaaagaac tgacagaacg tgtccgggac atgttcaaag atgaaattcc ggaagactgg 2 40 

aaacttactg tttctgccgt aaacttcgac cgtaaggata ttgatgggat cactctcgac 300 

tggatcaaga aacggatgga acggcttgga ttaaagaata aacatttgag caactacacc 3 60 

ggaattgaca aatgtaccgt ttcttccatc ctttccggag acaaggagtt gaccaaatgg 42 0 

cacaaagtag ctctatacta ttttttcaag tattatgaag tagccaattt ttag 474 



<210> 424 
<211> 336 
<212> DNA 
<213> B.fragilis 
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<400> 424 

cataccatga acctatcttc ttttaaactg accaatatta acgaattgat atccgtatac 60 

aaagagaatc cggagcgctt taatcgcttt tataacgcag tgtacctgct gctggatggc 120 

attccggaat gcggaagtat tcgtgtaatg gatcactgtg aggcgtcctc ctatgacttg 180 

tttataaagt gtgcatgttg gattattcag gaagagacgg aacagaaaga gttgacggat 2 40 

gcattacttg agttttcgga tgattataca attattcgcc ggtgcgcgaa gttcgtaaaa 3 00 

tccaaatcct gggttcattt ctactcacga cgatag 336 



<210> 425 
<211> 1320 
<212> DNA 
<213> B. fragilis 



<400> 425 

aagaacatga aaattgcaat tgtcggaacc ggatacgtag gtttggtcac aggaacctgt 60 

tttgcggaaa ttggcgtgga tgttacttgt gttgacacca acagcgaaaa aatagaggcg 12 0 

cttaaaaagg ggattatccc catttatgaa aatggattgg aagaaatggt catccgcaat 180 

accaaagccg gtcgactaaa atttacgact tcactggaaa gttgcctgga tgatgtagaa 240 

gtagtgttct ctgctgtcgg aaccccacct gatgaagatg gaagtgctga tttgagttat 3 00 

gtactcgctg tggcacgtac cattggacaa aacatgaaga aatacaaact tgtagtcacc 3 60 
aaaagtaccg tacctgtagg tacagcatgc aaagttcgta atgctattca ggaagaatta 
gacaaacggg gtgccaaaat agaatttgat gtagcttcca atcctgagtt tctgaaagag 

ggtaatgcag tcaatgactt tatgagtcct gaccgtgttg taatcggtgt ggaatcggaa 540 
cgtgcagaaa aattaatgac taagctatat aagccattca tgctaaataa tttccgcgtg 
atattcatgg atattccctc tgccgaaatg accaaatatg ccgcaaactc aatgttggct 

actcgtatca gtttcatgaa cgacatcgct aatctgtgtg agttagtagg agctgatgta 72 0 

aatatggtgc gtagcggtat cggttcggat acccgtatcg gacgtaagtt cctttatcca 780 

ggcattggtt atggtgggtc atgtttcccc aaagacgtaa aagctttgat aaagacagca 84 0 

gaacagaatg gatatcagat gcgtgtgtta caggcagtag aagaagtgaa cgaaaatcag 900 

aaaagcctgt tattcgacaa actggtaaaa caatataatg gaaatctgga aggtaaaaca 960 

gttgcattgt ggggattggc attcaaaccg gaaacagatg atatgcgcga agcacctgca 102 0 

ttggtcctaa ttgacaaact gttgaaagcc ggctgcaaag tacgggccta tgatcccgca 1080 

gcagcaaatg aatgtaaaag acgaatcggc gaaaccatat actatgcacg cgacatgtat 1140 

gatgcggttt tggatgctga tgctttgatg ctggtaaccg aatggaaaga atttcgtctg 12 00 

ccttcgtggg ccgttgtgaa aaaaacaatg tcacaacagg tagtcatgga cggacgtaac 12 60 

atttatgata aaaaagaaat ggaagaacag ggttttattt accattgtat cggcaaataa 132 0 



420 
480 



600 
660 



<210> 426 
<211> 501 
<212> DNA 
<213> B. fragilis 



<400> 426 

aacaaaatga 

aatgtagcta 

gagaaagaag 

aagaaagaga 

accgataagg 

cggaaacagc 

gtagacgcaa 

tctgattgga 

gaattggaac 



aaaatgtatc 
ataagaagaa 
aaacgaaaga 
gttcttccgt 
cggagcgtgt 
ttgaaagctt 
aagggctttc 
tgttagattt 
ggctaaatta 



gagcgcaaaa 
tgaaacagcc 
acaggtttcg 
agtagccgca 
ttatctgctc 
tactatctca 
catttctaca 
aaataatcac 
a 



agcgcagagg 
cctctaattg 
gccaaagttg 
cccaataagc 
cgtcagaaat 
catgataaaa 
agtaatcccg 
ttggcgaaaa 



ctaaagccgt 
tgctgccatc 
aaactcccgt 
gtctaagtat 
atcaagaagt 
ataatgccca 
ttgcaattgg 
ccgaagaaga 



agtgttaagt 
ccttccaacc 
tcaaacttcc 
tgatgaactg 
gagagaaaag 
acttactttg 
taagttgtta 
aattcgttca 



60 

120 

180 

240 

300 

360 

420 

480 

501 



<210> 427 
<211> 249 
<212> DNA 
<213> B. fragilis 



170 



<400> 427 

aaagaaatga atagtgatgg taataaaatt ctggatgcta ttaagagaat ggcagcagat 

gacaataaag gtttaagaat gaccactacg atagtcgatg ttaaagatga tccgctcggc 

tcaatcgttg gctttgggac tgaaaaagtt tgcggagatg atgcatttgc ccaaacaatg 

ggtttaccag gtaagtatat ggcatgtgcc ttttttatag atagagaaga actaaagaaa 
tacctttaa 



60 

120 

180 

240 

249 



<210> 428 
<211> 525 
<212> DNA 
<213> B.fragilis 



<400> 428 

ctaaaaccga 

ttgacttctt 

cctgatttca 

aaaggtaaat 

aacgcaagtt 

tcatttgacg 

cccacctgtt 

aaccggggtt 

tctgctgcag 



attacatgag 
tcgtagaaaa 
caatcgaatc 
atgtgctgct 
tgagcaatgc 
aataccagtc 
tcgcggaaac 
tcactaacta 
aactttctgc 



gcatgtaaaa 
agacaaaccc 
tacgtcagat 
tagtttttgg 
gcttcgatca 
ggtatttcag 
taaaggcgaa 
tttattggat 
ttatgcaaac 



tggatttttg 
accggaggtt 
gcacagtaca 
gcaagttacg 
acttctcaag 
gaaaccattc 
agctccggct 
ggaaatggtg 
aaaatcaaag 



ttgtattact 
tgaatgtggg 
attttgattt 
atgcacagtc 
atgtggaaat 
gtaaggacca 
tgtttaagaa 
tgattatagc 
gttga 



aattagttcc 
tgacgtagcc 
gaccgactta 
ccggatgcaa 
ggtttccgtt 
aatagttacg 
ataccgttta 
caaaaacatc 



60 

120 

180 

240 

300 

360 

420 

480 

525 



<210> 429 
<211> 564 
<212> DNA 
<213> B. fragilis 



<400> 429 

tatcatcaga 

ccgaagattt 

gatgctacta 

acgttacgcg 

atcaaaggtg 

aagcatatca 

tttgcccatg 

atatatgccc 

tggcctattg 

cgggaagcag 



aacgaatgaa 
ttttcgatcc 
tcggacagat 
gactccatta 
aagtgctgga 
gcgtattgtt 
gatttttagt 
cccaatctga 
ccgattctca 
aatattttga 



ttatatacaa 
gcgcggatat 
aaattttata 
tcaaaaagga 
tgtggctgtc 
aagcgacgaa 
gaaaagcgaa 
ggcttctatc 
acttgtcatg 
atag 



acagaaatag 
ttcatggaag 
caagataatg 
gcctatagcc 
gatttaagaa 
aataaacgcc 
atagctatct 
ctatacaatg 
tcagagaaag 



atggtgtgtg 
cattcaagca 
aatctcaatc 
aagccaaact 
agtcatcacc 
agctttttat 
ttacttataa 
atccggcatt 
acaagcaggc 



gatcattgaa 
acaggaattt 
ttcattcggc 
agtgcgtgtc 
tacatttgga 
tccccgtggg 
ggtagataat 
ggctatcgac 
aggagccttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

564 



<210> 430 
<211> 621 
<212> DNA 
<213> B.fragilis 



<400> 430 

ggagataaga 

gaccagacgt 

cttatgtcta 

ggcatagcag 

ggctcatccg 

acggctgtaa 

cttggagagg 

actatccttg 

gcgtcagact 

acccgccttt 

attaaaaaca 



atatggaatc 
ccgaatctag 
ttatcaaaaa 
aagattttaa 
gatatatgtt 
tctggaaaag 
tctctgcaat 
ccgagggaga 
atgctactgt 
ggcttcctgc 
aggaggaata 



aaagtttcat 
gctttatatg 
ggatatcgga 
ggacgtattt 
aatatgggac 
tgaacgagcc 
aacctgtgaa 
ttccgttgtt 
cgtaaatatg 
acgcggaagc 
a 



gaactgaaaa 
gatatacagc 
tatctggcaa 
ctatctgccg 
ggcactgcgg 
ttcatcaaag 
cgttcgatgg 
ggagtttcag 
aactgtccca 
tttgcagccc 



acaggctgct 
tggctcaaaa 
aggaaggtat 
gcataaaatg 
ttgatatttc 
gacgcgcatg 
tcattgcggc 
gctatcatgc 
acattgacct 
gaaaaaattg 



gaagaatatt 
ctgcgaaact 
cctttccccc 
taactccgga 
cggaactgcc 
cgctttcctt 
aggaagctcc 
ctctgtaaag 
tcgtgacaac 
tgatataatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

621 



<210> 431 
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<211> 225 
<212> DNA 
<213> B.fragilis 



<400> 431 

ccaaaaaaga aggaaggaaa acctatgttt aaagatataa tcgaattaga taaacaagtc 60 

gtagaccgga tcgtagataa ggtccacgaa aacaatttag aaattgagat ggaaatggga 12 

gttgtaaagg acggtatggt taaagtcctt ttcctctata aagatccgga acttctgcag 18 

agcgtgataa acgaatccgt tactgaagag tacgatctcc cataa 22 



<210> 432 
<211> 687 
<212> DNA 
<213> B.fragilis 



<400> 432 

acagaaaaga atgacacaat gagtaatata cctgttatct ttcgtttttt aaaggacctt 60 

actgcgaaca acaatcgcga gtggtttaat gaacatcggg aagaatatga aatagcccgt 12 0 

ttagaatttg aaaatttcct ttccacagta attgcccgta tttcactttt tgatgaaagt 180 

attcgtggta ttcaacctaa agaatgcact tatcgcattt accgggatac ccgcttttct 240 

tccgataaaa ctccctataa gaatcatttt gggggatata ttaacgcaaa agggaaaaaa 3 00 

tcctatcaca gtgggtacta tatacatata caacctgagg gttgcatgct ggctggagga 

agtttatgct tgccttctaa tattttgaaa gcacttcgcc agtctatcta tgataacatt 

gatgaatatc gttcgatagt ggaggatcct gaatttcagc aattcttccc cattgtaggt 

gaagatttcc tgaaaacagc tcccaaagga ttcccgaaag attttaaata cattgattat 540 

ttgaaaccta aagaattcac ttgtgcttat tccgtcccgg acagtttctt tttgactccg 

gatattctgg acaaaataga agaagtgttc cggcaattta aacgttttgc cgactttacg 
aatttcacta tcgatgattt tgagtaa 687 



360 
420 
480 



600 
660 



<210> 433 
<211> 342 
<212> DNA 
<213> B. fragilis 



<400> 433 

ttaagagaga atatgaagcg ttttgctgca cattatttat ttgttccggg aagtggattt 60 

ttaaagcaat atgcgataga aatagaagga ggatatattt gtcatatctt tcctttcagc 12 0 

gaagaaatag agtctgtaga atggtttccg ggtgtcatac tactgactcc acaagaagaa 180 

tcagatataa atactttgtt taactttact aatatagaaa aacaaagtat ttatattccg 240 

aaagttacca tagatatgaa atggcgggct tatttattat atcctttcaa ttttgttaca 300 

atgcagcctg tcgccgaaac tctgcacaga caattgcagt ag 342 



<210> 434 
<211> 1074 
<212> DNA 
<213> B.fragilis 



<400> 434 

agagaaagga aagatatgga aataataaaa accggattgg ctgcttttgg tatgtcggga 60 

caggtgtttc acgctccatt tatcagcacg aatcctcatt ttgaacttta caaaatagta 12 0 

gagcgtagta aggaactctc taaagaacga tatccgcaag catcaatagt acgtagtttt 180 

aaggagttga cagaagatcc tgaaatagat cttatagtcg ttaacactcc ggacaataca 240 

cattatgaat atgccggaat ggctcttgaa gccgggaaaa atgtagtagt tgagaaaccg 3 00 

tttacttcta ccaccaaaca gggtgaagaa ttaatagctt tggctaagaa aaaaggtttg 3 60 

atgctaagtg tatatcagaa tcgcagatgg gatgcagatt tcttaacggt acgtgatatt 42 0 

cttgccaaat ccttattagg acgtttggta gaatatgaat ctacatttgc tcgttatcgt 480 

aattttataa agcctaatac ttggaaagag accggagagt ccggtggtgg attaacctat 540 

aatttgggtt cacatctgat cgatcaggct attcagcttt ttgggatgcc tgaagctgtt ^ 600 

tttgcagatt tgggtatcct gcgtgaagga ggaaaagttg atgattattt tataattcat 660 



172 



ctgttacatc cttcgttggc accaaatgtg aaaatcacct tgaaagcaag ttacctgatg 720 

cgagaagccg aaccacgttt tgccttacat ggaacactag gttcgtatgt taaatatgga 7 80 

gtcgataaac aggaagctgc tctattagct ggtgaaatac ctgaacgtcc gaattgggga 840 

gaagaatcag agcaggaatg gggattatta catacagaaa taaatggaaa agaaatctgc 9 00 

cgaaaatatc cgggcatagc cggaaattat ggtggctttt atcagaatat ttatgaacat 960 

ttatgtttag gacaaccatt ggaaacacat gcacaagata ttttgaatgt gatacgaata 1020 

atcgaagcgg cttatcaaag ccatcgagat aataaaattg tcaatcttaa atag 1074 

<210> 435 
<211> 546 
<212> DNA 
<213> B.fragilis 



60 
120 
180 
240 



420 
480 
540 



<400> 435 

ataataaacc taaatatgac ggctaaattt attattatgg tattggtatt agcttatatc 

atggttataa tcgctatatc aatttattta ataaagataa tatgtactcg ttacaaccaa 

aactcagatc agatactccc tcctcccaat atgcactcta ttcaggagag tgcatccatg 

catttggtaa gaataggaca gttgcctcac ccaggacctg gatattgtta ttacgaatta 

ggaggaatga gatatcaagc gctaacagga tttgacattg gcgtacacga aggatatgca 3 00 

aaagcagagc ttaataatcg gtatgataaa tatgcggttg gagtctacag agaaggagat 3 60 

cacaaattaa tgggatacgt tcgaagagaa caaaatagag agctttatga atttatgtta 

aataataatt gtatagctaa agctaaattt cgaatatgga tacaccaagg agaaatctat 

ggagcagctt acataaaaga agaatggaaa tcttcattag gctttaagtc tgacattaaa 
atttag 546 

<210> 436 
<211> 525 
<212> DNA 
<213> B. fragilis 

<400> 436 

aaactaattg aaatgttgaa cgaaaaaaga actcaaagaa ttatgaaaag taaattcctg 60 

atatttttgt cggcagtagc catgctgtta ttatttagca attgtggaag caaaacaaca 12 0 

agtaatgatc aggccactac cgaagtgaaa gacactgtca cttcaaaaga agaagctgta 180 

ccggatagtg tatctatctt gggagaccag gtatatgata tagtgaacac agctcccgaa 240 

tttccgggag gaatgaaagc gtgtctcgag tttctctaca agaatattac ttatccggca 3 00 

caagctattg aaagtaagca ggaaggtcag gttgtgatac agtttgttgt taccaaaaat 3 60 

ggtaaaatta ttgatccgaa agttgtgaaa agtgtatctc catcacttga cgcagaggcc 42 0 

atacggatca taaatttaat gcctgactgg actccgggaa aacaaaaaaa tggtcaggaa 480 

gtgaattcac ggtttacact tccagtccgt tttacactta aatga 525 

<210> 437 

<211> 438 

<212> DNA 

<213> B.fragilis 

<400> 437 

accatgtatg atattgtagc gcagaggctt agactgtttt tagcaaagaa agatatcact 60 

tgtaaaaaat tgtcggctat gatttttatg tcagaggcga cgcttaaagg caaattgaat 12 0 

ggtacaagaa cgctagatct taatacaata atatccattg caatacggct tgaggatctt 180 

tctgttgaat ggcttcttcg tggcgaaggt gatatgttta aatctagttc tggtgtttct 2 40 

attttatctt catcagtacc tatatttaca ggggagacct cgtttatata cagtatgtat 3 00 

aaagaagaaa gagaagaggt taaaacttta ttaaagcaaa atggtatatt ggaagagcgt 360 

attcgtcagc tcgaggatga caatagatta ttaagagatc aagttgtaac agaattaaac 42 0 

ctaaatacta aactgtag 43 8 



<210> 438 
<211> 369 
<212> DNA 
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<213> B. fragilis 



<400> 438 

aagatgcatg atattgtaac gcaaaggctt aatcaatttt tagttgaaaa gaatattact 60 

tataaagaat tatctggtat gattcttatg tcggaaacgt cactttgtag aaagttgact 12 0 

ggttcaagga gtcttgattt gcatacatta atatctatag tagcatgctt gccagatgtt 180 

tcttccgagt ggcttcttag aggcaaaggt agggtgtgta attcttcttc gagcattagt 240 

tccgatgtct tagtagaaga acttaaaatg gaaaataacc tattaaaacg aaaaattcaa 3 00 

gttcttcaag aattgttgga gtttaagatg gaaaaaatca gagctgagaa tggtaacata 3 60 

aaaaaatga 3 69 



<210> 439 
<211> 912 
<212> DNA 
<213> B. fragilis 



<400> 439 

cagaaaagtg accattatgt ttcttctttg ttatatagtg ttttcattta tcatatggta 60 

agaaaaagtt caataaataa atacgagtta gacgtcagaa aggggttaca agaactcttt 12 0 

gacaaatgtc gacacaatat gaagcattct ggggatttat tattatgtca acaaaatggc 180 

ttcattgact acaaaggtcg cccatgtgtt ggattaggtg atgaagggct taattgtatg 240 

caacaagtca attttatttc gtttaatgga ataggaaata ttactgatga caatgattat 3 00 

tataaaaaag aaggaaataa ctttttttat ggtaattctg agtttgaagc tgatattatg 3 60 

agacaacata ttacctatat gaatatatgg gaaaattctt actttttacg ggtattcact 420 

caagtggtaa acgtgttaaa tggtttaaat tataattgga atttgacatt caagaatctt 480 

aagcccaatc aaaaaagcga acaaataaga gaaggtataa taaaattatt agatctatcc 540 

cccaacttcc aacgtatact taaagatgca tatgtcggac aaatacgaaa tgctgtcgct 600 

catacacaat accattgtat tcaaggagga atcttatatg acaactactc accatcaagt 660 

aaatattcta tcctgcaagg tctttcttat gaagaatggg agaagaaata tgtctactct 72 0 

tttttcatat ttataggtat attccaaatg ttaaaacaaa tcacaaacga attttatctt 780 

ccttgttccc aattaacctt tgcaaaggga gttccaattc aaataccact ttcggacaac 840 

aaaggatatg cagagactta tttatatccg aatcaaaaag gagatatttg gagatttaca 900 

agaataattt ga 912 



<210> 440 
<211> 213 
<212> DNA 
<213> B. fragilis 



60 



<400> 440 

gcatatcgcc taaacgagaa acaattgtcc atgtatatcc atcatttgat ggtaaaggac 

cacaagagaa aatattatta tctatcatat gaaaagagat tcatatttat aattaattta 12 0 

gtttctgcta aattacaaaa tagtaatgga ttaaaaaaga aaaagcagag taatagctct 180 

gctttaatat gtttctacag aaatatggcc taa 213 



<210> 441 
<211> 246 
<212> DNA 
<213> B. fragilis 



<400> 441 

cggtggcgag aaacttctgt taataatttc tcttctttgc agtcttgttt tataaatgta 60 

aatgagatca aggtacgttt tgggggtgct cccggtttag tgatgaccaa gagctggcgg 12 0 

gatggataca ggccttgtga agatgcgatg tctttaaaag aatcacttgc atccatcggt 180 

atgactactg taaaagtacc atttggcgaa agtaaattcg atactccctt cagcaactct 240 

tcataa 246 



<210> 442 
<211> 210 
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<212> ,DNA 

<213> B. fragilis 



<400> 442 

agtaagcttt ttgcccgttt gcttacttta attcatctaa aagtcggtag ctccacccct 60 

ttcagcttgc ttctgaaagg ggtgaaacta ccgactatgg tttgctacgc ttccggttta 12 0 

ccggattctc aagaagtgca tgttagccaa aaggatgtat atgctgcatt cggtcggtat 180 
ttacttcgat ttgctttcaa tgtggaatga 210 



<210> 443 
<211> 216 
<212> DNA 
<213> B. fragilis 



<400> 443 

tataaaatgg aattagagac aattggagaa aacgccggca aagtatggcg caccctgaat 60 

gaaatgaggg gagaaatatc tattcaggaa cttagtcgga aaattaacct cagcgccgaa 12 0 

gacgttgcac ttgcggtagg ttggttagcc agagaaaata atatttttat tcagagacac 



180 



aactacctgt tatacgtcag tcatgatgct ttctga 216 



<210> 444 
<211> 807 
<212> DNA 
<213> B. fragilis 



60 



<400> 444 

atgaaaatgg aaaatagtgt attaaccgga aaaccttata acatcggata tgccttgagc 

ggaggcttta ttaaaggctt tgcccatttg ggagttattc aggctttatt ggaacatgat 12 0 

attaaaccgg atattatctc aggagtcagt gccggggctt tggccggagt attttatgcc 180 

gatggcaacg aaccctatag ggttttggac tacttttccg gacataaatt tcaggacttg 240 

acaaaacttg taattcctaa agtaggctta tttgctttgg gagagtttat tgattttttg 300 

aagtcaaatc ttaaagctca gaagctggag gatttaaaac ttcctcttat cattactgcc 360 

actgatctgg atcatggtcg cagcatgcat tttcataaag ggaatatagc tgaacgggta 42 0 

gctgcttcat gctgtatgcc ggtgttattt acacctgtaa aaataggaaa tacacattat 48 0 

gtggacggag gacttctgat gaatttacct gtatctacca tccgaaatga atgtgaaaaa 540 

gtggtagcag tgaatgtcag cccgttgatg gcagaaaaat ataaaatgaa catcgttagc 600 

attgccatgc gttcttatca ttttatgttt cgtgccaata cgtttccgga gcgagacaat 660 

tgcgatttac taattgaacc ctacaaccta gagggttata gcaatactga acttgaaaag 72 0 

gccgaagaga tttttgaaca aggctataac actgcttctg aggttctgga ccaactaatt 780 
gaagagaaag gaaagatatg gaaataa 



807 



<210> 445 
<211> 1221 
<212> DNA 
<213> B. fragilis 



<400> 445 

agggacgcgg acagtttaca acttttttgc caacactttt gttataattg gataataaac 60 

ctattttatc caatgaaagt acacgaatat caggcaaagg agattttctc cacttacgga 12 0 

atacctgtcg agaggcatgc tttatgccat acggcagatg gggctgtggc tgcttatcac 180 

cgaatggggg taaaccgggt agccataaaa gcccaagtgc tgaccggcgg gcggggaaaa 240 

gccggcggag taaagttggc caataatgat agagatgtct accaatacgc tcaaactatt 3 00 

ttggagatga ctataaaagg ttatcccgtc accaagattc ttcttagtga ggctgtcaac 3 60 

attgcagccg aatattacat cagttttacg atagaccgta atacgcgctc tgtcacgctg 42 0 

attatgagtg cggccggtgg tatggacatc gaggaagtag cccgccaatc tccggaaaag 480 

ataatacgtt gcagcattga tcctctaatc ggagttcccg attatctggc acataagttt 540 

gctttctctc tctttgaaca agctgagcaa gctaaccgga tggcaactat tattcaagat 600 

ctttacaaag catttattga aaaagatgct tcacttgctg aaattaatcc attggtactt 660 

acccctgttg ggacattatt ggctattgat gccaaaatgg tttttgatga taatgcactt 72 0 
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tatcgtcatc cggacttaca gaagttatca gagcccacag aagatgagaa gttggaagcg 780 

attgccaaag aaagaggatt cagctatgtg cgcatggacg gtgagatagg ctgtatggtt 840 

aatggagccg gtctggctat gacaactatg gatatgatca agctttatgg aggaaatccg 900 

gctaatttcc ttgatattgg cggtagttca aatcctgtca aggtgataga agctatgaga 960 

ttattgctgg atgacaaaaa agtcaaagta gtctttatca atattttcgg aggtatcacc 1020 

cgatgtgacg atgtagccat cggactctta caggcgtttg agcagataca aacggatatt 1080 

cctattattg tgcggcttac aggcactaat ggaaatatgg gacgtgaatt attgcgtaag 1140 

aataaccgtt ttcaagtggc ccagacaatg gaagaagcta ctaaaatggc tatagaatca 12 00 

ttaaagaaag aatcgatatg a 1221 

<210> 446 
<211> 1443 
<212> DNA 
<213> B. fragilis 

<400> 446 

catcaaagtt tatttctaat gaagaagaaa caacctgagc cccaattatt tcaaaaagga 60 

tatgaaactt atgcagtcac caaaggcgga aaaggaatca taaagttcag tgataatagc 12 0 

gatatcacaa ctgaccggga gacctctacc gttgaagtag ttcccaaagg gaaagaggct 180 

ccaattaagt ttgttcccag agggcggaat aacaacatga tgtatgacat tatgaagaag 240 

atcggagcaa acgtaactgt cggcagcaat gtggaattca aaaacaaggt agtatatgga 3 00 
gatagtgtcc tcgtatatcg taaataccgg gataaggaaa cccgaaaaat catcaaagaa 
gaagtcttgc ccgaagaata cccggatata ttcgatttta tagaaaacaa cgacatacca 
tttatccgga tggagatagc gaatgattta gtgatcttct acgatgcata cgtcgaatat 

atttttaatc aggacactca gcccagactg gtacaagtaa aggcaaagga agcaacctgt 540 

tcacgtatta gcgtaatcga tgagaggacc ggcaagagtg aatatcatgg ttactcagcc 600 

aaatggcatg aaggtatgcc ggatgatgta attgcgacgc cactactgga ccgccaggca 6 60 

cctttgcggg atttaaagac acgaatgggt ttgtttccca atgaaaaggg aataaaagag 720 
atcgtcaaag accgccgctt catccataac attcgtatag cgactcccgg acgattctat 
tacagtaaac catattggtg gagtgtattc gtttccggat ggtacgactt tgggaatgcc 
attcctatct ttaagaaggc tttgatcaag aatcaaatgg cattgcgcta tatcgtctac 
atcaaagagg atttctgggg aaaattatat gcagatgaaa aaattacgaa cgaagcagac 

caggctgtac ggcgagagac cttccttcag gacatgaatg actttcttgc cggagaagag 102 0 

aatgcaggta aaggcttcgt gtcccatttt cgttatgacc gagtaaaagg atttgaggat 1080 

aaggatatca tcataaatac tttagattcc ttcttcaagg gtggcgaata cattgaagac 1140 

agcgaggaag taagcaacac catctgctac ggcatgaatg tacatccctc catcattggt 12 00 

gccgctcccg gcaaaggtaa gagtattaac ggtactgaag cccgtgagct gttcatcatc 12 60 

gaacaagcct taatgaaaat gtttcaggaa gccacgctca ctccccttta ttttgccaaa 1320 

gccgtaaacg gatggccgaa agatatctac ttttccgtca ccaactgtca gcttaccacg 1380 

cttgacaaag ggacaggagc tactaaaaat acaggtttaa cctcagaaac agaagaaaaa 1440 

tga 1443 

<210> 447 
<211> 645 
<212> DNA 
<213> B. fragilis 

<400> 447 

tcaaacatac tgggaggatt aacgatggga tattataaaa gattaagtac ctatcgtgct 6 0 

gaagtcaaac gctataacgc ctcccgccga aaagccacac agttgactaa tgccccggca 12 0 

tccggactga tccgccttga aaccgtctca gaaaccgaac gcttttcaat ggctcaggat 180 

gctgatagac tgactgcata taacaaggcc gttgaaaagt ggcaagatag tgtggcccga 240 

caattacgag ccggaatagc cggccgcagt atgcgaatag cccgtgaact tgagccacgg 300 

gcctacaccg acaaatacgg tattatcaac cgtcttggtt tctccttccc tcgacatgga 360 

atctacatcc acaagggcgc cggcgaaggt cagggtggct tcatcggttc caaatggaat 42 0 

tacctcaaaa aaattaatgg agttgagata gataccggta ttgtacgtca tacaaatctc 480 

aaatcactcg gacgacagaa tgaaggcaac cgccgggcct acgaatggtt tgaccctgta 540 

attcgtaacc ggatcaatga attagctgat atcgtcaccg attatttcga cactatgctg 600 

attgatgcta ctcgaatata catagataaa cgaaacagtc tctaa 645 



360 
420 
480 



780 
840 
900 
960 
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<210> 448 
<211> 2202 
<212> DNA 
<213> B.fragilis 



<400> 448 

cacatgtcac tcaagataaa aaatcaatta ggaatattcg atcttcaaaa cgatttcagc 60 

atcgagatcg aagacacctc ccctatttac aacgaacgtg gctcacaatc cgtaccggcc 120 

acgcttcctg cctcccgaaa caacctttca ctgatcaccc atgtccatcg tccggatagt 180 

acctactccc ctgccccgga tgcccgtgtc accgtctccg atggtgtcta caaccgaata 240 

ggtaagatga acatcacaca ggcctccaaa tccggaggaa tcgtatccaa tataggtttt 3 00 

gacgagtctg aactctactc ggaatggaat gctgtttcac tccgttccct ctctgctccg 3 60 

gttattcgtc ccgaaggagg aacaaccggc gtcatcagcc tgctcaattc tattatgaat 42 0 

gaaacaatcg tagacgatgc tctttccatt tttcccattt gtgtatccat tccatcacat 480 

gcaacgaccg tggacgatac ggaaaccacc acctactacc ccgaatacat caacaagata 540 

actaaattag agaatggtac ctactccctt cagggagctg ccagacagga aacattcctt 600 

atcaataacg aacccgtcct tacttccgtt cccgaaggtt atgccatcag cccattttta 660 

aaagtatctt ggatactcaa ttttatattc gtccggtacg gttatacggt ccttgaaaat 72 0 

ccattctcaa cccaccgtca actctcccgt ctggtagttc tgaacaacat ggccgacagc 780 

atagtcaagg gcttcattga ttactctgac cttctacccg attgcacgat taacgagttc 840 

ctacaagccc tctactgtcg ctttggtatg gtgtattttg ttgatggtaa aaataaaacc 900 

gttaatctca aatttatcaa agatatcatc tcaactccgg cctcactgaa ctggtccctg 9 60 

ctcaagtcgg cccggcctgc tatcaactat gccgctgcac agcaactcaa actttccgca 1020 

tccaccaata tctccggtcc ttataccaat ttagtagcta ctcctactgc cgactcactc 1080 

gacaaatttc ttaaaacctt tggccatgtc ttgtcaagta acacagcaaa aggatatctc 1140 

acctattctt tatgggatgg attttattat gtccggaaca atctgaccgg agttcgtgaa 12 00 

gcccgcagct ctgacttctt cccctgggat aaaggggcaa acatcagtta tatggagata 12 60 

tcatctattg atgaatgcct gccgatgaaa ggttcttacc ccgatgacca accggtttgt 13 2 0 

cctgcctatc tcctgggaaa agtacacaaa tataccaata tttccagcgc cagcgtagaa 13 80 

ctatcagagg agcaaaacac ccaaactcct ctatgctttt gcttttccat gccccgtgca 1440 

tccactccct acccctatgg atcgccaaga tgttacacac ccggtggtga agctattgcc 1500 

atcaacggac acacatttga tatctccatg acctttactg gtgataatgg cctgttctcc 1560 

cgtttttgga agggatttga cgctattctc cgacattcca atcatacggt tgaagttccc 162 0 

gtacacttga atccaattca attactcaat attgatttca gtcaaacgat caatatagat 1680 

ggtcaacgat tactccttga tacagtgcgc tatacattac ccaaacttct ttcacgcccg 1740 

gctactatcc gtcttcgtac ccttcgtctc ctaatccctg taggggaaac tgatttggac 1800 

ttggatgcag agcaaggaat acaaacgatt gagcaactct acaaatgggc gtttcacaat 1860 

aatcgtgaaa acatagtaga actcaagata cgggcacaag tcgaggagtg gaagaaggct 192 0 

attaccccac cggcgcaatg gctcggagtg ctacgtaaaa acgaggtaag tgatcaggtt 1980 

tcggatattg agataccgtt tactgtaccg actcaagaag attatgaagc caacaaagag 2040 

ttcttcatca aagaaatcaa ttacagtttc gacctttact ataaggtccg ggttcccaat 2100 

gggcagacat ctcaaggtga tatcatctgg aaagataaag aatacggagg cgtacactat 2160 

gccattactt acgggctttc cgttaaagcg gaactgcttt ag 2202 



<210> 449 
<211> 222 
<212> DNA 
<213> B.fragilis 



<400> 449 

cagtttccac tactgccact ggccaggctt ataaatctaa tcttcattat tctcttcttt 60 

ttaatgcaaa ttgccggcaa cttttatggc aaattgtcgg caaatataca taaaaaagga 12 0 

aacaatcaga cgttagtttt cttcttttta tcgatgaaat cagtgatttt tgcaaaaata 18 0 

agcaaagcaa ttgctgcaac caaaccgatt gccgcaaagt aa 222 



<210> 450 
<211> 450 
<212> DNA 



177 



<213> B. fragilis 
<400> 450 

aaatataaac ttatgattta caaattttta tttcctctaa aaccagacag tgccggagca 60 

tctctattct tgttgattct aagaatttcg tttggcctgt tgttgatgaa tcatggtata 12 0 

caaaagtgga gcaattttca ggaactttcc atatcctttc ctgatccatt agggctgggt 180 

tctcccctct ctttaggatt ggctgttttt gcagagttag catgttcaat ggcctttatt 240 

ataggcttct tatatagatt ggctatgatt ccgatgattt ttactatggt aattgcattc 3 00 

tttgttattc atgccaatga tgtttttgca atgaaagagc tggcgttagt atatctgatt 3 60 

atttttgttt tgatgtatat tagtggtccc gggaaatatt cagttgatta tgtgatagga 420 

cgacaactca aaaacaaacg aaaattgtaa 450 

<210> 451 
<211> 240 
<212> DNA 
<213> B. fragilis 

<400> 451 

tttgagattt gtatgacgta caataccggt atctatctca actccattaa tttttttgag 60 

gtaattccat ttggaaccga tgaagccacc ctgaccttcg ccggcgccct tgtggatgta 12 0 

gattccatgt cgagggaagg agaaaccaag acggttgata ataccgtatt tgtcggtgta 180 

ggcccgtggc tcaagttcac gggctattcg catactgcgg ccggctattc cggctcgtaa 240 

<210> 452 
<211> 666 
<212> DNA 
<213> B. fragilis 

<400> 452 

ctcgtaactt gtataactga tagcttaaca aagatgaaaa aactactcac gaaaggacaa 60 

atcgctatac tcgtcatttt ttctgtcttg attattgatc aggtcataaa gatttggatc 12 0 

aaaactcata tgtattggca tgaaagtatt cgcattacgg actggtttta tatctatttc 180 

actgaaaata atggtatggc gtttggaatg gagctttttg ggaaactctt tttaactaca 240 

ttccgaatcg ttgcagtagg attaatagga tggtatctat acaaaatcgt aaaaagagga 3 00 

ttaaagaccg gatatattat ctgtgtatca ttaattctaa ccggtgcatt gggtaatatc 3 60 

atcgacagtg tattctatgg agtcatcttc aacgaaagta cacattcaca aatagccagt 420 

ttcatgcctg atggcggcgg ttattctact tggttctatg gtaaagttgt cgatatgttc 480 

tatttcccga tcattgatac caactggccg acatggatgc cttttgtcgg aggagaacat 540 

tttattttct tcagtccgat ctttaatttt gcagatgccg ccattagttg cggaattatt 600 

gccttattac tattctacag caaatacctg aatgattcat atcatcattc tgtgactaaa 660 

aagtaa 666 

<210> 453 
<211> 1005 
<212> DNA 
<213> B. fragilis 

<400> 453 

ccgtatcatt gcaacaaaat aagtgcaatg agccaaaaac gcatcatctt atcagattca 60 

tcactcaacc ggtacggcta ccgggttctt actgctggac ttcttcttga agctttcatt 12 0 

gacaacccgg tgatgctgta tgggcatttc cgtgatgaag gatcacccct atggtgtgat 180 

tacaaagcaa tcggatattg ggacgatatc aagatagagg acgacgtgct ttctgctatt 2 40 

cctgttttcg acaaggtaga cgatttatcg aagaccattg ccgcaaaata cgaagcaggg 300 

accttacggg ccgcaagtat tggtatacgt atcctggcca catcctccga aaaagaatat 360 

ctgcttccgg gacaaacacg cgaaactgtt accaaagcag aagtcatgga ggcttccatc 42 0 

gtggatatcc cggccaactc ccatgccgtg cgcttatacg accgttcctc ctccgtttta 480 

ctggcagcgg gtatggacac gaatattgtg ccagcattaa caatcccaaa agaaaaggca 540 

atgaattaca aaccatcatg gaccggcttc ctctctttcc tgggaatttc aaaagataaa 600 

gcggaaacca ccgaactgtc tgctgaaaac ctggactcta tccatgctga aatggaacga 660 
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ttaaagacag 
tctgccaacg 
agtactctca 
gtgaagaatc 
cctgaaggta 
tatcaagcca 



agaacgctac 
cgaagattac 
agaactctat 
tgaagaacgg 
gcggaaccca 
tcaccgagaa 



tcttgtacag 
agagctgaac 
cactgagaag 
tcctacaccg 
ggaagagtta 
attaaaagct 



gctaagaccg 
ggttctacat 
gattctaaaa 
gggcatgccg 
tctgcttttt 
gagggcctgt 



atattgaaga 
ccggcaagga 
tcacccaact 
gtctgactcc 
gtgaccagaa 
attaa 



gaaacttaac 
taacgagatt 
tgaagagcaa 
tgaacaagag 
cgcaggaaac 



720 
780 
840 
900 
960 
1005 



<210> 454 
<211> 1407 
<212> DNA 
<213> B. f ragilis 



<400> 454 

cttaaagaat 

ggggcactgg 

gatgctcaga 

ccggaccttc 

acgcaaggtt 

cctgtcggaa 

aaacatgctt 

gctatggcat 

gagtttgcca 

gtcatcttgg 

tgt gcttcgt 

gtaggcggac 

gtccagttct 

aat ccggtcg 

gcaaacttta 

acgaatgcga 

tatctgaaag 

aagaaatacg 

ctgatgggcg 

aacggacgtt 

ttgagaaaca 

gaggctgtag 

aataaaatcg 

aacgataatc 



gcacaatgaa 
ctatcactta 
tgcaggaaat 
ctttctatgc 
atcgtgcttt 
aagtagcact 
attacatggt 
gggttaaaca 
atgaaccgat 
gagatatgct 
gtcacggatt 
agttcggagg 
gggatggacg 
agatggcctg 
ccaaagcctt 
tcgaggagtt 
gtgaaaagac 
attgtgctac 
taaagagaga 
tcaaacaaac 
ttgccctgac 
attacatggc 
ttgcttttct 
agaccaaagc 



gaaaagtaca 
ccgcgttgta 
cattacttcg 
caactggccc 
cgacatgacc 
tgctaaagtc 
acattgggga 
acatcgcttg 
acgtcccatc 
ctatcatgac 
gaataccggt 
cgtaaatgct 
tgccggaaca 
tcagtcattt 
tttggctgta 
tgaaaagaca 
tgccattaac 
atgtcatgtc 
ctacttcgca 
ccggaacgaa 
ggctccttac 
caaatatcag 
ggagacactg 
attataa 



aaatttatca 
aaccaagcac 
gggggatgtt 
gttgcaagcg 
gaaatggctg 
gaaaaagtaa 
tcgagcgtaa 
gcacattatg 
gcagattcta 
actcgccttt 
ggtgtggaca 
ccgacagtgt 
cttgccgagc 
gatgaaatca 
tatcccgatg 
ctgcttactc 
gatatagagc 
ggtgagacac 
gatcgcggca 
cgtgacaaac 
ttccatgacg 
atggatctga 
accggggagt 



tcgcattgct 
cgtcgaaaga 
tacaatgtca 
gaatggtaca 
aagctctgaa 
tcatggacgg 
cagatgccaa 
ctaacggact 
ttcctgtgga 
cggccgataa 
ataagcaata 
ataacgccgc 
aggctgccgg 
ttgcgaaact 
gttattccga 
cgaattcccg 
tggccggata 
tgggcggcca 
ttgaattgac 
atcgctttaa 
gcagtatgaa 
atcttccgga 
acaaaggtaa 



tgtgacagtc 
tcttgctgct 
ttcgggcagt 
gaaagacgta 
agccggtaag 
aacgatgccc 
gaaggaaatg 
ggctgccgcc 
tatgcgtaaa 
caccgtttca 
ttcggaaggt 
ttacaatttc 
acctcctttg 
ggagcaggat 
acagaatatc 
tttcgacctt 
cgaattgttt 
gtcttacgaa 
agaagaagat 
agtgcccggt 
aacaatgaaa 
agatgaactg 
gcccctgacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1407 



<210> 455 
<211> 192 
<212> DNA 
<213> B.fragilis 

<400> 455 

gtgacgacgt gctcgcctcc cgcccccggt gaagacctag agaccggtgt tctgaagttg 6 0 

acggactcac ctgccatcca gtcgtttcct aagcgtgtcg gtgtagtgga agatgttatg 12 0 

ctttgtgctt tctcggcaac agtaacaccg tcacttgcac gggtcgcaac gacttttaca 180 
gactttaaat ag 192 

<210> 456 
<211> 789 
<212> DNA 
<213> B.fragilis 

<400> 456 

atttctatcc gatttgtttt gtattctcaa atttgcaggt atatgaaaaa cagtgattta 60 

actacttatg gggagtattt ggaaaagcta tccccaaaac acggacggga aaaggtattt 12 0 

aatgactttc tgcaaatagt cgtttgttgc ctctcaatgg gacgtaagga agaactttat 18 0 

ttcaaaacga taaagcccta tgacaaaaca gaactggatt tgttttcaca ggcttttgcc 240 
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gcacttgtta 
gagtttttaa 
atgaaccaat 
gtattagacc 
gcattgactt 
ttgtgtctga 
tggcatcgtt 
gaagccggaa 
acagggatca 
aaatgttag 



tgcagatgga 
gcaacgccca 
tgattacagc 
ctgcatgcgg 
ttgtcgggat 
acagcttaaa 
ggttgattat 
taataaacca 
tacagccggt 



caggcaacca 
aaacgggcag 
tcctaaagta 
tagcggaaga 
tgatatctca 
cggagaagta 
cgttgatagt 
accgcctgca 
aaagaacatg 



ctggtagacc 
ttttttacac 
aatgatcagc 
ctccttttat 
tatacctgct 
ttacacatga 
gtaaccaaga 
tgtgcggatg 
attcccgcca 



cgttcggaga 
cgtttggggt 
ctaaacaggg 
cagcagccca 
gtctcatgac 
atgccttgac 
taccgaccgt 
atttaaagcc 
attttgtacg 



ctattttcaa 
atgtgaatta 
agatcggagg 
aaaggataga 
tatcattaat 
ggatcaatgt 
ttatgaagtg 
tttaccggtg 
ttatacccct 



300 
360 
420 
480 
540 
600 
660 
720 
780 
789 



<210> 457 
<211> 366 
<212> DNA 
<213> B.fragilis 



<400> 457 

aggaacttac 

gctcctacta 

attgagtttg 

aaattattta 

tccacaccga 

agaaactcag 

tcctga 



gtccgatacg 
actcacacag 
cggcatattt 
gcatgaatgg 
ttacaacacg 
gattggaagc 



ggtatccgaa 
attagcgatg 
ggtcatttcg 
cttatatagc 
gtcaggactc 
tacatcaaat 



ccgataccgc 
tcgttcatga 
gcagagggaa 
ttagtcatta 
ataaagtcat 
tctattttgg 



tacgcaccat 
aactgatacg 
tatccatgaa 
atttttctgc 
tgactgcatt 
caccccgttt 



atttacatca 
agtagccaac 
tatcacgcgg 
acgttccgat 
accctctttc 
gtctaattct 



60 

120 

180 

240 

300 

360 

366 



<210> 458 
<211> 903 
<212> DNA 
<213> B.fragilis 



<400> 458 

aatggctata 

cgacttttgg 

gaatacggaa 

gatacctttc 

tctgttattt 

ggtatccgac 

taccggtttg 

tctcccggag 

ggagtaatca 

ggtatgggac 

cgtgatcttt 

gagattgggg 

cctgtggtgg 

ggagctatta 

gcaggaatca 

taa 



gaatcattaa 
tgcagggtat 
caaatgtggt 
cggtattcaa 
tcgtaccggc 
tgataatttg 
tcgaactgaa 
aaagtctggt 
gccgtagcgg 
agtccactgc 
tgggaatgtt 
gtaacgcaga 
catttattgc 
tatccggaag 
gagttgccgg 



agaaagaatc 
cacaggcagg 
gggcggaaca 
caccatgcat 
acgctttgct 
tatcacagaa 
gggagctaaa 
aggtattctt 
tacgttgacc 
cattggtatg 
acaaaatgat 
agagttggcc 
agggagatct 
ttccggttct 
tgaaccctcg 



gatatgagca 
gatggacttt 
tcacccggca 
gaggctgttc 
gccgatgcta 
ggaattccta 
ctcattgggc 
ccggggcaag 
tatgaaatcg 
ggaggtgatc 
ccgcaaactg 
gcaacataca 
gccccgccgg 
gcaaccgaaa 
gaaataccgg 



tacttattga 
ttcatgccaa 
aaggaggaac 
gccgaacgca 
ttatggaggc 
cattggatgt 
cgaattgccc 
tgttcactcc 
tatcacatct 
cggttgtcgg 
atgctattgt 
ttcgtgaaca 
gtaaacagat 
agatttcagc 
atttactgaa 



taaatctacc 
aaagatggct 
aatgatagac 
agccaataca 
tgccgatgcc 
aataaaggcc 
cggcctgatc 
gggcaatata 
gactgctaaa 
actttatttt 
gatgattggt 
tgtgactaaa 
gggacatgcc 
attggaagct 
aggcagtttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

903 



<210> 459 
<211> 1761 
<212> DNA 
<213> B.fragilis 



<400> 459 

tcgaagcggc 

aaaatgaaga 

aataataatc 

gattttactg 

acgggtatta 



ttatcaaagc 
acgaaaacaa 
actgtatcta 
ctaaagcttt 
tcgaacatgc 



catcgagata 
aatgattatt 
caatggagat 
gggagaaata 
cagccaaact 



ataaaattgt 
tatcaagtgt 
atttcacaaa 
aaaaaactgg 
gattatagaa 



caatcttaaa 
ttacacgctt 
acggttgtgg 
gagcaacaca 
gatacaacat 



tagcatatat 
gtttggtaat 
aaaaatggct 
tatttggtac 
ccgcccggat 



60 

120 

180 

240 

300 



180 



780 
840 



catccggcta ttgtaaaagg taaagcggga tctccatacg ccattaaaga ttattatgat 360 

gtagatccgg atctggctac tgatgtccct ggaagaatga aagaattcga aaatctagta 42 0 

agccgtacac acagagcagg attaaaagta ataattgatt ttgttccgaa ccatgtagcg 480 

cgtcaatacc attcggatgc acaacctgac ggcaccactc agctgggagc caatgatgat 540 

cctaactact cttttagtcc gtacaataat ttctactata ttccgcaatc ggaattgcat 600 

ggacagtttg atatgacggg aaatgccttg gaaccctatc atgaatttcc tgccaaagca 660 

accggaaata accgttttga tgcttaccct aacattaatg actggtatga aaccgtaaag 72 0 
ttaaattatg gtgtggatta tcagaatggt ggaacttgcc atttctcccc tactccggat 
acttggacta aaatgttgga tatcctcctt ttttggtcct ctaaaaatat tgatggtttc 

cgttgtgaca tggccgaaat ggttccggta gaattttggg aatgggctat ccctcaagtg 900 

aaacaggagt atccgaatat tatatttatt gctgaagtat acaatccgca cgaatataag 960 

aattatttat ttcgtggtaa atttgatttt ctctacgata aagtaggact gtatgataca 1020 

ttgcgcaatg tagcttgtgg ttatgactct gcaactgcta ttactcgtag ttggcagtct 1080 

ttagggggga ttgaaaagcg gatgcttaac ttccttgaaa accatgacga acaacgtatt 1140 

gcatctgatt tttttgccgg agatccacgc aaaggtgttc ctgccttaat tgtatctgct 12 00 

tgcatgaata ctaaccccat gatgatctac tttggtcagg aattcggaga aatgggaatg 1260 

gatagtgaag gtttcagtgg acgtgatgga cgtactacta ttttcgatta ttggagtgta 132 0 

gatacaattc ggcgctggcg aaatgaagga aagtttgacg ggaagatgct aactgaagag 13 80 

caaaaacatt tatatgcaat ttatcagaga gttttgacgt tgtgcaatga agaacaggca 1440 

atatcaaacg gcgtattctt tgatttgatg tatgctaatg aaaatggatg gagatttaat 1500 

gagcacaagc aatatacatt tatgcgtaaa tacaaaaatg aattgctatt tattgtcgta 1560 

aactttgata atcagccagt aaatgttgcg attaatgtgc cttctcatgc ctttgacttt 1620 

ttacaaattc ctcaatttga ttcttataaa gcggttgatt tactaacaga taaagtagaa 1680 

gaaatcagtt tactgccata taaggcaaca gaaatcgctt taggagctta tacgggtaaa 1740 

atattgaaga ttaaatttta a 1761 

<210> 460 
<211> 195 
<212> DNA 
<213> B.fragilis 

<400> 460 

tgtaatataa gacgttttca tgacgacttt atgttaaata ctttgagtca atttataacc 60 

tcaaacaaaa gggatggaac aaggtatatg ccttttgctt ttattgagtt gggcgtagct 12 0 

atgctttcct ctattttaaa cagtgaagtt gtaattgaga ttaacaaaag attatatcga 180 

agaagtgttt actga 195 

<210> 461 
<211> 777 
<212> DNA 
<213> B.fragilis 

<400> 461 

tttcagtcaa gatccatgtg ttccgaaccc gatacctttg tgcaaacttt aaaaaatata 60 

aaaatgaaaa aagttattat tataggagct acttccggaa tcggtaaagg attagcagag 12 0 

cgctttctcc gggaaggaaa tacagttggt attacggggc gtagagaaga taaactacaa 180 

gagatctgtt ctcaaaataa aaattgtttt tatagtgttt ccgatgttac caaagatacc 240 

gatacagtcc gacaactgag caatcttgtg aacagagtgg gaggtatgga tatacttata 3 00 

ttctgttcgg gaatagggga gctaaatcct gaacttgatt atcttctaga gaaaccgact 3 60 

cttttaacca atgtaatagg atttaccaat gtagtggatt gggcttttca cttttttcag 42 0 

aagcaagaat gggggcattt gattgtaatt tcatctgtag gcggaatgcg cggtgaagga 480 

atagccccgg catacaatgc ctcgaaagcc tatcaaatca actataccga aggattaaga 540 

aaaaaaacag ccaagctacc ttatcctatt tatatcactg atgtacgtcc cggatttgtc 600 

gatacggcaa tggcaaaagg agaggggttg ttttggatta ctccgttgga taaagctgta 660 

caacagattt atcgcgccat ccttcgaaga agaaaagttg cgtatgtttc gaagagatgg 72 0 

aaatatgtag cattacttct gagaatgata cccgcctcga tttattgtaa aatgtga 7 77 



<210> 462 
<211> 1419 



181 



<212> DNA 

<213> B. fragilis 



<400> 462 

aagcacaaaa gaatgaaaaa ttttatggat aaaaatttcc tgcttcaaac cgaaacggct 60 

caggaattat atcataatca tgcggctaag atgcccatta ttgattatca ttgtcattta 120 

aatccccaaa tggtagcgga tgattatcgg tttaagtcct taactgaaat atggctaggc 180 

ggtgatcatt ataaatggcg tgccatgcgt tccaatggtg tggatgaatg cttttgtacc 240 

ggtaaagaaa cgtcagattg ggagaaattt gagaaatggg cagaaacggt cccatatact 3 00 

ttccgtaatc cactgtacca ctggacgcat ttggaactga aaactgcatt tggtattgat 3 60 

aaagtactaa atccgaaaac agcacgtgaa atttatgatg aatgtaatga gaaactttct 42 0 

tctcaggaat attctgcccg cggaatgatg cggcgttatc atgtggaaac cgtatgtaca 480 

acggatgatc ctatcgattc attggaatat catattagaa ctcgcgaaag tggatttgaa 540 

atcaagatgc ttcccacatg gcgtccggat aaagttatgg ctgtggaagt tccttcagat 600 

tttcgcactt atatagaaaa attgtcagaa ataagtgaga ttactatttc tgactataat 660 

gatatgatct tagctttacg taaacgtcac gactattttg cagagcaagg gtgtaagctg 72 0 

tctgatcacg ggatcgaaga attttatgct gaggactata cggaaggtga gattaaaact 7 80 

attttcaata aaatatacgg cggttcggaa ctgacaaaag aagaagtttt gaagtttaaa 840 

tcggcaatgt taattgtgct cggtgaaatg gactgggaaa aaggatggac acaacaattt 9 00 

cattacggtg ctattcggaa caataacagc cgaatgttca agctgttggg tcctgatacg 960 

ggatttgatt caataggtga gtttgctacg gctaaagcca tgagtaaatt cctggatcgg 102 0 

ctgaattcaa aaggtaagtt gactaaaaca attctgtata atctgaatcc ttgtgcaaac 1080 

gaagtaattg ccaccatgat aggtaatttt caagatggga gtatacctgg taagattcag 1140 

ttcgggtcgg gatggtggtt ccttgatcag aaggatggaa tggaaaggca attaaatgct 12 00 

ctttctcttc ttggattatt gagccgcttt gtgggaatgt tgacggattc tcgttcgttc 1260 

ctctcctatc ctcgtcatga atattttcgt cgtactttat gtaatttatt ggggtgtgat 1320 

gtggaaaacg gtgagatacc tttatcggaa atggagcgtg tctgtcagat ggttgaagat 13 80 

atcagttatt ttaatgctaa aaactttttt catttttaa 1419 



<210> 463 
<211> 774 
<212> DNA 
<213> B. fragilis 



<400> 463 

tccgggggaa gggactctta ccttccccta cataaactaa acaattacct gattatgaaa 60 

cgatttattc tttgcatttc atgcctgctt atctgctgcc tgttcttgct tccggaagta 12 0 

caagcggcca ttccggatac cggaaactgg atcagccatc atcttctgac atcagacggt 18 0 

ttaaccgttc tggctgccgg cccggcattt gccccgttaa aatggaatat cgggcaaaac 240 

aacatgggag gttataaagg acggctgctc tttattccgt atgacgctcc ttcaaccgta 3 00 

ccaatgattc cggcaaagcc tactacgaat gaggacctga ttaccgcttc aggatcattc 3 60 

acttttccaa gcggcggaac ctatactcag ccgatttact tgtattccac aaaagggaaa 42 0 

gtaggttata aagcggaaat tcaaggcgaa acggacggaa aatcttttaa gcagacttta 480 

gagtttttct tccccggcaa tactccggga atgcatgctt tcagtacact tgtcaagaac 540 

actccggggt acttcgtctt cgaagattcc gacggccaac aattcctgat gggtaaaccg 600 

ggcatgtatg ccgatgtatc accctccttt gatggtggta agctcgccgc cgatcagcgg 66 0 

ggaactgcct atacagccac ttgtgacgca aatgaatcgg ctgttgtttt agggacacca 72 0 

atcgacatgg aagtcattgc aggcctaaaa ccggctccaa gtcccggagg ttaa 77 4 



<210> 464 
<211> 393 
<212> DNA 
<213> B. fragilis 



<400> 464 

attattatga gtttaaatga tcgtttacga attgttgtaa atgaattttt tcatgggaat 60 

aaagcggctt ttgctcgagc cgcaaaaata tcggaccaaa gagcttatag ttgtttgtct 12 0 

gttcggagta atacagaacc tccggctaga gttttggaga atttagctaa gtatctaccg 18 0 

aatttaaacg cgacttggct tttaaccgga gagggagaaa tgattcaaga taaatccact 240 



182 



cctgagatgc cgataactct tgtttcggta aatgaatata aaagtcgatt gcagcaaatg 3 00 
gaggtaagat tggaagctct aagggctcag gtggtattaa aagataaact actagccgga 3 60 
ctactccgaa aagtagagaa caagactaaa tag 393 



<210> 465 
<211> 597 
<212> DNA 
<213> B. fragilis 



<400> 465 

aacaatatga 

gccctgcggg 

catgcttcta 

atgttcgtgt 

tctactgcat 

ggggttcttc 

gcttttacgg 
attgctacct 
tggctgagcg 
tttcttaccg 



gtccacgtaa 
ccgggttatt 
cttttgcata 
taaaacttac 
tttcttatat 
gtcccgacgg 
ctgccacact 
tcttgccggc 
acgttttggc 
ataggctctt 



attaatgttt 
gacaagtgct 
tcggtatgat 
aggagtaaag 
attaatgggg 
ttccgatttc 
actgtataaa 
agttgtcacc 
aggtgccatt 
aatgaaaacc 



tggttgtttg 
gatcaatata 
gattttttgc 
agccgtagta 
gccatagtgc 
ctttcattcc 
gaatatggtt 
ggattcacaa 
atcggtatca 
ggtgcacaga 



cctgtatctt 
tctatcattt 
cataccttcc 
actggaaacg 
ttacaatgaa 
cttcggggca 
tcaagacccc 
ggcagttgaa 
tgatggtaga 
cttgttccaa 



tttagtttgt 
acgtaatatg 
gattgttgct 
aatgctcgtc 
atcactggcg 
tacagctacg 
cctagcgggt 
taatcgccat 
gttagcctat 
atcataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

597 



<210> 466 
<211> 1599 
<212> DNA 
<213> B. fragilis 



<400> 466 

aagctgtccg gttataaaga taaagcttac ctttgcaatg tttttttaat acaagttact 60 

atggattatc ctcataaaat caataaggta cagatccgta acctccagat tgaagattac 12 0 

gctcaattat cccaatcgtt tacacgtgta tattcggacg gaagcgatgt gttctggaca 180 

cacgagcaga ttgagaaact aattaaaatt ttccccgaag gacaaattgt tactgtggtc 2 40 

gacgaaaaga tagtcggctg tgcactctct atcattgtag aatatgataa agtgaaaaac 3 00 

gatcatacct atgcccaggt cacggggaag gagactttca atacccattc tccccaaggg 3 60 

aatatcttat atggcatcga ggtctttatc catcccgaat atcgcgggtt acgactagct 420 

cggcgcatgt acgaatatcg caaagaactt tgcgaaacgc tgaatctgaa agcgattatg 480 

tttggcggtc gcatcccgaa ttaccataag tatgccgaca agatgcgtcc caaagagtac 540 

atcgaccgtg ttcgtcagcg cgaaatctat gatccggtgc tcacttttca actctccaat 600 

gattttcacg tacgcaaggt gatgaccaat tatttgccga acgatgaaga atcaaaacac 660 

tacgcctgtc tcttgcagtg ggacaacatt tactatcagc cgcctacgca agaatatctg 72 0 

gcccccaaaa caacggttcg tgtaggattg gtgcagtggc agatgcgtag ttataagacc 780 

ttggacgatc tcttcgaaca ggtagaattt tttgtagatg cggtatccga ctataagagt 840 

gattttgtgc ttttccccga atactttaat gcaccgttga tgtcaaagta caatgacaaa 900 

ggcgaatcac aggccatccg cgggctggcc caatataccg aagaaatccg cgatcgcttc 960 

attaatctgg caatcagcta caacatcaat atcatcacag gaagtatgcc gttgatcaaa 1020 

gaagacggat tgctgtacaa tgccggattt ctttgccgac gcgacggaac ttacgaaatg 1080 

tacgaaaagc tccatgtcac cccggacgag ataaagagtt ggggactgag cggcggcaaa 1140 

cagcttaaaa cattcgatac ggactgtgca aagataggca tactgatctg ttatgatgtg 12 00 

gaatttccgg aactctcccg tctgatggcc gaccaaggaa tgcagattct gtttgtaccg 12 60 

tttctcaccg atacacaaaa tgcttattcg cgtgttcggg tctgcgcaca ggcacgtgcc 132 0 

attgagaacg aatgctttgt ggtaatagcc ggcagtgtag gcaatcttcc ccgtgtgcac 13 80 

aatatggata ttcaatatgc tcagtcggga gtattcacac cttgcgattt cgcttttccg 1440 

acagacggaa agcgtgccga agcaactccg aatacagaaa tgatcctggt ttcggatgta 1500 

gatctcgact tattgaacga actacacact tacggcagcg ttcgcaacct gaaggacagg 1560 

cgaaatgatg tatatgaagt gcgcttcaag aagccttaa 1599 



<210> 467 
<211> 420 
<212> DNA 
<213> B. fragilis 



183 



<400> 467 

aaattgttgg gaaattacag agatcaaatt atgaatcgta atgccattct aagtaaacgt 60 

caaaaacagt tcatcgaacg tattgcctgg ggagcttctt ataaagaagt agctgatttc 12 0 

ttccatgtga gttggagcac tgttgacaat actctccgaa atgcaaaaac aaaattaggt 180 

ttaagtaaag tgactgagtt gggggcatgg tggttctgca ctaattacgg aattagtttt 2 40 

gatctatctc ctattgccag gcaatgtaca gcaggagtta tcttactctt gttttccctt 3 00 

ggagaagtga caacagtaac aaatatatca tataccatgc aaagagtaag aagaccacgt 3 60 

acagagtatc gcatccgtcg acacgaaact tctatatatc aaccatatat tattaactaa 42 0 



<210> 468 
<211> 1293 
<212> DNA 
<213> B. fragilis 



<400> 468 

gtttttcagg 

tcattcttta 

gggcaactat 

agtcttaaaa 

caagctttat 

atgattggtg 

ttggatataa 

gcatttcata 

atgcgcattg 

cttggagcca 

ggagctataa 

aatacctcag 

aataaaggag 

atggtggcat 

agtttgatag 

tggaatccaa 

gcattggcat 

gtggcacaag 

cagtttaaac 

ccttctatca 

atttttatct 

gctgtcagag 



gaggtatcct 
tcaaacttat 
tctctatact 
cgggatctgc 
taggcccttt 
ctgacagctt 
tcgaattatg 
cgcctgccat 
cgggtatcaa 
tattgcttct 
tagcttgtac 
ctaaaaatgt 
tcagttgggt 
tgatgccgtt 
aaactctttt 
agatacgtaa 
tttgtggaat 
gtatagttgt 
cggcatattt 
ttgggttgtt 
gttgtgggat 
atctggaaaa 



attctttctt 
tatgaataat 
aagtagttct 
ggaggtattg 
tgccggcgtc 
tgtcgctctc 
gcatatatat 
gaaatcctct 
tcaagctatc 
tgcgtttgac 
agctttactt 
actttatgat 
aatggtaact 
gatgactttg 
cggtgctggc 
aaccttgctg 
acttccggct 
cccatttttt 
aggacgtgtg 
cattaccgga 
agcaattgta 
gcaaagtaaa 



cttctgtata 
tggaagaaaa 
attgcccagt 
tcatttgcaa 
tttgtcgatc 
tgttccggag 
cttttactta 
gtaccattgt 
cagtcgattt 
atgagtcttg 
tttgtgtata 
atgcgagatg 
gaagttctgg 
aaaaatttct 
atgttggccg 
attgctattt 
gatggttttg 
tccgggcctt 
ttttctcttt 
tttattgctg 
tttacctcaa 
tag 



gatgcaacag 
agtttatcat 
tttctattgt 
ctatcgctgc 
gttggaatcg 
tgatcgcttt 
tgttgcgttc 
tggcaccgga 
gtaacattgg 
tcatgcttct 
ttcccaatcc 
gatttaatgt 
ttacattttt 
cgggaacagc 
gtggtgcctt 
cctatttttt 
tactttttgc 
ttacttcatt 
ttgacagcgt 
atagtttagg 
ttttgatgat 



ggaaataatt 
tatatggaca 
tttgtggatt 
tttattacca 
caaatggact 
actgttctat 
ggttggcggt 
aaaagagctt 
tggtccggct 
agatgtattg 
taaacaagaa 
aattatgcgt 
tgttatgcca 
atatcaagta 
attgggtgta 
gcttggtgcg 
agcattaact 
actgcaaact 
gagcttattg 
aatagccaat 
gtgcattccg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1293 



<210> 469 
<211> 396 
<212> DNA 
<213> B. fragilis 



<400> 469 

gagtatatta tcccaattta ttaccctata aagatacgtc ttttatttga tatacacaac 60 

attataatga taaaaaaaga gaataaaata ttcgtagtca tatctcctga tcccgtcgag 12 0 

cgtgagcagt tgatcgcacg cctggccgtt cgtttaggtt ttgccaagat tccgtccgat 180 

gcactcaaga tcataagcaa ggacatttat tcctttgacc tggcaactgc atattttgtg 240 

ctttgcagta actatcattt ccggggttct atcgtcacaa cacaacggct gtatgagctt 3 00 

gcagcaagag gtatatgtgt ttgtgtaggt gtgaagtcac tgccccgtga gtacgagttg 360 

ttatctcagg tgttttatcc gaatgatttg cgatag 396 



<210> 470 
<211> 1296 
<212> DNA 
<213> B. fragilis 



<400> 470 



184 



actttaacta ttaaaaagtc tattcaaatg gctgcaaata aactaattga tgtctctaaa 60 

ctgaacgaag cactggtcat ttatgaccag gcacttcgtg cgctgccgtt tgccaccctc 120 

accgaagtgg caaacctact gaagctgaat gttatggacc tgcaaggcaa acacgcacgt 180 

atcaacgagc gtcgtcgtgc cggtggtacg caatcgtata aaatcggaaa gaacttcgga 2 40 

ctggtcgata aactcttagg ttacgaaccc tcagtcatcg agccgaaaga tgttgtctgc 3 00 

atcaccaaag aaaactccca gaagtacgat gataacgaac tgctgatcat cggtggcact 3 60 

ccggtaagca acactacaaa aaaacatccg atggaaacca aggttgcatt taccctggta 42 0 

cgttcgcatc tggaagatat cgtatatagc ctgttctctg ccgaacggga tgaagattcc 480 

aactcacccg gcggggcttt cgatggtatt tataccaaga tggacatgct gatcactcgt 540 

ggcgatgtaa atgcggcccg tggtaatttc tctatttccg gagagtttgc cgcgccaaca 600 

tcagatacag attatacagc ttacgagaat ctggtggaat ggatcggagg cgcaaacacc 660 

taccttcgtt cttcaatagg cggtgtacca cagcttttgt gtgctgaaac cgttttgaaa 72 0 

gctgcccgtt cagcattacg taataagtta cgcatgcagg aatatccttc catgcaacgc 780 

atgcttgaac tcttgcggga agacgccatg tgtccgaacc tgattgtctc ctcccacgaa 840 

gctttaggcc aaggttccag gctgaccctt cagaaagttg gtaacataga cgtggcgttc 9 00 

aatactcaag cggcttctaa attctgccag atacgtgata tttacgagga cccgaacgaa 960 

tggcagttct ggttgcaggc aggatacgat acacgtatca atgactggca tgagaaagtc 1020 

ttccgctgta acgagcagaa gaacgaatct ctcgacctgg ccggcgacta ttgtaaaacc 1080 

ggtggagtac aggtagccat caccggcacc gacaaaggcc aatggagtat ccagggaaaa 1140 

gttgccaaac gcggtaacgg ccaatgcatc attggacttc ctccgggaaa atacaccatc 1200 

gagttcactg atgccgatgg caagaccaaa ccggcaaata cacaggttac agttgttgcc 12 60 

ggtgaagtag ccaccgctac cggagcctat acttaa 1296 

<210> 471 
<211> 348 
<212> DNA 
<213> B. fragilis 

<400> 471 

gttctttata ttatggaaca attgttcgaa gctatcctcg cgatagcaaa gcagaacccc 60 

gatgggttca cggttgacct cacaacctta aaaaaggtca caaagggtat ttcagtcgcc 12 0 

tatctcgaga ctcaagacag tttcggagaa gaaggactga aaagagttct taaccatgct 18 0 

gagatgcacg aaaagaaggt cggtggatgg ctgaatgaag agaaccaaga gttctatttt 240 

gattccgtcc ggattttcac caaccttgaa gaagccaagc gattcgggtg tgaaaataaa 3 00 

cagatcgcta ttttcgacat ctctcatatg agactcatca aattgtga 348 

<210> 472 
<211> 768 
<212> DNA 
<213> B. fragilis 

<400> 472 

tatctctata tttgcagcaa tcaaagactc aatattatgc atgacatacc caaacaaatc 60 

ccattggcaa acaatcacat ctcagtggac tgtgtagtga tcggttttga cggagaacag 120 

ctcaaggttc tgctgattaa tcggatagga gaggaaaacg gaaaagttta tcgtgacatg 180 

aaacttcccg gaagtctgat ctatatggat gaagacctgg acgaagcagc ccagcgggtc 240 

ttattcgaac tgaccggtat ccgaaacgtc aacctgatgc aatttaaggc attcggttcc 3 00 

aaaaaccgga cgagcaatcc caaagatgta cattggttgg aacgggctat gcaatcgaaa 3 60 

gtggaacgca tagtcaccat agcttatctg tcgatggtaa agatagaccg ggcactggac 42 0 

aagaatctgg atgaatttca agcctgttgg gtagcgttga aagacataaa gacattggct 480 

ttcgaccata acttgataat cagagaggcg ctgacttata tccggcaatt cgtagagttt 540 

aatccttcga tgctattcga cttgttgccg cgtaagttca ccgcatctca gttacgaatc 600 

ctgttcgaac tggtatatga caaagcagtg gatgtgcgta acttccataa aaagatagct 660 

ctgatggact acgttgtacc gctggaagag aaacaaaccg gagtagccca tcgggcagcc 720 

cgttattata agttcgacag gaagatatat aataagacaa gacgataa 7 68 



<210> 473 
<211> 2322 
<212> DNA 
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<213> B.fragilis 



<400> 473 

aaaccgtata tgaagaaagg gattctttac acgattcttc tttatcttgc tttgtcactg 60 

gcttcgtgct ctgccactaa gtttgtaccg gatggctctt atctattgga tgaggtaaaa 120 

atacatactg acaacaaaga aataaaacct tcggacatgc gactttatgt tcgccagaat 180 

cctaattcta aatggttcag taccatcaaa acccagctat atgtatataa ttggtccgga 240 

cgggattcta caaaatggtt caatcgattc ctgcgtaaaa taggagatgc tccggtaata 3 00 

tacaatgaat ccgacgctat acgctcgcaa gaagaaattg ctaaagcagt gcaaaattta 3 60 

ggatatatgg gagctagcgt aaaaagaact acaaaaacga aaaagaaaaa gctaaaatta 42 0 

ttttatgaaa tcacttcagg caaaccttat attgtacgta cactgaaata tgatatttct 480 

gataagaaaa tagcagaata tcttcggaat gattctaccc aatcaatgtt aagagaagga 540 

atgcttttcg atgtaaatgt acttgatgcg gaacgacagc gcattacaga ctatctatta 600 

tgtaacggtt attataaatt taataaggat tacattactt atacagctga cactgcccgc 660 

aatacccatc aggtggatct cactttacac ttattacctt ataaaactta tgtcggagat 720 

actcctaaag agcattttca gtataagatt aacaaaatca acttcattac cgattatgat 780 

gttctgcaat cgtcagcttt gagtagcatt gagatcaacg attccttgca ttataacgga 840 

tttccgatct actataaaga caagctatat ttacgtccca aagtgttagt ggataacctg 900 

agatttgcat ccggagattt atatgacgaa cgtaatgtac agaagactta tacttacttt 9 60 

ggacgactat cggcgctcaa atatactaat attcgttttt tcgaaactca aaatggcgat 102 0 

agcacccaat taaattgtta tgtcatgcta actaaaagca agcataaatc tatctctttt 1080 

gaactggaag gcactaattc tgcaggagat ttgggagctg ccgcatctgt ttcttttcaa 1140 

catagaaatt tattccgcgg ctcggaaacc ttcatggtaa agtttcgtgg agcatacgag 12 00 

gctatttcgg gattacaacc gggttataaa aaccataact atactgaata cggagtcgaa 12 60 

acaagcatta atttccccaa tttcttgttt ccttttctca cgtctgactt taaacgacga 1320 

ataaaagcaa ctacagaatt cggcttgcaa tataattatc aattacgtcc ggagttctca 13 80 

cgtaccattg cctcggcttc gtggagctat aaatggatac aaaaacagaa aatacagcat 1440 

cgcatcgatt tgttggatat cagttatctc tatctgcctt ggatttcgtc tcaattccag 1500 

gaagattata ttaataagga taaagataat tatattctca aatataatta tgaaaatcgt 1560 

ttgattgtac gtatgggata caattatagc tataatagtg cgggtggaac tcttgtcaat 162 0 

aatacaatta caactaattc ttattctata cgggcaggct tcgaatcagc aggtaatatt 1680 

ctttacggaa tttcgaaaat gattaatatg agaaaaaata aagatggcga atatgctatt 1740 

ttaggtatac catatgcaca atatttaaaa ggagattttg attttgccaa aaatattatt 1800 

attgaccatc gtaattctct tgcatttcat gccggaatag gaatcgctgt tccttatgga 1860 

aatgccaaag tagttccttt tgaaaaaaga tatttttcag gaggagcaaa cagtgttaga 1920 

ggatggtccg tacgcaattt aggaccgggt tcctttgccg gagatgggaa tttcatgaat 1980 

caatccggag atattaaact ggatgcaagt atcgaatacc gtactcgtct attctggaag 2 040 

tttcgtggag cagcatttat cgatgcagga aatatatgga ctattcgcga atatgaaaat 2100 

cagccgggtg gtgtttttga atttgataag ttttataagc agattgccgt tgcttatgga 2160 

ttagggctca gacttgactt agactttttt gtacttcgct ttgatggggg gatgaaagct 2220 

ataaatccta aatataaaaa agcaaaagag cgctatccta ttattcatcc tagattcagc 2280 

cgcgatttcg cattccattt tgcagtaggc tatccattct ag 2322 



<210> 474 
<211> 267 
<212> DNA 
<213> B.fragilis 



<400> 474 

cactatcgta ttatgccaag cgacagaata catcaaagta aagtctggga acttatggag 60 

caacggaaag agggtaaacc cattgagttc tccattgaat tctgcaaaaa aagtaccggt 12 0 

gaactcatta cctacgagcg tgcggtactt agttcatttc atagtagcgg aagcactgtc 180 

aacatacttc aaataggtga gtatgctccc aggaaaatcc ggagatgtct aattacacga 240 

tttaataaca tcaaagttta tttctaa 2 67 



<210> 475 
<211> 1530 
<212> DNA 
<213> B.fragilis 



186 



<400> 475 

tatctactca ttataagtcc gtatttcacg gtctgcaggc tcttgttgcc tgcagaccgt 60 

aattcaaaaa aagctaatgc catgaatcat accaacgagg gtagcaagct ctacctgtat 120 

tccattacat cggtagccat cctgggcgga ttgctgttcg gctatgatac cgctgtaatt 180 

tcaggagctg agaaagggct cgaagccttc ttcctcacag ccacagattt tcaatacgac 240 

aaagtgatgc acggcatcac ctcctcgagt gcactgatcg gttgcgtctt gggtggtgca 300 

ttgtccggca tcttcgcttc acggctggga cgccgcaact cactacggct ggccgccgta 360 

cttttcttcc tttcggcact ggggtcttat tatcctgaat tcctgttctt tgaatatggg 42 0 

aaggctaata tgaacctgct tatcaccttt aatctctacc gcattctggg aggcatcgga 480 

gtgggactgg cgtccgctgt ttgtcccatg tatattgccg agatagctcc ctccaacatt 540 

cgcggtacac tggtgtcatg caatcagttt gccataatct tcggcatgct tgtggtttac 600 

tttgtaaact acctgatctt gggcgaccac cagaatcctg ttatcctgaa agatgctgcc 660 

ggcacacttt ctgtaagtag cgagtcggat atgtggaccg ttaccgaagg gtggcgctat 72 0 

atgttcggtt ccgaagcctt tccggcagct ttcttcggca tgttactctt cttcgtaccc 780 

aaaactcccc gttatctggt gatgattgat caggatcaga aggcttattc cattctcaaa 840 

aaagtgaatg gagccacaaa agcacaagag attcttgccg aaataaaagc cacttcgcag 9 00 

gaaaagacag agaagctgtt cacctacggt gcggcggtga ttgttatcgg tattctgctt 9 60 

tctgtcttcc agcaagccat cggcatcaac gccgtgctct attatgcgcc gcgaatattc 1020 

gaaaatgccg gtgccgaagg cggaggaatg atgcaaaccg tcatcatggg cattgtcaat 1080 

atcgtcttta cactgatagc tattttcacc gtcgaccgtt tcggacggaa accgttgctt 1140 

atcatcggtt ccgtcggtat ggctgtcgga gcctttgcag tcgccttgtg tgacagtatg 12 00 

ggtatcaagg ggatacttcc cgtactgtcg gtcattgtat atgcagcttt cttcatgatg 12 60 

tcatggggac ccatctgttg ggtactgata tccgaaatct ttcccaacac catccgtggc 1320 

aaagcggtgg ccattgctgt ggcatttcaa tggatattca actacattgt ttcatctacg 1380 

ttccccgcac tctatgattt cagtccgatg tttgcctaca gtctttacgg aatcatttgt 1440 

gtgattgccg ccctcttcgt atggcgttgg gtgccggaaa ccaaaggaaa aacattggag 1500 

gatatgagca aactttggaa gaggcgttaa 1530 

<210> 476 
<211> 591 
<212> DNA 
<213> B.fragilis 

<400> 476 

aattatgtca tggatgaaga agtgaaaggt tttaatagat atatgtcaaa ggttgacttt 60 

caacctgtca cagaatttat atttcaaaat ggacagttga cagattataa aaagggagag 12 0 

ttttttagcc gtcaaaatga atcttgcaaa atggtaggct acgtgacgga aggctccttt 180 

cgttattgct gtaccgacag ccgtggagga agtaagattg tcggttatac gtttgatcac 240 

tcttttgtgg gaaattatcc tgcttttcgt ctgggagaca attctaatgt cgatatacaa 3 00 

gctatttgta attgttcggt ttatgtaatc aataacaggc aattggagga gttttacagt 3 60 

cgaaatgaag caaatcaaaa gttaggtagg caaatagcag agatattgct ttgggaagta 420 

tatgagcgga tgatctcttt atatagtatg actcctgaag aacgctatac ggaaatctta 480 

aaacgctgtc ccgaattgtt gaacttgatc tcgctaaaag aattggcatc ctatctgatg 540 

atctgtccgg aaacgctgag tcgtcttcga agaaaattag tacaaaagta a 591 

<210> 477 
<211> 204 
<212> DNA 
<213> B.fragilis 

<400> 477 

gacctacgca tggcctaccg atggcaaata atgaaaaatg aaactgcttt ttcaatggct 60 

ggtatttatg atattggggt agataaggaa tcaggcaagc agcatgctac gttttccatc 12 0 

ataacaattg ttacggatcc actgacagat tatatacata acactaaata tcggatgccg 180 

gttatttttg ttatccaaag atga 2 04 



<210> 478 
<211> 960 
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<212> DNA 

<213> B.fragilis 



<400> 478 

tacaaattaa tagccctttt atacgtttat tggcatattt ttacttactt tgcaccatat 60 

aaaggcacaa ctatgcacaa aaaactatta gtgactgcat attttgtaat agcagcattg 12 0 

ttgcaaactc tggcaggcaa tttcccctta tctttttttg cattccctct gaatgtgatt 180 

gtagcagtta tatggattta ttcgttatgg cgtctctata aagaagggaa taagttgcca 240 

ttaacccgtt ttctgttatc ttcccgaaca agtgttctat cgatcctatt attaataggt 3 00 

ggcagtctgg ttatcggttt gtttcctcag ttatcggaag cagaagcaga ttccatgcca 3 60 

ggggttttag cttcgttggg atgctataat ttcatgactt cctggatatt cattgccatt 420 

cttttccttt tattgagtaa tttggcaatg gtgattattc atgcttttta tcattgtgtg 480 

ccggcaaaga agcgttttat cctgaaccat ttgggattat ggctcgcttt atttgccgga 540 

tttttcggta gcagtgatgt tcagacttta cgcatcccgc tttataccgg acaaccggga 600 

cgcgaagctt atagtatgga tgggaaagcc tattatctgg attatgaact ggaactctat 660 

tctttcaata cagaatatta tccgaacggg atgccttcgc gttttgctgc agatgtgcgt 72 0 

attggaaacc ggagaactac acttgaagta aatcaccccc actgttaccg tttgggagaa 780 

gacatttacc tgaccggata tgatacacgc aacatgggaa atacccggta ttgtatcctt 840 

caaatagttc gtcaaccttg gaaatatgtt atggtagttg gcattttgat gatgttgaca 

ggagctgttt tgttatttat taatggtcct aaaaagctga aacatgataa cctgggataa 



900 
960 



<210> 479 
<211> 360 
<212> DNA 
<213> B.fragilis 



<400> 479 

ttcaaatcaa taaaagaaat gaataaatca tattttgaaa caagaaagac ggaaattcaa 60 

tcagagattg atagctggaa acaagggtta agagacttgg aggatgaata tatttcctct 12 0 

aatcaaaaat ttcctattgg aagcaaagtt tgtattacta cccctgcaca tgaaggatgg 180 

gcattgagta cccgcgaaaa aataacgttc ccagaaagaa aaagatattc ttatgtaact 240 

gggtatgaaa tatgtcataa tgaggtagtg cccattctga tgaaagctaa gaaggatggt 3 00 

actatttcaa aaattcggga ttatataaca ttagaaagag tgatagttga actggcgtaa 3 60 



<210> 480 
<211> 216 
<212> DNA 
<213> B.fragilis 



<400> 480 

ttcccgccaa ttttgtacgt tataccccta aatgttagta atatggagaa agttttgcaa 
tgtgtcagac ttccgcaaaa tggtaaagga acaatcgggt ttaatttgaa aggagagtat 
ttaaaaaaat acggtttcca gttaggagat aaagtaaagg tagaaatcag caaaaataag 
attgttttat ttaagacggg taatgtgctg gaatga 



<210> 481 
<211> 3450 
<212> DNA 
<213> B.fragilis 



<400> 481 

tatagtaaaa ttgtaaatat agccatgagc aagaaatttg ccgaatattc gcagttcgac 60 

ctttcgaagg tgaataagga cgtgttgaaa aaatgggacg aaaaccaagt tttcgccaag 12 0 

agtatgacag aacgtgaagg ctgtccttcg tttgtatttt ttgaaggacc tccatcagct 180 

aacggtatgc cgggtattca ccacgtaatg gctcgttcta tcaaagatat tttctgtcgt 2 40 

tacaaaacga tgaagggcta tcaggtgaaa cgtaaagccg gttgggacac acacggactt 3 00 

cctgttgagt tgggggttga aaagtctttg ggaatcacaa aagaggatat aggaaaaaca 3 60 

atttcagtag ccgaatacaa tgctcactgt cgtcaggatg tgatgaagtt tacaaaggaa 42 0 

tgggaagacc tgactcacaa aatgggctat tgggtggata tgaagcatcc atacattaca 480 
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tatgataatc gttacatcga aaccttgtgg tggttgctaa aacaattgta taaaaaagga 540 

ttactgtata aaggatacac catccaaccc tattctccgg cagcaggaac cggattaagt 600 

tcacacgaac tgaatcaacc gggatgttat cgggacgtga aagatacaac agtagtggca 660 

caattcaaaa tgaagaaccc caaaccggaa atggcacaat ggggcactcc ttatttcctg 720 

gcatggacca ctactccatg gacattacct tcaaataccg cactctgtgt cggccctaaa 780 

attgattatg tagcagttca atcatataac gcatatacag gacaacccat cacggtggta 840 

ttggcaaagg cgttattgaa tgcacatttt aatccgaaag cagccgaact gaagctggaa 9 00 

gattataagg caggtgataa gttggttcct ttcaaagtga tagctgaata taaaggtcct 960 

gatttagtag gcatggaata cgagcaatta attccgtggg taaatccggg cgaaggtgct 1020 

ttcagagtaa ttttgggcga ttatgtaacg acggaagacg gtacaggtat cgtacatatt 1080 

gcacctacat ttggtgctga tgacgcccaa gtagcaaaag ctgccggcat acctccgcta 1140 

cagttggtta ataaaaaggg agaacttcgt ccgatggtag atttgaccgg taaattctat 12 00 

actttagatg aattggatga agactttata aaacagcgcg ttaacgtaga tttatataaa 12 60 

gagtatgctg gccgatttgt gaagaatgca tatgacccaa acctgtccga tcaggatgag 132 0 

tcattggatg taagtatctg tatgatgatg aaggttaata atcaagcttt caaaatagag 1380 

aagcatgtgc ataattatcc ccattgctgg cgtacagata aaccggtact atattatccg 1440 

ctggacagct ggtttattcg ttctacagct tgcaaagaac gaatgataga attgaataag 1500 

actattaact ggaaaccgga gtctaccgga accggtcgtt ttggaaaatg gctggaaaac 1560 

ctgaatgact ggaacttaag ccgttctcgt tattggggta ctccattacc gatttggcga 162 0 

acagaagata acagtgacga aaaatgtatc gagtcggttg aagagctata taatgaaata 1680 

gaaaaatcag tcgctgcagg atatatgcag tcgaatcctt ataaagataa aggtttcgta 1740 

ccgggtgaat ataatgaaga gaattataat aagatagatc ttcatcgccc ttatgtagac 1800 

gatattatcc ttgtctcaaa ggacgggaag ccgatgaaac gtgaagcaga cttgatcgac 1860 

gtatggtttg attcgggcgc aatgccgtat gcccagattc attatccatt tgaaaataaa 1920 

gaattgttgg atagtcatca ggtatacccg gccgatttta tagcggaagg agtagaccag 1980 

actcgcggat ggttctttac tttacatgcc attgcaacaa tggtattcga tagcgtctct 2040 

tataaggctg ttatttccaa tggattagta ttagataaaa atggcaacaa gatgtctaaa 2100 

cgtttaggta atgctgttga cccattctct actattgaac aatatggctc tgatccgtta 2160 

cgctggtata tgatcactaa ctcttctcca tgggacaacc tgaagtttga tgttgatggc 222 0 

attgaagagg tacgtcgtaa attcttcgga acgttataca atacttattc tttttttgcc 2280 

ctgtatgcca atgtagacgg ttttgaatac aaagaagccg atcttccgat gaatgagcgt 2340 

ccggaaattg accgatggat tctatccgtc ttgaatacat tagtaaaaga ggtagatact 2 400 

tgctacaatg aatatgaacc gactaaagcc ggacgtttaa tttcagattt tgtaaatgat 2460 

aatctgtcta actggtatgt tcgcctgaac cgtaaacgtt tctggggtgg tggattcact 2 52 0 

caagataagc tctctgcata tcagactctg tatacatgtt tggagactgt agccaaactg 2580 

atggcaccta tcgctccatt ctatgcagac cgcttgtaca gtgatttgat cggagtaaca 2 640 

ggtcgtgata acgttgtatc tgtccatctt gccaaattcc cggaatacaa tgagaaaatg 2700 

gttgataaag aactggaagc acaaatgcaa atggcacaag atgtcacttc catggtgctg 2760 

gcattacgcc gtaaagtaaa cattaaggtt cgccagccat tgcagtgtat tatgattccc 2 820 

gtggtagatg aagttcaaaa agcgcatatc gaagccgtga aagtattaat catgagcgaa 2 8 80 

gtgaacgtaa aagagatcaa gtttgtggat ggtgcggcag gtgttctggt gaaaaaagtg 2940 

aagtgtgact tcaaaaaact aggaccaaaa tttggaaaac aaatgaaagc tgtagcagct 3 000 

gctgtagcag aaatgtcaca agaagctatt gcagaacttg aaaagaatgg taagtacacc 3 060 

tttgatttgg gcggagcaga ggctgtgata gaatcggcgg atgtggaaat cttcagtgaa 312 0 

gatattccgg gatggttggt tgccaatgaa gggaaactga ctgttgcact tgaggttacc 3180 

gtgacagacg aactccgtcg tgaaggaatt gctcgtgaat tggtaaatcg cattcaaaat 3240 

atccgtaaat caagcggttt cgagattaca gacaaaataa aattaacatt atctaaaaat 3 3 00 

ccgcagactg atgatgcggt aaatgaatat aatagttata tttgtaacca agttttgggt 3360 

acatccctta ctttagcaga tgaagtaaaa gacggaacag aattgaattt cgacgacttc 3420 

tctttgtttg tgaatgtagt gaaagaataa 3450 

<210> 482 
<211> 546 
<212> DNA 
<213> B.fragilis 



<400> 482 

aggggaaaaa tttccccttt atccgtcgga agaccacgca ccgccctcgg aaaaagtttc 
gcctcaaact tttttttctc tcttatatgc tgcccccctc ctcaaaaatc atcgcacatg 



60 
120 
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<400> 484 

atgaatcact tctctttaat cacagaaaag cccgtacata aaaactgtac gggctttttc 



cgataccgct ccggcacatt gtgccgtttt tctgtccttt acgtaagcgc ttgtacacaa 180 

tacatttgca ataaaaaagc gatgaatgag attctaaatt atatcatggt ctttctcttc 240 

ggcggcggtt tagtcggaac cgccacagca tttgtcacta tcaaatacac caagaaacgt 3 00 

gcagaagctg atgcaatgaa agcgatgcag gatgtctacc aggaaatgat caccgatcaa 3 60 

agaagttaca tcaactcact caaacaggat aaagaagata gtgaggcacg ctgggaaaat 42 0 

aaagttgaaa cattatccaa acgtattgag actatggatt tgaaaatcaa cgaaaacaat 4 80 

cgtttgataa cagagctaaa aaccatgaaa tgtaccgatt taatttgcca aaaccgtaaa 540 

caatga 546 

<210> 483 
<211> 1275 
<212> DNA 
<213> B.fragilis 

<400> 483 

atactgggaa ttttccccta cttttgcagg atacttaatg aacattcagc tatgcaacca 60 

tcaaaaacag aattaattct gattcgtatt accggtgaag atcgtccggg acttacagcc 12 0 

tccgtaacag agatattggc aaaatacgat gccactatcc tggatattgg tcaggcagat 180 

attcataaca ccctttcact gggcattctc tgtatgactg aggaacaact ctcgggattt 240 

atgatgaaag agttgctttt taaggcatct tccttgggag taaccattcg tttctatccg 3 00 

attaccgaag aagagtatga aagctgggtg aatatgcaag gtaagaaccg ctatatcctt 3 60 

acattgctgg gacgtaaact tacggctcgc cagattgcag cggttacccg tattttagct 42 0 

gaacaggaca tgaatattga tgccatcaaa cgtctgaccg gacgtattcc attggatgag 480 

cgtaaaatgc atacccgagc ctgtattgaa ttctcggttc gcggtactcc ccgggataaa 540 

gaagctatgc agggacagtt gatgaaattg gccagtgaac ttgaaatgga cttctctttc 600 

cagttggata atatgtatcg ccgtatgcgt cgcctgatct gtttcgatat ggactctaca 660 

ttaatagaaa ccgaagtaat tgacgaactt gctatacgtg ccggtgtagg tgctgaagta 72 0 

aaagcaatta cggaacgtgc catgagggga gaaattgact ttacggaaag ttttcgtgag 7 80 

cgggttgcac tgctgaaagg actggacgaa tctgtaatgc aggaaattgc cgagagtctg 840 

ccaataactg aaggggtgga tcgcttgatg tatgttctga agaagtatgg ttataagatt 900 

gctatccttt cgggaggttt cacctacttt ggacagtatt tgcagaagaa atacggtgtt 9 60 

gattatgtat atgccaatga acttgagatt gtagacggca agctgacggg gcgttatttg 102 0 

ggagatgtgg tagatggaaa gcgtaaagcg gaactgctgc gattgatcgc tcaggtagaa 10 80 

aaagtagata tcgctcaaac cattgctgtt ggggatggag ctaatgatct tcctatgttg 1140 

ggagttgccg gtcttggcat tgcctttcat gcaaaaccca aagttgtggc caatgccaaa 12 0 0 

caatctatta atacgattgg gcttgatggc gtactttact tcctgggatt taaggactct 12 6 0 

tatttgaata tgtaa 1275 

<210> 484 
<211> 237 
<212> DNA 
<213> B. fragilis 



50 



aaaggaaata gaatattctc ccggctacct tcgggaatgg ctacgcataa agggtatacg 12 0 
gctccctcaa tttatcagtt gtgtaacatt tataataaga cggaattgga ggtactttgt 180 
tccgcttcgg ctcatgtttt aactgtttat tttcatattt acctaattat ccaatga 237 

<210> 485 
<211> 1062 
<212> DNA 
<213> B.fragilis 

<400> 485 

aacctatatg ttatgaatta cggtataagt gtattgttca gagcaattcc cttggcaatg 60 

gctctgtttt gttttggcta cggggcgttt atcagtgcat atggcgatga ctctaaccga 12 0 

ctagtagcag gtccggtagt tttttcatta ggtatgattt gcattgcatt atttgcaaca 180 

gccgctacta ttatccggca aatcatacat acatacgggc gaggttcttt atatgcattg 240 



190 



cctattatcg 
cgatctacct 
actacttgta 
tcaaaaatga 
ttaaaaacaa 
gccaagagtg 
tgcatttgca 
tatacggata 
cttctttggg 
tacatcatga 
gcgaaaattt 
acggcgttgg 
gattatttta 
tcaattgtca 



gttatctggc 
ctacctcctc 
tagcaaccgc 
ttggtaacgg 
tagctatcac 
atgtacatcc 
caagcctgat 
gagaacgtaa 
gactgtttgt 
ttgggcttgg 
ggggaagaga 
cctgtttgtt 
tccctgcacg 
gtattcttga 



tgctgttgtc 
tttcgtagct 
agcaacatct 
tattcctgaa 
tatatcgttg 
tgcttatttc 
tgctttggtt 
aagatggcca 
tattttttcc 
gctggtttgt 
atttgctttg 
tctggcttct 
cgtattagcc 
gagtggaaca 



acaattattg 
ggacatgtag 
tctacccgtt 
ggagcgttta 
attgcctgga 
gtagcagggc 
gctactattg 
aaactcgtat 
gattcaagca 
tacagcattt 
gccaaccgca 
ttcgtttttg 
ggtttgggtg 
tcttcaaaat 



gcggaatttg 
ttgccggagt 
tttcattgat 
ccaagggcca 
tctgggcttt 
atgtgatggt 
cccgtcaaat 
tgttgatggg 
ctactaacgg 
caagtaaagt 
tccctttgat 
aattgggtac 
ccatttgctt 
aa 



catgtttacc 
gggactgata 
tcctgccaac 
agagcgtata 
cgttttatta 
aggtttagct 
cagaaatgtg 
aaccgtttcg 
agtaatcgga 
aattctgctg 
tccggtattg 
gacgcatgat 
cactctcttt 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1062 



<210> 486 
<211> 4932 
<212> DNA 
<213> B.fragilis 



<400> 486 

gaaatccccg 

caacaaatta 

tctaaccggc 

actgatggtt 

cttcgtcttc 

aaaggcgaag 

accgaaagag 

tgggacaatg 

accctgtcaa 

ttggccatgt 

tccggtgctc 

tcctccgctc 

gccggtgccc 

ggcctgctcg 

tcaggtaata 

ctgttgcctt 

agtgccggtt 

caacagatcg 

gtacatacga 

aatgttacag 

acaatcaaca 

ctt gctactg 

ctt tccgggg 

aaggccggta 

gggttgatag 

cgtttgaata 

accggcggtc 

aaactggatg 

actgctccga 

ggcaaacttt 

ggcacactct 

cctaatggaa 

gcattagcgt 

agagttattg 

agtggcttat 

aaaggttggg 

actaataata 

ggtatagctt 



tatacatacc 

tagatgaact 

atgtggcaga 

taaaagatat 

tgggaggtgt 

ctttcctaaa 

tttggaaact 

gaacatacac 

aagagggcgg 

ggggcgtcct 

ttgccggata 

ttaccggcta 

aggagattac 

attacgatcc 

tcactttcgg 

atgatccgac 

caagcttcga 

acaagtcaca 

accttaatgc 

agatagccaa 

aatggaagga 

ctctatctgt 

gtggtgattt 

catacactaa 

cctctgatat 

ccttcgtcac 

tgaaggtaaa 

gtaacctgtt 

cccttctcga 

ccgttatcgg 

atgaagcggc 

ccgcatcaca 

atggtggtga 

atagaagaaa 

ttcatatgtc 

gagaaggtta 

gattatatta 

ttcttgaaga 



gaacagttca 
cattgactac 
agtgctatac 
ctttatcagt 
cgaatttagc 
aaagttaact 
gaacggtaat 
cgcaccaact 
ccgactgtct 
cagtaaagaa 
tgcgacagag 
cgctacggag 
cggtgaaaaa 
gaccgaaaga 
ctgggacaat 
taccctgtca 
cgaatcctcc 
tctggataca 
cctgaaagga 
taccctgctt 
acttgaagcg 
caaagcggat 
gtctgcggat 
gctcaccgtt 
ccccacttta 
ccttgccggt 
cggtggcctg 
gatcacaggt 
cctgctacct 
tggcggtgga 
taacggagtc 
atttctcaaa 
ccaaaacagt 
tgacacaata 
tggcatgcca 
tgctacctgg 
tagggatggc 
ttgtaataaa 



cacctcaatt 
attgacaaag 
tggttgaatg 
aaaaagcaga 
agcggggatg 
ttaaatggtg 
atgctgatct 
cttctcgacc 
gtgatcaatg 
ggtgttcagc 
aaatttgtca 
accttcgtca 
gatttcaccg 
gtttggaaac 
ggaacataca 
aaagagggtg 
atgtggaccg 
gctcttgccg 
accggtcttc 
acctttctca 
ttcctggccg 
aaaacccgta 
cgtaccttgt 
gacgcttatg 
gagattagca 
gctcaggaga 
ctcgattacg 
agcacaacct 
tatgacccgg 
agtggcggtg 
atcacactgc 
gctgatggta 
attcagtatt 
cttccaacct 
tcctctaact 
gaacttgttg 
aaaggttctt 
gttgcaactc 



taattgatat 
cagtactcaa 
aaggattaaa 
tagacgaaac 
atccctacag 
gtctgataga 
caggtaacat 
tgctccctta 
ccggttctga 
agatcgacaa 
cagataaagg 
gagagaactt 
gcggactgaa 
tgaacggtaa 
cagctccgac 
gccggctgtc 
cgctcttgaa 
gatatgcaac 
caactacgga 
ccggatcaga 
gattctccga 
gcattattac 
ccctttctcc 
gtcgtgcaac 
aaatcaatgg 
ttaccggtga 
atccgacaca 
ggaatgcggt 
ccactctgtc 
gtggaaacat 
ctgatttata 
gtctggattc 
caagtaaatc 
cttatgataa 
ggtggtcagg 
ggccttcttc 
cttgggctac 
cttactttga 



ggccgatcag 
gcacagtgtc 
gaaagtttct 
caatttttta 
aatcacgcaa 
atatgatccg 
tactttcggt 
cgacccgact 
ctttgatgaa 
gtcacatttg 
ctatatcact 
tgtaaccctt 
agtgaacggt 
tatgctgatc 
ccttcttgat 
agtaatcggc 
aagtggttct 
agagaacttt 
aggatatcgc 
taccgactcg 
aacggatacc 
cggcaccggt 
ttccggaata 
gtccgcatca 
tttacaggat 
aaagaatttc 
taaagtctgg 
gggcgattat 
gaaagaaggt 
aatgttgaat 
tcagaaaaca 
caacctttat 
taattactta 
ttataatatt 
tattcatgtg 
aacagataac 
tgattggaaa 
gggacaaaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 
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atatattcag actatgggtg gtgggtagtt gctttatgta aacttagtcc tgctgattct 2340 

gaatataatt atgcaagcgg tactatgttt tacagaagag gaaatggcat ttatcccaat 2 400 

ggttctgtac aatttaatgt tatcaaaagg tacaatcaaa caaatgttaa ctttggcgta 2460 

ttatacaatg gctatggtat aaatgaaggt gaagatgctc caaagccttg tacatttact 2520 

tataatggag ttaaatatgc cggtcttaaa tgggcgtctg ctgcaagttt agatagtatc 2580 

aaaaccctta tatatgatat aagtactaca ggattaccgt tttatgttaa gtatttcaat 2 640 

tctcagagtg gagaggtatt taacaccgaa ataaaaaatt ctatcgttga acttggaagt 2700 

gatatatctg gaactggact tggtacaata ggaactatta gaatattaaa tagaaaggcc 2760 

attgatatta aaggagaaga ctggggatat attcaacaag caagtgctga tagaatgttc 2 82 0 

catgttgcag tagctaaaag cactcaatct ggccttggag gaggacttgg caactatgaa 2 880 

atcagatgta acggaaccag tgaggaaggt atatttgtga gatacagtgg ttcatcttat 2940 

ggcaaattag gggtcgttaa tagaaatggt caagaatcaa gtattagcta ttacaataat 3000 

aatacagctg taggtgcaga taaaccttta tggactgtag gtgccggtat cagaaatgct 3060 

tatagtttcg attggtggtt tggaactaat ggatatagaa tgactcttga ttcagatgga 3120 

aaactattta ttaatagaac aaataacaat gaaggtggtc tttctgttag tttggctatt 3180 

ggagacagcg atactggttt acattggcaa gctgatggaa ttatagaatt taggtctaat 3240 

gctaagcaag ttggttattg gggatatact aatggaagat tatttaattg ttactttaga 33 00 

gagccaagtg gagtcactta tgaaaaagca tctctaatga ttaatggcaa tggttctact 3360 

atatctcctt ctatcggatt tcatcaacca ggagttgtag gatgtcactt agaattagac 3 42 0 

aatgggggta attttagatt taaagatagt tctggatata gaaatgtcta tgcaggtaac 3480 

cttattgctg atgcgggttt cctttattcc aggtataatg gtattgaaat caaaataggc 3 540 

tctgaaaatg attcatatgt acatttcatt actaaacctg cgagaagttt atattttgct 3 600 

aatagcttgt ttgtgaatgg aagtgtatta ccttatagta gttctaccta tagcttggga 3 660 

gatgccgggc acttatggaa ctatgtgtat ggtaaccatt ttatgggtaa ttctgcatct 3720 

gccacattta tacttccgaa ctatgtcggc ggacaacagg cgaatcctca aacctatttc 3780 

aataatagta tgggggtcaa agtagctatg acaggcgtta atcctgattc atattggggc 3840 

gatactttat ggattaatgg atatggtggt actgatgttc cggatatgtg tgctttgcat 3900 

ttttcaaggg gtggtgctcc tcttatttat ataagcagtc aaaaatatca cgctacaagt 3960 

tatggcacaa tgtaccatat atggaccggt tataactcaa accattcttc tgctgcctgg 4020 

acttgcagta ccttaaatgc aaacggaagg attagcacta catcagacat atattctgcc 4080 

ggttgggtca gggccggtgg aagtaatgga ttctattgtg aatcctatgg cggtggtatc 4140 

cacatgacag attcgacctg ggtacgtgtc tataacggta agcagttcta tgtcagcagt 42 00 

acttcttccg atgccatcca taccgccgga ggtattaacg caagtggcag gatttatgcc 4260 

ggtggtcacc tgagtactaa tggcggtctt gctgtaagtg gtatctatgg cggctcaggc 432 0 

gcatcaggtt ttaatgtgta tgctgtattc cagggcaggt cagaccatgg aggaatagaa 43 80 

gtgagggctt ctgacaatac ctttggtatc ggtgtacact ccaatgatca catgtactgg 4440 

tggtggggaa catcaacctc aaccaattcc agttctggaa aatcctatat catggactat 4500 

ggcggcggta attggagttt taccggtaac cactatgtct ccggttattc aacctggggc 45 60 

tccgactcac gttataaaac ctatctgggt gaagtaaccc tgcaattgga tcagatcgca 4620 

gactcaccca ctatctacta ccgctggaac agtaagaaga gagatcgtga cgggcttctc 4680 

catgtgggtg gttatgctca gtacaccgag cagatccttc cggaactgac tcatgatacg 4740 
agtaacttta aaacgatgga ctatgctgtc tgcgcttatg tatacgcagt gcatgcagcc 
cggttcctcc gggatcatct cctttcagac tataaatgga agtcagacac ggagttgaga 
atgtatgctt tggaaaagga aaatatcaaa ttgagaaaca gaattgaaca attagaaagg 

agggctgctt aa 4932 

<210> 487 
<211> 393 
<212> DNA 
<213> B.fragilis 

<400> 487 

acagtaaaag ttatggcaga aaagacaaga tattcggatg cggaactgga agaattccgc 60 

gccatcatta atgaaaaact ggaattagca caacgtgact atgaacagtt aaaactaagt 12 0 

ctaatgggac tggacggaaa cgatacagat gacacgtctc ctacatataa ggttttggaa 180 

gaaggagcga atacgttgtc aaaagaagaa accacacgtt tggcacaacg ccagttgaag 240 

tttattcaag gcttgcaagc tgctttggta cgtatcgaga ataagacgta tggcatctgc 3 00 

cgcgaaactg gaaagttaat tcctgcagag cgtctgcgtg ctgtgcctca tgctacactg 3 60 

agcatcgaag caaagaacag tggaaagaaa taa 3 93 



4800 
4860 
4920 
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<210> 488 
<211> 762 
<212> DNA 
<213> B.fragilis 



<400> 488 

atgagcatat taagtaaaaa tagaataaaa tatattcgtt cgctggaatt aaagaaaatc 60 

aggaaagagg aaaaagtttt tttagcagaa ggtccgaagc ttgttggtga tgtattagga 12 0 

tatttccctt gtaaactatt gatagcaaca tctgattggc ttgaggaaca tcctgcagtt 180 

caagcagcag aagtcattga agtaacttca gaggagcttt cccgtaccag tctgttgaag 2 40 

acaccacaac aggtattagc attgtttgaa caacctgaat atgaaatcga tatggaagct 3 00 

atccgcaatt ctttgtgttt ggctttggac aatatacaag atcctggaaa tcttggtacg 3 60 

atcattcgtc ttgccgattg gttcggaatt gagcacattt tttgttcgcc caacacagtt 42 0 

gatgtgttca atcccaagac aatacaagcc acaatgggag gaattgccag agtaaaagtg 480 

tattatacag ctttaccgga cttgatgcat tcgttaggga atgtacctgt atatggtact 540 

cttttagatg gggaaaatat gtatgaacaa cccttgtcga agaatggaat tataataatg 600 

ggcaacgaag gaaatggtat cagccctgag atagagaagc tggtaaaccg taagttatat 660 

atccctaact atcctgcaga acgagagact tcagaatcac taaatgttgc tattgctact 720 

gcaattgtct gtgcagagtt tcggcgacag gctgcattgt aa 7 62 



<210> 489 
<211> 1032 
<212> DNA 
<213> B. fragilis 



<400> 489 

caatatgtat ttctaccctc ggtgaacgat agtaattgta gaaagtacag aaagccgggg 60 

agtctttata atcagttctt tcggtcgtcg cataccctca agcgatacga gggacttatt 12 0 

gtattttata gggtatgtca ttttgtatct tctttgtttt tatcctatct ttgccaccaa 180 

actattaaaa aaactaaaat gaaaacttat ccggtcgttc tttccattgc cggctccgac 240 

tgttcgggcg gggcaggcat tcaggcagat ataaaaacga tttctgcttt gggagcatat 3 00 

gcggcatcgg taattactgc tgttaccgtg cagaatacaa gaggagtaaa agctgtgcat 3 60 

acagttcccg ctgagatagt gcagggacag attgaagcgg ttatggaaga tttgcgtccg 420 

gatgccctga aaataggtat ggtgagtgaa ccggcgcttg tgaagattat tgccgggtgc 480 

ctgctaaagt atcctcattg tccgatagtt tatgatcccg tcatggtttc aaccagtgga 540 

cggaagttga tggcaaaaga tgcaatacag ttgatcaaag aagaactttt tccacttaca 600 

agcctgatca ctcccaatct ggatgaaacg gaggtactga ccggaaagaa aatcacaaca 660 

gcagaagaga tgaaagaagc tgcccggcaa ctttcagaag agtatcatac agcggtattg 72 0 

gtgaaaggag gacatctgga gggaaacgaa atgcaggatg tgctgtttac cgatgggaac 780 

gcctatatat ataaggagaa aaagatagag agccggaatt tgcatgggac gggatgcacc 840 

ctttcttctt cgatcgccac ctatctggca ttaggacttc ctatggacca ggcagttggc 900 

aaggcaaaga gttatgtaag caaagctatt gatgccggca aggaaataat tatcggacat 960 

ggcaacggac cgttatgtca cttctggggc cctgagaaag cccggatatg ggacgataat 1020 

aaggt agaa t ag 103 2 



<210> 490 
<211> 207 
<212> DNA 
<213> B.fragilis 



<400> 490 

aatatatctt ttaaccctag attagtgttg acgggggtca tgcaagtgag attcgttaag 60 

aagctatttc cactcaagat atttttccgt ataatatcaa acgatctttt atattggctt 12 0 

aaagaaatag gttggggctg ccatattatt gggtatctga aaggtgtaac aacagacgtg 180 

cagtttgtaa aaccgttcag attgtag 2 07 



<210> 491 
<211> 201 



193 



<212> DNA 

<213> B. fragilis 



<400> 491 

atattgactt caaagaaaat gcattgcttc aaaaaggcaa tgcatttttt atcggtattg 
caaaaaaaat ataatgtgag catcttttta atatctgatc tgttatatca acaatataaa 
aaaagtaact atggactatc tatttttcta tctattcttt ttgttcatta tatggacatt 
tatcgtaaaa agtatacatg a 



60 
120 
180 
201 



<210> 492 
<211> 1242 
<212> DNA 
<213> B. fragilis 



<400> 492 

atgaacaatt 

acagtcgaac 

ttatctacta 

ggtggattat 

aaatcaatgc 

cctacattgc 

ttgacagata 

ggctcattta 

cgtgcacgtg 

acagttatcg 

ttctccggat 

cacaccgtag 

accaattggc 

caactttatg 

ggctggattg 

tggatggcaa 

tctgctctat 

aacattacat 

tct ccccgtt 

ttagggttta 

cgt tctgctg 



cccctcagcc 
tgtttgaacg 
ttttaggttt 
acttgcttcc 
ttgttgcctt 
ttgaaagtac 
gcgttttccg 
ttaagtctgt 
gatattctat 
atcctctccg 
ttatgactct 
gagaaggaaa 
gtttattgat 
ctacgatgcc 
ctaatgtaaa 
aacgaagtgc 
tgatggcatg 
tgatgatgat 
atctggaata 
agtcatcttc 
accaagtatt 



tgctgccaaa 
catggcttac 
taatgatttc 
tatttttacc 
ttcgttatta 
cgggttggtg 
ttggttgata 
tatttcggct 
tttctacatg 
aaatatgatt 
cattgctttg 
aagtatgcgt 
tctaatactt 
aaagtatgta 
tccttttgtt 
cattacctca 
cggcaatctt 
tgtcggtatt 
tttttctctc 
attctttttt 
gtccggatcc 



gggttcacaa 
tatgctgtgt 
gaagcaagta 
ggagcatatg 
actgccggat 
agctatgggg 
gtgccggttt 
tctgttgcca 
atggttaaca 
ggtgatcagg 
ttggctgttt 
gagatcggac 
attatcacgg 
attcgtatgg 
gtagtatgtt 
atgaatattg 
ttggataatg 
gtagttcaag 
caggctccca 
atcttccatt 
aactttgttt 



gagcctttta 
ttatcgttct 
tgatttccgg 
ccgataagat 
actttggttt 
caagtaccca 
tatttataat 
aggaaacaac 
tcggtgcttt 
catatattta 
tctttttgta 
aaggtttctt 
gattttggat 
caggtgaaac 
gcgttagttt 
gtatgtttct 
aagttgtttc 
gtttggcaga 
aaggtgagga 
ttcggatttg 
ga 



tgtcagcaac 
cactatttac 
tcttttttcc 
tggttttcgt 
aggagtattg 
ttttagtgga 
catgataggc 
cgaggctacc 
taccggaaaa 
cataaactat 
taaatcaact 
gcgtattgtc 
ggtgcagcac 
agccaaacct 
tgtaactcgt 
gattccggtc 
tggaatgagc 
atgttttata 
aggtatgtat 
gtcttgccgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1242 



<210> 493 
<211> 987 
<212> DNA 
<213> B. fragilis 



<400> 493 

aaaaaatact 

ctgaatgcct 

gcgttatttg 

ctattccaag 

ggaataggat 

gatttgtttg 

ccggataagc 

caaaagcaag 

tatttatcac 

gtattacaaa 

tcaaaatcat 

cttgttaggg 

gatgtaataa 

ttggaccaaa 

attcagcgta 

cgtcccaatt 



cttcaatgat 
gctctgtcaa 
aaactgctaa 
aatctttaat 
atgtccttat 
gagaccaacg 
tgttagtttc 
atgagagaat 
ttcagttttt 
tgtatgaggc 
tgatggatag 
gatattactt 
gagatcatat 
agataaattt 
tagaaatgga 
gtatacatgt 



tagtgataca 
ctcttccggt 
atgtttgcaa 
aagaaaaaca 
ctatctaata 
cgaagcaata 
atataaaatt 
atattcaatt 
cgattggaaa 
ttatttaaag 
ttacgttaca 
aggtagtatc 
cagatatgga 
aacgggaatc 
tttgtttgaa 
tggatatcaa 



accatacgaa 
ctttacaacg 
gatactgaaa 
aatgattatg 
acaaataaat 
atcaaacatt 
atttattttt 
attgagaaaa 
aatatttatt 
ttggtcgatt 
ttatatagtg 
attactaaaa 
caaaagaata 
atagaaaacg 
gaaagtttag 
tatggattgg 



aacttgttga 
gaaaatcagg 
ttgaagataa 
gctttgaaaa 
taattgatgc 
ttgaaaacat 
tatttgtctt 
tatttcaagg 
acataaatag 
tttgcaatta 
agggaagaat 
ataatatggt 
ttaatccggc 
ctgatgaaaa 
aaaggataaa 
cccgttatct 



ttatatctca 
aatctcttta 
agctttcagc 
tggtatgtcc 
cgattttgaa 
tgacaagcag 
ggataaatta 
actggaactt 
taaagattat 
taaatatttt 
tgcgagttca 
tggttttaat 
tattctcttt 
tcgtgtaaaa 
aagaatggtt 
tggcttttgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 



194 



gcgaataaaa aatttccttt actttaa 



987 



<210> 494 
<211> 312 
<212> DNA 
<213> B. fragilis 



<400> 494 

atgatggcat 

ctggcacctt 

ttggggtgta 

acgttcggtt 

caattaaaaa 

ataatattat 



cggtttcccg 
tcacaagtcc 
ccagttcagc 
cgattgtaca 
gtgttgactt 
ag 



aatattggcc 
ggcgatatct 
cagtttattt 
gaaagggaag 
acctacattt 



aggaacttgt 
acgatttcta 
aaacgttcgt 
tttgccgcct 
ggaagtccga 



ttcccagtcc 
ctgttgtggg 
cgggcacggt 
gtgctttcgc 
caataccaca 



ttcacctttg 
gacgatgcgg 
aattacgcct 
attggacaga 
ttgcaatgcc 



60 

120 

180 

240 

300 

312 



<210> 495 
<211> 615 
<212> DNA 
<213> B. fragilis 



<400> 495 

agagttggat 

ttgagaatta 

gctgtcaacc 

actgccggta 

ctgtggaata 

atggagaaaa 

aaaatatgtg 

ccgatgatta 

cggaaattat 

caatggtata 

tggataaaag 



cgtattgtgg 
ttcttaaacg 
aacttagtga 
ataaagatgc 
aaccggttgt 
atgattattt 
gttctaagtc 
ccgataaagg 
atactgatgt 
ctacacatgg 
attaa 



ctcttctcac 
tatcatttat 
taatttcttt 
ttttaatacc 
ttatgttttc 
cactctttct 
gggacgtgag 
caacgttctg 
gttgcggaaa 
aggtttgcat 



gaaagaagga 
aaaaaagatt 
gaaaccatca 
atgactgcca 
attcgtccgg 
tttttagggg 
gtggataaga 
ttcgaacaag 
gaaaactttc 
catgtgtatg 



ctttcggcaa 
tggatatgat 
gtaaagaatg 
actggggtgg 
aaagatatac 
aggagaataa 
tcaaggagac 
gaaggttgag 
tggatccgtc 
tggcggagat 



ctgcttctta 
gaaaccaatt 
gatgttggta 
tatcggattt 
gttcggcttt 
aagtattcat 
cggattgaaa 
tctggaatgc 
tgtgtatgaa 
tacgagtgca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

615 



<210> 496 
<211> 195 
<212> DNA 
<213> B. fragilis 



<400> 496 

aatataaata caaagaaaaa caatacatgg agttttgtta ttgaattcat atttaatatt 
atttataaag attatcaata cttagtttcc tctggctcct cgtcctttcg tcaaaccatc 
tctcagccct ttagaatacc atatcatatc actattgaca cccccgattt cactaccggc 
agaaatgaca gctaa 



60 
120 
180 
195 



<210> 497 
<211> 951 
<212> DNA 
<213> B. fragilis 



<400> 497 

aaagaatggt 

ttggcttttg 

atatctgttg 

gtcttagagc 

cgcacgtctt 

cataatttta 

cgtatggatg 

caagaatatc 

caaacgaatg 



tcgtcccaat 
cgcgaataaa 
taatgcctgt 
aatcttttgt 
ctattattca 
tagagtcatt 
gagatgatat 
cggatgtaac 
gtttattgag 



tgtatacatg 
aaatttcctt 
ctataatgcc 
ggattttgag 
gtcatataat 
gaaccttgga 
aatgcatatc 
tgtttgtgga 
taccttgagt 



ttggatatca 
tactttaata 
gaaatgcata 
tttatcctca 
gataaaagag 
atagagaatt 
gatagactaa 
acttggatga 
gggttggttg 



atatggattg 
acttgataac 
taaaagatgc 
tagacgatgg 
tacgtcttat 
ctttaggaaa 
aaattcaata 
acagtattgg 
aacaaccact 



gcccgttatc 
aatgtgtgag 
gatagaaagc 
ttctactgac 
tcagaacagt 
gtatatggct 
tgcgattatg 
aacatattca 
gttaaaattt 



60 

120 

180 

240 

300 

360 

420 

480 

540 



195 



acaaagggaa 
aatgcattaa 
gcaaagtcag 
gacagtcagg 
aatgaggttt 
gcttatggtg 
actttatttc 



atttcttatt 
aatatgagaa 
gagggagatt 
tcagtagcca 
tggaatatct 
atttatgtaa 
aaactttatt 



tcatcccact 
ctgcccttat 
ttatattgac 
aaaaagtagt 
gatggaactc 
gttgtacgaa 
ttcaaagaat 



actatgataa 
gccgaagatt 
agtcaaccat 
gagcaaagag 
aataaaaatg 
aaacaattac 
gaaaagaagt 



ggatggattt 
ttaaattttg 
tactctatta 
caacaacaga 
aatatccgga 
ttactaaatg 
tgaacctata 



tttgaaaaag 
ggtggagata 
ccggatatca 
gtctataatt 
attggcagca 
tgaagtatta 



600 
660 
720 
780 
840 
900 
951 



<210> 498 
<211> 627, 
<212> DNA 
<213> B. fragilis 



<400> 498 

cggcatataa 

attaaaggca 

agaatgcgtt 

gctatcttta 

cttgaaggag 

cttcagaatt 

cattttgcag 

atctctattg 

gttgcatctg 

atcgaagtgg 

ttcgaaatca 



ccgacaaacg 
ctcagacaga 
acacttattt 
cagagactgc 
gtatggttga 
tgcaggctgc 
atgttgcaga 
cagaaaaagg 
tattcgctaa 
gtaaagaagc 
agaaagaaaa 



taattaccaa 
aaagaatttg 
tgcaagtgta 
cgatcaggaa 
aatcactgca 
agcggcaggc 
acaggaaggt 
gcatgaagaa 
agaaggcgaa 
tcctgaggta 
ttattaa 



tctaaaataa 
ctgacatcat 
gctaaaaaag 
aaagagcatg 
agctatcctg 
gaacatgaag 
ttcccaatga 
agataccttg 
gttgtatggc 
tgtccggcat 



atcagattat 
ttgctggaga 
aaggttacga 
caaaacgtat 
ccggtgttat 
aatggtcttt 
tcgctgctat 
ctttcgtaaa 
agtgccgtaa 
gtctgcatcc 



gactaaaagt 
atcacaggca 
acagattgca 
gtttaagttc 
cggtaatact 
ggattatccg 
gtatcgcaat 
aaatatagaa 
ctgtggttat 
acaagcatac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

627 



<210> 499 
<211> 2049 
<212> DNA 
<213> B. fragilis 



<400> 499 

aacacaataa 

gttaatatga 

gaagaggtga 

aatgaattgg 

aaagagcagt 

tacatttata 

gatcggcaga 

gagtctgtta 

gatgtgcctt 

aactcggtga 

aaagatagtg 

ttctatctac 

ttgaatttcg 

tcttcgtctt 

aattatttcc 

tattcacatc 

acaggtttac 

agcggaggaa 

tactgtttta 

ggt cgttctt 

caaggagaag 

cct ttacaag 

atgtttgctc 

gctgcttctg 

tttccctggt 

gcaaattacg 

gaacgtgatg 



ccactctcga 
cgttgataac 
tggaaaactt 
aggaattaac 
tacagcggtt 
ttcacggatc 
ctattcaata 
cgttaaagca 
tgtataagcg 
aatatggatt 
gcgaaccttt 
ttcttcatga 
gtcagggact 
tcactttcag 
gtgggagtgg 
gttcgttgga 
accgaagcga 
atatcagtta 
acaggtctta 
tttataatct 
cagctttggg 
atatccggtt 
attcatttag 
taaatccttt 
ggaggtaccg 
ttccgtcgaa 
tgaccggtac 



tatgaacgaa 
ttctacttgc 
gtctatatcc 
cgatcttgtt 
tccttttctt 
gatgcagact 
tctgcttcct 
gattttaaaa 
taagggatac 
tcactatcgg 
cggagcttta 
tataggaata 
ggtactcgga 
gagtacaggt 
catagctcta 
tggagtgata 
aaaggaagct 
taccggaaat 
tgaaccggaa 
gggtatggat 
gatcagtggt 
gatgttggtt 
tgaaggaagt 
caaccgttgg 
gataagtaag 
aacggttgat 
gcagggaaaa 



agcatatcta 
catgctcaga 
gatgaagaag 
aacaaccctg 
aatgatgttc 
gtctatgaac 
tttgtctgtg 
tatggaaaac 
gagaaaaatt 
gaaaaggttt 
cataataagc 
ctgaaaactg 
caaggttcta 
atacgcagac 
aaatggaaac 
aaaggcggtg 
gataaaatga 
agttatcaac 
cttaaagact 
tataaatatc 
atggctttta 
caccgttatt 
agtgttcaga 
acattttttg 
gcctcgaaag 
atgtatgtga 
gtgattctgc 



tattgagtat 
atcgttcgga 
gagatatacg 
ttaatataaa 
agattgagaa 
ttcagttagt 
tagagcctgt 
atgaagcggt 
acttgggtcc 
atgcaggtat 
agggatacga 
gaattgtagg 
tgtttggaaa 
atacttctac 
aatggacact 
aaattacctc 
atcaattgac 
tgggtataac 
attcaaaata 
gttttcatcg 
tgaatcaggt 
attcccatga 
atgaaaacgg 
tttctgccga 
gagtggatct 
attaccgtta 
ccacctatca 



attcctactt 
ttatccttgg 
taattgggag 
ctctgccacc 
tttacttgcg 
ggaggagttg 
tgataagaaa 
gactcgtatg 
tgctgtttat 
agttgccgag 
ttactactct 
aaactatcgg 
aacagcttat 
cgacgaatat 
ttccgtattt 
gatttataaa 
tatgcagatg 
cggtgtttac 
caacctacat 
tttttctata 
gctttattct 
ttattgggct 
atggtatctg 
tctgttttct 
gctctttcag 
taaacaaaaa 
tcaccggctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 



196 



cgttatcgat tgaactattt acgatgctct tcgttatttc tccggacaac agtcgattat 1680 

aatcattttc attcatccgg aaaaacggct ggacaaggtt atcaactgac gcagacggct 1740 

ggatggaaac ttccctggtt gcctctgaca gcagaattgc aaggtagcta ttttcataca 1800 

gacgattatg attcacgcat ttatatctac gagaaagggt tgctgtattc tttttatact 1860 

ccatcttttc agggagaagg tattcgcttg gctatttatt ttcgctatga tatgaacaag 192 0 

cattggacag caattgccaa gctcggacaa accacatatt ttgatcgtga tgaaataggt 1980 

tccggcaacg acctgatcag gggaaataaa aaaacggatg tacaaatgca gctgcgtcta 2 040 

aagttttag 2049 



<210> 500 
<211> 477 
<212> DNA 
<213> B.fragilis 



<400> 500 

aattcaacaa caatgaaaaa actgacaaga aaaagtttaa atgaactggc gaaaacaatg 

ccgataattg aagagtcatt gcaaatgagc tatgtggggg gaggaaatgg aacatcagcc 12 0 

aatccttata cccaagagga atatgaaagc atggttagta gtggcatatg gaatggagga 180 

tatgtagaaa attggggata tacttttcct gagatggcag tttcgagtta tgatcccaat 240 

aacttgccta aaacgggggt ggatagctat gatctaatgt atcaaggcgg gtttgctata 3 00 

gggtataagg ccgggttatc gggatctaca ttggatgaca tagggattgg tgcatggagt 3 60 

gctttagctg tcatttctgc cggtagtgaa atcgggggtg tcaatagtga tatgatatgg 42 0 

tattctaaag ggctgagaga tggtttgacg aaaggacgag gagccagagg aaactaa 477 



60 



<210> 501 
<211> 360 
<212> DNA 
<213> B.fragilis 



<400> 501 

ctttggcgga 

atggctgctt 

gcccaaaaac 

cgaaaatacc 

aaaacagttt 

tcggctgaag 



ctgtagaaaa 
tcgcattacc 
ctttcgtacc 
aaccgttaca 
ctcagaaaat 
aagcccgaaa 



taaaagtact 
tcttatcaga 
cattccggat 
ctcccaatcc 
agaaacgact 
agccattatc 



atggatgata 
cagatcaaaa 
actgaagaac 
acttctcaga 
ccggccaacg 
tggtccgaaa 



ttgtaaaagt 
agagcaaaac 
cggaagtcct 
aagtggaagt 
acccggaatt 
ttctaaatag 



cctcgtcatt 
agaaagatct 
gaaagtcacg 
aaaaaagaac 
taccattcat 
aaaatattga 



60 

120 

180 

240 

300 

360 



<210> 502 
<211> 660 
<212> DNA 
<213> B.fragilis 



<400> 502 

gttaatgctg caaggaaaac cattaatata aatttaatcc tttatttttg caaaatgaat 60 

atgaggctta ccataggact tttgatgtta tctatcgcct tattgttttc ttccgaatca 12 0 

ctggcacagg aaaaaacaaa tctcggtgga tacctggtac ctatgtgtgt gtataatggt 180 

gatacaatcc cggctttcca gattccgacc attcatatat tcaagccttt aaaattcaga 240 

aacagaaaag agcagatgga atattataaa ttggtgagaa atgtgaagaa agtgtatcct 3 00 

attgccagag aaattaaccg caccatcatt gaaacttacg aatacttaca gaccctgccc 3 60 

aacgaaaaag cccgccaacg tcatatcaaa cgggtggaaa aaggattgaa ggagcaatat 42 0 

actccacgaa tgaaaaagct ctcttttgca caaggcaaac tgttgataaa gctgatagac 480 

cggcaaagcc atcaaagttc ttatgaactg gtaaaagcat ttatgggacc ttttaaagca 540 

ggattctatc aaacatttgc cgctcttttc ggagccagtt taaaaaaaca atatgacccc 600 

gaaggagaag ataagttaac cgaacgagtg atactgttgg tagaaagcgg acaattgtaa 660 



<210> 503 
<211> 927 
<212> DNA 
<213> B.fragilis 



197 



<400> 503 

cattgttatt 

cgagagttga 

acagcttttc 

atctattatg 

atgacatttg 

tttatgctca 

ccgcttcagt 

tgtggtctcg 

attgcagcca 

gatatcatta 

ggtttcgtta 

caatcggtac 

aaagaaaccc 

gtaaaagtat 

aaagatatcg 

ggattcgacc 



taatacctac 
gagattatat 
tgattcctta 
caacaggttt 
ctattaagat 
ctttcttctt 
tggtaggaga 
gattaggggt 
ttgttaataa 
tcatcagttc 
ccttgtttat 
agttctttat 
atcggggagt 
tagttgtact 
atccgaatgc 
gcctgaaaat 



acacaaaatg 
tttcatcact 
tcagatcact 
ccccattcaa 
actcggtccg 
atggttcttc 
agggcaggac 
ggtatttaat 
atataaggac 
atgctatttc 
catcggtttc 
cttttcgaaa 
gaccgtactt 
tgcctacaaa 
ttttatctcg 
aaaataa 



attagtaaac 
ctcggactga 
acaggtggaa 
tggtcctact 
aaattcagta 
caactgatca 
tttatggctt 
aataacggca 
gttactctcg 
atctttaacg 
gttctggact 
gattatgcaa 
gacggattgg 
cgtcagtcac 
cagagttcgg 



ccacaaaatc 
taagttatgc 
caaccggtat 
tcatcatcaa 
taaagacgac 
ttgtggacga 
gcatcatcgg 
gcacaggtgg 
gacgaatgat 
actggcgcag 
atgtagtcaa 
agattgccga 
gctggtacag 
tcgatatttt 
taatcggtgt 



cgatgtaatg 
attaggctgg 
cggtgccatc 
cgctgtcctg 
atacgccatc 
taaaggagct 
agccatcatg 
taccgatatc 
catgttctgt 
agtgatattc 
cagcgcccgt 
ccgcattacg 
ccagaataat 
ccgtttagtg 
atatggtgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

927 



<210> 504 
<211> 228 
<212> DNA 
<213> B.fragilis 



<400> 504 

gagctgatcg cgctgtctat ttttgattgc caggaaaaat tagtgtttgc aacccagtcg 

tacgatcggg cggattcacg catggagggc gcaaccgaac aagattgcgc taaaaaaaga 

accgatgcca ataatataaa gtatttcata accggtgttt ttataaattt tctatccgct 

tctgtatctc ctttattttt tcctgtcttt gttttatttt cattataa 



60 
120 
180 
228 



<210> 505 
<211> 438 
<212> DNA 
<213> B. fragilis 



<400> 505 

ttcataaatt 

cgcattgtct 

ttatggtgta 

gaacagatca 

ggccgtttta 

cattctatga 

gggaaatatg 

acaagaaact 



tactaacttt 
ggtaccaata 
agaatattat 
tacatactgt 
tccgtaagaa 
aattcggaag 
cttctgtact 
taatttag 



ccgccaaaaa 
tttcctgctg 
attagctact 
ttataccgtt 
aaccattcct 
tttctctgta 
tccggtgaaa 



tctgacaaaa 
gttgtattgg 
ctgatgatgt 
acggcagacg 
attgcagaaa 
accaactatc 
gaaaaggaat 



tgaaccgtat 
gtgttaatgc 
tgtttctgat 
gtctgctgtt 
ttacttctat 
tattgataga 
ttatggaact 



ctttcatgct 
ttttggtttc 
tgttgttatc 
acttaatcat 
ccggaaagtc 
atatggaaaa 
gattgaaaaa 



60 

120 

180 

240 

300 

360 

42 0 

438 



<210> 506 
<211> 636 
<212> DNA 
<213> B. fragilis 



<400> 506 

ttttcaaaat 

acaagaaaag 

aaaataaccg 

ttagttgtag 

gcagatgctt 

cccgagatgc 

aacgcagttg 

caggcggaag 



atatgctgat 
aattactttt 
agctgaaccg 
tgttggacga 
tccggattga 
ataagacagc 
aaacggttga 
ggagtatcat 



tcttttgaca 
gcaacatact 
gataagtata 
tatacggagt 
atgtatttat 
tttgggagcc 
taacctccgg 
gttggatgag 



ggttttaaac 
aacagaaacg 
gaagagttta 
ttgcataata 
ctgtgtggaa 
gagtttacag 
agtgaaggat 
ttaacactgg 



cgttatcaac 
acatcatcat 
aagaagctga 
tcggttctgt 
ttacggctac 
tggattggaa 
atgtggtata 
accgttcgaa 



acctatgtta 
gcgaaaattg 
taaattgcct 
gtttcgtacg 
tcctccccat 
gtatgttaat 
ctctgtcgaa 
gaaatatgct 



60 

120 

180 

240 

300 

360 

420 

480 



198 



gtagttatgg gaaatgaagt aaaaggagtg cagcaggagg ttattgacca ttcggatggt 540 

tgtattgaga ttccccaata tggcacaaaa cattcattga atgtatcggt aacagcagga 600 

attgtgatct gggatttatt taaaaagttg aaatag 636 

<210> 507 
<211> 1347 
<212> DNA 
<213> B.fragilis 

<400> 507 

actatgaaat accaggttat tattatcggt ggagggcctg ccggctatac ggctgctgag 60 

gctgccggga aagcaggact gagtgtgttg ctttttgaga aacaaaattt aggaggtgtt 120 

tgtctgaacg aagggtgtat cccgacaaaa acgttactct attcggctaa aacctatgat 180 

ggtgctaaac atgcgtcaaa atatgctgta acggttccag aggtcttttt tgatcttcct 240 

aagatcattg ctcgtaaatc gaaggtggta cgtaaactgg ttttaggggt aaaatcgaag 3 00 

ttaacgtcca ataatgttac tattataagt ggagaggcaa ccattttgga caagaatacg 3 60 

gttcgttgcg gtgaagaaac ttatgaatgt gataacttaa ttctttgtac aggttccgaa 42 0 

acttttattc ctcctatttc cgggatcgat agtgtaaact attggacaca tcgtgaagca 480 

ttagacaata aggagttgcc ggcttcactt gccattgtgg gcggtggagt gatcggtatg 540 

gagtttgcgt cctttttcaa tagtc£gggt gtgaaggtga ccgttattga gatgatggat 600 

gaaattctgg gaggcatgga taaagagctt tccgccctgt tgcgtgccga ctatgcaaag 660 

cggggcattc agttcctgtt aagtaccaaa gtcgtttcat tggcacagac ggaagaaggt 72 0 

gctgtggttt cttatgaaaa cgccgaaggg gcaggaagcg tgattgccga aaagttatta 7 80 

atgagtgtgg gacgtcgtcc ggtgacaaaa ggctttgggc ttgagaacct taacctgcaa 840 

cggacagaac gtggaagtat tgtagtgaac gggcaaatgg aaagctcgtt acccggagtc 9 00 

tatgtttgcg gtgatttaac cggattctcc ttattggcac atactgctgt ccgtgaagca 960 

gaagtagccg tacatgcaat tcttggaaaa gaagatagga tgagttatgc cgctatcccc 102 0 

ggggtagttt atactaatcc tgaaatagcc ggtgtcgggc agacagaaga atctctgact 1080 

gcaaaaggca ttgcttaccg tgccgtaaaa cttcccatgg catattccgg acgttttgtt 1140 

gctgagaatg aaggagtgaa tggagtatgt aaggtgctgc ttggcgagga tgatactatt 12 00 

ttgggagcac atgttttggg taatcctgct tctgagatta tcacgttggc agggatggct 12 60 

gtagaaatga aactgaaggc agccgagtgg aagaagattg ttttcccaca tcctacggtt 132 0 

gccgagattt tccgtgaagc attataa 13 47 

<210> 508 
<211> 252 
<212> DNA 
<213> B.fragilis 

<400> 508 

ggtcaggggt atttccaaca agacgagcct gcattttccg tgttcgatcc tgttcttgat 6 0 

tatttcaaca cccatgtcat tcatcgttcc ggtgccatgc acgctctccc cgatggcgag 12 0 

tatcctttta ccggaaaagg gcagggactt aagatcgccg ccgttagcgg gcatcacatc 180 
cgattcccgg acggaagccg ccgtgccgtt attcaaggat gggagttcaa ccgcatattt 
tccatcaatt aa 



240 
252 



<210> 509 
<211> 249 
<212> DNA 
<213> B. fragilis 

<400> 509 

tattatagtc ttttttatat tatcaatcag ttgtgtattt ctatcaaatg gaagagtaat 

cctgtaaagt tgaaaaagag caattactct gataatattc cgttttgggg tgcaaagata 12 0 

atcaatttta ggctaatgca gtcatctgct ttctcttttt gtccaaaacg gggttgcctt 180 

ttattacctg gatcacagac aatgagagag aatttgtgtc actctgccgt cttgaaagaa 240 



60 



atttcgtag 



249 



<210> 510 



199 



<211> 957 
<212> DNA 
<213> B.fragilis 



<400> 510 

caagcaacac ataaaatatc aataaatatg gaaagtacca acagacttcg ttatcttatc 60 

gcaggaaccg gaggcgtagg cggaagtata gccggctttc tgtcacttgc cggaaaagac 12 0 

atcacttgca ttgcccgtgg agcacatctg caagcaatac aacaagacgg gctcaaattg 180 

aaatcagatt tgaaaggtga acatgctcta cggataaatg cctgcacggc agaagaatat 240 

aacggaaaag ctgatgtgat atttgtatgt gtcaaagggt attccgtaga ctctatcaca 3 00 

gagcttatca agcgggcagc ccacgaccga acgattgtaa ttcccatatt gaatgtatac 3 60 

ggcacaggac cgcgcatcca acgtctcgta ccgggagtca ccgtactgga cggatgtatc 42 0 

tacattgtag gctttgtttc cggaccgggc gaaatcactc agatgggaac catctttcgt 480 

ctggtatatg gtgcacaccg gggaatcctt gttccggcag ggctgatgga ggccgtacag 540 

agggacttgc aggaaagcgg catcaaagta gaaatctctc ccgacatcaa tcgggatacc 600 

ttcattaaat ggtcgtttat ttcagccatg gcagtcaccg gagcttattt cgatgtcccg 660 

atgggagaag ttcagaaacc cggcaaagtg cgcgatactt ttatcggact ctctaccgag 72 0 

agcgctgctc tgggaaagaa actcggaatt gaatttaaag aagacatagt cacatacaat 780 

ctgaaagtaa ttgataaact ggctcccgaa agcacagcat ccatgcaaaa agatatagca 840 

cgcggacacg aatcagaggt acaaggtctg ctctttgaca tgataacagc agccgaagag 9 00 

caaggtatcg atgtgcctac ttatcgggaa gttgctaaaa aattcatcaa acaataa 957 



<210> 511 
<211> 600 
<212> DNA 
<213> B.fragilis 



<400> 511 

aataaactgc atacaatgaa acgtaaactt gttttcgcaa ccaataatgc acataagctg 60 

gaagaagtat ctgctatctt aggggataaa gtcgagctac tcagcctgaa tgacatcaat 12 0 

tgtcatacag atattccgga gacagcagag actcttgaag gcaatgcata cctcaaatca 180 

tcttttattt accggaacta cggattaaac tgctttgccg atgacacggg actggaagtt 240 

gaatcactgg gaggtgctcc gggtatctat tcggcccgtt atgccggtgg agaaggacac 3 00 

aatgcggaag ccaacatgtt gaaactcctc cacgaactgg aaggaaaaga caaccgtaga 3 60 

gcacagttcc gtacagctat ttcgctcatt ctggatgaaa aggagtatct cttcgaagga 42 0 

atcataaaag gcgaaataat caaagaaaaa agaggtgatt ccggattcgg atacgatccg 480 

gtattcgttc ctgaaggata cgaccggacc tttgccgaat taggtaatga aattaaaaat 540 

caaatcagtc atcgtgcttt ggctgtgaac aaactatgtg aatttcttcg ttcgatctga 600 



<210> 512 
<211> 1482 
<212> DNA 
<213> B.fragilis 



<400> 512 

agatattcta aaacattgaa aacattatta tatatattgg tattttcttt gtgctatacg 

aatgcatatt gtcaaagtat accaagggaa gtgacattag atgaagtgat aaacagacta 12 0 

tctctggaat catcatcggc taaaatagaa ttacttaact tccaaaatga cttattgcga 180 

tacgagaatt ataagaaaag ctttctccct gcatttgtgc tgaattttaa tcctatcaat 

tttaacaggt cactgcgatt attgcaacaa ccgatcgatg gaagttattc ttatgtagag 

gacaattcaa ataatactaa ttttggtact actgtacgac agaaaataag cataacagga 360 

ggggaactga gtattggaag taatataaat tatttgaatg agttttcacg taaacaaaac 42 0 

agttttagta caaatccgtt ttttataagc tattcgcagc agttgtgggg aggaggaaaa 

ttacaaaggt tggaaaacaa aattgaacgt gccaaaaacg aagtggccgt gaaacaatat 

tgttcaaaca ttgcccagat ccagcaacaa gcattgacgc tttatttatc cgccatactg 

agtaagatgg atagtgaact tgctatagat atcaaacaga gcaacgacac tctgttacat 

attgcagaga taaaattgag gaatggaagt atcactgaat atgattacaa gcagatggaa 72 0 

ttgcagtcct taaacttgca atacatgtat gaaaatgcgg tcaaacacta tgcggaatca 780 

atacaaaaac tttttacttt tttaggaata gaaaataatg ccgaaattac aataccggat 840 



60 



240 
300 



480 
540 
600 
660 



200 



900 
960 



tttgacttac ccttaactat cgatgctcgg cttgtaatct actatgtgaa aaaaaataat 

ccaatttcaa atcagcaaga gattcaacag ttggaagagg agaaaaacct gttctctatc 

aaattgaaga ataggttcaa tggaaatata agtttaaact atggaataaa tcagtatgct 102 0 

gaaacattgg ccgatgctta ccggcatgga aatacaagac agtccgtgat cattgaattt 1080 

caaataccta tttttcagtg gggcatcaac aaaaataata tccggatcgc aaagaataat 1140 

tatgatgcaa gtcggttgcg aatagaaaaa aaaatgtttg aatttgagaa cgaagtaaaa 12 00 

gaaaagataa atgcttatga tcatagtgta aagctttggc tgacagcatc aagagcctat 12 60 

gcgttatcga aagaacagta taagatgttg acgaaaaagt tttcattggg aaaagtgtcg 132 0 

gtatatgaac ttgccaccgc acaaaaagag cggaatgatg ctatgcagcg ttactactct 13 80 

gccatcaaag attcttacga aagcttcttt acattacgta atttggcttt atatgatttt 1440 

aaaaaaaatg tcgaattaga aaaaatactc ttcaatgatt ag 1482 

<210> 513 
<211> 579 
<212> DNA 
<213> B.fragilis 

<400> 513 

tttatttata aaccattaat aataactgca atgaaaaaat taacaaagaa aaatttaagc 60 

gaactggcga aaacaatgcc ggtaattgaa gagtccttgc aaatgagcta tgttggggga 120 

ggaaatggaa catcagccaa cccttatacc aaagtggaat tcgatagtat gcttagtaat 180 

gacaactgga acggtggtta tgtagaggga atgggatatg tggctcccaa tacgtatatt 240 
tatgggaatt cagtatactg gggatcggta tcacaagatt attatacatt tccagattat 
gtcacttctc tttcttcgga tggactaaat caaatggcgg aatcattggc aggtgcaata 

ccaggagtgg gttcttatac cgcttattta tctcaagagt taggtgatat gagtagagag 42 0 

attcaatctg aactgttaaa aaaaggatat aatggttctt cttcattcac aattgttcgt 480 

acgtatatgg gaagttctgt taaattttct gtatataacg cgaataatgg agaacttata 540 

acttccaaaa cgattaatat gttcggattt tggcagtaa 57 9 

<210> 514 
<211> 1521 
<212> DNA 
<213> B.fragilis 

<400> 514 

cacgaaaata atactataat tatggcattc aaatccattt ctgcagcaga agctgccagc 60 
cttgtcaaac atggctacaa catcggcctc agcggtttca cacccgcagg aacggccaaa 12 0 
gcggtcactt ccgaaatagc aaaaatagcg gaagcggaac acgcaaaagg aaatcctttc 
caaatcggca tctttaccgg agcctctacc ggagattcat gtgacggtat attatcacgt 
gtaaaagcca tccgctatcg tgccccttac actaccaacc ccgatttccg taaagctgtg 



300 
360 



180 
240 
300 



aacaacggtg agattgccta taatgacatt cacctttcac aaatggcaca agaggtacgc 3 60 

tacggattca tgggaaaagt gaatgtagcc attatcgaag cctgcgaagt aactccggac 42 0 

ggaaaaattt atctgacggc tgccggcgga attgctccga ccgtctgccg cctggccgac 480 

cagatcattg tcgaactgaa cagtgcacac agcaaaaaca tgatgggaat gcatgacgta 540 

tacgaaccac tcgatccgcc ttatcgccgt gaaattccga tctataaacc aagtgaccgc 600 

atcggactac cttacataca ggtcgatccg aaaaaaattg taggtatagt agagacaaac 660 

tggcccgacg aagcccgctc atttgcagca gccgatccta tcaccgataa aatcggtcag 72 0 

aacgtagccg acttcctggc tgccgatatg aaacgcggta tcattccttc tacattcctt 780 

ccgttacaat cgggagtagg caacatcgcc aatgcagttt tgggtgcatt gggacgtgac 840 

caaacaattc ctgccttcga aatgtatact gaggttatcc agaactctgt gatcggtttg 900 

attcgcgaag gacgtgtaaa attcggcagt gcctgttcgc tgaccgtaac caacgattgt 960 

ctgcagggta tatatgacga tatggacttt ttccgtgata aactgatcct ccgtccgtca 102 0 

gaaatctcta acagccccga agtagttcgc cgtttaggca tcatctctat caatacagcc 1080 

attgaagcgg atatctatgg taatgtaaac tctacccaca ttggcggaac caaaatgatg 1140 

aacggtatcg gcggttcggg cgactttaca cgtaatgcgt acatctctat cttcacttgt 1200 

ccgtcagtgg ctaaggaagg taagatcagt tctatcgttc cgatggtttc tcacctggat 12 6 0 

catagcgaac actctgtcaa catcgttatt accgaacagg gagtagccga tctgcgcggt 132 0 

aagagtccga aagagagagc acaagcaatc atcgagaatt gtgcacaccc ggattacaaa 13 80 

cagattttat gggattacct gaaactggca ggtaataagt cacagactcc tcatgccatt 1440 



201 



60 

120 

180 

240 

300 



caagccgctt taggaatgca cgccgaactg gctaaaagcg gagacatgaa aaacgtgaac 150 0 
tgggcagaat atgaacgatg a 1521 

<210> 515 
<211> 447 
<212> DNA 
<213> B. fragilis 

<400> 515 

agttgttaca atgaggtttt atataaaagc attagaaaaa ctaactcaaa aaaaaataga 

tctcaatata tgaatattgc atttttaaca acattaaatc cggctgatat aaataattgg 

tcgggaacga catttcattt gtttcacgct ctaagcagaa agcatcatgt aaaagtgatt 

ggacagaata cccttcctca ggcagcgtat tttaccaaag ataattgtat taaaaaaaat 

ccattagaga actatgtttc tgttttcggg aaattatgta ccgaacaatt gacgaattat 

gatcttgtgt tttttggaga tttatattta gctccttttt tggatgtaaa tgttccggtc 3 60 

gtgcatctta gtgatgtgac ataccactca ttccaaagct acttaaaccc cctaaagaat 42 0 
gaagaacggt ataggaaatt ggaatga 447 

<210> 516 
<211> 1374 
<212> DNA 
<213> B. fragilis 

<400> 516 

agaacaggca aaaagatgaa ttcaagaata caaaagcagg aacaacctat atgttcacca 

aaaattattt tgcctaatcc taacaaaaaa tctgatgtga ttgccagatc tgaggaagta 

caagccatta ttgaccgtat gcccacctac tggacaaaat gggtgatact atgtgtaggg 

gtactgatgg gaatgatcat attacttggt tttttgatac agtatcccga tacggtagac 

ggacaaatat cagtaaccgc aaatgcagca ccggtacgtt tggttgctaa cagtaacgga 

cggattacgt tgtttcaacc caataaagca ttactgcata aaaatgatgt gattagttgt 



60 

120 

180 

240 

300 

360 



480 
540 
600 
660 



780 
840 
900 
960 



atcgaaagtg gtgcggatta caaacatatt ttatggattg attctttttt gaagacactt 42 0 
aatgacaaaa gcacaattcg tgttgcattg cccgatacgc tgttgcttgg ggaagtcagt 
tccgcataca attccttttt actttccttt ttacaatatg agcggttact tacttctgac 
atttattcaa ccatgcgaca aaaattgcaa caacaaatta tttctgacga agcagtcatt 
gccaatttta ataatgagct gcgattaaaa aaacaaatat tggataactc ccaaaaccaa 
ttgagtaagg acagtatatt gctgtcgatg aaaggaataa gcgaacaaga ataccagcaa 72 0 
aagttctcga cacatctttc tttaaaagaa tcacaattaa atttgcaaag taaccgacag 
atgaaacaat cggaaataag tcgtaatcaa ttggaaatac agcgtatctg tttggaagaa 
actgaggcta aagagaaagc ctattccgat tatatcactc ggaagaatga actttcaaat 
gccattaaac tctggaaaga gcattatttg caatatgcgc ctgtagaagg ggagttagaa 
tatcttggtt tctggcggaa caatcgtttt gtacagtccg ggcaggagct attctccatc 1020 
attcccgata aaactaacat cttgggtgaa gtagtgatac cttctttcgg tgcaggaaaa 1080 
gtagaagttg ggcaaacagt aaacgtaaaa atggacaact atccatatga tgaatatgga 1140 
ttactgaagg gagtggtgaa atctgtttca cgcattacca acaagataaa aactcaaaat 1200 
ggagacatgg atacttatct ggtaatcata tcttttcccg atggtacatt aactaacttt 1260 
gggaaaatat tacccctcga ttttgaaaca aaaggtacag ttgagattat caccaaacga 13 2 0 
aaacgtttga ttgaaagact atttgataat ttaaaatcaa aaggagaaaa ataa 1374 

<210> 517 
<211> 1824 
<212> DNA 
<213> B. fragilis 

<400> 517 

ataatattaa atatgaattc aataacaaaa ctccatgtat tgtttttctt tgtatttata 60 

ttctataccg tatcatgtac cgctaaatta gagaaacaga catatacgaa tgtatatgat 12 0 

ttacattttg ctatgcggtc tgattctgcg gtggtttatc cgtggcgtga aaatggagca 180 

tatagcaatt atactatccc tgcttatata caagattcaa atcgaaattt gttcgctaaa 240 

aaatatttta aaggatttcc tttttctaag cggttaagat cagagtacga acagagaatt 3 00 



202 



ttgcttccca ataataacat aaaagaagct gtaatcggat ttgaaggtaa aggtgataat 3 60 

ataaaacttg tctctatcat cttggatgcg ataggtaaac aggaaaacat tcttttttct 42 0 

gacaccttaa gattcaggcc tgacagtata ttaagcttgg ttacccaaaa cattaatttg 480 

actaatgcgg agatgttaaa cgtacggatt aatgtggaag gagaaattga taagaatgct 540 

tatattgctt tctctcgatt ggacatactg attgacggta aacctatcga cgaatttcct 600 

gttcgaaccc tttccccgtt gatagtagat aaaaaaatta actatacggg tataaatgtt 660 

gatagaaaaa taggattgga gcaaatcaat gaaatcaatg ataaaaagat tatcggctta 720 

ggggagtcag tccacgggaa tgacggtata aaaaatttag cgtatcaatt gattatccag 780 

gcagtggaaa ggttaaattg caaattagta ctgcaggaaa tgcccctcga acaatcattt 840 

gcctacaata ggtttataca agatgacaat tatgaacttg attcttcctt ggttatcaac 900 

catgctacaa ttaatttttt aaaaagattg cggagcttta attctggtaa aacgaaagat 9 60 

tctaaagtta aattatatgg catggattac aattcaatcc tttcttccac tcaaagttcc 1020 

gctatggata tttttgattt tattaccggg cttaacaaaa aatcgcagat tccggaagtt 1080 

gatcaattat ccctgctgtt aatgaaaaaa gatcgtaact gtgcgataaa ctttcttgat 1140 

attcatcgag ataagataaa aaaacttctt actgctgaag aaatagaatg tatcttgcat 1200 

attctgaggg tttcaaagca agccggagat ggcggaatag aaagattcat acggcgggat 12 60 

tccattatgt tcgtaaatgc aagattctta attgacaagt tcgccaaaga cgaaaacgta 1320 

aaaacggtaa tctacgggca tgcgggacat attaatccta tttcgagtta tcctgccgta 1380 

ccttgtattc ctttcgggag gtatatgcgc aaagcgtatg gtgaaagtta ctctcctttg 1440 

ctatttctga taggaagtgg agaagccatg gcatatgatg agcattataa caggaaagat 1500 

aattggttga gtagtcctcc tgaaaacagt atggaatatt ttttaagtct tattgatgac 1560 

aatgtttttt acaccccctt aactgtcgat tttaatgaat taacactgtc tcgacttcag 1620 

gggagtcacc atatcccaca agaattttat ccatttaact tatatcaaag atttaaaggt 1680 

gtgtttttca taaaaagtac ggattgtacc cataaggatg aaaaagaaat ctcttttgag 1740 

aaagcttctg ataggcttat aatgaaaata aaacaaagac aggaaaaaat aaaggagata 1800 

cagaagcgga tagaaaattt ataa 1824 



<210> 518 
<211> 255 
<212> DNA 
<213> B. fragilis 



<400> 518 

atgaagataa aaaaatattg ccgttacatt 
ctgatcagca tcatttgttt tacaggcgcc 
ataatgggat atgattccat ccgggaaagt 
tgggttaagg atgataaccg tccgccaggt 
tcatctttat cctga 



cacttatggc tttcactacc ggcagggatc 60 
atccttgtat tcaaagaaga gcttctgaca 12 0 
cctttgatga tcgtgatgaa gctccaccgg 180 
aaaatgattg taagtatttt tacctttttt 240 

255 



<210> 519 
<211> 315 
<212> DNA 
<213> B. fragilis 



<400> 519 

aacatcaaaa aaggcagaag aaggttaatg ttcgattacc actctgtact gggattatat 60 

gcagcactta tcttattagt atgtgcactc accggattga tgtggtcatt tcaatggtac 12 0 

agagacatcg taagttttat ctttgatgcg gaagtaaaac gcggagcacc tatctggaaa 180 

atagtacgtg ctttacattt tggcacctat gcgggaatgt tttcaaagat cgtcactttt 240 

atcgctgccc tgataggaac ttcattacct gtcacaggat attggatgta tctgaaaaga 3 00 

aaaaaattac tatag 315 



<210> 520 
<211> 1617 
<212> DNA 
<213> B. fragilis 



<400> 520 

tttatgaaaa acaattgtct gatatgttcc ttattgtttg cttcgggaat tcagaatgct 



60 



203 



240 
300 



960 

1020 

1080 



tggggaactc aaataacaga ccgtaaagcg aatcctgatc aagcgaaacc caatataatt 120 

ctgattatgt gtgatgatat ggggttttct gatttatcgt gttatggcgg agaagtacac 180 
acaccacata ttgattttct ggcggaaaac gggatacgtt tcagtcaatt taaaaatacg 
ggacgcagtt gccccagccg ggcggctttg ctgacaggta gatatcaaca cgaagtaggt 

atgggctgga tgactgctgt ggatgaacat cgtccgggat acagaggaca gatatcggac 3 60 

cggtatccta caatcgcaga ggtatttcgt gaaaatgggt accacactta tatgagcggt 420 

aaatggcatg ttaccgttga aggagcattt acccaaccta atggaagcta tccggttgaa 480 

cggggatttg agaaatatta cggttgcctt tcgggagggg gcaactatta tactcccaaa 540 

ccggtatttt cgggtttgca gcgcattacg gagtttccga aagactatta ttataccaca 600 

gccataaccg attctgccgt tagttttatc cgtcaacatc cggttgatga acctatgttt 660 

atgtatttgg ctcactatgc tcctcatctg ccccttcagg ctccaaaaga gagagtagag 720 

gcttgtcggg aaaagtataa agcgggatat gacgtattgc gtaaacaacg cttcgaacgc 7 80 

atccgtcgca atggcttaat cgacattgaa agagaacttc cggtatttga aaaagagttt 840 

ggaggaaaac gtcccgcatg gaatagtctt actccgcagc agcaggaacg atggattacg 900 
gaaatggcta cttatgctgc catgattgaa attatggacg atggtatcgg agaagtaata 
aaagccacta aggaaaaagg tatatttgat aataccatat ttttattctt aagtgataac 
ggtgctacca atgaaggcga tatgatcacg caattgcgtg cagatttgag taatactcca 

tttcgcagtt ataagcaatg gtgttttcag ggaggtacga gtgctcctct gattatcatg 1140 

tacggaggcg gacaacctga tggaaaaaag gaagcggttc gtcacgaatt tacacatatt 1200 

atcgatcttt ttcccacttg cctggatatg gcttctattg aatatccccg ggaatttcga 1260 

aatcatgcca ttgatgctcc tggaggcaga acgattcttc cggcgttgaa aggaaagaaa 13 2 0 

ttatcgaaaa gagatttgtt ttttgaacat caaacctcct gtggcattat atctggagac 13 80 

tggaagttgg ttcgggctaa tggtaagcag ccgtgggagc tgtttaacct gttacaagat 1440 

ccgtttgaac agaacgattt atctgcccgt tacccggata gagtgaaaac attggaaaaa 1500 

aagtggaatc aatgggcaga aaaacaacag gtatttcctt ttgaatacag accatggact 1560 

aagcgtatca attattataa atccctgtat cccgatcaat ccggaaagga tttatga 1617 

<210> 521 
<211> 1017 
<212> DNA 
<213> B. fragilis 

<400> 521 

aagaagagaa aaaataagaa tataatgaat cgagaagaat gggtgaataa gggattcgtt 60 
gacgagcccg tagacaaaag cattgatctg aaagcagcca tcaatgaact gaaaaaagaa 12 0 
aagaatgcag taatcttggg acactattac cagaaaggcg aaatacagga tattgccgac 
tacattgggg acagtctggc tttggctcaa attgcagcca aaaccgatgc ggatattctt 
gtgatgtgtg gcgttcattt tatgggagaa accgcaaagg tgctttgtcc ggacaagaag 3 00 
gtgctggtgc ccgacttgaa tgcaggatgt tcgttggcag acagctgtcc ggcagataag 3 60 
tttgctgagt ttgtgaaagc acatccggga tatacggtga tctcgtatgt gaatacaacg 
gcagctgtga aagcggtgac agatgtagta gtgacttcga ctaatgcaaa acagatcgtt 
gaaagtttcc cgaaagatga aaagattatt ttcggcccgg atcgtaacct gggaaattat 540 
atcaattcga ttacaggacg tgaaatgctg ttgtgggacg gagcttgcca tgtgcatgaa 
cagttttcgg tggagaagat tgtagaactg aaagcacaat atcccgatgc ggtagtattg 
gcgcatcccg aatgtaagag tgtggtatta aagttggccg atatggtggg atctacagcg 720 
gctttattaa aatatgcagt gaacagtgac aagcaacggt tcattgtggc cacggaggca 780 
ggtatcttac acgagatgca gaaaaaatgc cctcaaaaaa cattcattcc ggctcctcct 840 
aacgatagta cctgtggatg caatgaatgt aacttcatgc ggctgaacac gctggaaaag 
ctctataatt gccttaaata cgaattcccg gaagtaactg ttgacccgga agttgccaga 
gaggcggtaa agccgattaa acggatgctg gagatttcag ctaagttagg cttataa 1017 

<210> 522 
<211> 1425 
<212> DNA 
<213> B. fragilis 



180 
240 



420 
480 



600 
660 



900 
960 



<400> 522 

aacactatga agaacaaatt atttatttta tttgcatttt gtatttcagt ccatgtttat 
gctcaacagc cctccaggga gataccttta aaatatggag ctaccaatat tggcaaacgt 



60 
120 



204 



cgccttgcca atctgatgaa ttccgaaatc ataggtaagg taaccgatgt atcctgggca 

ttcgttttcg aacgggtaga catgcaacca cggcatccgg cacaacttta tgaagcaatc 

gcctatttta tcctcttcct ggtaatgatg ttcctctata agaactatag caaaaaacta 

catcgggggt tcttcttcgg actttgcctg acagctatct tcactttccg cttctttgta 

gaattcctga aagaaaatca ggtggatttc gaaaatagca tggcactgaa catgggtcaa 



480 
540 



caggatgatg ctatgaagcg gtttcgcaac aatcgcttgg gagagtttat tcattgggga 180 

ctgtatgcta ttcccggtgg cgaatggaaa ggtaaagtat ataatggggc tgccgaatgg 240 

ctgaaatcat gggctaaagt ccctgctgcc gattggctgg aattgatgaa acaatggaat 3 00 

cctgttaagt tcgatgccag acaatgggcc cggatggcca aagagatggg agtgaaatac 360 

gttaagatta cgacaaaaca tcatgaaggt ttctgtctct ggcccagtca atacagtcag 420 
tataccgtag cgcagacgcc ttatagaaaa gatatcttag gtgaattggt gaaagcctac 
aatgatgaag gtatcgatgt acatttctat ttttcggtga tggattggag tcatccggat 

tatcgttatg agattacatc gaaagaagac agcattgctt tcagccgttt tctgactttt 600 

accgaccatc agttgaagga actggctacc cgttatccga cagtcaaaga tttctggttt 660 

gacggaactt gggatgcaag tatcaagaag aacggttggt ggacagctca tgccgaacaa 720 

atgctgaaag aacttgtacc gggagttacc gttaatagcc ggcttcgtgc cgatgattat 780 

ggtaagaggc actttgacag taatggccgt cttatgggag attatgagtc gggatatgaa 840 

cggcgtcttc ccgatccggt aaaagactta caagtgacta agtgggactg ggaggcttgt 900 
atgactgttc ctgaaaatca gtggggatat cacaaagatt ggtcgttgag ctatgttaaa 
accccgatag aggtgatcga tcgcattgtc catgcggtgt cgatgggagg aaatatggta 
gtgaatttcg gtcctcagcc cgatggagat ttccgttcgg aagagaaaga gttggcgatg 

gcattggggt gctggatgaa gaggtatggt gaatgtatat atggatgcga ctatgccgga 1140 

tgggataagc aggactgggg atactatacc cgtaaggggc aagaggtata catggttgta 1200 

tttaatcgcc cctattcggg gcttcttaaa gtaaagatcc ccaaaggtac cgaaatagaa 1260 

agagccgttt tgccggatgg acaggtggta aaggtaactg aaactgcccg gaatgaatat 1320 

aatgtggcca tgccttcgca agatccgggt gagccgttta taatcaaact acaagttaag 13 80 

gaggcttccg gagcagcaga cggatatcgg gacgcattaa cgtaa 1425 

<210> 523 
<211> 915 
<212> DNA 
<213> B.fragilis 



960 

1020 

1080 



<400> 523 

cagcagccga agagcaaggt atcgatgtgc ctacttatcg ggaagttgct aaaaaattca 
tcaaacaata aagaaatcaa tatgaacaac cttcttttat ctatcaactg gaacccaaat 
ccggaattat ttaatctttt cggcatctca atccgttatt acggactatt gtgggctatc 180 
ggaatattct ttgcttacat agtggtacac tatcaatatc gtgataagaa gatagacgaa 2 40 
aagaagttcg aaccgctttt cttttactgt tttttcggca tcctgatcgg ggcacgactg 3 00 
ggacattgcc tgttctatga tccgggatat tacctaaatc atttttggga aatgatactt 3 60 
ccggttaaat ttcttccggg aggtggatgg aagttcacgg gttatgaagg actggccagt 
catggaggta ccctcgggct gatcatttct ctctggctct attgccgcaa aacgaaaatg 
aattatatgg atgtggtaga tatgattgcc gtagccactc ctattacggc atgtttcatt 540 

600 
660 
720 
780 
840 

tggttaagca tcccgttcgt aattatcggc atttacttta tgtttttcta cggaaagaaa 900 
aagagtgtaa aatga 915 



60 
120 



420 
480 



<210> 524 
<211> 735 
<212> DNA 
<213> B.fragilis 

<400> 524 

catggcactg aacatgggtc aatggttaag catcccgttc gtaattatcg gcatttactt 60 

tatgtttttc tacggaaaga aaaagagtgt aaaatgaaac atataattga tattaaaacc 12 0 

tgggaaagaa aagaaaatta tgaatttttc cttggtttcc agaatcccac tatctccatt 180 

acttcagaag tagaatgttc gggtgctaga acacgtgcca aaaccgccgg agaatccttc 240 

ttcctgcact acctttatgc cgtgttgcgt gctgtcaatg aaatcaaaga gttccgattc 300 

cgcattgatt ctgaaggacg ggtagtttat ttcgatacag tggatatgct gactcccatt 360 

aaagtggcag ataacggacg tttttttaca gtacgacttc cctggtatcc tgattttaag 42 0 



205 



600 
660 



actttctaca cagaagccaa agccatcatt agcggaatag atccggataa agatccttat 480 

gaagcagaaa agacaggagg tagtgattta ctggatgtag tgctcctcag cgctactccc 540 
gatttatatt tcacctcact gacttgtacg caggaacatc gtcacggtgg taattacccg 
ttaatgaatg cgggtaaagc cgttataaga ggtggtgtat tagtgatgcc catcgctatg 

accattcatc atggatttat agacggacat cacttatctc tgttttataa aaaggtggaa 72 0 

gagtttctta aataa 735 

<210> 525 
<211> 1884 
<212> DNA 
<213> B. fragilis 



60 
120 
180 
240 



780 
840 
900 



<400> 525 

gcttataatg aaaataaaac aaagacagga aaaaataaag gagatacaga agcggataga 
aaatttataa aaacaccggt tatgaaatac tttatattat tggcatcggt tcttttttta 
gcgcaatctt gttcggttgc gccctccatg cgtgaatccg cccgatcgta cgactgggtt 
gcaaacacta atttttcctg gcaatcaaaa atagacagcg cgatcagctc ttatccgctt 

ttattgcatc cgtcatatga agctaaaggg agcgtggggt tcacggtacc ggttttttat 3 00 

cgtatggata aaaagcgggt gggtgttgaa gtgaggataa agtataaaac ggaaaactgc 3 60 

aatgatctgt gtttgaagct gagcggcatt ggtgaatgcg ggaaggtcat ttccgcggac 42 0 

acgtttcgat tgtctgccgc cgaggcgtgg acggtagccc gccggagcgt ggatatggct 480 

tctcccctgt tgctgggggt ggctcttgaa gcccgcgggg agaagcccgg gaaaaaggat 540 

tttccggccg atcctttagg atgggagaat aattccttta agcccgggga atactctaaa 600 

atatggattg actccttgga tatcttaatt gatggaaaat atgcggttga actcccatcc 660 

ttgaataacg gcacggcggc ttccgtccgg gaatcggatg tgatgcccgc taacggcggc 72 0 
gatcttaagt ccctgccctt ttccggtaaa aggatactcg ccatcgggga gagcgtgcat 
ggcaccggaa cgatgaatga catgggtgtt gaaataatca agaacaggat cgaacacgga 
aaatgcaggc tcgtcttgtt ggaaataccc ctgaccttat ctttccatat caaccggtat 

ttggaggggg acgagcggtt caagccggac agcatcgctt cctattttga caaggtctta 960 

ttttcttctt catccttcgt gtctcttatg cggtgggtca aagaatacaa ccggcatttg 102 0 

gaagaaaagg tgagcttctt tggcattgac cggaatattt accgcttaca aagcagtatc 1080 

gacctgtttt acttctttta cacgctccgc agaggtaaag gcgacgaagg cttgaaagcg 1140 

atatgcgagt ctcttctgtt gtcggacgag aagttccctt ttaaaggggc ggactctgtg 12 00 

ttgcatgcca atcatggctt caagggcata cttacccggc gggaagcgga aataatgagc 12 60 

tactgcctga attcggagga ggaagcgacc gctgatgaac tgaatcgttt tcggggcagg 132 0 

gattccggca tgtacgagaa tgcgaagttc ttaatgaaaa caatgcttaa aaaagatgaa 13 8 0 

acgactaccg tatattgtca tttggggcat gcgaattata caagtatcgc tggatggctg 1440 

agaccggaca tgcgaccttt cggagaatac atgaagggtt catacggtga tgactactcc 1500 

gccgtgggac tgcttgccgg agggggaagt tatctgacat gggtatttcc cggtaaaatg 1560 

ggaataaggc gattgcagtc ttcgtcgtct gctggattag aatactgtat cgaacgttcg 162 0 

ggtatcagtc cgtgttattt gccgatggat aaactgtccg atgcggatgt tttgaaaatg 1680 

agatatatag gaaatacaga atcgaaaatt ggacaattcc agtgggtttt tccaaaatgt 1740 

atgatggacg gagtgctgtt cacaaaaaac gcgtccgcca caaataagag ggaagagttt 1800 

tttaaaatga acttagacta tcatgtccaa actttatttg ctcttatgta tttgtatgaa 1860 

aagaaaagaa aatggattcc atga 1884 

<210> 526 
<211> 1125 
<212> DNA 
<213> B. fragilis 

<400> 526 

tataaaaaag actataatat tatggcattg caatgtggta ttgtcggact tccaaatgta 60 

ggtaagtcaa cactttttaa ttgtctgtcc aatgcgaaag cacaggcggc aaacttccct 12 0 

ttctgtacaa tcgaaccgaa cgtaggcgta attaccgtgc ccgacgaacg tttaaataaa 180 

ctggctgaac tggtacaccc caaccgcatc gtccccacaa cagtagaaat cgtagatatc 240 

gccggacttg tgaaaggtgc cagcaaaggt gaaggactgg gaaacaagtt cctggccaat 3 00 

attcgggaaa ccgatgccat cattcacgta ctccgttgct tcgacgatga caatgtaacc 3 60 

catgtggacg gaagtgtaaa tccggttcgc gacaaggaaa tcatcgatta cgaattacag 42 0 



206 



ttaaaagacc 
ggaggagata 
gaacagggca 
aaagaattgt 
agtgcggtaa 
gccgaaatcc 
gaagaccgtc 
attaaatcgg 
gtacgtgcct 
accgactttg 
tatggctcgg 
gtagtacagg 



tggaaaccat 
aagccgccaa 
aatcggcgcg 
tcttactcac 
atggaaacaa 
tggtagtagc 
agatgtttct 
cctacaaact 
ggacctacga 
agaaaggttt 
aggctgctgt 
atggagatat 



cgagagccgt 
acaagcttat 
tacggtaacg 
cagtaaaccc 
atacgtagac 
cggaaaaaca 
tgccgaaatc 
gttgaacctg 
aaaaggatgg 
tatccgtgcc 
caaagaagcc 
catgcatttc 



atccagaaag 
gatgtacttg 
ttcgaaacaa 
gtaatgtatg 
atggtacgtg 
gaagctgaca 
ggcctggaag 
gagacttatt 
aaagctccac 
gaagttatca 
ggaaaattgg 
cgtttcaatg 



tacaaaaaca 
ttcaattcaa 
aagacgaaca 
tttgcaatgt 
aggcagtaaa 
tcgccgaact 
aatcgggtgt 
tcactgccgg 
aatgtgccgg 
aatacgaaga 
gtgttgaagg 
tataa 



agctcagacc 
ggatgcgttg 
gaaaatagcg 
ggacgaagca 
ggacgaagac 
ggaaacctac 
ggcacgtctc 
tgtacaggaa 
agtgatccat 
cttccttcaa 
aaaagaatac 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1125 



<210> 527 
<211> 2208 
<212> DNA 
<213> B.fragilis 



<400> 527 

aatcaaaagg 

gattcgcaag 

tactcattgc 

gatctaagta 

gatgatgtgg 

atcgtggtat 

ataaaataca 

gtattacttg 

aagagaaaca 

ttaatattta 

aaagcggtga 

atagggaaca 

ttattgcata 

atgaaactac 

caagatcatg 

acgcttacat 

atatttttat 

aaaaaactgg 

accgtttcga 

tgggaagaaa 

gctcaaaatc 

tgtgctatgg 

attattggta 

tatgccaaaa 

ttactttcta 

atacattttc 

ccggaaaata 

aaactattgg 

gtaagtgcca 

aaaatattca 

acgcgtttga 

aagggttttg 

cgtttgttga 

acaaactctt 

gaacagcgta 

att gtggtgt 

aaaaaggggc 



agaaaaataa 
actgcggacc 
agttcatgcg 
ccggggcaga 
tgaacagcat 
atcattctga 
cgcatgaaga 
ccgtggaacc 
gcttttcgag 
ttattatgct 
ttgatgtcgg 
tctgtatctt 
tcacggcgcg 
ctgttacttt 
aacgtatacg 
ttgccgtctt 
caggatcggt 
attgggaata 
ctatacagga 
ttcaggcacg 
tgggtgccca 
cggttatcaa 
tgctcaatgg 
tcagtttctt 
ttggcagtac 
aatacacgcc 
aaatcacggc 
ttcggcttta 
ttaatctacg 
gtgataccat 
gggaagtttg 
aaacgaccat 
ttgctcgtgc 
tggattcaat 
ctgttgttgt 
tggacaaagg 
attattttga 



gaatatgcta 
tgcatccctt 
tgaccgttgc 
aagcatcggg 
tccgtttcct 
taggaaatac 
atttcgaaag 
aactactgat 
cattcttaaa 
cgttgttact 
cattaaaact 
gttgagtgta 
agtaaatatt 
ctttgagaat 
cagtttcatt 
tagtattatt 
tctgtacgct 
ttttgaactt 
tatcaaaatc 
gctttatcat 
atttatagaa 
gggtgaaata 
tccgcttgtg 
acgcatcaac 
aaccatcctt 
taactctcct 
aattgtggga 
taagcccagc 
ccaatggaga 
cttgaataat 
tcgtatcgct 
tggagaaacc 
gctgtatcgg 
aaatgaacga 
tattgctcac 
ttttatcgtt 
gttggtttct 



ctccaccgtt 

aaaattattg 

ggcattacca 

ctgcgaacgc 

gcaattgtgt 

atatgggtct 

ggttggtatc 

tttaagaata 

tatttttttc 

gtcttacaag 

tcggacagga 

atgattttca 

gctttgattt 

aagctgctgg 

atgaataatt 

ttattgattt 

tgttgggtgt 

ttgtccaaaa 

tacaattatg 

gtcaataagc 

aatatcaaaa 

acatttggaa 

caatttatta 

gagattcgtc 

ccggaaagaa 

ttggttctgc 

ggaagtggta 

catggagaaa 

aacatgtgtg 

attgtattag 

cagattgagg 

ggacgcgggt 

gatccgaaat 

aaaattgtga 

aggcttagta 

gagaccggaa 

tcacagatac 



ttcccgtaga 
ctaagcattt 
aagaaggtgt 
ttgccataaa 
tttggaatga 
cggatccagc 
aaagggatga 
gtaaagctga 
catataaaaa 
gtatgttacc 
actttattaa 
atgtgttgag 
ctgactactt 
gcgatatttt 
ctttggcatt 
acaatactat 
tactgttttt 
accaaagcta 
acaagtaccg 
gtgttcttgc 
atatggctat 
taatgatttc 
attttgtggt 
agttggaaaa 
aaacgattct 
gtaatattta 
gtggtaagtc 
taaaaatgga 
gggtggtaat 
atgatgaaca 
atgagataaa 
tgagtggagg 
ttctctttat 
atgccttgaa 
ccattcgtaa 
ctcatgaaat 
aagattaa 



ataccaaatg 
tggtaagttt 
atcgttactt 
atgtaccatt 
cagtcatttc 
aaaaggacgc 
aagccaaggt 
acaagaacag 
gagcttcggg 
atttatctct 
tatggtactg 
ggattggatc 
gataaaattg 
gcaacgggca 
gatattttca 
aattttctat 
gagcatacgt 
ttgggtggaa 
gcggtggaaa 
cataaccaat 
cgtgtttttc 
tacacaattt 
atcagcgcaa 
tgaggatgaa 
attagagaat 
cttacaaata 
aactcttctg 
caagatgaat 
gcaagatgga 
aattaattat 
cgcgatgcct 
acaaaagcag 
ggacgaagcc 
caatgcattt 
tgctgatcaa 
attgatggag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2208 



<210> 528 
<211> 1194 



207 



<212> DNA 

<213> B.fragilis 

<220> 

<221> unsure 
<222> (130) 

<223> Identity of nucleotide sequences at the above locations are unknown 
<400> 528 

ctaccattac tgcgcttggc ccggccggac gcagatgaac cttttcgtac tgaagtgtgg 60 
tataaaggta caatagaaca tgatacactt cgaggagata tctatgtggt tggcggattc 12 0 
gatccggagn ttgatgatga aagaatgaat gcattggtag aagaggtgat tactttccct 180 
ttctcggtat tgaaagggaa tatctacgga gatatatcga tgaaagattc tctctattgg 
ggaagcggtt gggcatggga tgatactccc tcctcttttc aaccttatct atcaccatta 
atgtatcata aaggcatggt gaaagtgaca gctgttccgg gggcgacacg aggtgactcg 3 60 
gcacgtttaa gctttgagcc gtcatcgtct tattatacta tgaccaatga aactaaaaca 
cgtacatcct ctgccggtaa gttttctgtg tcaagaggtt ggttggaaaa taaaaataat 
cttattgtca gtggaaatgt agagaataga agaataggtg atgtaaatgt atattcttcg 540 
caggactttt tcatgcatac ttttgtcgaa cgtttacgta ataaaggtat agagatttcc 600 
aatcattatg ctttcgacag tttccggtct gacagtcttt ctatctgtat ggcacgttgg 660 
gagtgcccgg ttcaggatgt gatagaccag attatgaaag agagtgataa tttgagtgcg 72 0 
gaagcactgc tttgccgttt aggtgcccgg gccacaggta agaagcaggt ttcggctaag 780 
gacggaattg aggaaatata tcggttgatt caggatttgg gacatgatcc ggataactat 
aagatagctg atggttgtgg attgtccaac tacgactacc tctctcctgc cctactggtt 
gattttctga agtttgctta ttcgcggaca gatattttcc ggaaattata taaggccctt 
ccggttgcag gcatcgatgg aacattaaaa aatcggatga aacaaggggc ggcgtttaag 102 0 



240 
300 



420 
480 



840 
900 
960 



aatgtacatg ccaagaccgg ttcttatact gctatcaata cattagccgg ttatcttaag 1080 
atggctaatg gacaccaagt ggcttttgcc ataatgaacc agaatatact ttcagccgct 1140 
aaagcaagga attttcaaaa taaagtatgt gagatactgg caaaccatca atga 1194 

<210> 529 
<211> 1584 
<212> DNA 
<213> B.fragilis 

<400> 529 

acactaaaac ttatggttaa aaagttcgat ttcctcgtaa tcggttctgg tatcgccgga 60 

atgagttttg ccctaaaagt ggcacataaa ggaaaagttg cccttgtttg caaaagcggg 12 0 

ctggaagaag caaacactta ttttgcccaa ggaggagtgg cctcagtgac caatctgctg 180 

gtagacaatt ttgaaaaaca tattgaagat acaatgattg ccggtgactg gatcagtgac 

cgtaccgctg tagaaaaagt cgtacgtgaa gctcccgcac agatacaaga actgatcagt 



240 
300 



420 
480 



600 
660 



tggggtgtaa acttcgataa aaacgaaaaa ggagagttcg atcttcaccg ggaaggaggt 3 60 
cattcggagt ttcgtatcct gcaccataaa gataataccg gcgccgaaat tcaggatagc 
ctgattcgag ccgtacaaca acatccgaat ataacggtta ttgaaaatca ttttgccatc 

gaaatcctga cacaacacca tttaggagta accgtcaccc gtcagacacc ggacatcaaa 540 
tgttatggag cctacatact cgatccgaaa acagggaaag tggatactta tctggccaaa 
gtgacattaa tggcaacagg tggagtaggg gctgtctacc agactacaac caacccgctt 

gtagcaaccg gcgacggcat tgccatggta tatcgggcaa aaggaaccgt aaaagatatg 72 0 

gaattcgtac aattccaccc gacagcgctt taccatccgg gcgatcgtcc ttctttcctc 780 

attaccgagg cgatgagggg atacggtggt gtacttcgta ccatggacgg gaaagagttc 840 

atgcagaaat atgatccccg tttgtctctg gcaccgcgcg atatcgtagc gcgtgctatc 900 

gataatgaaa tgaaaaaccg tggagacgac cacgtctacc tcgacgtaac tcataaagat 960 

ccggaagaga ctaaaaaaca cttccccaat atatacgaga agtgcctgag cctgggaatc 102 

gatattacca gagaatatat ccctgtagca ccatcggctc attacctttg cggaggtatt 108 

aaagtggatt tgaatggcca atcttctatc gagcggctat acgctgccgg cgaatgttcg 114 

tgtacaggtt tgcatggtgg caaccggttg gcttcaaact cactgataga agcagtggtt 12 0 

tatgcagatg ctgctgccag acattgttta tcggttatcg accaatatac ttataacgaa 12 6 

gaaattccgg aatggaatga cgaaggtacc cgctcaccgg aagaaatggt acttattact 132 

caaagcatga aagaagtcaa tcagatcatg agtacctatg taggtatcgt ccgcagcgat 13 8 



208 



agtgtgctca gtaatgctac ttttattaac cgtccggctg tattcgatgc gttgaacagg 

gtggataata acattttaaa gttggacacg gtggatgaag agtatatccg gactgtagat 

cgtccgaacg gacgatacga tctgaatgga acagtcggac ttttaaaagc ttttaaaggt 

aattgcatcg tgcagactat gtttatgaaa ggaaaatata aggggaaaga tgtggataat 



ctccggttga aacgtgcatg ggatcgtctg gatatcttat atgaagagac cgaaagcctt 1440 

ttcaagcgta gcgtagcatc taaagaaata tgtgagctgc gtaatatgat caatgtaggt 1500 

tatctgatta tgcgtatggc catggaacgt aaagagagcc gcggtcttca ctacacggtc 1560 

gattatccgc atgccggtaa atag 1584 

<210> 530 
<211> 786 
<212> DNA 
<213> B. fragilis 

<400> 530 

acagaattaa gtatgactat tatttttcct tctcctatat tcggaccggt tcattcacgt 60 

cggttagggg tctcacttgg aataaatttg cttccttcgg acggaaaagt atgttctttc 12 0 

gattgcattt attgtgaatg tggttacaat ggtgaacatc gtcctaaatc ttcattaccg 180 

acccgtgaag aagtccgtat ggctctggaa gagaaattaa aagagatgaa aagcaacgga 2 40 

cctgctcccg acgtactgac tttcgccgga aacggtgagc cgactgctca tcctcacttt 300 

ccggagatta tcgaggatac acttgctttg cgtgatgctt actttccgga tgcaaaggtg 360 

420 
480 
540 
600 

acttctgaca agtatgtact tccctggttg aaagttgtaa aggatattgc cccaagacag 660 

gtgatgattt atacgatcga tcgggaaact cccgatcagg acttgcaaaa agctactcat 720 

gaagagttgg atcgtattgt ggctcttctc acgaaagaag gactttcggc aactgcttct 780 

tattga 786 

<210> 531 
<211> 2679 
<212> DNA 
<213> B. fragilis 

<400> 531 

gtagatttga gggaaaccgc tatctttgcc ttgttattta tgaacttaaa aagaagacta 60 

tccgtgagca atgatattga attaaccccg atgatgaaac agtttcttga cctgaaggct 120 

aagcatccgg atgcagtgat gctgttccgg tgcggagact tctatgaaac ttattctacc 180 

gatgcgatta ttgcagctga aatattagga attactctca caaaacgtgc caatggaaaa 240 

ggtaaaaccg ttgagatggc gggatttccg catcatgcgt tagatacata cctgccgaaa 300 

ttgatccgtg caggtaagcg ggtggccata tgtgaccagc ttgaagatcc taagacaacg 360 

aagaaattgg tgaagcgtgg cattacggag ttagttactc cgggtgtttc gatcaatgat 42 0 

aatgtcttaa attataagga aaataacttt ctggcagctg ttcattttgg aaaatcggct 480 

tgtggtattg catttctgga tatttctacc ggagagttcc tgacggctga aggacctttt 540 

600 
660 

tgggtattta ccgaatccag ttcccgggag aagttgctga agcattttga aacaaagaat 72 0 
ctgaaaggat tcggggttga gcatctcaag aatggtatta tagcttccgg agctatcctg 
caatatctgg atatgacgga acatacacag gtaggacata tcacttcgct ggcacgtatc 
gaggaagaca aatatgtgcg tcttgataaa tttacagtgc gtagcctgga gttgatcggt 



gactatgtag ataagctgct gaataatttt gctccgaaag agattctttt cgaacgtggg 
aaacgcggaa tgtttgaggg aaatttcgga agtaagttct ttacttttga actggatgat 



780 
840 
900 



agcatgaacg atggtggcag cagtttgctt catgttattg acaagactat cagtcctatg 9 60 

ggagcccgtc tgttgaagcg ttggatggta tttcctttaa aagatgagaa acccattaat 102 0 

gaccggctga atgtagtaga atacttcttt cgtaaaccgg atttcaggga gttgattgaa 1080 

gacgaactgc atcggatcgg agatttggaa cgtatcattt caaaagtagc cgtcgggcgt 1140 

gtttctcctc gtgaggtagt acagttaaaa gttgctttac aagcaattga acctattaaa 12 00 

gaggcttgtc aacaggccga taatccgagt ttgaaccgaa tcggtgagca gttgaatctt 12 60 

tgtatttcta ttcgtgaccg gattgaaaaa gagattaata atgatcctcc tctgttgata 1320 

aataaggggg gagtcataaa agatggtgta gatacggaat tggatgagct tcggcagatt 1380 

gcttattctg gcaaagatta tctgcttaag atacagcaac gtgaaagtga actgacagga 1440 

atacctagtt tgaagattgc ttataacagt gttttcggat actatattga agtgcggaat 1500 

gtgcataaag ataaagtgcc gcaagagtgg atacgtaagc agacgttggt aaatgcggag 1560 

cgttatatca ctcaggaact gaaagaatat gaagagaaaa ttctgggtgc cgaagacaag 162 0 



209 



atcctggtat tggagactcg cctgtataca gaacttgtac aggcattgag tgaatttata 1680 

cctgccatcc agatcaatgc taaccagata gcccgcattg actgcctgct ttcatttgcc 1740 

aatgtagcca aagagaacaa ttatatccgc ccggtgattg aagataatga tgtattggat 1800 

attcgtcagg ggcggcatcc ggtaattgaa aagcaactgc ctatcggaga aaaatatata 1860 

gctaacgatg tgttgttgga taacgctacc cagcaggtta tcattattac cggtccgaat 192 0 

atggccggta agtcggccct gttaaggcag actgcattga tcaccctgct tgcccagatt 1980 

ggttcgttcg ttccggccga aagcgctcat atcggattgg tagataagat ttttactcgg 2040 

gtcggtgcca gtgacaatat ctctgtagga gaatctactt ttatggtcga gatgaatgaa 2100 

gcgtctgata ttctgaataa tatttcttcc cgaagccttg tcctgttcga tgaattggga 2160 

cgcggaactt ctacttacga cggaatatcc atagcttggg ctattgtaga gtatatccac 2220 

gagcatccga aggcaaaggc acgtacactt tttgctactc actaccatga actgaacgaa 22 80 

atggagaaat cctttaagcg tattaagaat tataacgtat cggttaagga ggtggataat 2 3 40 

aaagttattt tcctccggaa acttgaaagg ggcggaagtg agcactcgtt cggtatccat 2400 

gtggccaaga tggcaggcat gcctaaaagt attgtgaaac gtgccaatga aattctgaag 24 60 

caactcgaat ctgacaaccg ccagcaggga atttcgggta agccgctggc agaagtcagt 2 52 0 

gagaatcgcg gaggtatgca gttgagtttc tttcagcttg atgacccgat cttgtgtcag 2580 

atccgggatg aaatacttca tctggatgtg aataatctta ctccgattga ggcattaaac 2 640 

aaactgaatg atatcaaaaa gatagtcagg ggaaaataa 2 679 

<210> 532 
<211> 1800 
<212> DNA 
<213> B.fragilis 

<400> 532 

gttgttaata ttgacaataa ccttgtttat tttatgaaaa caaaattgcc gttattacta 60 

ttatttttcg tattgttttt attcaaatgt gatttaaaag ctgatcccgg ccataaaagc 12 0 

cctttagaat ataggtgggt taatcatccg ctggatttct atctgaatgt gaccgtagac 180 

agtaccacta ctccccattc attgttattt gaaacaatgt atgaaaaaaa aggaattgca 240 

agctttttac tacctatcta tcaactggag aagaatagcc ttacttttga gattaagatc 300 

360 
420 
480 



780 
840 



agatataaaa cggaaaattg cgagaatcta ttcttggcaa ttacctctgt cggcgattgt 
gagaacataa attccattga taccattcaa ctgaacgcaa cccaagattg gaaggagtgt 
acgcggattt tgaaaacaaa gaaagcatat tttttaaata tatctgttgg ggctgtcggc 

tacggccaac gcaagggcaa gatatggatt tctgatttag aggtgctggg tgatggaaaa 540 

gcaatcgggg acaaccccca acaggaatat aaaaaagaag atattcattt gaaagcaacc 600 

gatctgattc attggaataa caaagagtat gacaaccttc ctttcttaaa taagaaaata 660 

cttgggcttg gagaaacggc gcatggcaca gaaacgatga acgacatcgg cattgaaatt 72 0 
tcaaaggaac ggattctgaa acaccaatgt cggtttattt tgctcgaaat tccgttggaa 
ttttcccttt acatcaatag atacgtgcaa aatgacaaaa attttaaatt tgaatatatt 

tcagaacgtt ttgaaccata cctgttttcc gactccatct tatcctttat ccggtggatt 900 

aaagaatata attcggcgca taatcaaaaa atctctattt tgggatttga tttaaatacc 960 

acaccactat tgagcagagc agatttattt aattttttct ataacctgaa gtcgggcggg 102 0 

catgtcgaag aaattgatac catttgtgaa tctttactgg acagcaaaac ctcttttgag 1080 

aaaattattt ctaagttcga caaaagcatc cgtttagcag attgtttgga taaaggcgaa 1140 

ttgaaattga tacaccgatg cctggagata acaggacgga gttcaagcag ctatttccga 12 00 

tttgttgaaa gggatagata tatgaatgat atcgtaacat tcattattga ccattttctc 12 60 

aataccaatg aaaccgtcac tctctttgga catttggggc atctcaacta caaaggcaat 132 0 

agagtagagc taatggatta tttttcctta ggatattacc tgaagagcag atatgcaaag 13 8 0 

aattattcat gtattggatt gatcactaac cgaggcactg caatgcttcc ggtatctgct 1440 

acaaacggtg gagtaacaaa gttggaacag gcaccgcagg gaagtttgga atttcaagta 1500 

aacaaattga aaatggactc ggtttatttg tcaatgagca agtttacttg ttcggatgta 1560 

ttcctattaa gagagttagg ttccggtttt tcccaaaata agaaaatcat tccgaatcaa 162 0 

ttccagtata tgatcccgaa gtcaagaatg gaaggcgtta tttttacaaa agaatcagtc 1680 

aatttcatga aagggaaaga atttttcaaa aaaaatatga acgttgaagt tgttacaatg 1740 

aggttttata taaaagcatt agaaaaacta actcaaaaaa aaatagatct caatatatga 1800 



<210> 533 
<211> 1413 
<212> DNA 



210 



<213> B. fragilis 



aaacgtattg tgcccgattg cggacttact accgatatat tttccggctt ccattccgaa 
acggaagaag accatcggga atcactttcg ttgatggaag cttgtggtta tgatgcagca 



60 



420 
480 
540 



840 
900 



<400> 533 

agcagtatct ttgcggacgc aaatttaaat tctgaattta tgaacgaatt gacgggagcg 

gactttaaat ccgcaactgc tgatgacaac aagaagttgt ttatcgagac ttatggctgc 120 

caaatgaatg tggcagatag tgaagtaatc gcctctgtga tgcaaatggc gggttattcg 180 

gttgccgaaa cgctggaaga ggctgatgcg gtgtttatga atacctgttc tatccgtgac 240 

aatgccgaac agaagatttt gaatcgtctg gagttctttc attcgatgaa gaagaaaaag 3 00 

aagcacctta ttgtaggtgt attggggtgt atggccgagc gggtaaaaga tgatctgata 3 60 
gaacaccatc atgtggacct tgtagtagga ccggatgctt atctgactct tcctgagttg 
attgcttcgg tagaggccgg tgagaaggca atgaatgtag aactttcgac tactgaaacc 
taccgggatg tgattccttc gcgtatctgt ggtaaccata tctccggatt tgtatccatc 

atgcgcggat gcaataactt ttgtacctat tgtattgtgc cttatacccg tggacgtgaa 600 

cgtagccggg atgtggagag tatattgaat gaagtggccg atttggtatc aaaaggttac 660 

aaagagatca ctctgctggg gcagaatgta aactcttatc gttttgagaa ggaggggggg 72 0 

gaagtagtta ctttcccaat gttacttcgt ctggtggctg aggctgcacc gggaatacgt 780 
gtccgtttca ccacttcgca tcccaaagat atgagtgatg aaaccttgga ggtgattgca 
caggttccta acgtatgcaa acacattcac cttcccgtac aaagcggaag ttcgcgtatc 

ctgaaattga tgaatcgcaa atatacgcgt gaatggtatc tggaccgggt agcggcgatc 960 

1020 
1080 

tttatgttta aatattcgga gcgtcccggt acttatgctt ccaagcatct ggaagacaac 1140 

gtttccgaag agataaaagt ccgtcggctg aatgaaatca ttgctttgca gaatcgtttg 1200 

tcggccgaat ccaataatcg ttgcatcggt aaaacgtacg aagtgttggt tgaaggtgtt 12 60 

tcgaagcgtt cacgcgacca gctgttcggc cggaccgaac agaatagggt agtggtattc 1320 

gaccgcggta cccatcggat aggtgatttc gtgaatgtga gaatcacgga ggccagttct 13 80 

gccacattga agggtgaaga agtcttcagc taa 1413 

<210> 534 
<211> 687 
<212> DNA 
<213> B . fragilis 

<400> 534 

aagaacatca taagaataat gggaacaaac aacagtgatt tttatctgcc tgtatatgtc 60 

attaacctta aagagcgcac ggaacggcgg cagcatatag aggaacagtt tcaagggaag 12 0 

gtagagtttg ctctccattg gatagaggca atcgaacatt ccattggagc agtaggatta 180 

tggcaaagca tgctaaaggc tgtacaaaca gctatcgaca aaagggatga tatcatgatc 240 

atttgcgaag acgaccatat atttaccccc gcatataaca aagattattt gtttgccaat 300 

ataataggag caaacgctca aggttccgag ttgctttcgg gaggtgtcgg aggatttggc 3 60 

acagcggtac cagtggacac aaatcgctat tggatggatt ggttttggtc tacgcaattt 42 0 

atcattattt ttaagccgct atttcaaaag atattagact atgacttcaa agacactgat 480 

acggcagatg gagttttatc tgtccttgct aaagataaga tgactatcta tccgttcatc 540 

tccgttcaaa aagattttgg ctattcggac gtaaccgtct acaatgggac tccggggatg 600 

ataagcaact atttttctca ggcaaactac cgcttgagaa tgatacatca tgttagtcat 660 

aaatttaaag aacaggcaaa aagatga 687 

<210> 535 
<211> 717 
<212> DNA 
<213> B. fragilis 

<400> 535 

aatactgcca tcaattatag ttccgaatgg gcaaagcaaa gcacaataaa ttattatgat 60 

atagagccgg gtaaaattca tgtagtggaa tttggggcaa atatccctac tccttcagac 12 0 

tataaaatag atatacagac agatatttgt aatttagtct ttatcggaaa aaattggcag 180 

aaaaaaggtg gagataaagt tttaggggca tatagaaagc ttaaatccga tggatttcga 240 

tgtacgctta cgattattgg ttctattatt cgggaacctt atgatgaaga tgagaattta 3 00 
gttataattc cttatttaga taaatcccaa ccggaacatt tggaaagatt ttgtaatatc 360 



211 



<400> 537 

cttgtaagca ttatgaagaa actgaactta tttattttat tctctttttg tttttcgatt 



600 
660 



ttgcaggaag ctcatttttt agtacttcct acagagttcg acgcatttgg aattgtgttt 42 0 

tgtgaagcat cggcttatgc tgtacccagc attgccgcca atgtgggtgg agtgagtcaa 4 80 

ccggtacgtg aagggaaaaa cgggtatttg ctcatgccgg atgctacagc tgaagattat 540 

gctgagaaaa taaagtcggt tttcgctgac aaagaaaact atctgaaact ccggatgtca 

tcgcggcaag aatttgaaac ccgtcttaat tgggaggtat ggagcgagaa agtaaataaa 

atattggaag aaattgtaga agaacatcat aagaataatg ggaacaaaca acagtga 717 

<210> 536 
<211> 285 
<212> DNA 
<213> B.fragilis 

<400> 536 

gaacttcgca ttctcgtaca tgccggaatc cctgccccga aaacgattca gttcatcagc 60 

ggtcgcttcc tcctccgaat tcaggcagta gctcattatt tccgcttccc gccgggtaag 12 0 

tatgcccttg aagccatgat tggcatgcaa cacagagtcc gcccctttaa aagggaactt 

ctcgtccgac aacagaagag actcgcatat cgctttcaag ccttcgtcgc ctttacctct 

gcggagcgtg taaaagaagt aaaacaggtc gatactgctt tgtaa 2 85 

<210> 537 
<211> 267 
<212> DNA 
<213> B.fragilis 



180 
240 



60 



atcacttggg gacaagccaa ttttgcagcg attgattcac ttattaaaaa agaactgcct 12 0 

caaggttcgg aggttggtat ttccgtgtat gacctgactg cccgaaagac actttacacc 180 

tatcgtgata ccaaactttc gcgtccggca tctaccatga aacttttgac taccattact 240 

gcgcttggcc cggccggacg cagatga 267 

<210> 538 
<211> 1689 
<212> DNA 
<213> B.fragilis 



60 



<400> 538 

aagaaggaaa tgaaagtgtt ggatttcaaa ccaaggttat tctctacctt gaagaactac 

tctaaggaaa cgtttatgtc agatctgatg gcaggtatca tagtaggtat cgtagcctta 12 0 

cctctggcca tcgcattcgg tatcgcatca ggtgtatcac ccgagaaagg aattattaca 180 

gctatcattg caggattcat catctctctg ctcggaggaa gcaaggtaca aatcggagga 240 

ccgaccggag cattcatcgt catcatttat ggcatcatcc agcaatatgg agaagcggga 3 00 

ttaatcgtag ctacactgat ggccggcata ctcctgatcc tattaggagt atttaaattg 3 60 

ggagcgatta ttaaatttat tccctatccg atcattgtag gctttaccag cggtatagcc 42 0 

gtcactattt ttacaaccca gattgctgac atattcggat tgaatttcgg tggagagaaa 480 

gttccgggag actttatcgg aaaatggatg atctatttcc ggcatttcga cacagtcaac 54 0 

tggtggaacg ctgtcgtaag tattctcagc atcatcatta ttgccattac tccgcggttt 600 

tcgaaaaaga taccgggttc tcttattgct attattgtgg taacgatagg agtatatgta 660 

ttaaagacat atgccggcat tgattccatc gataccattg gcgatcgttt taccatcaaa 720 
tcagaattgc ccgaagcagc catacccacc ctcaactggg aagccatcaa ggatttattc 
ccggtggcca ttacaatcgc tgtattggga gctatcgaat cattactatc ggcaaccgta 
gccgacggtg tgacaggaga taaacacgat tcaaataccg aactgatcgc acaaggaaca 

gccaatctga tcacaccgtt atttggtggc atccccgcaa ccggagccat tgcccgcaca 960 

atgactaata tcaataatgg cggtaaaaca ccggtagccg gtatcattca tgctatagtt 102 0 

ttattgctga tcctcctgtt tctgatgcct ctggcgcaat acatcccaat ggcctgcctg 1080 

gcaggcgtat tagtcatcgt atcatataat atgagtgagt ggcgtacatt caaagcattg 1140 

ctgaagaatc ccaaatcgga tgtgaccgta ttgctgatca ccttcttcct caccattata 1200 

ttcgatctga ctattgccat cgaagtaggt ttggtgatcg cctgtatcct gtttatgcga 12 60 

cgtgtgatgg aaacaaccga gatatctgtc atcaaagatg aaatcgatcc gaatgacgaa 132 0 



780 
840 
900 



212 



ctggacattg 
attaatggtc 
ggtgaccgtc 
ggtattcaca 
ctctcgggag 
ctgggaaaac 
ataaattaa 



ccgtatgcga 
cgtacttttt 
ctaaagtacg 
acctgaccag 
taaacgagaa 
aaaacatctg 



agagcatctg 
tggtattgcc 
catcatccgc 
cctttgtaaa 
agtacacaaa 
cccgaatata 



ataatccctg 
accaaatttg 
atgcgtaaag 
atgtctcaaa 
ccccttgaga 
aatgtagcgt 



ccggcgtgga 
aagaaacaat 
ttccattcat 
aggaaaagat 
agtcgggctt 
tggacagagc 



ggtatatgaa 
ggcacaattg 
cgattcgacc 
cactatcgta 
ctatgaatta 
caaagaaatt 



1380 
1440 
1500 
1560 
1620 
1680 
1689 



<210> 539 
<211> 2433 
<212> DNA 
<213> B.fragilis 



<400> 539 

tttaatgctg 

atattcccat 

cgtggagtag 

ggtaccacca 

cgtaaagttt 

gatctcacct 

ctatcgactg 

acccgcatgc 

tcgattaccg 

actcttttcg 

gtacctattt 

tccgaaatgc 

ggtatcggaa 

aagttcacca 

acatttgacg 

gccttcgaac 

ccttcattag 

aatgataacc 

ctgtacgaca 

acgctgactt 

gcatatttcg 

aataaagaat 

tcaactttcc 

ttccaattag 

aacatagaca 

aaatttgttc 

gaagtaatga 

agcagtcagg 

ggcatcatgc 

agccttctcc 

cgccagtttg 

acatatttcg 

caacctaccg 

ttaagcggaa 

aaatacgaaa 

actgctaacg 

ggtataggag 

ggacacggct 

gcccagctgg 

gatgcattag 

aatttctcag 



ttaaatgtat 

cgttactttt 

tctatgacga 

tcggaacaac 

acaagataaa 

cgagaagtgt 

tagaagtgtt 

ctttacgtcc 

aacagggggc 

gatcgtatgg 

taaagaatgg 

agggagtaga 

acgacctcgg 

atgaaggaga 

tacagtccgt 

gttcggataa 

aatggcgtcc 

gtactcctta 

tgccacacaa 

atgccgcacg 

gctcttcata 

ataacatgcg 

agcttgactt 

gattcgacta 

ctatcaacgt 

ccgaaatccc 

cttttaataa 

acggcacaag 

tgactccggt 

atgcagccag 

aagtgggcat 

atatccttac 

gttattttga 

gtatacttga 

acagcccggc 

gatggattca 

tctatttcgt 

ccatgaccaa 

cctatagtat 

gatacaattc 

cagtaatctc 



gaagcaaatt 

tgccacagaa 

aactgatacg 

aaccaatagt 

tgtcagcttt 

agcgcaactc 

cggagaacga 

cagtgaacag 

acttaccgta 

cggagtaaga 

agtccggata 

aagcatacag 

aagtgcaggc 

agtatctttg 

tctggataaa 

ttatcgtccg 

ggatgacaaa 

taccagttca 

caaattcctg 

catcacccgc 

taaagtggat 

tagaagaacg 

tatcggcaga 

caaaaacacc 

actggccccg 

ggtagaatcc 

gtatataaag 

tgcaggtccc 

aaagaacatt 

acgaatggag 

caagtccgat 

taaaaacctc 

taaagcggga 

aaacctccaa 

ctttaaaaat 

atatcgattc 

aggtaaacgt 

cgaaaagcct 

ccacaagttt 

gtattatcgc 

ttaccatttt 



tatagtaccc 

ccagagtctg 

ccattggctt 

gaaggccgat 

gtaggatatg 

tcgtttacgc 

tataaacaac 

atacagagta 

accgatgtcg 

gaaagcatgt 

gattccgatt 

gtcatcaaag 

ggagtaatca 

cgggcaggca 

aatcagacga 

gtcatccatt 

acaagcgtta 

gtcaatctat 

ggattcaaaa 

cagctgacag 

aatacaagta 

atttcacgtt 

gatattttca 

gatctgtcga 

agcatctcga 

aattcttcaa 

gctatcctgg 

actaccggag 

aatttgttcg 

aacggagatg 

tggctgaata 

tcttatagta 

agtctgaaaa 

gtaatgatgg 

ggttcagccc 

gacaaaggag 

cctgttaatg 

tttgacatgc 

actgcccgtg 

ggaggttata 

taa 



tgcttttatt 

tcgatagagt 

cggccactgt 

ttattctgcg 

ccactcaaac 

tcttaccgga 

cgaaaaaact 

tttctgtaat 

caagaaatgt 

ccatccgcgg 

tccgtaccgg 

gttccgcagc 

acgtagtaac 

gctggggatt 

tcgctttccg 

ccaatcgcgt 

ccatagaaat 

cgaaagatac 

acgataatgt 

ataatatcag 

cttccgtaaa 

cattacgcga 

ccggccctgt 

tcaccaatta 

atgtgctacc 

gttatggaat 

gactgagata 

atgcctggaa 

gttcatatac 

aaataggtcc 

accgcctgcg 

cttatcatcc 

gaaaaggtat 

gttacgctta 

cgatgaatac 

tattaaaaag 

attttgcaat 

ccggttatac 

tatacctgaa 

tcaatcagat 



agttttacta 

ccctgccatc 

ccaaatagaa 

taatctggca 

ccgtacagtc 

cgataattta 

ggatgccatc 

ctcagaaaaa 

acccggagtt 

ataccgggga 

ctctgcttta 

cgtcacacaa 

caaaacaccc 

gttccgcccc 

tatgaacggt 

atacatcaac 

ggattatctg 

ggaagagaat 

aaacaacaag 

tgtgcgggca 

aaccgtagtc 

tgaccgcaac 

taaacataca 

tacccctgtc 

tgtcgcggtt 

aatggcacag 

cagttatatc 

tccgatgttg 

aactactacc 

gtcgaagacc 

ctttaacctg 

gggaactact 

tgaaaccgaa 

tctggatgca 

cccgaaacat 

actatcagcc 

caagccggac 

tacaataaac 

caacctgttc 

cgatccacgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2433 



<210> 540 
<211> 1119 
<212> DNA 



213 



<213> B. f ragilis 



<400> 540 

acacatgcag 

tatggttcac 

aagaagtttg 

caagaggctg 

gatttctacg 

ccgggaaatt 

atgggagcat 

ccgatcttaa 

acaagcggag 

caatggtcta 

gaccataact 

tttgtctgga 

tacccgaccc 

tat accgcaa 

ggcgatacaa 

gtgggccgcg 

ataaatgtca 

gggaaagagt 

ataaaaccct 



tgatgagtat 
tggactctcg 
tcaaagacag 
taagtcgttt 
ggcagaagtc 
acggaccgag 
gggttaccga 
tccacgaatt 
aacagattta 
ttgtccttac 
tcccggctgt 
taagcaagct 
tcaacagcta 
actatgattc 
ccgttcggag 
gacactcttt 
actacgccaa 
atggaatcac 
acgaaatttc 



ggccatcaat 
ctggaatcgg 
tcgtttcgat 
tatgcctatt 
caacgaccgg 
tgtcaccgac 
ttccgtagga 
caaccattca 
tgctgcggtt 
cgaagcaatg 
cgaaatcacc 
ggtcgatgaa 
tatgccgcgg 
tatccgtccc 
cgacatcaaa 
caattatggt 
tgacaaccgt 
tcttttagga 
tttcaagacg 



ctcgataacc 
aatcaagtag 
gccttttatc 
tacaaaagca 
tttcacatta 
aaggaaaaca 
atggttgttt 
ttcataaact 
ggcgaacaga 
gtacgtgcgg 
aaagaaaccg 
ctcgagaagt 
ctggcagaag 
aaagtagttt 
actattactg 
cacctcggaa 
accgtgatta 
ctctctttcc 
gcagagtga 



agttcaatct 
gcccattcat 
actcgaacga 
tagataccca 
tactttcaat 
tacataatgt 
atccgcctga 
tcgacccgga 
tggcgcgcca 
cagtcataaa 
tcatacaaaa 
attcctctga 
cttatacagg 
cgatcgacga 
tccacttcga 
tggaagcaat 
tcggagtaga 
gcactccgga 



gcccgcagat 
aaaactgcta 
gaacttatat 
atggtataat 
gtccaatggc 
cttttcagtc 
attaatcttg 
aatgttccgc 
agcctatgga 
atacatgaaa 
aacacgtggc 
ccgcacgaca 
ctttgcccaa 
atttaccaac 
ccgtcctttg 
gcctaaaatt 
attgttaccc 
aggagatgcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1119 



<210> 541 
<211> 1122 
<212> DNA 
<213> B.fragilis 



<400> 541 

atacctatat 

gcctgtttct 

tctactaccg 

atccctttga 

atactgatag 

aataatttat 

ggaggtccca 

attttcgaca 

gaaccgatcc 

aaagccggtt 

gacagcacca 

atggtattct 

atgttcaacg 

attgaattag 

aaaagtgtct 

ttcttttctg 

aaatccaata 

tcattcatac 

gagaatgatg 



tagatcccat 
ttgtactctc 
aatgtcagac 
aagaatgggc 
gtaatatccg 
cccttttcga 
ccgaatatag 
tagccaacaa 
cggaaagcaa 
atatacagaa 
tcctggctaa 
acaatgaatg 
ataccatttt 
gaaaatacaa 
ttgacaatgc 
caagggccaa 
gcattcaaat 
ctaagtgtat 
aaaacccggt 



aattaatccg 
cctgtttgtc 
tatcgacttc 
aaaatccata 
cgcaacaatc 
tctgtcgggc 
cggcattaat 
aattaaaaca 
tatcaaagaa 
tattaccgga 
aataccctat 
ctatccattc 
ttccatcgat 
aatagcagaa 
agcgacttta 
taaacagaat 
ttcttacccg 
gtctgacgac 
gatcatatta 



ttacacaata 
tgctccatgt 
tctacccttt 
cactttgtcc 
ctacataaag 
aaatttattt 
aatgcttgga 
tataactgga 
gtatttcccc 
aatgaacctc 
ggcaaatcct 
catgcaaatg 
aatcaatatc 
gatgcccgct 
actcccatcg 
tatctttttt 
gaaaattcat 
ggaaaatatc 
gcggaaaaat 



caattatgaa 
ttacttcatg 
ttgacggcca 
aattagaaac 
acaaaatact 
gtaacatagg 
cagatgatga 
acggaaaatg 
ttgcatcagg 
acaaaattta 
ttcaaaaagg 
gccggacttt 
aacccgttcc 
acacattaac 
gaaaatggga 
actatgacct 
tcgccatacc 
tgatttcgta 
aa 



aaacaaatcc 
taacaaagaa 
accggaaaag 
caatgattcc 
ggtacatcac 
tagcaaagga 
aggcattcac 
gataaagacc 
taataatata 
cctattcaaa 
agagatgaca 
ctttaaagag 
acgttggtac 
cgatcccaga 
taataaactt 
gaaagaaaag 
ggaagaacat 
tgaaatacag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1122 



<210> 542 
<211> 2898 
<212> DNA 
<213> B.fragilis 



<400> 542 

tcatgtaaat ttgctccctc atataaagca tatattataa atcactataa acaaaaggtt 

atcgctatgg aatataattt cagggagatc gaaaagaaat ggcagaaaat ttgggtagac 

aaccatacct accaggtaaa tgaagatgca tccaaacaaa aattctatgt actcaatatg 

ttcccctatc cttcgggtgc cggactgcac gtaggtcatc ccttgggata tattgcctca 



60 
120 
180 
240 



214 



600 
660 



840 

900 

960 

1020 

1080 



gacatttatg cgcgctataa acgcctgcaa ggatttaacg tattgaaccc gatgggatat 3 00 

gatgcttacg gactgccggc cgaacaatat gctattcaaa ccggacagca ccctgcaatt 3 60 

accactgtta ataacatcaa ccgctatcgt gaacaattag ataagattgg tttctctttc 420 

gactggaatc gtgaaattcg cacttgcgat ccggaatatt atcattggac ccaatgggca 480 

tttatcaaaa tgttcaacag ctattattgc aatgatgaaa aacaggcacg tcccatcgaa 540 
gagttaatag aagcttttag taccaacgga acacaaggta tgaacgtagc ctgtggcgaa 
gaaatggact tcactgccga cgaatggaat gctaaaagcg aaaaagaaca acaggaaatc 

ctcatgaact accgtatagc ctatttgggc aatacaatgg taaactggtg tccggcattg 720 

ggtaccgtac ttgccaatga tgaagtagtt gacggtgttt ccgaaagagg cggttatccc 7 80 
gttattcaaa aagtgatgcg tcagtggtgc ctgcgcgtat ccgcttatgc acagcgtttg 
ctcgacggac ttgaaaccgt agaatggacc gactcgctga aagagactca acgcaactgg 
ataggccgca gcgaaggtgc tgaaatgaac tttaaagtga aagattcgga tattgaattt 
actatcttta caacccgtgc cgacacggta tttggagtta ctttcatggt gcttgccccg 
gaaagtgaat tggtagccaa gttgaccact ccggaacaaa aagccgaagt agatgcttat 

ctggaccgta ccaaaaaacg taccgaacgc gaacgtattg ccgaccgtag cgtaagcggc 1140 

gtattctccg gaagctatgc cattaaccca ttgaccaacg aacccatacc ggtatggatt 12 00 

agcgattatg tattagcagg ttacggtaca ggtgccatca tggctgttcc tgcacacgat 12 60 

agccgtgact atgcttttgc caaacatttc aatctggaaa tccgtccgct gatcgaaggt 132 0 

tgcgacgtca gtgaagaaag cttcgacgcc aaagaaggca tcatgatgaa ttctccccgt 1380 

ccgggagctc ccgaaggcgg actcgtattg aacggattga ccgtaaaaga agcaatcgcc 1440 

aaaactaaag aatatatcaa ggcaaccgga ttaggccgtg tgaaagtcaa tttccgtctg 1500 

cgtgatgcaa ttttctcacg tcagcgctat tggggcgaac cattcccggt ttactacaaa 1560 
gacggaatgc cttacatgat tgatgaaagt tgcctcccgc tggaattgcc ggaagtagcc 
aaattccttc ctaccgaaac cggtgaacca ccattgggac atgccactaa atgggcatgg 

gacactgtca acaaatgcgt gacagacaat gaaaacattg ataatataac cattttccca 1740 

ctggaactga atacgatgcc gggatttgcc ggttcatccg cctattatct gcgctatatg 1800 

gatccgcgca atcacgaagc tctcgtttct cccgccgtgg atcagtactg gaaaaatgta 1860 

gacttatatg taggaggtac ggaacatgct accggacact tgatttattc acgtttctgg 1920 

aataagttcc tgcacgattg gggtatctcc gtagccgaag agcctttcca gaaacttgta 1980 
aatcagggaa tgatacaagg acgaagtaac tttgtctacc gtatcaagga taccaatact 
ttcgtatctc tcaatctgaa agatcaatat gaagttactc ctatccacgt agatgtcaac 

atcgtatcca acgatatcct cgacctggaa gctttcaagg cttggagacc cgaatacgaa 2160 

actgccgaat ttattctgga agacggcaag tatatctgtg gatgggctgt tgaaaagatg 2220 

agtaaatcta tgttcaatgt ggtaaatccg gatatgattg ttgaaaaata cggtgccgat 22 80 

acactccgta tgtacgaaat gttcctcgga ccggttgaac agtccaaacc ttgggatacg 2 3 40 

aacggaatag atggcgtaca tcgtttcatc aaaaaattct ggtcattgtt ctatgacaga 2400 

aacggcgaat atctggtaaa agacgaaccg gctaccaaag aagaactaaa agcactccat 2460 

aagttgatta agaaagtaac cggtgatatc gaacagttct cttacaacac ttcagtaagt 2 52 0 

gctttcatga tctgtgtcaa tgaactttca agtctgaaat gcaataaaaa agaggtattg 2580 

gagcaactca tcgtagttct tgcacctttt gctccgcatg tatgcgaaga gttatgggat 2 640 

acattaggaa acatcacctc tgtatgtgat gcacaatggc cggctttcaa cgagcaatac 2700 

ctggtagaag atacggttaa ttacaccatt tctttcaatg gtaaagcacg tttcaatatg 2760 

gaatttccgg ctgatgccgc cagcgatgcg attcaagcca ctgtacttgc cgacgaacgc 2 82 0 
tcgttaaaat ggacagaagg caaaacaccg aagaaagtta tcgtagtccc gaagaagatt 
gttaacattg ttatttaa 



1620 
1680 



2040 
2100 



2880 
2898 



<210> 543 
<211> 753 
<212> DNA 
<213> B. fragilis 

<400> 543 

atgagtttct ctttagttac agtaacttac aatagcgcac agacactacg tgatactata 60 

acttctgtat tatcacaaac tcatcaagct atagagtaca taataataga tggtttttcg 12 0 

aaagataaca ctgtggcgat tataaaagag tatgagccat tgtttaatgg gcgcctgaag 180 

tggattagtg aaaaggacaa tggcttgtat gatgcgatga ataaaggttt tcaaatggca 

acaggagatg tgattgggat tattaattct gacgatttaa tatctgatcc taatgcaatt 

gaaaaagtga taaaatgctt tgaatcagat acttctattg atgctgttta tgctgattta 3 60 

tattatgttg ctcagaacga tatatctaaa atagttcggt attggaaatc agggggacaa 42 0 



240 
300 



215 



cgtcctttct 
tatcagagat 
cttcgtttga 
atgcgattag 
tgtataaatg 
ttattaccca 



gtaaagggtg 
atgggttgtt 
ttgataaaga 
ggggaactac 
cttttaaaaa 
aaatcaggca 



gcatccggct 
cgatctggat 
gcatattaaa 
aagtaagaat 
gaacggaata 
atattttcaa 



catcctacat 
tttaagttcg 
ttatattatc 
ctatctaata 
aaagtgagta 
taa 



tttatgtgaa 
cagcagattt 
ttcctgaacc 
ttaggaaagg 
tgttatatcc 



gaaggaagta 
tgagttaatg 
tttagtcagg 
aaatcttgaa 
tttatatcgt 



480 
540 
600 
660 
720 
753 



<210> 544 
<211> 636 
<212> DNA 
<213> B.fragilis 



<400> 544 

aataaaagat 

tacgaaatgc 

accatcgatt 

gttccgggca 

ggagctatct 

gcgccgaaac 

ttcggcagct 

tcgggatggg 

aacggaagca 

catgcttact 

atcatcgact 



ttgaatttat 
ctaaacttcc 
accattatgg 
ccgaatatga 
tcaataatgc 
cggcaaagaa 
ttgaaaactt 
catggctgtc 
atccggtacg 
acctcgacta 
gggatgtcgt 



gaatacatta 
gtacgcaaac 
taaacatctt 
aggaaaaaca 
cggacaggtg 
cgaaccggca 
caagaaagag 
cgttgacaaa 
cgcgggactg 
tcagaaccgt 
agaaaaacgg 



ttaatgtctt 
aatgcgctgg 
caaacatatg 
gtagaagcca 
ctgaaccata 
ggcaagttgg 
ttcaacgcag 
gacggaaagc 
aaaccgttac 
cgtgccgacc 
ctgtaa 



taatatttac 
aacctgtaat 
taaacaatct 
tcgtagcctc 
ctctgtactt 
gagaagccat 
cttctgtagg 
tgcacatcac 
tgggatttga 
acgtaaacaa 



gaccatgact 
cagtcagcaa 
caatagcctg 
ggctcccgac 
cctgcaattt 
caaacgcgac 
attgttcggt 
caaagagccc 
cgtatgggaa 
actgtgggag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

636 



<210> 545 
<211> 381 
<212> DNA 
<213> B.fragilis 



<400> 545 

agatttgaat 

ttctttcttc 

atcagggtcg 

gtatcgaatg 

ttcggcaaag 

tttcagggtt 

ctgttcgtca 



cgggcgatgg 
aactcgcgca 
cagttgtctt 
cagggaatgc 
gggtgggcgg 
ctccgggtca 
cctttggttg 



actggacacc 
acagatcccg 
cgaaaatcgg 
cgagatcatg 
catctatcaa 
tggcgccgct 



tgtcaggaaa 
attgtcatat 
gttgaggtaa 
cgcaacatcg 
aaggcgaatg 
caggcgtatg 



agaaatctgt 
tcaaatttaa 
gagcggtcgt 
gttactgtat 
tcgttttggc 
ccatgatgca 



gaccctcttt 
aaggcaaatc 
aaatgcgttc 
catgcagctt 
gacagaaagc 
ggtagatatc 



60 

120 

180 

240 

300 

360 

381 



<210> 546 
<211> 852 
<212> DNA 
<213> B.fragilis 



<400> 546 

tcggtacgat 

aatctgtcaa 

gaagccacta 

catatcatga 

aaacaatata 

gtggaccttt 

acgatggttg 

tatctcggga 

ggtgataccc 

cccggtggta 

caggagaagc 

gaagtgttcg 

tctatagcca 

gatactctga 



ccgtgacttt 
gcataatgaa 
acctgatccc 
aaacctacag 
tcatcaagga 
ccaacaatac 
ataccgggct 
acgaacgttt 
tgaaggctca 
agttcggcgc 
ccgacgggga 
attatatccc 
aggcgggccg 
gagacaatac 



gtctcttccc 
agccgtcatt 
caaacctatg 
ccattacggc 
gtatttcgcc 
gaccaccatt 
gaacacccaa 
cctgctgacc 
cgagtcttcg 
cctgcagctc 
ccgtaactgg 
tgagggtgac 
gatgcatgct 
agaattgaat 



gtaagtaatt 
ttagccggtg 
gtggagatcg 
atcaacgatt 
aactatttcc 
ctggacaacc 
acgggcgggc 
tatggtgacg 
ggctgcctcc 
gatctcgata 
atcaatgcgg 
tccaccatct 
ttccgtcata 
gaaatgtggg 



tccggttatt 
gcttcggcac 
gtggtaaacc 
tcgtgatctg 
gtcataacag 
attccgagaa 
gtatccggcg 
gtgtcaccga 
tttcccttac 
cggacaaggt 
gctattttgt 
ttgagcggca 
cgggtttctg 
atcagggagt 



aaccttcaat 
ccgcctgagc 
catcctctgg 
ctgcggttat 
cgatatgacc 
ctggaaagtc 
tgtacagaaa 
cctgaacatc 
ggcctacaaa 
cctctctttc 
gtgtgaaccc 
acccctcgag 
gaaaccgatg 
cgctccctgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 



216 



aaagtgtggt aa 



852 



<210> 547 
<211> 1125 
<212> DNA 
<213> B. fragilis 



<400> 547 

aatgtgggat 

tttgataatt 

agctggctct 

cct tttacgg 

ctt cgtgccg 

gagattgttt 

gaaacctacg 

gatagcgtga 

atctggggct 

ggagccgctg 

gataaacacg 

tgggctttag 

gatatccgaa 

tatatgctgc 

ttcggtcccc 

gagtatgggc 

cttttgatgt 

atcgggcaga 

tatgatgttt 



cagggagtcg 
tttatcgggg 
ccatctggtt 
ctcgagacaa 
atatccgcga 
ttcatcttgc 
aaaccaatgt 
aggtaggtgt 
atcgtgaaaa 
agattgctat 
gaaaatccat 
accgtatcat 
gtccgaaagc 
tcgcacaaaa 
attccgagtc 
ggggtgaact 
tggatatctc 
cggtcggatt 
gtgttgacca 



ctccctggaa 
caaacgtgtc 
gcatgaattg 
tttcgtactt 
tggtgagcgt 
tgcccaacct 
aatgggaaca 
gatgattacc 
cgagcctatg 
tgcttcatgg 
cgccagtgta 
tccggactgc 
tatccgtccc 
gatgtgggac 
catctcgaca 
gcgtgacctt 
caaggctcgg 
gacggtggac 
gataaaagat 



agtgtggtaa 
cttgtcaccg 
ggggccgagg 
tccggtatcg 
ataaaggcta 
ctggttcgct 
atccatgttc 
acagataaat 
ggcggttatg 
cgtcgttctt 
agagctggta 
atcaaggctt 
tggcagcatg 
gcccctactg 
gtttgggatg 
tctactccgg 
ttctgtctgg 
tggtataaga 
tatttattga 



gccgtatggg 
gtcatacggg 
tgattggtgt 
gcgagaaaat 
tctttcagga 
tgagttatga 
ttgaggcagt 
gttacgagaa 
acccttattc 
tcttcaaccc 
acgttatcgg 
tggaatctgg 
tgcttgaacc 
actattgtga 
tggccacccg 
atgcattgca 
gctgggagcc 
gataccggga 
aatga 



aattgatata 
ttttaaaggt 
ggctcaagac 
taaggccgac 
atatcaacct 
catccctgtt 
ccgttctacg 
taaagagcaa 
cagtagcaag 
cgagcaatac 
tggtggagac 
agcggctatc 
gttgagcggt 
gggctggaac 
ggttgtgtcc 
tgaagcccgg 
taggatgaat 
agaagaggta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1125 



<210> 548 
<211> 186 
<212> DNA 
<213> B. fragilis 



<400> 548 

ttaatgatgt cagttatttg gctctttaat agtcctgagg aatcagattg ttcattagga 
gtaataatag ggatattatt atcccaagca ttgtcttcat tgccacagga cataaaaact 
aaaaaaaata gactgagaaa gagtggtaca aaatacttca taatgtttag gttaggttat 
aaatga 



60 
120 
180 
186 



<210> 549 
<211> 1434 
<212> DNA 
<213> B. fragilis 



<400> 549 

accaagaggt 

tttgcaactc 

tcttccctga 

cgtgaggcga 

tttacacttt 

cgtcccgatg 

gggatgcagg 

cagcgcatcg 

agcggaacac 

aatacggtag 

acgggtaaag 

tttgtaaaag 

cataaaaaag 

aacaaaggaa 



cttacctttt 
aaaaattgat 
gttttatgct 
tggaagtagt 
ccacttccgg 
tcccggtcat 
ccggtgcatt 
agacggcact 
tcaaccggag 
cacgtattgc 
aactgatagc 
tcaatctggg 
gtgcgtttac 
ctattttttt 



aaaccaacgt 
gcctatgctt 
gaaacgcgcc 
tcgttcggaa 
tgaagaagga 
tctgatgact 
cgactttatc 
cgaactgact 
ccatatcata 
tcctaccaat 
cgaagccatc 
aggaatttcg 
agatgccact 
ggatgaaatc 



acctttggtt 
ttaattatag 
ggttatcagg 
gctccttctc 
ttgacgcttt 
gcctggggca 
acgaaaccct 
gccactccca 
ggcaaaagcc 
gctcccgtat 
catatcaata 
cagagccttt 
gccgaccgta 
ggggatctcg 



ttaaaccaaa 
atgatgatag 
taattgcagt 
tgatcctgat 
taaaacaagt 
gtatacagtt 
ggaataacgc 
aagacactcc 
gtgggttgat 
tgatcacggg 
gccaacgtgt 
tcgaaagtga 
tggggcgttt 
atccgtcgtg 



aaagagtagt 
cggggtccgc 
gaccggcccg 
ggatatgaat 
aaaggttttt 
ggctgtacaa 
tgctctgttg 
gcaagagcaa 
ggaggtattg 
tgaaagcgga 
ccgccagcct 
aatgttcggt 
tgaaatggcg 
ccaggtaaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 



217 



ctgctgcggg 
acggatatcc 
acttttcgtg 
agagaacgca 
attaacaatc 
ccttttccgg 
gggaaggaag 
gtggcaactt 
cttcaggcac 
agtcgtgcag 



tattgcagga 

gggtggtttc 

aagacctgtt 
gagaagatat 
tgccccgtac 
gcaatatccg 
tgttggatgc 
cttcttcatt 
tggaacgcta 
cattgtatcg 



ccagaccttt 
ggctacgaat 
ctatcgtatc 
accattgctt 
agaattctcg 
tgaattgaag 
aatcgatttc 
tgcgggaatg 
taaaggaaat 
ccgtttggag 



gaggtgttgg 
gccgacctga 
aacctgataa 
gccagacact 
tcggatgcgt 
aacctggtag 
gagaaccaat 
accttggatg 
ctcagccagg 
aaatatgata 



gagacagtcg 
gtaagatggt 
ccgtaaaact 
ttgccgaccg 
tgaacttcct 
aacgtacgat 
accaacgtca 
aaattgaaaa 
tggctaccgc 
tcggtgataa 



cccgcgtaag 
gagcgaacac 
gcctgcactg 
tcaggcggag 
gagccgttta 
tttggtcagc 
tgacgaaagt 
gcaaacgatt 
attgggcatc 
gtaa 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1434 



<210> 550 
<211> 324 
<212> DNA 
<213> B. fragilis 



<400> 550 

tgcgcggatc 

caactcaaag 

gatcacgcca 

aaagtagatg 

gatcatgccc 

tatgcaaaag 



cttctcagcc 
ataacaaaac 
ataacgatat 
tacgtatgca 
aatttatgct 
acaaagacga 



cctggtgaag 
gaaacaggtt 
agccgtcgac 
ggggttgggc 
aaagcatgaa 
atga 



acttatccta ctttcgacac catgcttcag 
acgctggttc cgttcatgtt tgtggcaggt 
tggaaagaag cccttgaaaa agaaggtttg 
gaaatacccg ccatccaaca actgtttatc 
atggtggata taatgaagaa aaaagccaaa 



60 

120 

180 

240 

300 

324 



<210> 551 
<211> 1503 
<212> DNA 
<213> B. fragilis 



<400> 551 

cttcatgctt 

cacagggtac 

gtcatgatgt 

cctctgtctc 

aaagttaagg 

gacaaaggtg 

agccgcatcc 

ttgaaccgac 

aaatttcact 

gtcttgcagt 

tgggtttatt 

ttatccggta 

cgtcgatcac 

cattatgtaa 

atgtcactgg 

cgggaaataa 

ctaaccgaat 

tacatagtaa 

ccgttgaatt 

atccatctga 

atgtatcgcg 

agttgttatt 

tggaaatact 

cctacacttc 

agcggagttg 

taa 



tccgcatctt 
tgggcacact 
accacggatt 
cttctctgcc 
gaatccgttt 
aacacaatct 
accgggtagc 
tggatcaatg 
ttgccgacac 
tcactacccg 
ttacttggct 
tcggatgcct 
gaaagcaaaa 
cgggaattgt 
cggaagtgcc 
agaagggagc 
atccggacgt 
aacggagtga 
tggacaaaaa 
aggtcgaatt 
accgcagttt 
acattcatcc 
ggatgtatac 
gtaaaagtgt 
tactcggagt 



atactttata 
gctcagtatc 
tccacgtgca 
ttctgtcagt 
agaccgatac 
tccggccgac 
ctctctatgg 
gatacccttc 
agaaaagcac 
taacgaacgt 
ccgccaagat 
gatgactatt 
aggaaaattc 
ttttggactt 
ggcatggatc 
acccaaaccg 
acgccaggtg 
aggagatctg 
acaggttact 
gattgataaa 
attaccggta 
ggaaacagcc 
cgctttgcac 
actgtgggta 
gagatatatc 



atgatgatca 
ctgtttttag 
tcacaagctg 
gaaattactt 
ctcggacaaa 
tccgttcagg 
tgtaacgcac 
ggcggtctga 
cagctataca 
ttctgggcat 
gccgctttat 
gccggattgt 
tcaccctatc 
tttgttctga 
agcaaaccgg 
gttcagtatc 
gaatggagta 
tacatcgatg 
gacgcagtaa 
ttcgaaactt 
tggaaaatca 
acagtccgat 
agactacgca 
ttgctgcttg 
gaacgaaaat 



cgaaattcat 
tttggttcct 
aaaaattaga 
ctcgcctgcc 
ccatattcca 
cacttccggt 
cgatagacag 
aaagagagtt 
ttggttcgca 
ggctgggagc 
ggagcattac 
gggtaggtat 
gtaaaaagtg 
cattctgttt 
tattggacag 
tattggatta 
atttccgttc 
cctccgactc 
ggacaatcca 
actaccgcga 
cagtagacga 
acgtcaatag 
tacaaggact 
gcggcacggt 
gcagaaagaa 



gtataccatt 
atcggctttt 
aaagctggag 
cgaaggagaa 
tatccgtacg 
tatagacgga 
aatcgatact 
tcccatctat 
aagcggagaa 
cattcctcat 
ggtaatctgg 
ggatgtatgg 
gtatcactgg 
cagcggtatg 
gaaccccaca 
tcggcaaata 
aaaaccctac 
tctgccccat 
tggggacagc 
catgagtagg 
tcccgaccac 
cactgcccgt 
gaactcctct 
ttgttcatta 
aacaagaaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1503 



<210> 552 
<211> 519 



218 



60 



240 
300 



<212> DNA 
<213> B.fragilis 

<400> 552 

ctaaacacta cacaaacgat gaataagaaa ctgttgaaac aaatagtaaa cgaacggcgc 

tctaactcct ggctgttcat agaactgctg ctggtaagca ttgtgctttg gtatgtagtc 120 

gattatatgt ttgtcaccct ttatacctat tttgaacctc gcgggttcga tattgagaat 180 

acctaccggg tggagttcga ttacctgata gagaaaagtc ccgactacat agccaaccgt 

acggatgagg aagcacatgc agatatgcgt gagttattgg atcgccttcg tcgtcgtccc 

ggagtcgagg cggtgagtat gtcgcaaaat tctttcccgt acaacggaag taacagtggg 360 

atggacgtac gcctggatac catggagagc aagtacaaca tccgtcgttg ggtgacgccc 42 0 

gacttcttcc gtgttttccg ttatcaggga gccaaccgga gaaactccgg aacaattagc 480 

tgctctgttg aaggagggta cttttatggt atcgcgtaa 519 

<210> 553 
<211> 1044 
<212> DNA 
<213> B.fragilis 



60 



<400> 553 

aaagtatata cactgctttg caaaaggaga ttgttgatct ttggagtaaa cctaaataag 

atgaaagtat atattaaaaa tatttctaga aaaaaagggg agttccagtt ctatatgcat 12 0 

cagtttgtag aagcttgcat cagacaaaac ataccatttg tgaatgaatt gcatgtgtgt 180 

gtgcggttga aactgagtgc tgtgatgatc aaactgggac attggataaa ttttttcttt 2 40 

tgtcggtgta ataataaagc gattattgtc tctacttggg gtggtggatt aatgtatacg 3 00 

tcatttccgt atagtctatt gtatgaaata attccagtat tttgggatag ttggcctttt 360 

aattgggaag agcagattta ttcgttaaga agattaaatt gtagaacatg ttttgtgact 42 0 

tctagccaag ttgcacaaag gataaaagaa acactaccaa acataaatgt ccattggctt 480 

cctgaaggta tagatatatt ggattatgtg cctggtcagg atttgacaga acgaagcatt 540 

■■■■■■■ 600 

660 
720 
780 
840 
900 
960 



gagatttatg agcttggaag acaaaaggct gattatcaca agattttgtg tgatttgaaa 
tcagaaggta tcttttcgag ttttctttgt aatgaatacg atataaatgg aatgactact 
aaactagcat ttcctactgc aaaagctttg ttgaaggctt tacccaatat aaagatagtg 
atttcttttc ctcaagtgga tacacatccg gaaaaagttg gtaatataga aactctaact 
caacgatatt gggaggcaat gcttagccga aatctgatag tgggtcgggc tccaaatgaa 
ttgatccagt tgataggtta taatcctgta attgatgtgg attgggaaga tcctaaaaaa 
caactttcag atatattact taatataagc tcttttcaaa aattagtcga tagaaactat 
cgaacggcaa gaaaaatatc ttcttgggat aacagggtca aagatattat aacaattctt 102 0 
cggacttctg gctatgaaat atag 1044 

<210> 554 

<211> 1161 

<212> DNA 

<213> B.fragilis 



60 



<400> 554 

agaagattaa ccagaagaat aaaaagaaaa aataaaatgt tttcagatga attagaaaag 

atttcctggg aagagacgac taaagccatc tattccaaaa ctgacgctga tgtgcgccgc 120 

gcattgtcga aagaacactg cgatgtaaat gattttatgg cattgatttc gccggctgcc 180 

gctccatatc tggagacgat ggcacgtctc agccggaagt atacgatgga acgcttcgga 240 

aaaacaatct cgatgttcgt gcctctctat attacaaatt cttgtacaaa ctcgtgtgta 300 

tactgcggct ttaaccacaa caacccgatg aagcgtacca tccttacgga agaagagatg 360 

gtgaacgagt acaaggcgat caaaaagctg gccccctttg agaatctgtt gctggtgaca 42 0 

ggcgagaatc ctgccaaagc cggagtggac tacatcgaac gtgccctctt gctggcaaag 48 0 

ccctactttg ctaaccttca gattgaagta atgccactta aagcagaaga atatgaacga 540 

cttacacatg caggtctgaa cggggtcatc tgctttcagg agacgtataa caaagccaat 600 

tacaacatct accacccccg cggcatgaag tctaaattcg aatggagggt caacggattc 660 

gaccgcatgg gacaggccgg agtacacaag ataggaatgg gcgtactgat cggactggag 72 0 

gaatggagaa cggatatcac catgatggcc tatcatctcc gctacttgca gaagcattat 780 

tggaaaacga aatatagtgt caacttcccc cgcatgcgcc cgtcggaaaa cggaggcttc 840 



219 



cagcccaatg tggtgatgaa cgaccgtgag ttggcacaag tgacttttgc gatgcgcatc 
ttcgaccatg atgtagacat ctcctactct acccgcgaaa gcgcagcctt ccgtaaccac 



900 
960 

atggctacgc tcggagtgac caccatgagt gcagaaagca aaacggaacc gggaggatac 102 0 
tttacctatc cgcaagcact ggaacagttt cacgtaagcg acgagcggaa agccgtggag 108 0 
gtggatgcag cactacggtc gctgggacgg ataccggtat acaaagactg ggacacggcg 1140 
ctgacgctac cccaatgctg a 1161 

<210> 555 
<211> 1668 
<212> DNA 
<213> B.fragilis 

<400> 555 

ttaaacataa ctcttgatat aatgatgaaa agtaatgaaa acaacggagc agtaactaaa 60 
agttttgcta aaaagatgga gagcatcagt cctttcgaat tgaagaacaa actgattgaa 120 
atggctgacg agagcatcaa gaagatagcc cacaccatgc tgaatgccgg acgtggaaat 
ccgaactgga ttgctaccac tccgcgcgaa gcgttcttcc ttttaggtaa attcggactg 
gaagagtgta ggcgtgtgat gtacctgccg gaaggaatag ccggtattcc gcaaaaagac 
ggaattgccg cccgctttga gactttcctc aagaccaacc acagccagcc gggggcagag 
ctgttgaaag ggacgtatca atacatgttg ctggaacatg ccgccgaccc ggataccctt 42 0 

480 
540 
600 
660 



180 
240 
300 
360 



gtccacgaat gggcggaagg agtggtaggc gatcagtatc cggtgccgga ccgcattctg 
caatttaccg aaatgattgt gcaagactat ctggcacagg agatgtgcga ccgtcgtccg 
ccgaaaggca aatacgattt gtttgccacc gaaggcggaa cagcagccat gtgctacgtt 
ttcgactctc tgcaagaaaa cttcctgctc aataaagggg atggaatcgc cttgatggta 
cctgtcttca ctccttatat tgaaattcct caattgagac gctatgaatt taacgttacg 72 0 

780 
840 
900 
960 
1020 
1080 



gaaatatctg cggatcagat gacgacagac ggattgcaca cctggcaata caaagacgaa 
gatatagacc gcctgaggaa cccgcagatc aaggcactct tcattaccaa tcccagtaac 
ccgcccagtt atacactgaa tcccgagact gccgcacgga ttgtagatat cgtgaaaaaa 
gacaatccga acctgatgat tattacagat gacgtatacg gaacattcag tccgcatttc 
cgctcactga tggccgaatt accacaaaac actttgtgtg tctactcttt ctccaaatat 
ttcggagcca cgggatggag ggatgccgtg atcgctctgc acgaagagaa tatcttcgac 

cggatgatag cccacctgcc ggaagagcag aagacaattc tcaataagcg ttactccagt 1140 

ctgactctta cacccgagaa actgaaattc atcgaccgca tggtggctga cagccgccag 12 00 

gtagctctga accacaccgc cggattatcg ctgccacaac agacgcaaat gagcctgttt 12 60 

gcttctttcg ccattctgga taaggaaaat cggtataaaa acaaaatgca ggagattatc 1320 

cggcgtcgct tgaaagccct gtgggataac accggattct cactcgtaga cgatccgctg 13 80 

cgtgtaggtt actacagcga aatagatatg ctggtatggg ccaagatatt ctatggagaa 1440 

gaatttgtca gttatctgaa gaaaacttac agcccgctgg atgttgtttt ccgcctggcc 1500 

aacgaaacct cactggtatt gcttaacgga ggaggttttg ccggaccgga atggagcgta 1560 

cgtgtatcac tggctaacct gaatgaaaag gattatgtga aaataggtca gggaatcaaa 162 0 

cggatactgg atgaatatgc cgtgaaatgg caagaatcac ggaaatag 1668 

<210> 556 

<211> 1788 

<212> DNA 

<213> B.fragilis 

<400> 556 

acgaataaaa caataacctc gcctccggcc cctctcccgt cgatcactct tcgggaaaag 60 

aatacggggc tgagagatcc taataacgaa tgtatggaac aaagaataaa atttccccgc 12 0 

tctgagaagg tatatctgtc cggcaagtta ttccccgaaa tccgtgtagg tatgcgaaaa 

gtagagcaag tgcccagcac aactttcgaa ggagaaaaga aagtgatcac tcccaatccg 

catgtgtaca tctacgatac cagcggtcct ttcagtgacc ccgacataga aatcgacctg 

aaaaaaggcc tcccgcgcct gcgtgaagaa tggatactga acagaggaga cgtggaacaa 

ttgcccgaga tcagttcgga atacggacgc atgcggcggg atgacgggag cctcgaccac 42 0 

ctccgttttg aacatatcgc actgccctac cgggccaagg ccggccggca tatcacccag 480 
atggcgtatg ccaaacaggg cattgtcact cccgaaatgg aatatgtggc tatccgtgag 540 

aatatgaact gcgaagaact gggcatcgag acccatatca cacccgaatt cgtacgtcag 600 

gaaatagccg aaggacgggc ggtgctgcct gccaacatca accatcccga agccgaacct 660 



180 
240 
300 
360 



220 



atgattatag gccgcaactt cctggtgaaa atcaatacca acatcggcaa ctccgccact 72 0 

acctcgagca tagacgaaga ggtggagaaa gcaatgtgga gctgtaaatg gggaggagac 7 80 

acattgatgg atctttcgac cggagagaac atacacgaaa cgcgggaatg gatcatccgc 840 

aactgtcccg ttccggtggg gaccgtacct atctaccagg ctctggaaaa ggtaaacgga 900 

aaggtagagg acctgacctg ggaactgtat cgcgacacac tgatcgagca gtgtgagcag 960 

ggagtggact acttcaccat ccatgcgggc atccgccggc ataatgtgca cctggcggaa 102 0 

aaacgcctct gcggcatcgt atcccgcggc ggaagtatca tgagcaaatg gtgcctggtg 1080 

cacgaccggg aaagcttcct ttacgaacac ttcgatgaca tctgcgacat cctggcacaa 1140 

tacgatgtcg cagtgtcgct cggcgacggc ctacggcccg gatcgaccca cgacgccaat 1200 

gatgaagcgc aatttgccga gctcgacaca atgggcgaac tggtggtgcg cgcctgggag 12 60 

aaaaacgtac aggcatttat cgaaggaccg ggacatgtgc cgatgcacaa gatacgcgaa 132 0 

aacatggaac gccagattga aaaatgccac aatgccccgt tctatacgct cggcccgctg 1380 

gtgacggaca tcgctccggg atacgaccac atcacttcgg ctatcggagc ggcacaaata 1440 

ggatggctgg gaacagccat gctatgctat gtgaccccta aagagcacct cgccctgccc 1500 

gataaagaag atgtacgcgt gggagtaatc acttataaaa tagccgccca tgcggccgat 1560 

ctggccaaag gacacccggg ggcacaggta cgcgacaacg cactgagcaa agcccggtac 162 0 

gaattccggt ggaaagacca gttcgacctg tcgctcgatc cggaacgtgc attctcttac 1680 

ttccatgccg gacggcatac cgacggagag tattgcacca tgtgcggacc gaatttctgc 1740 

gcgatgcgac tgagccgcga tctgaagaaa actcaaaaac aaaaatag 17 88 



<210> 557 
<211> 774 
<212> DNA 
<213> B.fragilis 



<400> 557 

ccgataaaaa caaagaagca tacagccatg gaaataacac ttaaaaatca gttcattact 

ttgtggaata cttattttcc acaagccgga cttccgataa cattccaata ctcggcagat 12 0 

acacaaaatc tcccgatagt ggaagctccg aaaggacatc ggtgcatcat tgcacagttg 180 

acccaggtac agcgtggaaa aactctctgc atgcaggcgg attctgtggg atgccgaggt 240 

ggaaaacggt acacaaactt cacggacaag atgtttcccg gattcgaatg tttcctttca 3 00 

cacaatgaac agggcgaagg agaacgatac aagcagactc cagagctggc agctgccgct 3 60 

ctggcacagt tgcctgcact tcctgtcaag ggagaaaacc tgatcttcaa acgttgggat 42 0 

aagctggaag cggaagacat gccggaagtt gttatctttt ttgtatctgc cgacatcctc 480 

tccggtctgt tcacattggc ttgttttgac aatgtagctc ctgatgcagt gatcgctccc 540 

tttggtgcag gctgtgcttc tattatctat catccatacc gggaacaact ggacagaacc 600 

aatcgggcgg tattgggatc attcgaccct tctgcacgca aatgtatgaa acccgatctc 660 

ttgtcttttg ccattccgtt taacaagttc aagagtatgg tgtcacaaat ggaagaaagc 72 0 

ttcctaaaga cagcaacgtg ggatgtaatc aaaaagagaa taggctcgtc ataa 774 



60 



<210> 558 
<211> 468 
<212> DNA 
<213> B.fragilis 



<400> 558 

caagcgagaa ccttgttcgg ttcactttta acaactatct ttgcccaata ttcacacatt 60 

aaaaagatta agcaaatgaa aaaaattatt ctcggagcat gcgctgttct tttcacgctt 12 0 

gcttcttgcc aacaggccaa acaaaaagtt ttcgaactgg ctgccgaaca agtaaacaaa 180 

caatgcccca tcactgtcga tgaaatgaca agaatggaca gcaccactta ttcaggtaag 240 

gacaatacat ttacctattt ctatacctta agcggccagg ctgacgatcc taccatgtca 3 00 

gaacaactga agaaatcatt ggaagaaacc ctgccggaaa caataaagaa cactgaagag 3 60 

atgaaagtgt acagagaatc ggatgtgacc attaaataca tctatctgtc aggcaaaaca 42 0 

aaggaagagc tgattcaagt aacagttact cccgatatgt ataaataa 468 



<210> 559 
<211> 1227 
<212> DNA 
<213> B.fragilis 



221 



<400> 559 

atggaactga ctctcctctt gattattgca gccttactgg ttgccctgct tgtattgaca 60 

cttacccgca acaatcgcgc acaaagcgaa gagatgcaac gggcattgcg ccaacaaatg 12 0 

caggaaaacc gggaagagtt gaatcgcagt attcgcgagt tacgcatgga aatgacgcaa 180 

accctgaatc agggtttgca acagctgcaa gatgccatgc ataagaacat gatgaccacc 2 40 

ggagaactgc aacgccaaaa gttcgacgca atggcacgcc agcaggaaac gctgatacag 3 00 

tccaccgaga agcgtctgga cgacatgcgc gtgatggttg aagagaaatt acaaaagact 3 60 

ctcaacgaac gcatcggaca atctttcgag atagtccgtt cgcaacttga aaatgtgcaa 42 0 

aagggcctgg gcgaaatgaa gtcgctcgca caagacgtag gcggtctcaa gaaggttctg 480 

agtaacgtga aaatgcgcgg aacgttcggt gaggtccagc taggcgcact tctggaacag 540 

atgatgagtc cggaacagta tgaagcgaat gtcaagacca agaaaagcgg aaccgaattt 600 

gtggagttcg ccatcaaact tccgggaaaa gatgatgcca acagcactgt ttatctgcca 660 

atcgacgcca aattccccaa agatgtttac gaacaatact acgatgcttt cgaagccgga 72 0 

gatgccgcat tgatggaatc gtgcggacgc caactggaga caaccatcaa aaaaatggcg 7 80 

aaggatatcc acgacaagta tgtcgatcct ccgtttacaa cggacttcgc tatcttattt 840 

ctccccttcg aaagcatcta tgcagaagtg atccgccgga caagcttagt tgaaacgcta 900 

caaaaggatt acaagattgt agtaaccgga ccgactactt tgggagctat cctgaacagt 960 

ttgcaaatgg gattccggac actcgccata cagaaacgca caggcgaggt atggaccgta 1020 

ctgggagctg taaaaaccga attcggaaaa ttcggaggac tgcttgagaa ggtccagaag 1080 

aatctgcaaa gcgcaggtga ccagttggaa gaagtgatgg gaaaacgtac gcgcgccatc 1140 

gaacgcaaac tccgtcaggt cgaagaactc ccccacgagg aaagccggag aatattaccg 1200 

atagacgatg gcggagaaga tgactga 1227 

<210> 560 
<211> 423 
<212> DNA 
<213> B. fragilis 

<400> 560 

tgtaatgtcg caaaaaatgc gatgatttta ggcgcaaaac gtctggtggt tacgatttac 60 

atccagtatc acttatgcct aaaatatgaa tttgctttgg tgagagtgaa agaacttctg 12 0 

ccattggttg atgataatat ccctgcgaat gataaggatg cagtggaact ctctgttatg 180 

tccgacatcg ttattgcata tgggaaagaa cattatccga tagaaaaacc aactgttgca 240 

gaattaatag aactttatct tgaagaaaaa ggaatgagcc aaaaacaact tgccattgag 300 

attggaataa gtctttcacg ggtgaatgat tatattgcag gacgttcaga acctactttg 360 

aaaatagccc gtttgctttg tcggatattg aatattcctc ccgttgcaat gttgggtttt 42 0 

taa 423 

<210> 561 
<211> 756 
<212> DNA 
<213> B. fragilis 

<400> 561 

ggcaaatatt ttaaaccaat gggaagagcg ttcgaatata gaaaagctac caagctgaaa 60 

agatggggca acatggcccg tacatttacg agaatcggta aacaaattgc tatcgctgta 12 0 

aaagccggtg gtcctgatcc cgaaaacaac ccgcatctgc gtgcagttgt cgctactgca 180 

aaacgtgaga acatgccgaa ggataacgtg gaacgcgcta tcaagaatgc catgggtaaa 2 40 

gaccagaagg actataagga aatgaattat gaaggttatg gtcctttcgg tattgcggta 300 

tttgtagaaa cggctacaga taacacaacc cgtactgttg ccaatgttcg tagcgttttc 360 

aataagtttg gcggaacact gggtacttca ggcagtcttg attttatgtt cagctggaag 42 0 

tcaatgttca ccattacaaa gaaagaaggc gtggatatgg acgatctgat tctggaactg 480 

atcgattacg gggtagagga agagtatgat gaagacgaag atgaaatcac gctttacggt 540 

gatccgaagt cgtttgccca gattcagaaa tatcttgaag agaatggctt cgaggtgaaa 600 

ggtgctgagt ttacccgtat tccgaatgac gaaaaagatc tgacaccgga acaacgtgcc 660 

accattgata agatggtaga acgcctggaa gaagacgagg atgtacagaa tgtgtacact 72 0 

aacatgaagc ctgcagataa cgaaggcgaa gagtaa 7 56 



222 



<210> 562 
<211> 2373 
<212> DNA 
<213> B.fragilis 

<400> 562 

cagataaatt ttatgcccga ctatatcgaa gaacttaatg aaagccagcg tgcggcggtg 



60 



840 
900 
960 



ctctacggtg atggcccttc gctggtcatc gccggtgccg gttccggaaa gacgcgtgtg 12 0 

ctcacttata agatagccta tctgctcgag aacggttaca atccctggaa tatcctggca 180 

ctgactttca ccaacaaagc tgcccgtgaa atgaaggagc gtattgcccg gcaggtgggc 2 40 

gagcagcgtg cacgattcct ttggatgggt acgttccatt cggttttttc ccgtattctt 3 00 

cgtgccgagg cgtcccatat cggctttacg tcgcagttca ccatctacga ttcggcggac 3 60 

agcaagagcc tgattcgttc catcatcaaa gagatggggc ttgacgagaa gacctataag 42 0 

cccggcagtg tgcaggcacg catctccaat gcgaagaacc acctggtgtc tccttcggga 480 

tacgcagcca acaaggaggc gtacgagggc gatcttgccg caaagatgcc tgccatacgg 540 

gatatctaca gccgctactg ggagcgttgc cggcaggccg gagcaatgga tttcgacgat 600 

ctgctggtct atacctatat ccttttccgc gactttcccg acgtgctggc acgctatcgc 660 

gagcagttcc gctatgtgct tgtcgacgag tatcaggaca ccaactatgc acagcacagc 72 0 

atcgtgctgc aactgacaaa ggagaatcag cgtgtatgcg tggtgggcga cgacgcgcag 780 
agcatctact ccttcagggg agcggacatt gacaatattt tgtatttcac caagatatat 
cccgatacca aagtcttcaa gctggagcag aactaccgtt ccacccagac cattgtccgt 
gcggccaaca gcctgatcga aaagaacgag cggcagatcc ccaaagaggt gttctccgag 

aaggaacggg gtgaggccat cggggtcttt caggcttaca gtgatgtgga agaaggcgac 102 0 

attgtgacca ataaaatagc gcaactgcgt cgcgagcacg attatgaata ctccgacttc 1080 

gccatccttt atcgtaccaa tgcccagagc cgtgtcttcg aagaggcttt gcgcaaacgg 1140 

ggcatgcctt ataagattta cggcggcctc tctttctatc agcgcaagga gatcaaagat 12 00 

atcatagcct acttccgcct ggtggtcaac cccaatgacg aagaggcgtt caagcggatt 12 60 

atcaattatc cggcacgcgg catcggcgat accacggtgg gcaagattat tactgccgcc 13 2 0 

accgataaca atgtcagcct ctggaccgca ctctgcgaac ccattacgta cgggctttcc 13 80 

atcaataaag gtacacatac caaattgcag gattttcgtg cgctgatcga gcagtttatg 1440 

gcagatgtga ccgtaaagaa tgcttatgaa ataggtacgg aaatcatccg tcagtccggc 15 00 

atcatcaatg aagtctgcca ggacaattcg cccgaaaatc tcagccggaa agaaaacatc 1560 

gaggaactgg tgaacggtat gaatgatttt tgtgccatgc gtcaggaaga ggggaacacg 162 0 

aacgtttctc tgatcgactt tctctccgaa gtatccctgc tcaccgatca ggattccgac 1680 

aaggagggag acggcgagaa ggtgactctg atgacggtac attccgccaa aggactggag 1740 

ttccgcaacg tattcgtggt ggggatggaa gagaatcttt tccccagcgg gatggcgggc 1800 

gattcacccc gtgcgatgga agaggagcga cgcttgttct atgtagccat cacccgtgcc 1860 

gaagagcact gtttcctctc gtttgccaaa acccgtttcc gttacggtaa gatggagttc 192 0 

ggcagcccca gccgtttctt gcgggacatc gacacccgtt tcctgcaact tccgcaggag 1980 

gccgctttag gccggagcgt cgacgaaggg gccggccgct tccgccgcga gatggaagag 2040 

gggtattcgc gccgttcgtc ttccgaacgc ttctctgccc gtccgtcggc cgaccgtccg 2100 

gaacgcgaac ggccgaaggc gcagatcatc gcgccgacgg tcccccgtaa cttgaaaaag 2160 

gtaagtggga ctacgctctc cccatcgtca gcttccggag ccggcgtcgc cggcgtacag 2220 

cccggacaga ccatcgagca cgaacgcttc ggcctgggtg aggtgatccg cgtagaaggt 22 80 

acgggcgaca atgccaaagc taccattcat ttccgtaatg caggcgataa acagctgttg 2340 

ctgcgtttcg ccagatttaa agtaatagaa taa 23 7 3 

<210> 563 
<211> 219 
<212> DNA 
<213> B.fragilis 

<400> 563 

cttaaatttc taaagcaaat gttgaaagaa aaagcaggtg aaattgcagg taaaatctgg 60 

aatgcactga atggaacaga aggactgact gccaagcaga ttaagaaagc aactaaattg 12 0 

gtggataaag atttgttcct tggcctcgga tggctgttga gagaagataa gatctctact 180 

caggaaatcg aaggtgaact cttcgttaca ttgaactaa 219 



<210> 564 



223 



<211> 1329 

<212> DNA 

<213> B.fragilis 



<400> 564 

gtaagttatt 

aatctggttt 

accagatatt 

gatgtagtat 

tcttctgtat 

aatattcgct 

gtaggtgtat 

acaaaaagaa 

atcaaaaatg 

gaagttggtg 

tattggtcgc 

gtttcttttt 

ataataacag 

agtataattt 

acttttgatg 

attttagaat 

atacaactaa 

gttaaactat 

aatttatcta 

cttaattgtc 

tgtccggaat 

tttaatgcgt 

aaacgatga 



atttagacca 
tttcttttat 
caaattgggg 
tactttttat 
atcttgtttt 
tctcgaatga 
tttgtttgcc 
tcttttatta 
ttggttacat 
aatctaattc 
ttatcttagc 
tgttgtttgt 
gattgatata 
ttttgtttat 
gaataaataa 
cttatctaat 
aagaagtatt 
tgagtataag 
cttatatcgc 
tatatggaat 
acataatctc 
tcaatacgat 



atacgttttt 
tcttctaagt 
aataaaaaca 
gcttatttct 
tgtaggacta 
gttgctcttg 
tcaaatacac 
tattctatgt 
acctgtcttt 
cgttttacat 
gaaagaggga 
atttgttaat 
tttagaattt 
cagtttattt 
agtgctgaga 
atcatattct 
gaattactct 
tgaaccattg 
agatccatat 
gattgctgtt 
atggggggta 
gttggtatgg 



tatatgatct 
ctattaattg 
gattctattt 
ttttttcgat 
tttgcatata 
ataatattat 
attattactt 
gtactaactt 
gtgataggac 
acatttgttt 
attatacaga 
aattttggtc 
tatacaaaat 
attattatgg 
agaataggaa 
tctgttaatt 
tctaatggta 
gataatgtcg 
cttgattttg 
atgttattcg 
gttgtctttt 
gtaatatata 



tttttaaagt 
tatcagatct 
ggctttttat 
ttaaaagaat 
gtgtattacc 
gtggtgtagc 
tcccggtttt 
tttcttgttt 
aaagtttaga 
tattgactcc 
cacgaattag 
gtacttctct 
taagtgtctc 
gtaatgtgag 
atacccaata 
tctataagat 
gaaattcatt 
cagagtttca 
gatatgcagg 
aaaggtatga 
gtatattgat 
tatgtaataa 



aaaaactcgt 
attattattt 
tgcaatcata 
agtaaaccca 
actatcagag 
agcttatttt 
tacaaacagg 
tatttatgag 
tatttatgga 
tatattattc 
aaattgtatt 
tttgatgttt 
aaagtttatt 
gtctggaagt 
tgaaacatct 
gaatgatgta 
aaagccaata 
aacacaacaa 
ggttgttgtt 
gaaaaagaat 
ggggtgtttt 
aatattgcta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1329 



<210> 565 
<211> 1356 
<212> DNA 
<213> B. fragilis 



<400> 565 

acaatacatt 

gaatactaca 

aactacggcg 

gatttctggc 

ttgggtgtta 

atggccctga 

acggtagcct 

gtt tttgtgg 

ttatctccca 

caggctgtca 

gcccttggct 

atcggcacaa 

tataccgatg 

tgctggtgca 

gacctcccgg 

gtgaccgata 

gaagcccggc 

atccttcccg 

aaggaagatg 

cagacacgca 

tccacgggtg 

cataccctct 

ggtacgatcc 



atatgagtaa 
aagaggttca 
gccgttattt 
tgaccgccgg 
aatattgctc 
cctcgccgca 
gcggctttcc 
atgtcaccat 
aaactaaagc 
aagatttctg 
ctacttatac 
gtagctttta 
atcctctgct 
tcgggggtgt 
taggttacga 
tgcaggccgc 
ggtcgaactt 
aagcgcagaa 
cgggcttcac 
acctgtttgc 
agggataccg 
ggatcggtgt 
gtgactttgt 



gaaagaaaca 
cgggtcctcc 
tgatgaccgc 
tccgtgggcc 
cctgaccaat 
gttgggtgag 
cacaaccgtg 
ccccgagtac 
ggtaatgatt 
tgataaacat 
aattgacggt 
tcctccacac 
gcacaaactg 
tgacaacacg 
ccataaatat 
catcggctgt 
cgcctacctg 
gaactccgac 
ccgtaacgac 
gggcaaccta 
cgtaatcggc 
ctatccgggc 
ctcttcccgt 



ctcaagcaac 
cgctccttcg 
gaactggtga 
cggaagtttg 
tcgggctctt 
cgcagaatca 
accccctgca 
aatatcgacg 
gcccactctt 
aacctctggt 
gtagaaaaga 
cacatgacga 
gtcaattctt 
tgcaaatacc 
gtctattccc 
gcccagcttg 
aaagaaggcc 
ccgagctggt 
ctttcccaac 
ctgaagcatc 
aatctggaag 
atgacccgtg 
aagtaa 



aaattctcga 
aacccggtaa 
acctggttga 
aaatccgttt 
cggccaacct 
ggcggggtga 
tccagtatgg 
tgactcaact 
tgggtaaccc 
tggtagagga 
aaacaggtac 
tgggagaggg 
tccgtgactg 
gtttcagcaa 
atttcgggta 
agaagctgga 
ttgcgggtac 
tcggcttcct 
acctggagag 
ccgcctttga 
gtactgatta 
ccatgctcga 



tttaacccgg 
gagttttgtc 
ctcatccctt 
tgccgaatgg 
tcttgccttt 
tgaagtgatc 
tgcggtccct 
ggaagccgca 
gtttgatttg 
taattgtgat 
gatcggccat 
cggtgctgtc 
gggccgtgac 
acagttcggt 
taacctgaaa 
ctccattgtc 
atctggcctt 
tatttcggtg 
caggaagatc 
tgaaatgcgt 
cgttatgaat 
ccacatgatc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1356 



224 



<210> 566 
<211> 903 
<212> DNA 
<213> B. f ragilis 



<400> 566 

agagaacatt 

atggagcaac 

actgtgattg 

atcattatag 

aaaatatcct 

attgtaaaat 

acccacatat 

ttatatggtg 

ttgagtgatt 

ttgcttaagg 

tatagattat 

gatgcaataa 

atacaggaca 

gtgcttttgt 

cataaaaaga 

tga 



atttgatgta 
gcacttattt 
aggctactat 
atggtggtag 
attgggttag 
cgactggaga 
tggaagattg 
atgtcatatg 
ttgaatcata 
aaaatatatt 
atcataaggg 
ctggaatatc 
accgacataa 
catttataat 
gattgttgag 



tgtaaagaaa 
acctttggtg 
tcttagtatc 
tacagatgga 
cgaaccggat 
atggatccat 
cataagatgc 
caaatttgat 
ttttcctata 
tgatacttct 
gtgtactttt 
ttctactaat 
aataagaaaa 
aaatggattc 
aaataagcgt 



agaaacaacc 
tctgttatta 
attgggcaaa 
acaatagaag 
aaaggtattt 
tttctaaatg 
tttaatgaga 
tttggaaatt 
tctcatccag 
tatcagatat 
gagtacatac 
cctctattgc 
tgtattagaa 
ttactaaatg 
tttaacgaaa 



attttaaaat 
cggtgtgtta 
catattccaa 
tcatcaagaa 
atgacgctat 
caggagatgt 
agaaagtgaa 
tacttttaaa 
ctactcttgt 
ccgctgatta 
ctataccatt 
tttataatga 
ttgctataat 
attatgtgcg 
tagatattaa 



ttataatttg 
taatgctact 
tatcgaatat 
atatgaaaag 
gaataaaggt 
ttatttgaac 
agctgatgtt 
acctggtgct 
gaaaggagaa 
tgaactgtta 
ggttttgttt 
aaatactcgg 
aaaagtacgt 
aaataattat 
taattttatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

903 



<210> 567 
<211> 957 
<212> DNA 
<213> B.fragilis 



<400> 567 

cttatgtact 

attgctgata 

acgctacggg 

cagtttgagt 

gacgacattc 

ttgatgtttt 

attgtatgta 

ggagggtatt 

tttgtggagg 

aattttcgca 

gtgattcttt 

ttgcttgtgg 

gaaaatattg 

atcccgcatg 

tatctgttaa 

atggtgtata 



atctgataat 
aatgcaatat 
gtggaggtat 
atccttggtt 
gctctacctc 
atcaatgggg 
caggcattat 
cttgggttgt 
aggatctgat 
agaaggccaa 
tcctgatagg 
tttatggagt 
gtttgccaca 
tgatggtttc 
ctcccggttg 
tcttatttat 



cttagttctg 
catcgataaa 
tattttctac 
catgctggct 
tcagggattg 
attattcaat 
caatgcttat 
attacttgct 
atacactatg 
atgctttgcc 
taaactaata 
agatagtgtg 
tcggaaacac 
actgatttac 
gggctattgt 
gaagaaatat 



ctattcctgg 
ccgaacgaac 
ttgggtgcat 
ctgactttga 
cgtttagttt 
ctgccttggt 
aactttatgg 
ttagctttta 
ctttgtgctg 
ggggatgtgg 
attcgaacag 
ctgactatta 
ttatatcagt 
atgacatcac 
tatttattgg 
ttccatttgc 



cagaactttt 
ggagttcgca 
tggcttattt 
taacctttat 
ttcattttac 
ggactatcct 
atgggattaa 
tcaatgtgca 
tgttggtatt 
gatcggtcag 
agaatttcag 
tccatcggct 
tgatggcaaa 
aagccataat 
gcacaattgt 
atccggctat 



ttatttccgc 
tacccggatc 
tctgacaaat 
tagttttgtg 
tgcaatggcc 
tgttgccttg 
tggcattacg 
aattgtccgt 
taattttttc 
tattgccttt 
ttggattgtt 
tatgctacat 
tgaactggag 
tattgttggt 
catactaagt 
gaaatga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

957 



<210> 568 
<211> 1488 
<212> DNA 
<213> B.fragilis 



<400> 568 

ataaacgaat 

gcgctgagtg 

acacagtctg 

cgcacgtttc 

agcaagcagt 

aagttaggat 

gtttcattgt 



attatatgaa 
cgcaggaaac 
tagatgcagc 
gcgccgatct 
ataacagtta 
tgaacggtgc 
cctcttcact 



acgatacttt 
gcaggaaatc 
agtagctctg 
tcttccggaa 
tcagaatgaa 
tttgtctatc 
cgactttatg 



ctactttctg 
accttgaatg 
aatgagctga 
gtaaacttca 
gacggctctt 
gaccagaaca 
aaacagttag 



cttttgcatt 
aagccattgc 
agacagctta 
gcggtacatt 
actcgtttgt 
tctggtttac 
gttccggagg 



ttgctcgttg 
actggcacga 
ttgggagtac 
gcccagttac 
ccgcagtaat 
gggaggtaaa 
aagccggcag 



60 

120 

180 

240 

300 

360 

420 



225 



840 
900 
960 



ttcatgtctg tccctatcgc ccttcaactg actcagccca ttttcggagt aaacaacctg 480 

aagtggaacc gccgtatcga accggtgcgt tacgaagagg cgaaagccgc ttttattact 540 

gcgacggaga cggtgaccat gaatgcgatc acttacttct tcaatttgct gtcggcaaag 600 

gagacactgg gcactgcccg acagaatcag gtgaatgccg accgtttgta cgaggtggcc 660 

ggggctaaac gaaaaatggg tcaaatctca gaaaatgaac ttctgcaact gaagcttgcc 72 0 

gccctgaaag cccgggctgc cgtgacggat gcagaaagca atctcaatgc ccatatgttc 780 
cgtttgcgct ctttcctggc cataggaaac gacctgatac tggaacctgt ggtaccggaa 
tcggctccca acctaaagat ggaatacaac caagtgctga acaaagcact ggaacgcaat 
tcgtttgccc acaatattcg tcgtcgtcag ttagaggccg aatatgaagt tgcaacagcc 

cggggaaatc tcaggagcgt cgatcttttt gccaatgtag gttatacggg actgaataaa 102 0 

gacttatctc ctgcctatca caatttgcta gacaatcagg tggttgaggt cggtgttaaa 1080 

attccaatcc tcgactgggg caaacgccgt ggaaaagtga gggtagccaa gagtaaccgg 1140 

gacgttaccc tgtccaagat aaagaaggaa cagatggatt tcgatcagga catcttcctg 12 00 

ctggtcgagc atttcaacaa tcaggcacag cagctttcca tagccaatga ggccgataag 12 60 

attgcgcaac agcgctataa gacgagtgtc gaaacattcc tgatcggtaa gatcaataca 1320 

ctcgatctga acgacgccca gaactcgaaa gacgatgcgc gccaaaaaca tatcaacgaa 13 80 

ctgtactggt attggtatta ctattatcaa ctccgtagcc tgacgttgtg ggatttccaa 1440 

aacaatactc ccctggaagc tgattttgaa gatattgtaa aaaaataa 1488 

<210> 569 
<211> 2406 
<212> DNA 
<213> B. fragilis 



60 



<400> 569 

aatggtggat ataatgaaga aaaaagccaa atatgcaaaa gacaaagacg aatgaaaaga 
atcatattag ctgcattggg gagtgctcta ttacttccct cacaggcaca acagaaaaat 12 0 

180 
240 
300 
360 
420 



aaagaatata ctaacttcaa cgattctgta ttttcaatca acgaagtagt agtggcaacc 
aactacagac gcaagaccga tgctttgaaa ctggatgttc cggcaaagtt cattcctatt 
tcaaccaact ccattacttc tggaatgctt gagaaacgaa acatccggga tatacaggaa 
gcctcccgtt tccttcccgg tgtgcgcttt cgcacctctt acggagcgtt tacccaattc 
tcaatccgtg gattcgataa ttctgtaatc atggtagacg gagtacgtga cgaacgctcg 

tctattgaca actcttatcc gttcatggac ttatcggctg tggaaagcat cgaactgtta 480 

aaaggtccgg cttcagtact ctacggacaa tccgctgtgg gtggtgtcct caatattgtc 540 

cgcaaggctc ctgtaagcaa gcaaagtgtc tatgcccgcc tggcttatgg cagttactat 600 

aacaagcagg ccacaatggc tttgggtggt aaactgatag gaccattgaa ctaccgtgcc 66 0 

agcgtcaatt ggcaggatca ggagggatgg agaagcaatg ctaccaaacg tctctccggc 72 0 

tatctggcct taggagggca tttgacagaa aatgacgaat tggatatccg tatcggagct 780 

aaccgcgatt tctatccgac agaaatcggt ttacctccca caatgtctta tgacatcctc 840 

tcagccacag acggcagcaa atatctgagt aagggggatg ccctgcccgg actgaacaag 900 

aaagcccgct acaacagtga atcggacttt atgtacaacc gtggattcaa tgcttccgcc 9 60 

atgtataagc acacattcag cgaagctttc aaattgatgg agaaattgtc ttatacctat 102 0 

gacgacattg actacttcgg taccgaatca ctggactacc tcacaagcga ccgtcccatc 1080 

tatgatcatt attacatgac caaagacaaa cagggcaatg ataccaaaaa gtatatctgc 1140 

ctggactcca tctactacag ttacccgcta cgcttttcac atatcgctaa aactgtgaac 1200 

aatcaattgg aggcaagcgg aaagttctat acgggagacg ttgcacacaa ctatttgggc 12 60 

ggttattctt ttgtatcctt gatgcgtgac tcttatatgg cctatggcaa tggaagcacc 1320 

ggagccaccg gtcccggaac cacaggacat agctcggtat acaaccctca cagcattggt 13 80 

tggatggaag ctcctttcag atttgttact gcacagaaaa catttaccca cggattttat 1440 

ctgcaagact tggtggaatt cagtgataaa ctgaaaatga tgctggccgg acgttacgat 1500 

ctttttatgt ataagactgc taacctgaac accagtgacg gaggacgcca ttatgataaa 1560 

ccggatgacg atgcttataa taaaataacc aatggtgcct tcaccttccg tgccggattg 162 0 

gtatatctgc ctattgaaaa actatctgtt tacggttcat acggtactta cttcaagcct 1680 

atccgcgcat tttatgacgc taacaccatt tatatcgaca aggatggaaa agagttcact 1740 

cccgtaaatg gtaaagaggt attcaagccg gagaaaggtt tccaggtaga agtgggtgca 1800 

cgatacgaga tcacacgtac attgcagact aacgtaagtt tgttctatat caataaggat 1860 

aatatccgcc agactcttgc caacaaaggc gatattgcca acggcgtaga actggacaag 192 0 

aaagttgtgg gacaagtagg caaaatggat tccaaaggat tcgatattga cattacctgg 1980 
agtcccatct acaacttgtc gatgagcgcc ggatacggat ataccgatgc aaaggtacgc 



2040 



226 



gatttagccg 
attcctaaaa 
ggactgggag 
acgagctcgt 
aatgtgcgcc 
cttggtaatc 
ttataa 



acaaccctta 
acacattcta 
taaacttttc 
tcgatgctta 
tgggagtcaa 
agttaatacc 



tatgccgacc 
tgcttttggc 
taccagtttt 
ctggctgacc 
catcaataat 
aagcatgcct 



acttcaagca 
gcttacaccg 
caggacaaag 
gacctcggtt 
ctgttcaata 
cgtaacttca 



aaggcaaaca 
tatccaaagg 
tataccgcaa 
tctcttatac 
aagaatactg 
tgctttccgc 



atatgcctac 
cgtgttaaag 
ctcggataat 
attaaagagc 
taatcaggca 
atcttatact 



2100 
2160 
2220 
2280 
2340 
2400 
2406 



<210> 570 
<211> 285 
<212> DNA 
<213> B. f ragilis 



<400> 570 

gtaaggtttc 

attttaggtt 

acaaaaaggc 

ttcttcgatg 

gctaacgtaa 



ataagataaa ggaaatattt ttatttgaat tatacaaata ttcactctac 
tccgccggct tttctctcct gtaggctctg tctgcaccat acagatatat 
gaggctaccc tcttggagta gcctcgcaca aaccagttaa tctaaagtta 
gctgcctgcg ccgctgccag acgtgcgata ggcacacgga acggagaaca 
ttcaaaccta ctttatggca gaacttaaca gatga 



60 

120 

180 

240 

285 



<210> 571 
<211> 900 
<212> DNA 
<213> B.fragilis 



<400> 571 

tcaatgaatc 

ctgaaagata 

aatttagtgt 

ggtaaagcgc 

cttcagggta 

atttttatta 

catcctttaa 

gaagggtggt 

ggacctaatc 

ttaagtattg 

cttcttccta 

agttttcgtg 

attccttatt 

ccgatcaatt 

gctgtacgag 



ttctttttac 
aatatcaaat 
cggatgttcc 
attctattcc 
ccaagaattt 
gtaccgttgc 
atggaacgac 
gtgctatgca 
ctccgggaaa 
ccgggggaaa 
tgttaataga 
aactggagat 
ggttagctaa 
cattgaaact 
aactcaagtg 



tggtgcttct 
taggactgtg 
taaactgaat 
taaaacagaa 
atgtactgct 
cgtttatggt 
tccttatgct 
taatgtaaag 
tttaggagcg 
agcccggaaa 
aaagggtggg 
ggtgatatgt 
aagtatggct 
tcgtaaaatc 
gaagcccatg 



ggctttttag 
gggctgactc 
atcaagtacg 
gaagagaaac 
ttagagaata 
tgtgattcgg 
ttgagtaaga 
ttaagtatac 
atgattcgtg 
agtgttctga 
atatataatg 
aatcaattga 
gtcattggag 
acaagctctt 
aatgtgctgg 



gcagcaatct 
ctcgggataa 
atgttgtgct 
aacttttttt 
gtggtattcc 
gtgagaatat 
ttaaggctga 
ttcgtccttc 
ggattagaaa 
tggttcagga 
tttgtgacag 
ataaaaaaag 
attgcctggg 
tgacctttag 
agacttttct 



ttattctcta 
ttatactata 
gcatgctgca 
tgatgtaaat 
taaagctttc 
cacggaagag 
gaaatacttg 
attgattgcc 
tggtaagtac 
tattgcaaat 
ttaccagccg 
accaatatct 
taaaaaagct 
taatgaaaag 
tatagaatga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 



<210> 572 
<211> 1437 
<212> DNA 
<213> B.fragilis 



<400> 572 

tctggctgcc 

cggaagagac 

tatataaaaa 

gggaagagag 

agtccttcgc 

tcaggaggaa 

atcgacatca 

attgtctttg 

ctgcccgatg 

cgattagtga 

ggatttgatc 



accctggtaa 
gtatagcaga 
gaatgaagac 
cggaagaggg 
cttacctttc 
cgcgccatca 
ccgttcctct 
cttcgggtga 
cggagattcg 
tgccctacga 
gtgcattgat 



atgccattct 
cgaccggtat 
ctcagaaaca 
tacaccccat 
accttccgtc 
cgacattgtt 
cgaccaggta 
ccctatcttt 
gctctatccc 
tgatatgcgc 
agaacgtact 



ttgctatccc 
acagacgttt 
acccgaccta 
aaccttttca 
aaggcattga 
gccccactgc 
ttcgcccgtt 
ttcggctttg 
tcttttaact 
accatctccc 
cccaagatgg 



gatgcggaac 
tatttattag 
ctctttcttc 
ccgttatagg 
tagaccaagg 
tccctgccgg 
atgcaggaca 
ccaacacgat 
ctttgcaaac 
ttaccggacg 
gaatcctcac 



aactcagacc 
aatcgttcat 
tctccctgtc 
tctcgacgac 
atgtgtcttt 
agcaaagtgg 
tccccatatc 
ccaccgacgt 
gctggcacac 
cccatggcat 
cgatcgcgaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 
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catacgcctg ctaccattgc aagccggatg ctggactatg gttacaacga ttataccatg 72 0 

tacataggtg aacatctggg acatccggca aaagaactga ttcgccggat gacgcttgaa 780 

gaggcagccg cagaaacatt tgaatatccc aattgcctga ttttaactac aggtgacgga 840 

ctgcaatctg taaacggagg aatactctcc cctcgcttct tcggcatccc cgacgaagca 900 

ttcgaactgc ttgacggacg tgcccggatg atcaccaaag cccctatccg cctgctcacc 960 

ttgagtgcct tggaactaaa ccgacggact tctttctggg atatcggctt ctgcaccggc 102 0 

tccgtctcta tcgaagcccg attacaattt ccccatctgc atgtcacatc ttttgagatc 1080 

cgccccgaag gcaagcgact gatggaaatc aacagccggc gtttcggtac tccgggcatt 1140 

accaccgtca tcggtgactt tcttgaaaca gataccgcca tttatccctg tcccgatgca 1200 

gtttttatcg gcggacatgg cggacggctg aaagaaatca tatcccgggt ccggcataag 1260 

cttcttcccg gagcacgcat cgttttcaac tccgtatccg aagaaagtaa aacgcatttc 1320 

atcgaagccg ccaacgaatc cggactttgt tttctcggcg gaacaagagt cgcaataaat 13 80 

gaatataatc caatagagat cctggtggct tcagctccgg actctcctta tctataa 1437 

<210> 573 
<211> 1899 
<212> DNA 
<213> B. fragilis 

<400> 573 

atcgcccgcc atcccgctgg ggaaaagatt ctcttccatc cccaccacga atacgttgcg 60 

gaactccagt cctttggcgg aatgtaccgt catcagagtc accttctcgc cgtctccctc 12 0 

cttgtcggaa tcctgatcgg tgagcaggga tacttcggag agaaagtcga tcagagaaac 180 

gttcgtgttc ccctcttcct gacgcatggc acaaaaatca ttcataccgt tcaccagttc 240 
ctcgatgttt tctttccggc tgagattttc gggcgaattg tcctggcaga cttcattgat 
gatgccggac tgacggatga tttccgtacc tatttcataa gcattcttta cggtcacatc 
tgccataaac tgctcgatca gcgcacgaaa atcctgcaat ttggtatgtg tacctttatt 
gatggaaagc ccgtacgtaa tgggttcgca gagtgcggtc cagaggctga cattgttatc 

ggtggcggca gtaataatct tgcccaccgt ggtatcgccg atgccgcgtg ccggataatt 540 

gataatccgc ttgaacgcct cttcgtcatt ggggttgacc accaggcgga agtaggctat 600 

gatatctttg atctccttgc gctgatagaa agagaggccg ccgtaaatct tataaggcat 660 

gccccgtttg cgcaaagcct cttcgaagac acggctctgg gcattggtac gataaaggat 72 0 
ggcgaagtcg gagtattcat aatcgtgctc gcgacgcagt tgcgctattt tattggtcac 
aatgtcgcct tcttccacat cactgtaagc ctgaaagacc ccgatggcct caccccgttc 
cttctcggag aacacctctt tggggatctg ccgctcgttc ttttcgatca ggctgttggc 

cgcacggaca atggtctggg tggaacggta gttctgctcc agcttgaaga ctttggtatc 960 

gggatatatc ttggtgaaat acaaaatatt gtcaatgtcc gctcccctga aggagtagat 102 0 

gctctgcgcg tcgtcgccca ccacgcatac acgctgattc tcctttgtca gttgcagcac 1080 

gatgctgtgc tgtgcatagt tggtgtcctg atactcgtcg acaagcacat agcggaactg 1140 

ctcgcgatag cgtgccagca cgtcgggaaa gtcgcggaaa aggatatagg tatagaccag 12 00 

cagatcgtcg aaatccattg ctccggcctg ccggcaacgc tcccagtagc ggctgtagat 1260 

atcccgtatg gcaggcatct ttgcggcaag atcgccctcg tacgcctcct tgttggctgc 132 0 

gtatcccgaa ggagacacca ggtggttctt cgcattggag atgcgtgcct gcacactgcc 13 8 0 

gggcttatag gtcttctcgt caagccccat ctctttgatg atggaacgaa tcaggctctt 1440 

gctgtccgcc gaatcgtaga tggtgaactg cgacgtaaag ccgatatggg acgcctcggc 1500 

acgaagaata cgggaaaaaa ccgaatggaa cgtacccatc caaaggaatc gtgcacgctg 1560 

ctcgcccacc tgccgggcaa tacgctcctt catttcacgg gcagctttgt tggtgaaagt 162 0 

cagtgccagg atattccagg gattgtaacc gttctcgagc agataggcta tcttataagt 1680 

gagcacacgc gtctttccgg aaccggcacc ggcgatgacc agcgaagggc catcaccgta 1740 

gagcaccgcc gcacgctggc tttcattaag ttcttcgata tagtcgggca taaaatttat 1800 

ctgttaaatc tctattcacg ggcaaagata gaggaaagtt tggacaactc cgaacatttg 1860 

gcccctatat cctgttatat attcagagaa atgaaataa 1899 

<210> 574 
<211> 312 
<212> DNA 
<213> B. fragilis 



300 
360 
420 
480 



780 
840 
900 



<400> 574 
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ttcggcttcc tttttttgtt tactgtttac cgttgcttca agataacagt tagggataat 60 

agcagtattc cagccactgc gcagtcagat ggaggaatgt tgttgaacta tgacgatacg 12 0 

gaaaaccgca cttatttgag gttcaccggg tatccacctc tcataacaca gttgaataat 180 

ataggtaagg agggatatat caatgtgata gacaccaaga gtgtattgaa ggtcagtccc 240 

tcaaataatc aaattgaggt tgctccattt gaagattatg atgcgcacac cacccgttgt 3 00 

gtacaggaat ag 312 

<210> 575 
<211> 207 
<212> DNA 
<213> B. fragilis 

<400> 575 

acacaattcc tattatctat ttatttcaaa tatcaaatca gcaaatatta taagatttac 60 

catcattggg ctaccggaaa atttttggga tattataaat gctcctatta ccagagccgg 12 0 

tattatagat gccgggtgga ttcccgaagt tacaattctt ggagatttat gtcgacaaat 180 
gagtcatatc tgttcagatc catatga 2 07 

<210> 576 
<211> 723 
<212> DNA 
<213> B. fragilis 

<400> 576 

aatattgcta aaacgatgat aacagtttgt atggctacct ataatggaga gaaatatatt 

gaagaacaat tagaatcagt actaatgcaa ttgcattcca atgatgaagt tattatttct 

gatgatggtt ctggcgatag tacagtagac ttgattcgaa ccttcaatga ccctcgaata 

aggcttttag tcggtaataa tttcttttct cctactcaaa actttgaaaa tgccttaaaa 

tatgcaaaag gcgattatat atttttatgt gatcaagatg atgtttggct tcccgataaa 

gtagaaagta tgctgcaata tttgcttcaa tatgatttgg ttgtgtcgga ttgtaaggtg 

gtggatgcgg aattgaatgt aatttctcag tcttttttta tggggcgttc gtcaggaaaa 

ggtttttgga agaatttaat taagaacact tatttggggt gctgcatagc atttagaaaa 

gaagtattag gttacatatt accttttcct agaaatatag cgatgcatga tatatggata 

ggattatctg ttgaaatgca tagtaactct ttttttctgc ctcgtcaact aatactatat 

cgaagacatg gctcaaatgt tagttttgga ggggaaggaa gtaaatactc gttaatgtat 660 

aagataaagt atcggttgtg tatgcttttt tatttattaa aacgaaaata tttgaataaa 720 



60 

12 0 

180 

240 

300 

360 

420 

480 

540 

600 



60 
120 



tga 723 

<210> 577 
<211> 207 
<212> DNA 
<213> B. fragilis 

<400> 577 

gctcgtcata atactccgga agttcgcgga ccaacatcac accttccttc cagaatatcg 
catgatttct ataaaaaggt tcgggatacc gcaaaaacaa taaatcattc agaacaaaca 
actaacacta ttagttattc gactaacagc ataaattatt tacctaatct tagcaggaga 180 
ttagaaaacg gaccttatta caggtga 2 07 

<210> 578 
<211> 1230 
<212> DNA 
<213> B. fragilis 

<400> 578 

tccatacata aaattaatat catggagaag aatattttca aactggacaa tgaacaactg 60 

aaaggaatcg cacacgcatt ccgggagaaa gtggaagagg gattaaataa gaataatgct 120 

gaaatacaat gtattcctac ctttatttta cctaaggcga ccgatgttaa aggcaaggct 180 

ttggttctgg acctgggagg taccaactat agagtggcaa ttgtcgattt ctcaaccgaa 240 
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aagccaatca tctatcccaa taatggttgg aagaaggata tgtcgattat gaagtcgccc 3 00 
ggttataccc gtgaagagtt gttcaaagag ttggccgact tgattgttga aataaagcgg 3 60 
gaagaggaaa tgcctatcgg ttattgtttt tcttatccga ccgaatcgat accgggaggc 42 0 
gatgcaagat tgcttcgttg gaccaagggg gtagacattc gggaaatggt gggacagttt 480 
gttgggaaac ccttactcga ctacctgaat gaaaaaaata aaatcagatt tacaggagtt 540 
aaagtgctga atgacacgat tgccagttta tttgccgggc ttaccgacaa aagctatgat 600 
gcttatattg gcctgattgt agggacaggt acaaatatgg caacttttat tccgtctgac 660 
aagataacga agttggaccc ggaatgtcac gtacaaggct tgattccggt caatctggaa 72 0 

■ ' ' 780 
840 
900 
960 
1020 
1080 

gaaagtcgga aagataaaaa ctataatatc cttgtaatgg agaaattgca ggaacttctt 1140 
cgtgagcttg aactggaaga tgtcgaagtt catattaata gtatggataa tgccaatctg 12 00 
ataggaacgg ggattgcggc attatcctga 123 0 



tcgggaaact tttatcctcc cttcctgact gcggtggacg atactgttga cgcaacttct 
gacagtttgg gtaaacagcg ttttgagaaa gcggtatccg gcatgtatct gggagatatc 
ctgaaagcag ctttcccttt ggaagaattt gaagagaaat ttgatgcaag gaaactgact 
gctattatga attatcctga tatacacaaa gatatctatg ttcaggtagc ccattggatc 
tataacagat cggcccagct cgtcgctgcc tctcttgccg gattaatcgc attgctgaaa 
tcgtataatc gagatatcca tcgggtttgt ctgattgccg agggcagtct tttctggagt 



60 

120 

180 

240 

249 



<210> 579 
<211> 249 
<212> DNA 
<213> B.fragilis 

<400> 579 

ataagacata aagctacaat gataaatccc atacaactca aaagcaccaa tacagctagt 
atcttagaaa atgatttatc caattcttta aaatctcgta atgacaccaa tttagacaaa 
ttgggtactt tagtgtatat ccagttcatt gaaacagatg agaccccgtt tatcacactt 
aatgtcattc caagttgccc agcagcagca ctacctattg tggcaaacag aatcggattg 
aaaaactga 

<210> 580 

<211> 1320 

<212> DNA 

<213> B.fragilis 

<400> 580 

gaaaatagta acaagcacaa aatgattaaa caatatttca agcaggccct tgcacaactc 6 0 

agacagcagc ccttgctgac tacgatcagt gtgttgggca ctgctttgac catttgcctg 12 0 

attatggtag tggtcatgca acagcaaata aaaaccgccc cttttgctcc ggagagtaac 180 

cgtaaccggc tattgcatgt caaacagatg agcacgagca acaaaaactg gagtgatgac 240 

ggatcgagta acggtccgat ggggctgcag acagcaaaag gatgttttga aggattaacg 3 00 

acggccgagg aagtcagtat ctatacgata cccgaaacta tgcaggtggc tttgccccgt 



360 



ggtgtacgta cggggatcga tgccctcgag accgacggag ctttctggag gatattcgac 420 
ttttcgttta tagacggtaa gccttattcg gatgcggaag taaaatccgg gcttccggta 
gctgttataa cagagagtgt cgcacgtctt cttttcggta cgtcccatca ggtgtccggc 
aaggagatct tggtgaatga tgcggtctac cggataagcg gagtggtgaa agatgtgtct 
tcaatggctt cgacagccta tgcacagatt tgggttccat attcatcaac ccatattacg 
ggaggagaca atacctggtg tgacgggatt atgggagtga tgcgagtggt gatcctggcc 72 0 
cgcagttctt ccgacttcga agctatccgt gcagagtgcg aacgtcgccg cttggcttat 
aacgccgggt tgggtgatta ttttgttttc taccgtgggc agccggatga ccaactgacg 
atgtctcagc ataaatgggc aaatgtgcag ccggatatgg cagcctattt tcgtcaacaa 
gtcattatat ttttgattct gttactggta cctgccatca atctgagctc gatgacccat 
agccgtttga gacaacgcgt tgccgagatc ggtgtacggc gtgcgttcgg agctacccgt 102 0 
gggggagtga tggggcaaat tgttgctgag aatctggtac tgactttgat ggccggagtg 10 8 0 
gtcggactgt tgttctgtct gatcatatct tattgttggg gaggtacgct ttttgccgat 1140 
agcagattga tgtaccttaa cacggctccg gttatcgagt ggaaaatgct ttttaaattt 1200 
tctactttta tttatgcatt acttttctgt ttggcactga atctgctgag tagtggatgg 12 60 
ccggcctgga gggcatcgcg gatgtctatt ataaatgctc ttagcggaaa gcttaactaa 13 2 0 



480 
540 
600 
660 



780 
840 
900 
960 
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<210> 581 
<211> 288 
<212> DNA 
<213> B.fragilis 



<400> 581 

ttgttcttca atatatttct ctccattata ggtagccata caaactgtta tcatcgtttt 

agcaatattt tattacatat atatattacc cataccaaca tcgtattgaa cgcattaaaa 

aaacacccca tcaatataca aaagacaact accccccatg agattatgta ttccggacaa 180 

ttctttttct catacctttc gaataacata acagcaatca ttccatatag acaattaaga 240 

acaacaaccc ctgcatatcc aaaatcaaga tatggatctg cgatataa 



60 
120 



288 



<210> 582 
<211> 579 
<212> DNA 
<213> B.fragilis 



<400> 582 

agaaaactca aaaacaaaaa taggatgact gcaacggaaa ggacagcgga ataccggaaa 60 

gcactcgatg tgcctatctc ccaactggag acggaccgga ttgtaaaaga aatcctggat 12 0 

cgaccggaga acttcgacaa catttaccgg ctgacgtcgg acgataaatt attggtgtcc 180 

tggcgggcct tatggatatg cgacaaactg tgcaggcaga agccggagtg gctgatccct 240 

ttcagggaag agctgaccgg aaggttgatg tcctgcgggc acgatggctc gaaacgactg 3 00 

cttctttcca tactctacca tgcacccgca acgaaggtgc cttccgtggc tctgctcaac 360 

ttctgcctgg acgccatgct gtcgccccaa gagagtatcg gcgtgcaatc gctcgccatc 42 0 

cgaatggctt accgcctgtg cgagcccgag ccggagttgc tgtatgagct gcgtaccata 480 

ctggagagta cagagaccga aatgtattcg accgccgtaa aatcggctgt acggaacaca 540 

ttgaagaaga ttaaccagaa gaataaaaag aaaaaataa 579 

<210> 583 
<211> 801 
<212> DNA 
<213> B. fragilis 



<400> 583 

aatgaactca aaacaagatc agaaatggaa aagttaatca ttgcgggacg tgaattcaac 60 

tcccgcctct tcctgggaac aggtaaattc agctcaaacg aatggatgga acagtcgata 120 

ctggcatcgg gcaccgaaat ggtgacagtg gccatgaaac gtgtcgacat ggagagcaca 180 

gaagacgaca tgctgaaaca tattgtacat ccgcacattc agttgcttcc caacacatcg 2 40 

ggcgtacgca acgcggagga agcggtgttt gccgcacaaa tggcacgcga ggctttcgga 3 00 

accaactggc tgaaactgga gattcatccc gacccgcgct atctgctgcc cgactcggtg 3 60 

gagaccctga aagcgactga agaactggtg aaactcggat tcgtcgtgct cccctattgc 42 0 

caggcagatc cggtgctctg caaacaactg gaagaagcgg gagccgccac ggtaatgccg 480 

ctgggagcac ctatcggaac caataaagga ctgcaaacca aggagtttct gcaaatcatt 540 

atcgaacagg ccggtatccc ggtagtggtg gacgccggaa tcggagcacc gagccatgcg 600 

gcggaggcta tggaaatggg tgcatcggca tgcctggtaa acacagctat cgccgtagct 660 

ggcaacccga tagaaatggc aaaagccttc aagcaggcag tagaagccgg acggacggca 72 0 

tacgaggccg gactgggtat gcaggccata gggttcgtgg cggaagcaag ctcaccactg 780 

acggcatttt taaacgaata a 801 



<210> 584 
<211> 330 
<212> DNA 
<213> B. fragilis 



<400> 584 

aaaaacgcca acactaaaac gcaacatcct atcacagaat ccattaaaga aaaaagaggc 

agaaaaacag gagcgcagat accgggaatt atctccaaca atgaaggagt tataaaagcg 

ctgatagaat cctacatatt ggacgcaaaa gaacaaaata tcaagacatg caaagattcg 
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ttggcacgct acatagagga aaaagaactt tttgggaaaa tgagaaatgg agtattcaaa 
ccattagttt tcagcacaat caggaattac gtcaacgaaa tctggaataa gatggaaaga 
aagaaaaaga accaagaagg aaagcgctga 



<210> 585 
<211> 1281 
<212> DNA 
<213> B.fragilis 



<220>< 

<221> unsure 

<222> (1074) , (1086) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



60 



<400> 585 

aagaaaaaca agagacatga catccttatg aaaaatattt ttaaagattt aaaaagtaaa 

gaccacaaac gctatctggg aggtttggac gtcttcagat atattggtcc cggtttattg 12 0 

gttactgtag gttttattga tccgggcaat tgggcttcta attttgcggc aggttcagaa 180 

tttggttact cactgttatg ggtggttacg ctgtccacca tcatgctgat catattgcaa 240 

cacaatgttg ctcacttggg aatcgtgaca gggctttgcc tttcggaggc ggcaacgcag 3 00 

tatacgccca agtgggtatc gcgtcccata ctggggacgg ccgtacttgc ttccatctct 360 

acatcactgg ccgagattct gggaggggcc atagcgctgg aaatgttgct cgacattcct 42 0 

attgtctggg gggccgttct gactaccgtt tttgtttcca tcatgctttt tacaaattca 480 

tataagaaaa tagagcgctc catcattgct tttgtatcgg tgatcggctt gtcgttcatc 540 

tatgaactct ttttggtgga tattgactgg cctatggcag tagaagggtg ggtgacgccg 600 

gctataccta aggggagcat gctcattatt atgagtgtgc tgggtgctgt ggtgatgcct 660 

cacaatcttt tcctacattc ggaggtgatt cagagccacg aatacaataa gcaggataca 72 0 

gcgtccataa agaaagtgtt gaagtacgaa ttgtttgata cgctcttttc aatgattata 780 

ggatgggcca tcaacagtgc catgattctg ttggcagccg ctaccttctt taaaagtggc 840 

attcaggttg aagagctgca gcaggcgaaa tcattgctcg aacccctgtt gggaagtaat 900 

gcggctattg tttttgcttt agccctgctt atggcgggta tctcgtctac gattaccagc 960 

gggatggcgg ccggatctat tttcgccggt atctttggcg aatcatacca cattaaggat 1020 

agccactctc aggtaggggt tatcctgtcg ttgggcattg cattgctact gatnttctta 1080 

tcggcngatc cgtttaaggg tctgatcatc tctcagatgg tgctgagtat ccagttgccg 1140 

tttacggttt ttttgcaggt cggtctgacc tcctcgcgta aggtgatggg cgattatgtc 12 00 

aatagtaaat ggagcacgtt tgtgctttat accattgccg tgatagtgac agtgttgaat 12 60 

ataatgttgt tgttctcgta a 12 81 



<210> 586 
<211> 288 
<212> DNA 
<213> B.fragilis 



<400> 586 

atgaacgaca agccgatcac cgatacaaaa gcaatgatgg agcgctctat tttcttatat 

gaatttgtaa aaagcatgat ggaaacaaaa acggtagtca gaacggcccc ccagacaata 12 0 

ggaatgtcga gcaacatttc cagcgctatg gcccctccca gaatctcggc cagtgatgta 18 0 

gagatggaag caagtacggc cgtccccagt atgggacgcg atacccactt gggcgtatac 240 

tgcgttgccg cctccgaaag gcaaagccct gtcacgattc ccaagtga 



60 



288 



<210> 587 
<211> 1347 
<212> DNA 
<213> B.fragilis 



<400> 587 

aattgcttaa atctaatatt atatatacct 
ttctcgtcga tgaatattac ggcgagtatt 
gtaatcgcta actctccggc agcatcggtg 



attatgacag ttttacgttc gatgaaggat 6 0 
ctattgtttg tcacagcgat tgccgctgcg 12 0 
tatcaggagt ttttgtcgca tgaacttcat 180 



232 



tttcgcatcg 
ttcattaatg 
gagttactgg 
tgtggcggaa 
ggcgggcaag 
agcctgttgg 
gtcgatgata 
gaatatttgt 
gctaccaata 
tcgggtatcc 
cagttgaacg 
atgggagcaa 
tcggcttccg 
gtgaattatc 
gaaggtgaag 
tttctgggga 
ttggggatga 
gtatcgcttt 
gccaaactcg 
cattgggtct 



gaggctttaa 
acggtctgat 
taggcgagct 
tggtagtgcc 
gactggctat 
gcaagcgtgt 
taggcggtat 
tatgggcggc 
agattttctt 
atagtacgat 
tcggtacata 
acaacatcgt 
accgtgtcat 
ttgtccttcc 
ttattggcgg 
tttattcttt 
actggaagaa 
tcatcgccaa 
gtgttttatc 
tgcccaaaag 



tttactttcg 
gacgattttc 
ttcctcgttc 
tgttgtcatc 
ccctatggca 
tccgttgagt 
attggtgatt 
gctgctttac 
tttagttgtc 
ttccggtgtt 
tattgagcgt 
actgaccaat 
cagtcccctt 
gttgttcgct 
ggttaccctt 
tacctggctg 
tatatccgga 
tctttcgttc 
cggtacggta 
aagataa 



catgcgggac 
ttcttaatgg 
cgtaaagctg 
tattccatgg 
accgatattg 
ctgaaaatct 
gccattttct 
gtcctgttat 
ggtgtggtta 
attctggcct 
atccggcgca 
caacagatag 
cagtcgcttg 
tttgtcaatg 
gcggttgctt 
gctgtcaaaa 
gtggcgttac 
ggctccgccc 
atggcgggta 



acaatctgac 
tcggactgga 
cactgccatt 
tttgtgcccc 
ccttttcttt 
tccttacagc 
acagttcaca 
attttatagg 
tctggtatct 
ttgtcattcc 
ttatcagtac 
ccaagctgaa 
aggataacct 
caggtgttat 
tgggattatt 
gcggtcttac 
tgggtggaat 
atcctgtatt 
tcttgggata 



gatgattgag 
gattaagcga 
cattgccgca 
gggcactgaa 
gggagtgctc 
gtttgcggta 
cgtggcttat 
taagaaggga 
tttcctgcaa 
ggccaaacca 
attccccgaa 
agaggttgag 
gcatggtgca 
gtttagtggc 
ggcaggcaaa 
tccgatgcct 
aggctttacg 
attgaaccag 
cctggttttg 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1347 



<210> 588 
<211> 1014 
<212> DNA 
<213> B.fragilis 



<400> 588 

cctaacctaa 

tttatgtcct 

aatgaacaat 

tccaaaatca 

agccctaaat 

gtt cctctgc 

atcaaatcag 

att ataaatg 

acaattcttg 

gcttttcttg 

gatagttctt 

atgcgacatc 

gctaaaaaat 

agtgctctca 

gaacaaaatc 

ggttatcaac 

gaaataataa 



acattatgaa 
gtggcaatga 
ctgattcctc 
attttgatga 
gggaaaatgt 
tagcacagga 
caaatattat 
ctcctattac 
gagatttatg 
agttcctaca 
catctggtgg 
cacaagtaat 
ttccaggaca 
acacgctttc 
cgggacaaga 
tgggagatct 
aatataaaaa 



gtattttgta 
agacaatgct 
aggactatta 
aaactttaat 
ttctatggtt 
caatcctgaa 
aagatttacc 
cagagccggt 
tcgacaaatg 
cagcaaatgg 
cgattatagt 
taaaaaattt 
gcacaatgga 
agaaaatgca 
tattgcagag 
agcaaaacaa 
tgatggaaaa 



ccactctttc 
tgggataata 
aagagccaaa 
aatacagaac 
cttcagaaac 
cacaattcct 
atcattgggc 
attatagatg 
agtcatatct 
cttaaagaac 
cggctgactg 
catgataacg 
gagggcgatg 
aatctggcaa 
aagaatatgg 
aataaatggt 
ttacaaacaa 



tcagtctatt 
atatccctat 
taactgacat 
tttataagaa 
aagacacact 
attatctatt 
taccggaaaa 
ccgggtggat 
gttcagatcc 
atggcaatga 
aagcagaaaa 
caagaaaagc 
cggtaagaca 
aagagttcgg 
atctatttaa 
cagaagaacg 
aattacatcc 



ttttttagtt 
tattactcct 
cattaattat 
cttgatttta 
ccatttatgt 
tatttcaaat 
tttttgggat 
tcccgaagtt 
atatgatgaa 
aagttcctcg 
gcgcttcctt 
tagtgaagca 
tgtttattgg 
tgatgcacat 
taactccatt 
tttatttaag 
ataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1014 



<210> 589 
<211> 429 
<212> DNA 
<213> B.fragilis 



<400> 589 

attttatcaa 

aaccctttaa 

atgaaactga 

gaatggatgg 

atcgcttatc 

gaagcggaat 

aatgtggacg 

atcgtatag 



accggaatac 
atcatatacg 
gatggaaaaa 
cggagcgcct 
agaagcagaa 
tccacaagaa 
aacagcgatg 



attcgatccg 
tatgaaacag 
acggattgtc 
ggaagcgttg 
cggagacttc 
gtatgatccc 
gacgacattc 



acttaccttt 
aagaaaagac 
tttgagaaag 
accgaccatc 
aggttggtga 
acaaaaatag 
caggtggaga 



ggggcgataa 
cggcatcaca 
gatacactga 
tgcaatacgg 
aagcgacact 
aaggcgcagt 
acttcatgga 



tttatctatt 
aactgaagcc 
aatgtgtgcc 
gcacgcagcc 
gatctactat 
agtctactgg 
gtggagaccg 



60 

120 

180 

240 

300 

360 

420 

429 



233 



<210> 590 
<211> 2484 
<212> DNA 
<213> B. fragilis 



240 
300 



480 
540 
600 



840 
900 



<400> 590 

aaacataaaa aacttaatga tatgaatatc tcttataatt ggctgaaaga gtatgtcaat 60 

ttcgatctga cgcccgatga agtggcggct gcgctgactt ctatcggact ggaaacaggt 12 0 

ggagtagaag aagttcaaac gattaaaggc gggttggaag gtctcgtgat tggcgaggtg 180 
ctgacttgcg tggaacatcc caattcagac catttacaca tcacaaccgt aaatttgggt 
aacggcgaac ctactcagat tgtgtgcgga gctccaaacg tagctgccgg acagaaagtc 

gttgttgcca ctttgggcac gaagctctat gatggtgacg aatgttttac tattaagaaa 3 60 

tcaaagattc gcggcgtgga gtcgatcggt atgatatgtg ctgaagatga aatcgggatc 420 
ggaacttcac atgacggtat catcgtattg ccggaagatg ccgtaccggg tactcttgcg 
aaagattatt ataatgtaaa gagcgactat gtacttgaag tagatattac tccgaaccgt 
gctgacgctt gttcacacta tggggtggca cgcgacttgt atgcttatct ggtacagaat 

ggcaaacagg ctgcactgac cagaccgtct gtcgatgctt ttgctgtcga aaatcatgat 660 

ctggatatca aggtaactgt agaaaacagt gaggcatgtc cacgttatgc aggtgttact 72 0 

gtgaaagggg ttactgttaa agagagtccg gaatggttgc aaaataaact tcgcatcatc 780 
ggtttgcgtc ctattaataa tgtagtggat atcacaaatt acattgtgca tgctttcggg 
caaccgcttc attgctttga cgcaaacaaa ataaaagggg gcgaggtgat tgtgaaaaca 

atgccggaag gcacaacgtt tgtcacgttg gatggcgttg aacgtaagtt gaatgaacgt 960 

gatctgatga tctgtaacaa agaagacgct atgtgtattg ccggtgtttt cggaggtctt 102 0 

gattccggtt ctacagaggc cacaacggat gtgtttctcg aaagtgcata tttccatccg 1080 

acatgggtgc gtaagacggc ccgtcgtcat ggcttgaata cagatgcttc tttccgtttt 1140 

gagagaggta ttgatcctaa tatcacgatc tactgcctga aattggcggc tatgatggta 12 00 

aaggaacttg ccggaggtac catttcttcg gagattaaag atgtctgtgc tgctcctgca 12 60 

caggatttta ttgtcgagtt gacttacgag aaggtacaca gcctgattgg taaagtgatt 1320 

ccggtagaga cgataaagag cattgttacc agtcttgaaa tgaaaatcat ggacgagacg 13 80 

gccgaagggc tgacattggc cgtacctcca taccgtgtgg atgtacagcg tgactgcgac 1440 

gtgattgaag atatcctgcg tatttacgga tataataatg tggaaattcc atcgacactg 1500 

aagtcgagcc tgactacaaa aggagattgt gacaagtcga ataagttgca gaacctggtg 1560 

gctgaacagt tggtaggttg tggtttcaac gagattctga ataactcttt aactcgtgcc 162 0 

gcttattacg atggtttgga aagttatcct tccaagaatc tggttatgtt gctgaatccg 1680 

ctaagtgcag atttgaattg tatgcgacag acactgttgt tcggtggatt ggaaagcatt 174 0 

gcccataatg ctaaccgtaa gaacgcggat ttgaaattct ttgaattcgg taactgttat 1800 

cactttgacg cggagaagaa gaatcctgaa aaggttttgg ctccttactc agaggattat 1860 

catttgggac tgtgggtgac cggtaaaatg gtatcaaatt catgggcaca cgcagatgaa 192 0 
aacacttctg tctacgaatt gaaggcttat gtggagaata ttttcaaacg tttaggattg 
gatttgcact ctctggtagt gggcaacctg agtgatgata tttattctac ggccttgacg 

gtaaatacta aaggtggcaa gagactggct acattcggtg tcgttaccaa gaagatgctg 2100 

aaagcttttg atgttgataa tgaagtctat tacgctgatt taaactggaa agagctgatg 2160 

aaagcgattc gttcagtaaa agtaagctat aaagagattt ctaaattccc ggctgtgaaa 2220 

cgtgacttgg ctctgttgct ggataagaag gtacagtttg ccgagattga aaagatcgct 2280 

tatgaaacag agaagaaact cttgaaagag gtttctttgt ttgatgttta cgaaggcaag 2340 

aatcttgaag ccggaaagaa atcttatgct gtcagcttct tgcttcagga tgaaagccag 2400 

actctaaacg ataagatgat tgataagatc atgtcgaaac tggtgaagaa cctggaagac 2460 

aaactgggag ccaaactcag ataa 2 484 



1980 
2040 



<210> 591 
<211> 192 
<212> DNA 
<213> B. fragilis 



<400> 591 

tcccacaaaa atagaaggcg cagtagtcta 
attccaggtg gagaacttca tggagtggag 
cacggatttt cacagatgaa tgatttctat 



ctggaatgtg gacgaacagc gatggacgac 60 
accgatcgta taggacgcca ccacagatta 12 0 
tttgaggaat caagtgaaga catctttggg 180 



234 



aatccggggt aa 



192 



<210> 592 
<211> 579 
<212> DNA 
<213> B. f ragilis 



<400> 592 

aacgtaaaaa 

agtttccttc 

tatacggttt 

aactctctgc 

gcctcaataa 

cttttccaac 

atactaacct 

atcaccggca 

aaaaacatag 

atcgtactgc 



ataagcttat 
tcaaaaccgg 
tccttatttg 
tggcctctgc 
tgatcgcttt 
ggacagttac 
atctgcttat 
tctcttcagt 
taggagaacg 
ttagtgtcgt 



ggaacatatc 
tttctatcca 
tataggtcca 
cccacacata 
ttgtttcaac 
atacatattg 
acagttgttc 
tgctgttttt 
gaaattacga 
tagcacagga 



attcatttac 
cgatggggaa 
tgggctacgg 
ctgacccttt 
tgttttgcag 
aatttttatc 
ttcgcttttc 
atactgatat 
cttgaaatac 
aacaactaa 



taatcggttt 
tctggttgtc 
aacaatcgcc 
cagtatatgt 
acacctcaaa 
ccggactgct 
cgggagtcag 
ccggactgag 
tcttcatcac 



catcgtttta 
agcacttgtc 
aacagagata 
cacgctcgaa 
acagcggaca 
aatggcgggc 
cttcggactg 
cctactcttg 
caatctgttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

579 



<210> 593 
<211> 723 
<212> DNA 
<213> B.fragilis 



<400> 593 

gagaacggaa 

gtgtcgttag 

aatgcgcttt 

ttctttgcgc 

catactcaaa 

aaagcaaatt 

tatacgcatc 

gccgaacaag 

atgggtacat 

tcaatggcct 

gctatcggag 

cttgaattta 

tga 



attacgactt 
cacaggaaac 
tctggatatc 
gtgctgtact 
aatctcttgc 
cccttaccgg 
gggacaatgc 
aattgggacg 
taatcccaat 
acaatatgca 
tggtcaccct 
tcagtaaaac 



gaaatactct 
aactaaagaa 
caacgggcta 
gcttgcaggc 
cgagcagttg 
agacagaagt 
agcttattgt 
ttcacgtaca 
gggtcccgca 
agtagcattt 
gcaaatccgg 
cctaatccat 



tcatcaccaa 
ataacaatta 
cttgtcccgg 
ggcttcttcg 
gaagaactga 
acccccctgc 
gagcgattgc 
tttgtcaagc 
cttgtaggtc 
gccaccactg 
caacgttggt 
ggcacgaaac 



tctgtttatc 
taaatatgga 
tagtagttct 
gagaatttta 
ctcccgacaa 
aacgctgtgt 
ttgctaactt 
tgggtcccat 
tggccacagg 
tagtaggtat 
atgcccgtga 
aaacttctac 



gtactgctta 
aacaatatca 
gttgttacta 
tcgtagggta 
tatcgaagaa 
atacaagctt 
tgaagtagat 
gctcggacta 
agacattgct 
ggttatagct 
aataaacgac 
acaaccagaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

723 



<210> 594 
<211> 948 
<212> DNA 
<213> B.fragilis 



<400> 594 

tgctgcttga 

aaggcccgtg 

cttgcggcca 

atcgtgctgg 

aaggtgctta 

gtctacgagg 

acagagcggc 

ggacgcggaa 

atgtttgcca 

acggaccgtc 

gtgtcctggc 

tccaaatcga 

ttgcgattgg 

cctctgatcg 



acttaaataa 
cacaatatac 
aaatgcccga 
tcgacaatgt 
tcagcaagga 
acgcctatat 
agaaagaacg 
accgcatcta 
aagacgaaaa 
gctatgtagc 
tgacagaccg 
tcacccacta 
aaaccggacg 
gagacggacg 



aaaaatgaat 
cggctatctg 
tgccagccgt 
gatcacaaca 
caagaacaag 
catcgtagtg 
tactgcccag 
cgtggttcat 
gacacaatac 
ggtggttacc 
taccttgtac 
tcgcaccatc 
taagaatcag 
ttatgggata 



atgaagagaa 
gtgaaagaac 
accaagctaa 
caattcaact 
aaagaattcc 
gagaaaaagg 
catattttaa 
cgcctggacc 
acgttgcgtg 
ggcgagatgg 
gtcagctcaa 
aaacgtgcca 
atacgtgtac 
gacggtgggc 



tcaaacgaac 
cgatggaatt 
agtctctgtt 
ttcctctgca 
gccatccgct 
aaggattgct 
gcgaatatgt 
gggatacttc 
accattggca 
agaaagacag 
gcagctatga 
atggctactc 
acatgcagga 
ccaatcctct 



tccggccgag 
aatggatttc 
gagcaaacga 
accaggcatg 
actgaagata 
ttccgttggc 
aggtcgttcg 
gggattaatg 
cgacatcgtg 
cgacacggta 
tgatggcggt 
gctggtagaa 
tctggggcat 
cgggcgcctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 



235 

gctttgcatg ctttcaaact ttgtttctat catccggtga cagatcagct aatggagttt 900 
gaaacccctt accctcctac attcaagaag ctatttctga agaaataa 948 

<210> 595 
<211> 1806 
<212> DNA 
<213> B. f ragilis 

<400> 595 

aaatatatga aaacagccat tattgtcata tcagaagccg gcatagcact ggccaagaca 60 
ctggaacagg aacttcccga atcagagatc ttttctaccg gcacagacac agattgccac 12 0 
tctatttcca atcttcagga ggccgttcct gagatattcc ataaattcga tgctattatc 
ttcatcggag ctatgggaat ttgtatccgt gccattgctc cccatattga agacaagcat 
aaagatcccg ccgttgtctg tgtagacagc acaggacgtt atgctgtctc tgtcctgtcc 
ggacatattg gtggagccaa cggactgacc cggtatgtgg caagcattct gggagccgaa 360 
cctgtgatta ccacccggag tgaccgtacc ggtctttggg ccctcgatac tcttggcaaa 
aaatacggtt ggcaaacagt cccggccgaa tcatcagata tgaatcatct gatcacactc 
tttgtagatt gcaaaccaac agctctatta ctcgacattc gcgacgaagg cacaacacag 
ttggaacata ccttgcctcc tcacgtcgat gtattctaca aatttgagga tatggatctc 
cggaaatatg acttgctcct gcttgtcact ccatttattt acaacacctc tgacactccg 
gcactctact acgtcccacc ggtattgcat atgggagttg gactggcccg cgatgcccat 720 
ccggtggata ccgtcattac ccatctgatg gatgttgtgg tgcaagccaa catgatccct 
cttgccatac gtaccgtatc ttccattgaa gaaaaaaaag acgaaccggt gctcaaacta 
cttgcagagg cttatcagac ccggctttac accgccagtc aactcagcaa aatagaggtg 
cccactccaa gtgaagtggt caacaagcac atgggtactc ccagtgtatc cgaagcctct 
gccctactct cttccggagg cggtccctta ctcctgccca aacaaaaagg cgctaacttt 
actgtagcca tcgccatgga cgccgcctcc gtacgtcagg ggcacatcga aattgtcgga 
gccggtcccg gcgatccgga gctgatctcc gtacgcggac gtcgctttct cgaagaagcc 1140 
gacctgatac tttatgccgg cagtctcgtc ccccgcgaac tgacagaatg tgccaaagcc 12 00 
ggtgctacca tacgcagttc ggcttccatg actctcgaag agcaatttgc cttgatgaaa 1260 
gagttttatg accgtggaca gttggtagtc cgtctgcata caggcgaccc ttgtatctat 1320 
ggtgccatcc aggagcaaat gaatttcttc gaccaatatg gtatgcatta ccacatcact 13 80 
ccggggatct cttcatttca ggctgccgcc gctgctctcc aatcccaatt caccattccg 1440 
gagagggtac agaccatcat cctcactcgc ggtgaaggtc gtacaccgat gcccgagaaa 1500 
gagaaactca gcctgctggc acgttcgcaa agcaccatgt gcatcttcct cagcgcaggc 1560 
gtagtcgatc aggttcagcg agagctcctc gagcactatc cgcccactac acctgtagct 162 0 
gcctgttatc atctgacctg gaaagacgaa cgcatctttc gcggacaatt acaggattta 1680 
gctaagatcg taaacgaaaa ccatctgact ctgactacca tgattgtcgt aggcgatgcc 1740 
atcgataatc gggaaggact gtcacgacta tattctcacc aatttaaaca cttattccgt 
aaataa 

<210> 596 
<211> 489 
<212> DNA 
<213> B.fragilis 

<400> 596 

caagatatga actatttaga atcagaaatc tccgctcttt atgcttctgc tcatgaactt 

tgctatctgg gcatggacgg tcggccgatc tacagtgatc aattcacccg tctgaatcgt 120 

gatgtttttt ctcaggctaa tgctttgtac gacaagcatg gtgatagtga tgaagaagag 180 

gcccggttgt gtctgtcgct cctgatggga tataatgcga ctctctataa taacggtgac 240 

aaggaggagc gtatccaaca tattctggat cgttgctggg atgtactgga acatctgcct 300 

gcctctctgc tgaaagtcca actgctggtt tattgttacg gagaggtttt cgacgaggaa 

ttggcccggg aagctcaggc tatcatcgat acgtggcagg acagagagtt gtcggaagac 

gagcgtgagg tgatggaacg cctgaaggat gtgcaagaga atccgtatcc ttggagtgag 



180 
240 
300 



420 
480 
540 
600 
660 



780 

840 

900 

960 

1020 

1080 



1800 
1806 



60 



gtggagtga 



360 
420 
480 
489 



<210> 597 
<211> 411 



236 



<212> DNA 

<213> B. fragilis 

<400> 597 

gaagaaatga aaggctattg gaagatttta ctgatactga tgctcgctgt cggattcgcg 60 

tcttgcgagg acgatcaggg agagattgaa tatgtcatta ccgggcgggc atggaccggc 12 0 

gatgtgggga tgaatgccca taatggtgaa cccctgttca gtacctttga gttcgggaac 180 

gacggttttg gagtggagac ccagttctat gcttcagacg gtcttttgta tgatcagttt 2 40 

cgctttcagt ggtattggga agattcttat aatcgtaatt tagtattgaa ttacggtaag 300 

aacggtatct cttatatgga cgacgtaagg atatacggag atcggataac cggtgccttt 3 60 

tatctttcgg acgatgcccg gggatttaac tttgaattaa ggatggaata a 411 

<210> 598 
<211> 3981 
<212> DNA 
<213> B. fragilis 



60 

120 

180 



<400> 598 

gatatatcga acgaaaatgc agaaagaaaa caagaagata acatgaaagt attgacctta 
tttcgtcata aaagaacact gtacatagcc ggaagcgtat tgcttctggc tattgccttt 
actatcggct accgctattg gatggccccg acacggatcc tgattgtcaa tccgctaccg 

gcacaagctg ccgacatagt attgaacaac gatagccgga atatagaagt tacttgcata 240 

caaaccgaaa atttggagtc ctttaagggc tatgatgccg tagttctcta tggacgcagc 3 00 

ctcaacctga acgatcgaca aatgaaggag gcggaacgtg ccgcatcggc cggtattcca 3 60 

cttttcacga tttcactgcg taacttcaat acaattatca acaggaatat cacccctgag 420 

caggaagcca tgcttatgca atatttcggg gatgcctgcc gacagaatta ccggaacgga 480 

ttacgttatc tccgacacat tgccacaccg acacgctgga acattgaaac ttttgatgcc 540 

cctcttcgcc tacccaacaa tctattttat catcaagaat atggaaaata cttcgagact 600 

cagaaagccc ttgaacaata cctgcgtcaa aaaggtattt tccatgaaaa cggacctaaa 660 

atcgctttca tctccggagt cagttttcca atggaaggta acagagcaca tgtagacaca 720 
ttaatatcca aaatgacaca agccggattt aatgtttatc ccatagcagg aaaggaaaag 
cgggaagaga tgctacgttc tctacatccg gatgcattgg tttaccttcc catgggaaga 
cttggagatg attcgctgat taactggctg cataccgaaa acatccccat tttcaatcct 
ttccccctta ttcagtcacg ggaagagtgg cttgatccga tgaaacccgt cagtggcgga 

acccttacag ctcgtgtcct cgtccccgaa atagacggag gaatgacacc tttgttaatt 102 0 

gctacacaga atttacacaa aagcggatat tatctgcacg aaccggaaat ggaaagagtg 1080 

gataacttca tcagccatgt acacaaatat ctggatttac gtactaaacc caactcggat 1140 

aaacgtatcg ccatctgtta cttcaagaca ccgggcaaag atgcattatt ggccagtgga 1200 

atggaagtga ttccgtcact ttacaacttt ctgaaaaggc tacgcaccga aggttatgat 12 60 

gtcagcgggc ttcctgctac tgtcgaggag ttcggcaaac aaatctaccg ggatggagct 13 2 0 

gtaatgggtt catacgctac cggagctcaa gaaaagtttc tacagacagc ccatccggtt 13 8 0 

tggctgacta aaacacagta tgaaaagtgg gtacatgaag taatcgaacc ggataaatac 1440 

aaagaagtta ctgaacgtta cggagatgct ccgggccatt tactgaccgg aacaaaccct 1500 

caaggagaag cacaattagc cattgcctgc ctccgcttcg gcaacatcct gcttttccct 1560 

cagccacgtc ctgcattggg ggacgatgat ttcaaacttg ttcatggcat gccggtcgca 162 0 

ccgccacaca gttatctggc accttaccta tatgtacaaa aagggtttca ggcagacgcg 1680 

ttaattcact tcggcacgca tggcaacctt gaatatactc cagggaaaaa tgtagccctt 1740 

tctcataatg attgggcaga tgctttggta ggcgacttac ctcacttcta ttattatact 1800 

accggtaacg taggtgaagg tatcattgcc aaacgtcgca ctcatgctgt gcttgtcacc 1860 

cacttgactc ctccctatgt ggaaagcgga atgcgtcaac gatacacttc tttactggaa 192 0 

gacattcaca aaatactttc cgaagacata gagaaaaacc ggactttggg aatccgcata 1980 
aaaaaagagg tcataaagtt ggggctacat cgtgacctca aattagattc tgtatccagt 
cgtccttata ccgccgaaga actggaacgt attgatctat ttgccgaaga gatagccaat 

gaaaaaacga ttggagctta ttataccctc ggtgaaacct attctgcgag agacctgctt 2160 

accaccacac ttgcagtcag tgccgatcct ttagcctatc aaatggcgaa acgtgatcgc 2220 

gataaaggaa aaattacgac cgaacagtta caagattttg gctacatcac ccatcactat 2280 
ttacccatag ccaaacaacg gttaatcccc ttgttacaaa atccacctaa ggataccaca 2340 

2400 



780 
840 
900 
960 



2040 
2100 



gggatcgccc ccgaattgca agaggcactc cgttatcatg cgcttttagt ttcatccacc 
ggtaacgaat tgaacgccat gctacgcgga ttaaaaggtg gcacagtatt tccggctccc 2460 



237 



2940 
3000 
3060 



ggtggagatc cggtactcaa tccgaatgtc ctgccgacgg gacggaacat gtatagtatc 252 0 

aatgtagaaa caactccggg catattgtca tgggaagaag gcaaacgatt ggcagaagcc 2580 

acactgaaag cctatcgtga gaatcacagc ggaaagtatc cacgaaaagt aagctactct 2 640 

ttttgggccg gtgaatttat cacgaccgaa ggggctacgc tggcacaagt attctggatg 2700 

ttaggcgtag aacctgtacg cgacaaaatg ggacgtgtgg tcgatctacg cttagtgcct 2760 

tcctcagagt taggccggcc cagagtcaac gtcgtcgtac aagtgtcggg acaactacgt 2 82 0 

gacatagcgg gttcccgact gactatgcta accgatgccg ttcgccttgt ttcggccgca 2880 
gacgacaaag cataccctaa ttatgtctct tccggtacac gcttgcagga aaaactgctg 
gtagaaaaag gagtatcacc caaaagagca cgtgagatgt cagtcatgcg tgtatttggt 
cctgtcaaca gcggatatag taccggtatg atggcatata cggaaaagag tgaccgatgg 

gatcatgaat cggagttagt agacggatat ctgaacaata tgggagccgc ctatggtgat 3120 

gaagaggact ggggaggtat gcaaaaagac ctttttgctt ccgccctttc cgaaactgat 3180 

gtagtgatac aaccccggca aagtaatacc tggggaccac tttcacttga ccatgtatac 3240 

gaatttatgg gaggtctgtc gttgacagtg aagacactga ccggtaaaga accggatgcc 3300 

ttaatggctg actatcgcaa tcgaaacaac aaacggatgc agaatatcaa cgaagcaatc 3360 

gctgtagagg cgagagctac cgtgctcaac ccaactttcg tgaaagaacg gatgaaagga 3 42 0 

ggtgccacca ccgcgcaaat gttcggtgaa atattccgta atatcttcgg atggcatgcc 3480 

acccgtccat cggcaatgga taaagagatc ttcaacgatc tctataaaat gtacattgta 3 540 

gatgaaaacc atttgggtat ccgggactat ttccaaagaa ttaatccggc ttcttatcag 3 600 

gcaatgacct cagtcatgct tgaaagtgcc cggaaaggat actggaaagc gagcgacgaa 3 660 

caattgaaag taacagcccg actacatgcg caaatcaccc gcgaagccgg tgccgcctgt 372 0 

acagaatttg tatgcgataa ccgaaagctt cagcaatttg tagaaggtca cttggacaac 3780 

aatgactctg aaagttatcg tctggttatg caagaagtcc atcaggcagg aaacgaaaaa 3 840 

ggaaaagata tcgtattgaa agaggagaaa ctcacgaaaa cggaaaaccg gaaaaagaat 3900 

gtggtaaatg gcatccttac cggcgttatt gttcttttag cattcggtgg agtaatatac 3960 

ctgctgaaac gtaaaaaata a 3 9 81 

<210> 599 
<211> 522 
<212> DNA 
<213> B.fragilis 

<400> 599 

ttagttgccg acaaagatac ttttttaata tttttgcaag aaataaaaaa atataaaatg 60 

acaaaggaag aaaggataag ccgtgctact gagcttttca agagcggcta taattgttcg 12 0 
cagtctgtag tagctgcatt tgccgatatg tatggattta ctgaagagca ggcgctgcgt 
atggcagctt cgtttggcgg aggtatcggg cgcatgcgtg aaacatgtgg cgctgcctgt 
ggcatgtttc tacttgccgg actggagaag ggggcaattg acggagccga tcgtgaggga 

aaggctgcca attatgcttt ggtgcaagag cttgcggccg aattcaagaa acgaaatggt 3 60 

tcgttgaatt gtggcgaact gcttggttta aagaagaaag caccggtgtc gtccgagccg 42 0 

gaagcccgga cagaacagta ttatgccaaa agaccttgtt cgaaaatggt agaggaggca 480 

gccagaattt gggcagaata tctcgaaaaa gagaagaaat ag 522 

<210> 600 
<211> 288 
<212> DNA 
<213> B . fragilis 

<400> 600 

gcggaaaacg agactcgaac tcgcgaccct aaccttggca aggttatgct ctaccaactg 60 

agctatttcc gcaatgtagt gcccagaaca ggactcgaac ctgcatgcct ctcgacacac 12 0 

gcacctgaaa cgtgcgcgtc taccaattcc gccacctggg cattgactaa tcagaaacct 180 

gccgttaaaa aaaatggaga ggaacagata accgacgttc ttgttgagcg gaaaacgaga 240 

ctcgaactcg cgaccctaac cttggcaagg ttatgctcta ccaactga 2 88 

<210> 601 
<211> 1812 
<212> DNA 
<213> B.fragilis 



180 
240 
300 



238 



<400> 601 

cgcatcaaag ataacgttaa taatgtatac aatgatacaa gaatagatcg tttaacaaaa 60 
cactttcttg cacaggctgt ttttaatgag aaattaaacc taaacaaatt aactatggat 12 0 
tggattgtac atcaacttag ggtacacccc gagctggcta tcttcctgac cctttttgtg 180 
ggcttttgga ttggaaaaat caaaatcgga aagttcagcc tgggagttgt aacaagcgta 2 40 
ttgctggtag gagtccttgt cggacaactc gacatcaccg tcgacggacc tatcaaatct 3 00 
gttttctttc tgctttttct tttcgccatc ggctataagg tcggtcctca gtttttccgc 360 
ggactgaaaa aggacggact ccctcaaatg gggtttgccg ccatcatgtg tgtattctgc 42 0 

480 
540 
600 



780 
840 
900 



ctgatcatcc cttggatact ggctaagatt atggggtata atgtaggtga ggctgccgga 
ttactggccg gatcgcaaac catctctgcc gtaatcggcg tggccggaga cacgattaac 
gaactgaaca tctctccgga aaccaaagaa gcatataata acattatccc ggtgtcctat 

gccgtaactt acatcttcgg tacggccggc tctgcatggg tattgggttc actcggcccg 660 

cgactgctgg ggggactcga taaagtgaaa gctgcctgca aagaactgga agccaaaatg 72 0 
ggaaataacg aagcggatca acccggattc atggcagccg cccgccccgt tactttccgt 
gcttataaaa tagccaacga gtggtttggt gacggcaaac gggtgtcgga tcttgaaagt 
tattttcagg aaaacgataa acgcctgttt gtggaacgag tgcgccaggc aggagtcatc 

gtaaaagagg ttagtccgac ttttgtactg aagaaaggcg atgaagtggt actgagcggc 960 

cggcgcgagt atgtgatcgg tgaagaggac tggatcggtc ctgaagtatt ggacccgcag 102 0 

ttgctggact tccctgccga ggtattaccc gttatggtca cccgcaaaac ggttgccgga 1080 

gaaaaagtca gcaccatccg ggccctgaaa tttatgcacg gtgtcagcat tcgccgcatc 1140 

aaacgggcag gtatcgacat accggtattg gcccagaccg tggtcgacgc cggtgacatg 12 00 

gtggaactgg tgggtaccaa acatgaagtg gatgcggcag ccaaacaact gggatatgcc 12 60 

gaccggccga ccaaccagac agacatgatc tttgtcggac tgggtatttt gataggagga 132 0 

ctgatcggcg cactcagcat tcacatggga ggagtcccca tcagcctcag cactagcgga 13 80 

ggagccttga tcggcggatt attcttcgga tggctacgca gcaaacaccc tactttcgga 1440 

cgtattcccg aacccgctct ctggatactg gacaacgtgg ggctgaacat gtttattgcc 1500 

gttgtgggca ttgctgcagg tcccagcttc gtgcaagggt tcaaggaagt gggtttaagc 1560 

ctgttcatcg taggcgcact ggccacttcc attcctctga tagcaggcat actgatggca 162 0 

aaatatatct ttaaattcca tccggcactg gtattgggat gcacagccgg cgcacgtacc 1680 

actacggctg cattaggagc catccaggaa gccgttgaaa gcgaaactcc tgctttggga 1740 

tatactgtga cgtatgctgt cgggaatact cttctgatta tctggggagt agtgatcgta 1800 

ttacttatgt ag 1812 

<210> 602 
<211> 1788 
<212> DNA 
<213> B.fragilis 

<400> 602 

tatcagatgg ataaaatcag aaatttttgc atcattgctc atattgacca tggtaaatca 60 

acattggcgg accgtttgtt ggagttcact aataccattc aggtgacaga agggcagatg 12 0 

cttgatgata tggacttgga aaaggagagg gggattacga ttaaaagtca tgccatacag 180 

atggagtaca cttataaggg ggagaagtat attctgaacc tgatcgatac tccggggcat 240 

gttgactttt catacgaagt atcccgctcg atagctgcct gcgaaggtgc gttactcatt 300 

gtggatgcgt cgcaaggagt ccaggcacag accatctcga atctttatat ggctattgag 360 
cacgatcttg aaatcattcc gatcattaac aagtgcgaca tggcaagtgc catgcccgaa 
gaggtggaag acgagatcgt agagctgctg ggatgtaagc gggatgaaat tatccgtgcg 

tccggtaaga ccggtatggg tgtggaagag atactggcag cggtcatcga gcgtatacct 540 

catcctcaag gtgatgaaag tgcgccgttg caagctttga tattcgactc cgtattcaac 600 

tcattccgtg gaatcatcgc ttattttaag ataacgaacg gagtcatccg tgctggtgat 660 

aaggttaagt tcttcaatac cgggaaagag tatgttgcag acgaaatcgg agtgttgaag 72 0 

atggaaatgg ttccacgcaa ggaactccgg acgggagatg taggctatat catttcggga 780 
attaagactt cgaaagaggt gaaagtggga gatacgatca ctcacgtagc ccgcccttgc 
gataaagcga ttgcgggatt cgaagaggtg aagccgatgg tgtttgccgg agtttatccc 
atcgaagccg aagaatttga agatctgcga gcttcacttg agaagttgca gctgaatgat 

gcctcactga cgttccaacc ggaatcatcg ttggccttag gcttcggttt ccgttgtggc 102 0 

ttcctgggat tgcttcacat ggaaattgta caggagcgtc tggatcgtga gttcgatatg 1080 

aatgtcatca ccacagttcc taacgtatct tatcatattt acgacaaaca aggtaatatg 1140 



420 
480 



840 
900 
960 



239 



gttcggcata cattgaagtt tgatgataac ggccggctgg cagagactta tgcctattat 
catgtaaatg aaagtcagcc ttaccgcacg gagaccaata atctgacctg gtcgtggata 



acggaggtgc ataaccccgg cggtatgccc gatccgacta tgatcgacca tatagaagag 1200 

ccttatatca aagcttctat tattacaacg accgattata tcggacctat catgacgctt 1260 

tgtctcggta agcggggcga attgttgaag caggaatata tctcgggaaa ccgcgtcgag 132 0 

ttgttctata atatgccgtt gggtgaaatt gtgatcgact tctacgacag actgaagagt 13 80 

atttcgaaag gttatgcttc gttcgattat catccggatg gtttccgtcc gtccaaattg 1440 

gtgaaactgg atattttgtt aaacggtgaa tcggttgatg cgctttctac cctgactcac 1500 

ttcgataatg cttacgatat ggggcgtcgg atgtgtgaga agttgaaaga actcattccg 1560 

agacaacagt ttgaaatagc tattcaggcc gctatcggtg ctaagattat agctcgtgaa 162 0 

acgatcaaag cggtgcgtaa agacgttacg gcaaaatgtt acggaggtga tatcagccgt 1680 

aaacgtaagc tgcttgagaa gcagaaaaaa ggaaaaaaac gtatgaagca gatcggtaat 1740 

gtggaagtgc cgcagaaggc attccttgcc gtgcttaaac tggattag 1788 

<210> 603 
<211> 717 
<212> DNA 
<213> B. fragilis 

<400> 603 

aataaaaggg aatttatgga aagatacagc agacaaacca tgcttccgga aataggagaa 60 

gcaggacagc taaagctaaa agctgccaaa gtactgattg taggcgtggg aggactcggt 120 

tctcccatcg ccctctatct ggccggcgcg ggagtgggta ccatcgggtt ggcagatgac 180 

gacgaagtga gcctcagcaa tctgcagagg cagatactct acacggagga ggaagtgggc 

gacctgaagg ctatctgtgc ctccatgcgg atcagcgccc tcaacaggga gataaaagtg 

aatgcctgtc cgggaaggct aagtaaagaa aatgcacgtg atctgatagg ccagtatgac 3 60 

atcatcgtgg acggttgcga taactttgca acccggtatc tgctcagcga tgtctgttcg 

gagctcggga aaccgtatgt atacggtgct atctgcggat ttgaaggaca ggtgtccgtc 

ttcaactacg gagaaggaac tcaacggaaa acttatcgtg acctctaccc ggacgaagaa 540 

ggaatgttac acatgcctcc tcctcccaag ggggtggtcg gagtgacacc ggcagtaacg 

ggcagtgtgg aagcatgcga agttctcaaa atcatttgtg gattcggaga ggtcctggca 



240 
300 



420 
480 



600 
660 



60 
120 



ggcaaactat ggacaattga cttgcggaca ttgcaatcta acatattttc actataa 717 

<210> 604 
<211> 447 
<212> DNA 
<213> B. fragilis 

<400> 604 

caaatgacac ttatgaagac attgaatttt atgaaaacgc tattcttatt ggtagctata 
gtaggcctaa gctcttgtgg tgacaagtat tattcagatg attatctacg aaatagcaat 
gcaaagctct gtggcaaaac ctgggtaaat gattcggaga agaatgatgt agacgagtgg 180 

240 
300 

gacgatacga tggaaggtat tgtttttgac tatggagtga acggggtgac ttatttcgat 360 
aacgtgtggg tacgtgagca taatctgtcc gggaagctga acggaaaggt agttgtattt 42 0 
gtcgattcaa aatataacag aaactaa 447 

<210> 605 
<211> 1779 
<212> DNA 
<213> B. fragilis 

<400> 605 

atgccggttt ggagtatact atcacttatt ataaaaaaca acatgaaagt atctgactat 60 

ataatatcgt atatcgagtc ccggggagta catgtcatat tcggatatat aggtggaatg 12 0 

atcacccatc tggtcgattc tgtttctcag aatccgaata tgcaatttat tcaaacttac 

cacgaacaga ccgctgctat cgctgcagaa ggctttgcga aagaatccgg actttttgga 

gttgctattt cgaccagtgg tcctggagct actaatatga tgacgggtat tgctgacgca 

tattttgact ctattccggt tctttatata acgggtcagg tgaatacata tgaatacaaa 360 

tatgataagc ctgtccgtca gcaaggtttt caggagacgg atattgtaag tatggttaag 42 0 



180 
240 
300 



480 



600 
660 



780 
840 
900 
960 



240 

tccgtcacta aatatgccaa attgatagat aaggctgaag atattaaata tgaactggat 

aaagccttat atattgcttt gtcgggtaga aaagggcctg tactgctgga tctgccaatg 540 
gatatccaac gggaggaaat taatcaggaa acattgatcg gatattccgg tgagagtatt 
ttaaataatc ctttgatagc ctgggaggaa atcaggttat taatggagtc gtcccatcgt 

cccatgttgc ttttaggggc aggatgttgc aattcggata tggttttgct gaatgatttc 720 
ataagacggc accatttccc ggttattact tctttaatgg gtagaggggc tattgatgaa 
acatacgata attacattgg gatgataggc agttatggta accgttgtgc taacatggga 
gttgccaatg ccgatttgtt gattgcatta ggaaccagat tggatactcg acagaccggt 
gcccggttgg atcaattttt atcaaatggg cacatcattc atgttgatat tgatgacaac 

gaactggaat atcatcgttt attgaatcgt aaaaaagtga attgtaccat tgattgcttt 102 0 

ctacagaagg aaaaagaaat gccgatttct ttaggggaca tttcagagtg gaattttttc 1080 

ctgcatgggc tcaagcaacg atatggtcag gatgcagaaa tagagcgttt tgttgaaaac 1140 

aaatctccat atcgcttcat gcagtatttt gattctttga ctcaaaccga cgatgttata 1200 

tgtgcggata taggtcagaa tcaaatgtgg gcggctcaaa ccttacggtt aaaatccggg 12 60 

caaaaatttg taacaagtgg cggacttgcc ccaatgggct tttcattacc ggtagccatc 132 0 

gggtgttcgt ttgccaatcc aaataaaaaa gttttttcta taaatggtga tggaggtttt 13 80 

catatggcta tccagtcttt gatgcttatt tctcaatata atcttcctat taaggtaata 1440 

atattgaata atgcttcttt aggtatgatt actcaatttc aacatttgta ttttgatgat 1500 

cgaatgtgtg gaactacttt gaatggaggc tacagagtgc cggatattaa atctctctct 1560 

acggcttatg gcttacctta ttttagattg actgttgatc ggttggatga tcctgatttg 1620 

cgggaagaga tgcaggcagc ccacaactgt attattgaat gtgtggtaga aggcttgact 1680 

agtgtttctc cgaaattgga atatgataag cctatttcca agcctttacc tttattgcca 1740 

gaagaagaat ataaggagaa tatgctatta gaggcttga 1779 

<210> 606 
<211> 789 
<212> DNA 
<213> B.fragilis 

<400> 606 

cgggcagtgt ggaagcatgc gaagttctca aaatcatttg tggattcgga gaggtcctgg 60 

caggcaaact atggacaatt gacttgcgga cattgcaatc taacatattt tcactataaa 12 0 

ggttggtttc tgattaagtt aattagtaac tttgctaaac ttaacagttt aacaaaagaa 

atgaaactta tcgtagtaac gacgcctact ttctttgtag aagaagataa gattatcact 

gctctttttg aagagggact ggatattctg catctcagaa aaccggaaac accggctatg 

tattcagagc gcctgttgac actgattccg gagaaatacc acaaacggat tgtcacgcac 3 60 

gaacacttct atctgaaaga agaattcaac ctgatgggaa ttcatctgaa tgcacgaaat 42 0 

cccaaagaac cgcatgacta ttcgggacat atcagttgtt cgtgtcactc ggtggaggaa 

gtgaagaata aaaagcactt ttatgattat gtattcatga gcccggttta tgacagtatc 

tcgaaagagg gatataactc accctataca gccgaagaac tgcgcctggc agccaaagac 
aagatcattg acaacaaggt gatggctttg ggaggtatta cgccggataa catactggaa 



180 
240 
300 



480 
540 
600 
660 



gtgaaagatt tcggattcgg aggtgcagta gttttaggag atttatgggg caaattcgac 720 
gcttgctccg accaggatta cctggcagtg atagaacact tcaagaagct gaaaagaatg 
gcggactga 



780 
789 



<210> 607 
<211> 330 
<212> DNA 
<213> B.fragilis 

<400> 607 

tccatggcac gaaacaaact tctacacaac cagaatgata ctgacccgat gggaacagta 
gccaacttat tcgatgtagc catggttttt gctgtggcat tgatggtagc actcgtcagc 12 0 

180 
240 



60 



cgattcaata tgaccgaaat tttctccaaa gaagattata cgatggtaaa gaatcccgga 

caagagaaca tggagattat cacaaaagaa ggtaaagaga ttaaacgata tactccatcc 

gaacagaaag aatcatccgg taaacgagga aagaaagtag gtgtagccta tgaactcgag 300 
aatggaaaga tcatttatgt ccctgaataa 330 



<210> 608 



241 



<211> 924 
<212> DNA 
<213> B.fragilis 



<400> 608 

tttcgatcta gtaatttaat tttaataagt atattgatga aaggtattgt cttggccggt 60 

ggttcgggca ctcgcttata tccgatcacc aaaggagtca gtaagcagtt gcttccgata 120 

tttgataagc cgatgatcta ttatcctatc tctgtactca tgttggcggg gattcgtgaa 180 

atattgatta tttccactcc atacgattta cccggctttc aacgtttgct gggtgatggc 240 

tctgactttg gagtacgttt tgagtacgcc gaacaacctt ctcccgacgg tttggcacag 3 00 

gcatttatca ttggtgagaa gtttataggt ggtgattctg tatgtctggt tcttggcgat 3 60 

aatatctttt atggacaaag ttttacccgt atgctgcgtg aagcagtcca tacagccaaa 42 0 

tcagagaaca aagcaactgt ttttggttat tgggtcagcg atcccgaacg ttatggggta 480 

gctgagtttg acaaggctgg gaatgttctc agcatcgaag agaaacctac tgttcctaag 540 

tccaattatg ccgttgtggg tctttatttc tatcctaata aagtggtgga agtagccaag 600 

agtattcagc cttcccctcg tggagaattg gaaatcacga cggtcaatca acggttcctg 660 

tccgatcggg aactgaaggt ccagcttttg gggcgcggct ttgcctggtt ggatacaggt 72 0 

actcatgatt ctttgtccga agcaagtaca tttatcgagg ttattgaaaa acgtcagggt 780 

ttgaaagtgg cctgtttgga aggcatagcc ctgaggcaag gctggatttc tcctgaagag 840 

atgaaagcat tggcaggtcc gatgctgaag aatcaatatg gacaatatct gttgaaagtt 9 00 

atcgatgaat tatccataaa gtag 924 



<210> 609 
<211> 1437 
<212> DNA 
<213> B.fragilis 



<400> 609 

tataattcaa ggagtactgt ggctagaaag aaaaaagaac ttcctctgct ggagaaggta 60 
acaataacgg atgtggctgc cgaaggaaaa gccatcgcaa aagtagatga cctggtcgtt 120 

180 
240 
300 



tttgtacctt acgtagtgcc gggcgacgtg gtagatttgc aggtaaaaag aaaaaagaat 
aaatacgccg aagctgaagc ggtgaagttt cacgaactct caccggtacg tgccgttcct 
ttttgccagc actatggcgt atgcggcggg tgtaaatggc aggtattgcc ctacgcagaa 
caaatcaaat acaaacagaa acaggtggaa gacaacctcc gccgtatcgg aaagatcgaa 3 60 
ttgccggaaa tctctcctat cttgggatct gctaaaacag agttttaccg gaacaaactg 420 
gagttcacct tctcgaacaa acgctggctg acagcggaag aagtgaaaca ggacgtcaaa 
tatgaccaga tgaacgcagt aggattccac attccgggag cattcgacaa ggtgctcgcc 
atcgaaaagt gctggttgca ggatgatatc tctaaccgta tccgcaatac gatccgcgat 
tacgcctacg agcacaacta ctctttcatc aatctccgtt cgcaggaagg aatgctccgc 
aacatgattg tacgtacctc gagtaccggc gaactgatgg tgattctgat ttgcaagata 72 0 

780 
840 
900 
960 



480 
540 
600 
660 



acggaagagc atgaaatgga tctcttcaag cagttattgc aatatgttgc cgaccaattc 
ccggaaataa cctctctcct atacattatt aataataaat gtaacgacac gatcaatgac 
ctcgatgtac acgtattccg tggcaatgat cacatcttcg aggagatgga gggacttcgt 
ttcaaggtgg gaccgaaatc gttctatcag accaactcgg aacaggcata caatctttat 

aaggtggcac gcgactttgc cggactgaca ggtgacgaat tggtatatga cctctatacg 102 0 

ggtaccggaa ccatcgccaa ctttgtgtca cgccaggcac aaaaagtgat cggcatcgaa 1080 

tatgttcccg aagccataga agatgcaaaa gtgaatgccg agatcaatgg aatagagaac 1140 

accctgttct ttgccggaga catgaaagat atcctgacac aggatttcat caatcagtac 1200 

gggcgtccgg atgtaatcat caccgaccct ccccgggcgg gaatgcatca ggatgtggta 12 60 

gacgtaatct tatttgccga acccaaacgg atcgtatatg ttagttgtaa tccggctaca 1320 

caagcgcgtg acctccagtt gctggatgtc aaatatcgtg tgaaagcagt gcaaccggta 13 80 

gatatgttcc cccacaccca tcacgtggaa aacgtagtgc tgcttgaact taaataa 1437 



<210> 610 
<211> 507 
<212> DNA 
<213> B. f ragilis 



<400> 610 



242 



tataatccaa 
aagtttaata 
ctttgcacgg 
agcgaagcaa 
gtcggctctg 
gcaggactga 
tttgctgccg 
ctcgaacgtg 
caatggagca 



gcagtaatac 
gaaaggaaaa 
caatagcatt 
agactgaaga 
ccggatacgt 
aaatcagcga 
actctgtgca 
cggacaacca 
tttctaaaca 



aaggtttatt 
aacttttatt 
ttcggcatgc 
agcagttata 
atggagtgaa 
aacccaaaaa 
agcagaactc 
atggaaaaac 
gaaataa 



gcatttatag 
atgaagaaaa 
aaatccaata 
ccgggaagtg 
gtgaagaaag 
gacaacgcta 
tacacacccg 
gatacgatca 



aatataatta 
cttatttatg 
aagccggaca 
ataaagacga 
attgtattcg 
cttacgccac 
agtctgaagg 
gcgtcagttg 



tcaaaacatg 
gacggccatg 
ggacaccgca 
acacggttgc 
tcccttcgaa 
ttacattgta 
aagtatcctg 
caagaacggt 



60 

120 

180 

240 

300 

360 

420 

480 

507 



<210> 611 
<211> 945 
<212> DNA 
<213> B.fragilis 



<400> 611 

ccagataaaa 

ggatcgcata 

tcaatgtctt 

gataacccta 

gcctggacgg 

gactttatga 

ttgggtagtc 

tttcctgtca 

tgtgatttac 

gagtcaaacc 

atggatctga 

gccgtaatga 

actgcgatcg 

gagttgcgtt 

gtgtcgaaat 

gagtatacta 



gattatttat 
ttgcagaagc 
cgttgaacaa 
tctggatatc 
gggtcctgtc 
attctttgct 
aggccgaata 
atagctatgg 
gtgggattga 
aatggctgat 
ctttagggca 
aagtatgttc 
aattaagagt 
ttggtgcgtt 
ttgtaaagtc 
tcacttatta 



tgaaatgaag 
cttgctggca 
ttgtacctct 
ccaagcttgc 
tggcaataga 
ttatatagca 
tggggatttt 
atacgtgaag 
ttggtactgg 
ccccggtctt 
acaacgctac 

agggaaaact 

acttcttgaa 
gccctatcgt 
gtttgggcac 
taaaaaacaa 



atattattga 
aacgatgtta 
ttcattgatc 
tcttttagcc 
tacgattggc 
gagaaaagta 
gatgggatag 
tccatggttt 
ttaagagttt 
ttgactaata 
gcttatttat 
ccctgcggtg 
catttgagag 
gccggtcaac 
tttgagaata 
catgaaagta 



cgggggcgac 
acctgatgat 
atgtacaagt 
ctgatattat 
cggtccaact 
acgtttctaa 
tctctgaaaa 
cccggatggt 
tttccgtata 
tgcttgacaa 
atgtgaagga 
tctataattt 
atagattaaa 
caatgttggt 
caccgttaaa 
tctga 



tggtttttta 
aactaaacgg 
cataaactcg 
tattcattca 
gtctaatatt 
atttatagct 
tgccggcttg 
tggttctttc 
cggagagcgg 
tatggctggt 
ctttgccaat 
atcatcttct 
tccggctttt 
gcaaggtgat 
tgccggtttg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 . 

720 

780 

840 

900 

945 



<210> 612 
<211> 261 
<212> DNA 
<213> B.fragilis 



<400> 612 

cgaaggcgaa gagtaatgaa atacgtttat aaaacccagg gtacttgcag cacaaacatc 

gaattggagg tggagaataa tattgtgaag gaagtagctt tttggggtgg atgtaacggt 

aatctgcaag gaatttcacg tctggtgacc ggaatgcctg tgtcggatgt cattacgaag 

cttgaaggga tccggtgtgg ggctcgctct acttcatgtc ccgaccaact atgccgtgct 
ttgcacgaga tgggtttcta a 



60 

120 

180 

240 

261 



<210> 613 
<211> 618 
<212> DNA 
<213> B.fragilis 



<400> 613 

cctatgttga 

gcgcgcatgg 

ccggaagagg 

atcctgattc 

ctgggcaaaa 

attggcggaa 

tatctgggca 

ggactcgaag 



gcctacaatt 
ctctcgaagg 
tggaggcagt 
tcgacgacca 
aagacatgcc 
cggccaatac 
taggcccttt 
gctacacctc 



tatcacccat 
gggctgtaaa 
ggcactgcaa 
cgtagagctg 
catcggcgaa 
ctttgaagac 
ccggtttacc 
catactggca 



caaacagaga 
tggatccaac 
ctgaaacctc 
gccaaaaagc 
gcccggcaga 
gtaaagctgc 
accactaaga 
cagatgaacg 



attactccta 
tgcgcatgaa 
tctgtaaagc 
tggaagtgga 
tgctgggcga 
attatgccgc 
aaaatctgag 
aggccggtat 



tctggaatcg 
agaggcatcg 
taaagaagct 
cggggtgcac 
agcattcatc 
cggagccgac 
cccggtactg 
ccggataccg 



60 

120 

180 

240 

300 

360 

420 

480 



243 



gtagtagcca tcggagggat cgtagcggaa gacattccgg ccattatgga aacgggggtg 540 
aacggcatcg ccctctccgg agcaatcctg caagcaccgg acccggtaga agaaacaaaa 600 
agaattctaa acatataa 618 



<210> 614 
<211> 894 
<212> DNA 
<213> B.fragilis 



<400> 614 

tattgggttg gctattgtat aataatagag ctgaatttca aaaaaaatac gtattatatg 60 

agtaatcaaa gagaagctgg aataacagct tttttacctg tatataatga agaaaaacgc 12 0 

cttaaaaatg tattggagtg ttttcagtgg tgtgatgaaa tcttgttatt agataaagga 180 

agtgttgacg atacggtaaa aatagcaaaa caatacccta atgtcacggt tttaacaaaa 240 

gagcataccg aaaaatatga ttccaatgaa attgaatatt ttattaagaa ttgtacaaca 3 00 

gagtggtgta tgattgttac agcaagtgat ttgattcatc ccaaattggc gcggaacatg 3 60 

aaagaactta taaataactg taatttcgaa tatgatattg tctctgtacc gtataaacca 420 

tattttctag gttgttgtga gaagtattct ccttggtata cagagcacat gaataagatt 480 

tttcgggtaa gtgtattgaa tcttaatctc aactctgtac atgctgtttt aactcctaca 540 

tcttcccgct tgtatcagat tcctttcact gatcctaaag ttgcttatta tcatttgact 600 

catcaaagtg ctgagagtat tatagaaaga aatgtaagat attggaaagg agaggcttct 660 

tcttcggaac ccttatccct aataaataaa atcataataa ggactgtgct tcgttttgtt 72 0 

tttttgcgag gtgggttgtt caaaggacgg caagctttag ccttatttta ttctttctta 780 

agctattata tgatgactta tgtatgtaaa tgggaatacc agaatggaga agtagaaagt 840 

atatacactg ctttgcaaaa ggagattgtt gatctttgga gtaaacctaa ataa 894 



<210> 615 
<211> 360 
<212> DNA 
<213> B.fragilis 



<400> 615 

gcgaacaatg ggcattgggc gttgagcttc agttcaacca tgaaatacaa ggagcatatc 6 0 

tcgacaaaca catttgccat tgctccttac gcacgttttt cttattatga aaacaagatc 120 

gtacgtctgt ttgtcgatgg cggatttggt tttgctacca ctaaggtgaa agatggcggt 180 

gatgctgtaa acggtttcga gatcggtctg aaaccgggta tcgctatcaa gttgaaccag 240 

catttcagcc tggttgccaa atgtggcttc ctgggatata aggacgatta tatgggtaac 300 

ggttttggct ttagcgcaag cagtgaagac cttacattcg gattccatta cgaattttaa 3 60 



<210> 616 
<211> 291 
<212> DNA 
<213> B.fragilis 



<400> 616 

gtgtgtttgg ggataaatga aagtaaatac aaagaggggt gcgtgcggga aaaagaaaaa 60 

acggaagaaa tacacaatcc cttcggcagt actaactgca tcaggttctt agggtataat 12 0 

ctcagccacc gggtggcacc cctttgtctt tacggaggca aagtaagtga aaaagaaaat 18 0 

aagaagaaag taaatcgggg attatttcat cgcggagtca tgcggaattt gtttttgccc 240 

tttctcaaaa ctcaccataa aaaacacaga gtaacacaga ggcgatttta g 291 



<210> 617 
<211> 357 
<212> DNA 
<213> B.fragilis 



<400> 617 

agacaaaggg gtgccacccg gtggctgaga ttatacccta agaacctgat gcagttagta 
ctgccgaagg gattgtgtat ttcttccgtt ttttcttttt cccgcacgca cccctctttg 



60 
120 



244 



tatttacttt catttatccc caaacacacc tatacgaata 
aaagaagtgg aaacagccgc aagcacactg gcccagcttg 
gaaaacggtg tagccatcgc cgtcaacaac cgaatgatac 
ttcgggctgc aagagaatga taacctgatt gtgattaaag 



tgaaagtaca 
ccacccaact 
cgcgtccgca 
cagcctgcgg 



agtgaacaac 
gcaacttccc 
atgggacgga 
aggatag 



180 
240 
300 
357 



<210> 618 
<211> 2730 
<212> DNA 
<213> B. fragilis 

<400> 618 

aatcgtatta tggacaaaaa aagagtttat acctttggta atggactggc agaaggaaag 60 

gccggtatgc gaaacttact tgggggcaaa ggtgcgaacc ttgccgaaat gaatctgatc 12 0 

ggtgtccccg tacctccggg cttcacaatt acaactgacg tttgtaccga atattacgag 180 

atgggacagg aaaaggtcgt atctctcctg aaagaagaag tcgaaaaagc tattgcaaat 240 

attgagaacc tgatgcgttc aaaatttggt gacgtagaga atccgttgct ggtttctgtg 300 

cgttcgggtg cacgtgcatc catgccgggt atgatggata cgatcctgaa cctgggtttg 3 60 

aatgatgaag tggttgaagg tctgacccgt aagaccggaa acgctcgttt tgcatgggat 42 0 

tcttaccgcc gttttgtaca gatgtacggt gacgtagtat tgggtatgaa acctgttaac 480 

aaagaagacc aggatccgtt tgaggcgatc attgaagaag tgaaacatgc caaaggcgtg 540 

aagctggaca acgagctcga ggtggaagat ctcaaggaac tcgtgaagaa atttaaagct 600 

gccgtaaagg cacaaacagg caaggacttc ccgacttgtg catacgaaca gctttgggga 660 

gctatctgcg ctgtgttcaa ttcatggatg aacgaacgtg ccatcctgta ccgtaagatg 72 0 

gaaggaattc ccgatgaatg gggtactgcc gtaagtgttc aggcaatggt gttcggtaac 7 80 

atgggcgata cttccgcaac aggtgtatgc ttctcccgtg atgccgctac gggcgaggac 840 

ctcttcaatg gtgaatatct gatcaatgca caaggtgaag acgtggtggc gggtatccgt 9 00 

actccgcagc agatcactaa gatcggttcc cagcgttggg ctcagcttgc cggtgtgagc 960 

gaagaggaac gtgcatcaaa atatccttct atggaagagg ctatgccgga gatctacaag 102 0 

cagttggatg aattgcagac caagcttgaa aatcactaca aagacatgca ggatatggag 1080 

ttcaccgttc aggaaggcaa actttggttc cttcagacac gtaacggtaa acgtaccggt 1140 

gctgccatgg taaaaatcgc catggatctg ttccgccagg gcatgattga cgaaaagacc 12 00 

gcgctgatgc gtgtagaacc caataaactg gatgaattac ttcacccggt attcgataag 12 60 

tctgctttga aacaggctaa agtgctgact cgcggtttgc cggcttctcc gggtgctgct 1320 

accggtcaga tcgtattctt tgctgacgat gcagccgaat ggcatgctgc cggaaaacgc 1380 

gttgtgatgg ttcgtatcga gacttcaccc gaagatttgg ccggtatggc agttgccgaa 1440 

ggtatcctga ccgcccgtgg aggtatgacc tcacatgcag ccgtggttgc ccgtggtatg 1500 

ggtaaatgct gtgtttcggg agccggtgca ttgaatatcg actacaaggc ccgtacagtg 1560 

gaagtggatg gtgtattgct gaaagaggga gatttcatct ccttgaacgg tagtaccggt 162 0 

gaagtttatc agggtaaagt agaaacgaaa gcagccgaac tgtcaggcga ctttgccgat 1680 

ctgatgaagt tggctgataa atatacccgt ctgcaggttc gcaccaatgc cgacactccg 1740 

catgatgccg aagttgcccg taatttcggt gcggtaggta tcggtctttg ccgtacggaa 1800 

cacatgttct tcgaaggtga aaagatcaaa gccatgcgtg agatgattct ggcagaaaat 1860 

gctgagggac gccgcaaagc tcttgccaag atcttgccat atcagcaagc cgacttcaag 192 0 

ggaatcttca aggcaatggc cggttgtccg gtgactgtac gtctgctcga tcctcctttg 1980 

catgaatttg ttcctcacga tctgaaagga cagcaggaga tggccgatac aatgggagta 2 040 

agcctgcaat atatccagca gcgtgtcgaa tcgctctgcg aacacaaccc gatgttaggt 2100 

caccgtggtt gccgtttggg aaatacgtat cccgaaatca cacagatgca gactcgtgcc 2160 

attctgggtg ccgctcttga actgaagaaa gaaggaatag agacacatcc cgaaattatg 222 0 

gtgccgctga caggtattct ttacgagttc cagcagcagg aaagtgtgat tcgtgccgaa 2280 

gcagacaagc tctttgaaga ggtgggagac cgcatcgact tcaaagtcgg aaccatgatc 2 340 

gaaattcccc gtgcagctct gactgccgac cgtatcgctt cgtctgccga gttcttctcg 2400 

ttcggaacca acgacttgac tcagatgact ttcggttact ctcgtgacga tatagcttct 2460 

ttccttccgg tttatctgga gaagaagatt ctgaaagtag acccgttcca ggtactcgac 2 52 0 

caaaatggtg taggtcagtt ggtacgtatg gcaaccgaaa aaggccgtgc catccgtccg 2580 

gacctgaagt gcggtatctg tggtgaacat ggcggtgagc cttcatctgt taagttctgc 2 640 

cataaagtag gtttgaatta cgttagctgt tctccgttcc gtgtgcctat cgcacgtctg 2700 

gcagcggcgc aggcagccat cgaagaataa 2 73 0 



<210> 619 



245 



<211> 1419 

<212> DNA 

<213> B.fragilis 



<400> 619 

attatcatta tgaaacaatc caaaattatc gtagccggca ttgggccggg aagcgaacaa 60 

gatatcactc ctgccgtgct cgccgctgta cgcgaagcag atgtagtggt gggatataaa 12 0 

tattatttcc gttttatacg tgattttgtc cgtccggacg ccgagtgtat cgacaccggg 180 

atgaaacgcg aacgtgcccg cgccgaacag gctttcgaat atgccgaaca aggaaagacc 240 

gtttgtgtca ttagctccgg agatgccggt atctatggca tgacaccctt gatttacgaa 3 00 

atgaaacgcg aacgtcagag taacgtagag atcattgcct taccgggaat cagcgctttc 3 60 

cagaaagcgg cctcactact tggtgcaccg atcgggcatg acttctgtgt catctctcta 42 0 

tcagacctga tgacaccatg ggaacgtatc gagcgccgta tcctcgctgc agcccaggcc 480 

gactttgtga cggctgtata caatcccaag agtgatgggc gctattggca aatttatcgt 540 

ctgcgcgaaa tctttctgcg cgaaggacgc tcaccggaaa cccctgtagg ctatgttcgg 600 

caggctggtc gtgaagaaca ggaaatacac atcaccactc tcgccgcatt cgatccggaa 660 

actgtggata tgtttacggt cgttctgatt ggtaactcac aaacatatac atttaaccaa 72 0 
aacataatta ctccacgggg atactatcgc gaaacacgca gtgaagcaac cggtatcgga 
caagacatca tgatacgcag tttccgcacc atcgagacgg aattgaagaa ccgtgatatt 

ccactcgacc ggaaatgggc cttattgcat gctatccata cgacagccga cttcgagatg 9 00 

gaacgtttgc tttacactga tcccaatgct gtggcctctc tctatgacgc catccgcaca 960 

ggaaatctgc ggactattgt aacagatgta acgatggcag cttccggcat ccgtaaaggt 102 0 

gcattgcagc gtctgggtgt agaagtgaaa tgttacttga acgatgaaag agtagccgaa 1080 

atggcaactt caaaggggat cacccgtaca caagcgggca tccgcctggc tgtggaagaa 1140 

catcccgatg cactctttgt ctttggtaat gcccccacag cactgatgga actttgtgat 1200 

ctgatccgga aagagaaagc gcaaccggca ggtatcgtag ccgctcccgt agggtttgtc 12 60 

catgtagaag agtcaaaaca catgacaaag cccttcaccc gcatccccaa actgattgtg 1320 

gaaggacgca agggcggaag taatctggct gccaccctgg taaatgccat tctttgctat 13 80 

cccgatgcgg aacaactcag acccggaaga gacgtatag 1419 



780 
840 



<210> 620 
<211> 591 
<212> DNA 
<213> B. fragilis 



<400> 620 

agtagaattt ttattccaat gaatataata aaaacatcaa ttgaaggtct tgttatcctt 60 

gagccccgtc tgtttcagga tgaccgtggc tactttttcg aatccttcaa tcagggggag 12 0 

ttcgaatcaa atgtatgtca aacgactttt gttcaggaca atgaatccaa atcgagctac 180 

ggtgtcattc gcggtctcca ttttcagaaa cctccttttg cccaaagcaa actggtacgg 240 

gtaatcaagg gtgcagttct tgatgtggct gtcgatatcc gcaagggttc tcccacattt 3 00 

ggaaaacatg tttcggttga attgacagaa gacaatcacc gtcagttttt tattccgcgt 3 60 

ggcttcgcac atggttttag tgtgctgagc gaagaggtca tcttccaata caagtgtgat 42 0 

aatttctatc atccggaagc tgaaggggcg attgcctgga atgatccgga tttgaatatc 480 

gactggaaga taccacaaga ccgggttata ttgagtggta aagactacac acatcctctg 540 

ttacataaca tagaattaca gtttgatata aacaatacat tatatgagta a 591 



<210> 621 
<211> 423 
<212> DNA 
<213> B. fragilis 



<400> 621 

tcaatgattt ttatggcaac aacctttgac atacaattgc cacactatcc acgtggcttc 60 

catctgatca cccgtgacat cctttctctc cttccggacc tgccggaaaa cggactgctg 12 0 

gttgtgttca tcaagcatac ctcagcaggc atcactatca acgaaaatgc cgatccggac 180 

gtgcgtcatg acttcaatac gtttttcaac aaactcgtac ctgacggtgc cccttatttc 240 

gtccacaccc ttgaaggccc ggacgatatg agcgcacaca ttaaggcttc actaatcgga 3 00 

acctcagtca gtatccccat ccggaatcac cgtctgaacc tcggaacctg gcaagggatc 360 



246 



tacttgtgtg aattccggga cgggggcgac aaacgcaaac tgagtattac cattttggag 
taa 



420 
423 



<210> 622 
<211> 471 
<212> DNA 
<213> B. fragilis 



<400> 622 

ccgggaggta 

acaacatccg 

gctttgaact 

ctcaattcgg 

gagcgtcggg 

cgtagttgta 

atggatcgtg 

gctgtctctg 



tgcttcgcta 
catctatcga 
ggtatgcctt 
aggggatcga 
ttcgtaagct 
tcgatgccat 
aatatcatcg 
cgaattatga 



ctccggtgtc 
gtcttcgatg 
gcgtatcact 
gaatttcatc 
tgttcccgca 
taaagaaagc 
tcccatcatc 
tgaatccttg 



cccaaagaac 
gagcgctctc 
tacgggcgtg 
cccatgcact 
gttcataatc 
aggagcgcca 
gttcctgatt 
ctttatttcg 



atcctgacgt 
aatctatcct 
aactggcttt 
acgaatatac 
tggtttttgt 
cgcttcctat 
cccaaatgcg 
aacccttttg 



gaacgacatg 
atcttcttcc 
gcaggagtac 
cattaaaaac 
ccgttcctcg 
ccgttacatt 
taatttcatg 



60 

120 

180 

240 

300 

360 

420 

471 



<210> 623 
<211> 1311 
<212> DNA 
<213> B. fragilis 



<400> 623 

tcgttaatca 

gtcgctatca 

attgccgaag 

gtgaaaccga 

agccggttaa 

atgatggaac 

ctgatgatta 

caactcaatc 

aaattgtcgg 

accagcaccg 

gaccgcggct 

gctgaaaaga 

acagcgggca 

atggaggata 

ttcattaccc 

ctgaatgacc 

atccggttgc 

tttgaacaag 

cagatcatca 

atatcaaaag 

caaggtatcg 

ttgcgtacat 



tcaatcacat 
tgggttttct 
cattgatgct 
tgaacaccat 
gccatgtggg 
aactgaaaaa 
atgcttcccc 
ccatggcgat 
aaatcgattc 
tccggctgaa 
tccagcatcc 
aagcatacga 
tcacttccac 
tttgcgatgt 
gctttgccga 
ttgcatttac 
aactgatatg 
tattggtaaa 
ttcgtacttc 
agactgaagc 
gtttgatttt 
acgccgacgg 



gtcagtaaag 
gatttatata 
gctcctgatg 
cggcagtggt 
gcagcaggag 
cgaacggctg 
catgggagtt 
gaagatgatg 
tccgcttgcc 
cgactcgagt 
tttctatctg 
gaaagtgatc 
attggatacg 
gatgcgcgta 
tgtggtgaag 
ctgcaagcgc 
tgacgaatct 
tattattaag 
attgcctaca 
aaaactcttc 
tattcgcgaa 
actgacaaga 



ggattcttct 
tccgaaactg 
ctgtatctta 
atggaacttc 
gccgaccgtg 
cgcttgcgag 
attatcatga 
ggggtccgtc 
ctggaattgg 
atctataaat 
atggaaggat 
cgtatgattg 
gtagagcagg 
tgtaccgagc 
attcccgagc 
tttatggaag 
ctggacgatg 
aatgcagccg 
gctatcgaag 
agtcccttct 
gttctgagcc 
ttcaggattc 



tcatacttgt 
ttgtcgtgaa 
ttttgtttta 
tcagagaaca 
tggtaaatgt 
aacagaatca 
cgctggacga 
cggaagaagc 
ccgccattcc 
gcacccactc 
tgaccgatga 
cgcatgaggt 
cattgtacga 
gttgtttttc 
cccgttttac 
ggatgtgcaa 
tgaagctgga 
agagtatcgg 
tggtggataa 
tttctaccaa 
gtcatggctg 
tatttccgtg 



tttcttcttg 
gtatctctat 
ccggaaaatc 
ggacttcagt 
tttcaatcgc 
ttttctcgac 
agaggtatct 
cgaggggagg 
gaatggagca 
ttcatttgtg 
agtgatgaaa 
gaataataca 
gtcggaaggt 
tatgagccat 
tccgaccaac 
tgaccggaac 
tgcgtctctg 
acaggacggg 
cggacccggt 
acccaacgga 
cacgttttca 
a 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1311 



<210> 624 
<211> 291 
<212> DNA 
<213> B. fragilis 



<400> 624 

gtgtatttaa 

attgcaactt 

cgtaatgatg 

agcgaacaat 

ctcgacaaac 



ttaaaactac 
tatctgttaa 
atgccaatac 
gggcattggg 
acatttgcca 



taagaagatg 
ggcacaagac 
cacttccttt 
cgttgagctt 
ttgctcctta 



aaaaagattg tattgttttt atttgttgct 60 

ctttacatgg gaggtaccgt aggtttgtgg 12 0 

aaactggctc cggagatcgg atacaacctg 180 

cagttcaacc atgaaataca aggagcatat 240 

cgcacgtttt tcttattatg a 291 
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<210> 625 
<211> 462 
<212> DNA 
<213> B. f ragilis 



<400> 625 

aatataaaaa tgatggaaaa ttacaaacaa aattacatcc ataagccata tctgttttta 60 

gcaatcttat tttctttgct tagttgccaa aaagaggtgg tatcaaaggt aactttcgaa 12 0 

agaaagttat caggaataaa accagaaacc gaatttagac ttgattctct gagaaatgat 180 

aaatggcaaa aatgctatat tattccaccg tatcaacagt acaattctgc attaaatagg 2 40 

ataaagttga gaaagcatga tttaaataaa ataaaggaaa atgcaatctc tgatggaata 3 00 

aatacatttg tgtttataaa taacgatgga tcaatatcaa tagaaacagt ttcaagatct 3 60 

atcattgata ttcaagacac attgtcagac tccatatttc ttttttatcc cacaacaata 420 

atgaaaatgg atagtaaaag aaaaattata gacataaaat aa 462 



<210> 626 
<211> 1188 
<212> DNA 
<213> B. fragilis 



<400> 626 

acaataaaaa tgagaaaagt tctgtctttt tcggcctttt tgattattgg ccttttgcta 60 

tcacaatact tgccgttatt ggcaggtgaa ggatatgcta ccgtaaaaat tgtatctaac 12 0 

attcttcttt acatctgcct gagttttatt atgattaacg tagggcgtga gtttgaagtt 180 

gataagaccc gttggcgaag ttatgccgga gactacttca ttgcgatggc tactgccgcc 240 

atgccttggt tcctgattgc tatctattat gtatttgtgc ttttgccgcc agaattctgg 300 

aacagttggg aggcttggaa agagaatctg ctgttaagcc gtttcgcagc tcctacatcg 3 60 

gccggtattc ttttcacgat gctcgcggct attggactta aatcaagttg gatttataaa 42 0 

aagattcagg tcctggcaat ttttgatgac ctcgatacca ttttgttaat gattcccctg 480 

cagataatga tgattggttt gcgctggcag ctgatcgtgg ttgtctttat tgtcttctta 540 

ttgctttcat tgggttggaa acagttggga aggtataact ggcgtcagga ctggaaagcg 600 

ataatgggct attcggtgct tgtatttgtt gctacccaag ccgtttacta ttttagcaag 660 

cagctctatg gcgaagaggg gagtattcac atcgaggtgt tgttgccggc ttttgtgctg 72 0 

ggtatgatca tgaaacacaa agaaatagat actcctgtcg agcataaagt ttcaacaggg 7 80 
gtttcgttcc tgtttatgtt cttggtaggt atgagcatgc cgcatttcat tggggtgaac 
tttgccgaga cacatgccgg aacccattcg gtgacaggtt cgcaggaaat gatgtcgtgg 
ggaatgatag cacttcacgt attgattgtt tcactgcttt caaatatcgg taagctgttt 
cctgtgttct tttaccggga taggaagttc agcgaacgcc tggcgctttc tatcggtatg 
tttacccgtg gtgaagtagg agccggagtc atctttattg ccctcggata caacttgggt 

ggtcctgcat tggttatttc agtgctgacc attgtattga atttgattct gaccggtatc 1140 

tttgtactat gggtgaagaa gttggcattg cgaagctata caacttag 1188 



840 

900 

960 

1020 

1080 



<210> 627 
<211> 936 
<212> DNA 
<213> B. fragilis 



<400> 627 

tctaaaaaaa taaatatggc aacaatatat gacgggatca actatttccc ggtgggcgta 60 

aacttcatgg aagagaacgc aatggaagtg atagaagcta aatacggaat aaagggctcg 12 0 

gcaattgtac tgaaactgct gtgtaagata tacaaggagg gat act teat ccgttgggat 180 

gaagagcagt gectgatett tgccaacaag gcgggaaggg aagtgcaggc cgctgaggtg 240 

cagggcatca ttgagatcct cttcatcaaa gggataatgg acaaaaacag ctatctggaa 300 

aaeggaatae tgacctcgga aaacatacag aaggtatgga tggaggegae aaageggaga 3 60 

aagagagagt tgteggaact cccctacctg atggtgaaaa eggaaaagga aaaggaaaac 42 0 

gataaacegg aaaaggaaag cgacaaaccg gaeaatgeat ccacacaaca ggaaattgaa 48 0 

cgacccaagc cgcttaaaga aggaaaagta gctggcagca caggagatgt ageegttage 540 

ccgggaaatg tagtacacga tgtagccgtt aacgeaaaaa atgcatgcaa ttccggacaa 600 

agtaaagtaa agaaaagtag agcaaaggaa aataaagaat tacccccctc agttcccccc 660 
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gaggggaagg aggaagaaag gaaggaggat tctgtttctc tcccgatacc gggatacgct 72 0 

780 
840 

acggtgtggc ggctgattgc caacacttgc tggagtgaca taggggcaaa aggaaggtat 90 0 
ctgatagcgg cactgaacag ggcaaaaagg aaataa 93 6 



ttcaatacaa tgacacacaa ttatccggga ctgacggata cgctccaaag attggggatc 
aacgaggtaa gcgaggtaaa tgccattctc aggctatcgg actatggcag aaagggaacg 



<210> 628 
<211> 801 
<212> DNA 
<213> B. fragilis 

<400> 628 

tatatgaaaa tatcagtgat tattccttgt tttaatcaag gaaaatattt ggctgaagca 



60 
120 



300 
360 



ttagattctg tagtaatgca gaccttttct gattgggaat gtattatcat aaatgatggt 
tcgattgata attccgaaaa tgttgcttta tcctatgtag aaaaggaccc tcgttttcat 180 
tatatatgtc aaaaaaatca aggagtatgt atagccagaa atagaggtat agcaatggca 240 
caaggagagt atatcttatg ccttgatggt gatgataaaa tatctcgtaa ttttttggaa 
tgtatgtatc ctattttgga tgaagaacaa tctgtgaagg tagtaacaag tactgttgtg 
caatttggta aaatccatcg tgtgatacca tcaactgatt actctttaga aaagttaatg 420 

480 
540 
600 
660 
720 
780 
801 



gggcgaaatc tatttgtgat tacgtctatg ttccgtaaag ttgattttga aaaaacggaa 
ggttttaacg aaaatatggc aaagggctta gaggactggg atttttggtt gtctatgtta 
gagtctggtg gtgaagttgt ttgtgcaaag caggctattt tttactatag aatcagaggc 
tattctagaa ataaaagtat ttctgaagat tattattcat tattacgtaa aactatatac 
gaaaatcata aacacttatt ttctaccatt ttctttaatc cgaagtattc atttgagtat 
tatttgattg caaaatctta tgaatataag ttaggtaagt tattatttag accaatacgt 
tttttatatg atctttttta a 

<210> 629 
<211> 765 
<212> DNA 
<213> B. fragilis 

<400> 629 

aagatgaaaa ttataactta taatgtgaac ggacttcgtg ccgcagtaaa caaggggctg 

cccgagtggt tggccgagga aaatcccgat gtgctttgtc tgcaggaaac caaactgcaa 120 

cccgaacaat atccggcaga ggcttttgag gcacttggat ataaagcata tctctattcg 

gcacagaaaa aaggatatag cggagtagcc atcttgacca aagtagagcc cgatcacata 

gaatatggca tgggaattga agaatatgat aacgaagggc gttttattcg tgcggatttt 

ggtgatttgt ctgtggtgag cgtttaccat ccttcgggca ctagcggaga cgaacgccag 

gcttttaaga tggtctggct ggaagcattc cagaagtatg tgacggaatt gcgtaaatca 

cgtcccaatc tgattctttg tggggattat aacatttgcc atgaaccgat cgatattcac 

gatccggttc gtaatgctac caacagtggt ttcttgcccg aagaacggga atggatgacc 540 

cgtttcctgt cggcgggctt cattgattct ttccgtacgc tttatcctca aaagcaagag 600 

tatacttggt ggagttaccg tttcaattcg cgtgccaaga acaaagggtg gagaatcgat 660 

tattgtatgg tcagcgagcc ggtacgctct ttgctgaaag aagccgttat tctgaacaac 720 

gccgttcact ccgatcattg tccgatggcg ttggagatcg gctga 765 

<210> 630 
<211> 582 
<212> DNA 
<213> B. fragilis 

<400> 630 

aattacagaa ttatgaaaag aaatcttgtg tttgtattgt ttgccctcgt ttcggttgtg 60 

ggcttttctc aagtgagctg gaatgccaag gtgggaatga atatcagtaa ctttaccggt 12 0 

gattttgaca tgaatgccaa agtaggattc aagataggag gtggcatgga gtatggattt 180 

aatgaaatct ggtcgttgca accctctttg tttgtatctt ccaaaggtgc caagaaggac 240 

gaactgagtg tgaatgctgt ttatctggaa ctgccggtta tggctgctgc gcgtttcaaa 3 00 

gtagccgata atactaatat cgtgttgagt gcaggtcctt attttgcttg cggtatcgcg 3 60 



60 



180 
240 
300 
360 
420 
480 
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ggtaattcca aagtagatct gggcaaaggg cgcttggaag tcgatacctt tggtgacgac 42 0 

ggtctgttga aacgcggcga tgtaggtctt ggtatcggtg ttgccgcgga gtttggcaag 48 0 

attatcgcag ggcttgacgg tcagttcggg tttgtcgatg tcatggacaa cgtaaatggt 54 0 

aagaatctga atctttctat tagcgtaggt tataagttct ga 582 

<210> 631 
<211> 2871 
<212> DNA 
<213> B.fragilis 

<400> 631 

actgacacct tcgtggtact tttctacttt ttctgtcggt cgctttatgt tgtgcgaagg 60 

catcacgtta ctgttgatgg cactgatgat cttggcaggt atctggttcc cggcccgtca 12 0 

gtctatgaaa atccagccgg cagaggcatt gcacgaagaa taaaagctta tcgcaatatt 180 

ttatcatcta tctgcgttgt tttcaagatt ataacgtata tttgcgttca caatcctgat 240 

acaacgcaga tttttgctat gaacaaacga ttatatacta tatttcttat atctgtcttc 3 00 

ttattactcc cgggattttc cactgctgcg gaacgtattt ataatgttct cttcgttcag 3 60 

tcgtatgctc cggaaacacc ttggcataat gacttggtcc gggggttgaa agacggtttc 42 0 

ggtgaatcgg gattgaaagt gaatattaca accgaatttc tggatgccaa tttctggact 480 

tatcaatccg aaaagctgat tatgcgtcgt ttttgcgaac gggcccgtga aaggggaaca 540 

gacttgattg ttaccgtcag tgatgaggct tttcatactt tgctgacatg cggagactcg 600 

ttagccttgc agttgcccgt tgtctttttt aatataaaat atccggaagg cagcctgatt 660 

gattcgttac ccaatgtgtg tggatatacg gcgaatcccg atttcggaga attactgagg 72 0 

caggcgagtc gccttttccc tacccggacg gaggtggtct gtatctccga caatagcctg 780 

ttgagttcga aagggaaaga tgattttatg aacgaatggg aaggttttgt ggaggagcat 840 

ccggaatata cggtgacctt ttataattcg cagaccgata caaccaataa aattattgct 900 

tctacctgct atccccgcaa tacacataag acattgatca tcgctcctaa gtggtcgtct 960 

tttatgtcct ttatcggtcg taactcaaaa gcgcctttct tttcgtgcga gaaccttgcg 102 0 

ttgactaatg gtgcctttgg cgcgtatgat gccgattcgt atgcttcggc gcatgaagta 1080 

gggcggacgg ctgccgatgt gctgcgtggc aaatctccct cggaagtggg aatcattgaa 114 0 

tctcctttga agtttatgta cgatttcaag cagcttgtgt tttttaaggt agatcccaag 1200 

caggcgagtg ctatcggcgg aaccattatt aacgagccat acatggagaa atatcgtatg 12 6 0 

ctgtacatct tgttgtatag ttcgatattg gctttactgg tgttcctgat agtgtggctg 132 0 

tatcgtataa accgtcgtga atcccgccgt cgcatccatg cccagacccg cctgctgata 13 80 

caaaaccggt tggtggccca atgtgatgaa tttgataatg tgtttcattc catccgtgat 1440 

ggtgtgatca cttacgacac cgatttccgt attcatttta ccaatcgttc tttattgaag 1500 

atgctgcatt tgcccaaaga tgaagccgct cgaccttacg aaggcttgcc ggcaggcagt 15 6 0 

atttttaaga tctacaataa tggtaaagag attctccgtc ctatgctcaa gcaggtggtg 162 0 

actgaagaaa gcagcgtagt cattccggaa aattcattca tgcaagaggt gcacagtggg 168 0 

agttattttc cggtttcagg agaggttgtt cctatccgtg cacatggaaa aattacgggg 174 0 

atggccctct ctgcccgtaa tatatcggat gaagagatgc agaagcgttt tttcagaatg 1800 

gcggtagacg agagttctat ctacccttgg caatataata tccgcaccgg tctgtttact 1860 

ttccctgcgg gctttctgac acgttttggt tttgcggaaa ataagactac catttcacgt 1920 

gacgagatgg accggatggt tcatccggac gatcaggaat cggcatacga ggttttcaac 19 80 

cgggcactgg cgggcctcag tcagagtacg cgtatgagtt tccgtcagct tagtggtgac 2040 

ggtaattatg agtggtggga atatcggacg tcggtgcttt caggattgac aacagacacg 210 0 

ccctacagta ttttgggagt atgccagagt atacaacgct ataagacgac cgaagaggaa 2160 

ttaaccgcag cgcgtgataa agcacttcag gcggataaat tgaaatcggc tttccttgcc 2 22 0 

aatatgagcc atgagatacg tacgccgctg aatgcgattg taggtttctc tgatttgttg 22 80 

agcgatacca gcgggtttac ggaagaagag gttaagctat ttatagagac tatcaataag 2 3 40 

aattgtggac tgttgctggc acttatcaat gatattctcg atttgtcccg tatagaatcc 2400 

ggcacaatgg attttcagtt tgccggtcac aatcttccgt tattgatgaa gaatgtatac 2 4 60 

gattcacagc gtttgaatat gcctccggga gtgcagttgg tgctgaagtt gccggagaat 2 52 0 

agcaaaaagt atctggtgac ggataatgtc cgcctccaac aagtggtaaa caacttgatc 2 58 0 

aacaatgccg ttaagtttac gacccaaggt tcgattacat tcggatatac cgaagaagaa 2 640 

cccggctaca cttctctttt tgtcgaagac accggaaaag gtatttcgga agatggattg 2700 

aggcatatct ttgaacgttt ctataaggta gatagcttta cccagggtgc ggggctcgga 2 7 60 

ctgagtattt gccagactat cgtaggacgt ctgaacggga cgattaccgt cgcttcagaa 2 82 0 

gaagggcacg gaacccgttt cactgtccgc cttccggata tttgcgaata a 2871 
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<210> 632 
<211> 1146 
<212> DNA 
<213> B.fragilis 

<400> 632 

ttgactatga tagattttac ccaattcccc tctccgtgtt atatcatgga agaggagctg 60 

ctgagaaaga acctcagcct gataaagagt gtagccgatg atgccggagt tgaaatcatc 12 0 

cttgctttca agtcttttgc catgtggcgt tcatttccca ttttcaggga gtacatcgga 180 

cactccacgg ccagctccgt ctacgaagcc cgtttggcgc tcgaagagtt cggcagtaag 240 

gcgcatactt attccccggc ctataccgag gcggacttcc cggagatcat gcgttgcagc 3 00 

agccacatca cgttcaattc cctgtctcaa ttcagccgct tctatccgct gaccgtggcc 3 60 

gaaggcagcg gcatctcttg cggcatccgt gttaatcccg agtattcgga ggtagagacc 420 

gaactctata acccgtgcgc tcccggcacc cgtttcggga tcactgccga tctgttgccc 480 

gcccgtttgc cgcaggggat cgaaggtttc cattgtcatt gccattgcga gtcatcttcg 540 

tttgagcttg agcgcacttt gcaacatctt gaagagaagt tctcgccgtg gttttctcaa 600 

atcaagtggc tcaacctggg cggcggccac ctgatgaccc gcaaggatta tgatacccgg 660 

catctgaccg gcttgttgca aggattgaaa aagcgctatc cgcatttgcg tatcatcctc 72 0 

gagcccggtt cggctttcac ctggcaaacc ggagtcctca cctccgaggt ggtggatatt 780 

gtcgaaagcc gcggcatccg tacggccatt ctcaacgtca gcttcacctg ccacatgccc 840 

gactgtcttg aaatgcctta tcagccctcc gttcgcggag cggtgatggg agaggaggga 900 

ccgtttgtct atcgtctcgg gggcaattcc tgcctgagcg gagattacat ggggtcttgg 960 

agtttcgacc atgaactgca ggcaggcgaa cgaattgtct ttgaagatat gatacattat 102 0 

acaatggtaa aaacgaatat gtttaatgga attcaccatc ctgccattgc tctgtggaca 1080 

gcggatggca aagccgaaat cttcaggcag ttttcctacg aagattatcg cgatagaatg 1140 

agttga 114=6 

<210> 633 
<211> 1935 
<212> DNA 
<213> B. fragilis 

<400> 633 

ataatcatga tacttatatt tggaggaacc actgaaggac gggctgccgt caatgtaatc 60 

gaagaagccg ggaaacctta ttattactca accaaaggtg acgaacagga tatctacctg 12 0 

catcatggca tacgcctgag cggcgccatg acccggagaa ccctgaaagc tttctgtcgc 180 

caaaacgaca ttcgcctttt gatagatgcc gcccacccct ttgccgaaaa gctgcatgat 240 

acagtaaccg atgttgcgca tgatctcggc attccctgca ttcgatacga acgcatttac 300 

gaccgctctt acctcaaccc gattttcgaa gacaactgcg accctgatga tttgcctttt 3 60 

aaatttgaat atgacaatcg ggatctgttg cgcgagttga agaaagaaaa agagggtcac 420 

agatttcttt tcctgacagg tgtccagtcc atcgcccgat tcaaatcttt atggaccaag 480 

aagaagtatg aatgttactt ccgtattctc gaccgtgaca gctcccgtga aatagcccgt 540 

caagccggat tcccggaaga tcatctggtg tactatcatc ccgaaacgga gaatctgccc 60 0 

caactcctgc aagaactgtc tccccaggca gtcgtcctga aagagagtgg gaagtccgga 660 

ggattcaccg aaaagaaaga catgatcctg gaatatgggg caactcctta catccttctc 72 0 

catcccgaat tagaatatta cgatataaca gtagacggag taaacagtct ccgccgtacc 780 

cttgagaaaa tgctgcccga ttacttccca ttgcgtagcg gactcactac cgggagttgt 840 

gccgcagccg ctgccatagc tgcttttcga aaactgaaaa atcccatact cgaggatttt 900 

aaccggaata tccataccgt ccttcccagt ggcgaaacga ttgagattcc ttgccaatcc 960 

gtatccggaa cattctccga cgagaaaatt gaagtcagcg ctaccgtcat caaagatgga 102 0 

ggagacgacc ccgatgtaac cagtgggctg ccgattgtaa ccactttaac cctgaacctc 10 80 

gcagaagcga aacaggctaa taacgcacct gtacaaactc cggaaacatg ggagttcgtc 1140 

ttccatggtg gcccgggtgt aggaacagtt accctgccgg gactcgggct cgaagtaggt 12 00 

ggtccggcca ttaacgccac tcccaggcaa atgattatcg acaatctgag gaattgcatc 12 60 

cggtactact atcgatacct gccaaacgtt cccatccatg tcaccatctc agttcccgga 1320 

ggtgaagaag tcgccgcacg tactttcaac ccccggctgg gtgtcgtagg cggaatctcc 13 80 

atcatcggta ccagcggtat tgtgaaaccc ttttcttccg aagctttcgt ccgttccatt 1440 

cgtaaggaga tggaagtggc acgtgctaca ggtgcttgcc gtattgtcat caattcggga 1500 
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gccaaaagcg 
cactatggca 
ctgaccttgg 
cacagcaaga 
tgtaccccct 
atccttcctg 
cactgcgatg 
aaaatcatac 



aaaaatatat 
atttcatagg 
gagtgatgat 
aagtcaccat 
ccagcataga 
aaaccgaact 
tgctgctccc 
agtaa 



<210> 634 
<211> 228 
<212> DNA 
<213> B.fragilis 



tcgcaatctc 
cgaaacgatc 
gggtaaagct 
gaacaaggag 
ggcaatcgac 
gcaggctttc 
caatggagaa 



tatccggaac 
ggaattgcag 
gtgaaactgg 
ttcctcaaag 
catatcattc 
tgttccctgt 
ctaaccatcc 



ttccgccgca 
ccgaactcgg 
cggaagggca 
aaattgcccg 
tggcacgtga 
tgatcgaaca 
ttctgattac 



ggcttttgta 
catctcgcgg 
cctcgacaca 
acggtgtgga 
actttggaac 
atgccaccgc 
cgaagaagga 



1560 
1620 
1680 
1740 
1800 
1860 
1920 
1935 



<400> 634 

ttatcgtatg tttcatcaat agcccctcta cccattaaag aagtaataac cgggaaatgg 
tgccgtctta tgaaatcatt cagcaaaacc atatccgaat tgcaacatcc tgcccctaaa 
agcaacatgg gacgatggga cgactccatt aataacctga tttcctccca ggctatcaaa 
ggattattta aaatactctc accggaatat ccgatcaatg tttcctga 



60 
120 
180 
228 



<210> 635 
<211> 1353 
<212> DNA 
<213> B.fragilis 



<400> 635 

tttcaaagag 

tactcttcat 

gcaaaaaaat 

atacaggtat 

aaagcacatt 

cgtttagctt 

ctgattgttt 

agttggaaaa 

cctttcttat 

attcagcaaa 

tatgttggag 

agatacagaa 

aagaaagaga 

gtttttcagt 

cttggaatga 

actaaagtac 

tcattttcta 

gctttatgtc 

ttgttcctaa 

tactttagat 

gtttttttaa 

tattctctag 

gaatttcaaa 



tgttttttaa 
taagtaagat 
taacattagt 
tttttgaatt 
tagattggaa 
ctgtattacg 
tgtttatagg 
ttccttggtt 
ccttcttaga 
ctgtaagttt 
gatgttcgag 
ttttgttttt 
tatttccatt 
ttttcaatcc 
cattaagtgt 
ccaatttgtc 
agatactagc 
ttattcagtt 
taatgtcatt 
gctttaagaa 
tagtctttcc 
ctgctttggt 
aaaaatacgt 



aaatatatat 
cctcagtggc 
agaacaagga 
ggggcttaac 
aggaaaggat 
attatgcgtt 
tggctatgtc 
actgctttgt 
aggattaaat 
gctgatatta 
tttagctggt 
gaatatatat 
tcaatggaag 
gattctgttt 
gataaacggg 
taaattggtg 
tgtattggtg 
taatatactg 
atcctctgta 
agaacctttc 
gtgtactctt 
gggtttaata 
attatatgag 



aattttttgc 
tttggaggct 
tactattata 
tctattatca 
gatttagtag 
aaatattatt 
tttttttcaa 
gtttcaacat 
ttaatgaaag 
tgggtgggac 
ggtgtggcta 
gggaaagtaa 
gtagctgtgg 
gccacaatag 
gtctcatctg 
tcattacgag 
cttttgagtt 
catattgcgt 
ttcacacaga 
ttaagggtat 
tattatagat 
ttgggttggc 
taa 



aattagatgc 
ttctaacggt 
cttttatcag 
ctcagtttgt 
gaaaagagtt 
cctatttggc 
taaatagtaa 
ctttatcgtt 
aggtttgttt 
tgataggagg 
ttcttatttt 
ctattcattt 
gatggcttag 
gtagtgctgc 
tttcaatgaa 
attttaaaga 
gtatgggatt 
ctaagttgct 
ttacatcctg 
ctttaattaa 
taagtggatt 
tattgtataa 



tgctgtcatt 
ttatttaatc 
tgtattgtat 
tgctcacgag 
tcatctctct 
tataggatta 
tattggggtt 
cttcttaaat 
tatccgtttt 
tatgaagcta 
tgtttcttac 
tattaattac 
cagttctttg 
tgctgggcaa 
ctggatatac 
attggataaa 
tatcattgta 
tccgatgtcg 
ttgggctatt 
tctagctgtt 
ggtgttgtct 
taatagagct 



<210> 636 
<211> 186 
<212> DNA 
<213> B.fragilis 

<400> 636 

agtctggtgg tgaagttgtt tgtgcaaagc aggctatttt ttactataga atcagaggct 

attctagaaa taaaagtatt tctgaagatt attattcatt attacgtaaa actatatacg 

aaaatcataa acacttattt tctaccattt tctttaatcc gaagtattca tttgagtatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1353 



60 

120 

180 
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tctgacgcat cacgcatgtt tatgatcggc gaagtcagcc ctgaaaaaca gagattggta 
caggtcacta aagaatgtat ggagatcggc atagctgccg cacagccttg ggcccgttta 



180 
240 
300 



atttga 186 

<210> 637 
<211> 918 
<212> DNA 
<213> B.fragilis 

<400> 637 

aataccgcat tccccttttt attccctaca tttgtactac gaataaaaaa tcaagatagt 60 

aaaatgaaaa aatttatcaa aggagtccgg tttactcctt ccaattatcc ggatgaaata 120 
gaagataaaa tacagaaata cagaaaacaa ggttacaaac ttcctccacg caaggtattg 
cgcacaccgg aacaaattga aggtatccgt gaaagtgcaa agatcaacac agctctgctg 
aaccacattg cagaaaatat tcgtgaagga atgtccaccg aagagatcga tcgtttggtc 

tacgatttca ccacgtccca tggggctatt ccggctcctc taaactatga aggatttccc 3 60 

aaaagcgtct gcaccagcat caacgatgta gtgtgccacg gaatccccag ttcaaccgaa 42 0 

attctaaaaa gcggagatat tatcaacgta gatgtttcta ccatttacaa tggttatttc 480 

540 
600 

ggtgacgtag gtgcggctat tcaggaacat gccgaaaaga acggttatag tgtggtgcgc 660 

gacttatgcg gacatggagt gggaatcaaa ttccatgagg aacccgacgt agagcacttc 72 0 

ggacgccgcg gtaccggtat gttgattctt ccgggaatga cttttaccat cgaaccgatg 780 

atcaacatgg gaacgtatga ggtctttgtc gactctgctg atgactggac agtctgcaca 840 

gacgacggat tgccgtctgc acaatgggaa aatatgattc tgattactga aaccggaaac 900 

gaaatattga cttattaa 918 

<210> 638 
<211> 1011 
<212> DNA 
<213> B.fragilis 

<400> 638 

tgcgttaaat tgtatattgt tttgaagaat aaatttcgta atttgttttt ggcctttggc 60 

atcctggccg tcatcatcat gctgttcact ttcgatgtgt cctatgatga actgctggac 12 0 

aacctgcggc gtgccggatt ttatctgccg ctggttctgg tcctttggct ttttatttat 180 

ctgattaata cgctttcgtg gtatatcatt cttcgtagta gcggtccggt gaattcattg 240 

tcgttcgccc ggctctataa attcacggtt tccggttttg cgttaaatta tgtcactcct 3 00 

gtgggactta tgggagggga accttaccgg attatggagc ttacttcata tgtcggagtg 3 60 

gagcgtgcca catcttcggt catcctgtat gtgatgatgc atatcttttc gcatttctgt 42 0 

ttctggctca gctcggtgtt gatttatgtc tttttttatc cggtcggttg ggggatgggc 480 

attgtcttgg ggttgatcac gctcttctgc cttttacttg tcactctttt tatcaaaggc 540 

taccggaacg gtatggctgt ggcatgcgta cgtctgggta gtcatatccc tttcttgaag 600 

aagcgcgcag tccgcttcgc ggaacttcac aaagaaaaac tggaaaccat agaccgtcag 660 

atagcactgc ttcatcaaca gcgcaaaagt accttttatt cggcattggg gttagagtat 72 0 

accgctcgca ttgtggggtg tcttgaagtc tggttgattc tgaatgtatt gactacggat 780 

gtcagttttg tcggttgcat tctgatcgtc gctttctctt ccctactggc caatctgctt 840 

ttctttcttc ccatgcaatt agggggcagg gaaggaggtt ttgccctggc ggtagccggt 900 

ttgtctttat ccggagcata tggggtattt gccgctctga ttacgcgggt gcgggagatg 960 

gtctggattg ttatcgggct ggtgctgatg aagataggaa atcggagata g 1011 

<210> 639 
<211> 849 
<212> DNA 
<213> B.fragilis 

<400> 639 

cgcccgactt cttccgtgtt ttccgttatc agggagccaa ccggagaaac tccggaacaa 60 

ttagctgctc tgttgaagga gggtactttt atggtatcgc gtaatgtgtt cgagtcacgt 12 0 

tataagatcg atttgaaaga ctatgtaggc aaagagtttt gtctggatca ggacactgcc 180 

catttatcca aattggtggc agcgctccaa gtggtccgtt atgatgattt ttcctccggt 240 
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gcttatagcc ggtccgcagt catcttgtta ccggagaatc ggttggcttc gggcaatgaa 300 

atctgtttgc gtacgaacaa aaacgaatct gccgcctttg ccgaacaatt gatgaaggat 360 

gccccctccc agtatcgggt gggcaatctt tttctgacta aggtaagttc attccgggac 42 0 

atacggcata cgttccagtt ggacgatgtg aatactttac gtaactatct ggtaggtatg 480 

ggctttttgc tgctcaatat ttttctgggg ctattgggaa cattctggtt ccgcacccag 540 

cagcgtaaag gtgagatggc gctgatgatg gcagtaggag ggagtaagca aagcgtattc 600 

tttcgtttgc tgagcgaagg ttggctgatg cttcttctgg ttactccgtt ggctattggt 660 

gtcgatttct atatcgcaaa gagtgaactg acaccttcgt ggtacttttc tactttttct 720 

gtcggtcgct ttatgttgtg cgaaggcatc acgttactgt tgatggcact gatgatcttg 780 

gcaggtatct ggttcccggc ccgtcagtct atgaaaatcc agccggcaga ggcattgcac 840 

gaagaataa 849 



<210> 640 
<211> 441 
<212> DNA 
<213> B. fragilis 



<400> 640 

tctgaaattc cgataacaac gccacctttt gccgtcccat ccaatttggt tatagccata 60 

gcagttactt ctgttgcaag tgtgaattgt tttgcctgct caaaggcatt ttgtccggtc 12 0 

gagccatcaa gcaccaacaa cacctcgttt ggagcatcag gcactacttt cttcattaca 180 

tttttaattt tcgtcagctc gttcatcaag ccaactttat tgtgcaaacg tcctgctgta 240 

tcaataatca ccacatcagc gttattagct actgcagaac ttaacgtatc aaacgcaaca 3 00 

gaagccggat cggcccccat cttttgttta atgaccggaa catccactct ctcgccccat 360 

atcaccaatt gctccactgc tgctgcacga aaagtatcgg ctgcccccaa atatacagat 42 0 

ttaccggctt tcttaaattg a 441 



<210> 641 
<211> 1092 
<212> DNA 
<213> B. fragilis 



<400> 641 

acgaaattaa cgggaaagat gaataaatat atattgctga tagtattgtt atttctggtt 60 

tctggcagaa ttgctgcaca gtcagtgact gtggatgcca aaatagactc tttgcaaatt 12 0 

ctgataggag aacaggcgaa agtgcaatta caagtagcga tggatgcgaa gcagagggct 180 

gtttttccgt catttacaga tacattggta cgtggtgtgg aaattgtgga tattgctaaa 240 

cctgatacac aatatttgaa tgatcgccag agaatgctga ttactcaaga atatacggta 3 00 

acttcttttg attcggcatt atattatatt cctcctatgg gagtaaagat tgataataaa 360 

gagtataaat caaaggcgtt ggcattgaag gtatattcaa tgcctgtaga tacgttacat 42 0 

cctgatcaat tttttggtca gaagactgtc atgaaagctc cctttgcctg ggaggattgg 480 

tatggtttga ttgcttgctc ttttctggca ttaccattgc taggattgct tatttatctg 540 

atcatccgta tccgtgataa caagcctatt attcgtaaag tgaaagttga acctaagttg 600 

cctccacatc aattggcgat gaaagagatt gagcgaataa agactgagaa aatttggcaa 660 

aaaggacaat cgaaggagta ttatactgaa ttgacagatg ctcttcgtac atatattaaa 72 0 

aatagatttg gctttaatgc attggagatg acttcatccg aaataataga taaactgctg 780 

gaatttaatg ataaagaagc tatatcagat ttgaaatatt tattccagac agctgattta 840 

gtaaagtttg ccaagcatga tccgcagatg aatgagaatg atgcgaattt gatcaatgca 9 00 

attgacttta taaatgaaac aaaacaattg gaagaagaga atcagaagcc gcaacctact 960 

gaaatcacga taatagagaa acgatcttta cgtacgaaga tattgctgat ctgtggtatc 1020 

gtattcctgt cggcagctct tattgcgact ttcgtctata ttgggttgca actctataat 1080 

ctatttggat aa 1092 



<210> 642 
<211> 288 
<212> DNA 
<213> B. fragilis 



<400> 642 
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gacatgagaa caataacatt taatgaactc cgtaaaatta aagattcatt gcctagcggt 

agcatgcata gaatagcaga cgaactcaat ttaaacgtcg atacagtgag aaacttcttt 

ggtggtcata attttaagga agggaaaagt gtcggaatac atcttgagcc tggtccggat 

ggtggattgg ttatgattga tgataccact gtgctcgatc gggcattaag aatattggat 
gaattaaata tgagtaaaga agaagctacc gaatctgtgc aggtttaa 



60 

120 

180 

240 

288 



<210> 643 
<211> 699 
<212> DNA 
<213> B. fragilis 



<400> 643 
atctcccgtg 
ttttttaaat 
gaattttcca 
tctactacag 
' atatcatcta 
acttccacat 
ttcgcctctg 
gttattaacg 
attggcgtat 
ggt ctattta 
tctattttaa 
aat ttgaatg 



ctataatttc 
tatctgtagt 
ctactttatc 
cttcactacc 
caacctctgt 
ctgacgacac 
tttcaaccac 
gaagctcagt 
cttccaagac 
tgatatctct 
tacgttcacc 
tacccaatcc 



ttcggcagtc 
ttcttgccct 
caatccaccc 
tttatgtatg 
cctttcggat 
aacatcctgc 
cttctcctct 
tgtttcggat 
tgtattctca 
taaagaaggg 
cgtattgaca 
tttgattttc 



aatttaggag 
gtatctccag 
tcttctcgaa 
ctaacatctt 
tccttcataa 
ccttttgagg 
gcttttgctt 
atattccctg 
ttcagcacta 
tcaggagtaa 
tttacacttt 
acatattga 



aaccttcgtc 
tgacctggat 
cctcagcaat 
caacagtttc 
cttccgaaac 
tttccggttc 
cctctctctc 
attcttcctc 
ctgtttcgaa 
aagaaatctt 
cacgactatt 



ttctatcacc 
tggctcttca 
ggatgaacct 
aagtatgtca 
gtcttctgca 
tactttccca 
tacggtctct 
caattcctct 
atgtgaaaaa 
agtatgtccc 
gacccctatc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

699 



<210> 644 
<211> 723 
<212> DNA 
<213> B. fragilis 



<400> 644 

gtagtaatgt 

gcgggtgcgt 

aaagacagtg 

aagtctacta 

gatgcgatgg 

caaatttatc 

gaagcttata 

gcattagctc 

caggatcaga 

aatgatcaaa 

gaaatgtcta 

gttcaggata 

taa 



atatgtcaag 
ttgcacaaaa 
tttttgtgga 
tttccatgta 
aacaatacgt 
ataatatggg 
agatgtcttt 
aaaagttgct 
ataaagatga 
ataaagatca 
aggagaatgc 
aagtgaaaaa 



aaataaatat 
agccgagcgt 
tgctgaagtt 
caacttgggt 
tgcggcaacc 
agtgttattc 
gagaaataat 
gaaggatcaa 
tcaacaaaag 
acaacagcaa 
ggaacagctg 
gcaacagact 



atattgtttg 
gattatatcc 
aattaccgta 
aatacgttgt 
agtatagaaa 
cagtctggta 
ccgaaagacg 
cagcaaaatc 
caacaagata 
caacctccta 
ctcaattctg 
cttcaaggaa 



ctctgttgct 
gtaaaggcaa 
aagcattaga 
cgcagcagca 
aagataaagc 
aagattatca 
atgaaacccg 
agcagaatca 
aaaaagatca 
aatcagagaa 
tgatgcaaga 
gacgtctgga 



atcgctttct 
tcgtctgttt 
ggcaaatcct 
aaagttcaaa 
taaactggga 
aaaagcggtt 
ctataatctg 
agatcagaat 
gaataagcag 
aaatgataac 
tgaaaaaggg 
aaaagactgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

723 



<210> 645 
<211> 192 
<212> DNA 
<213> B. fragilis 



<400> 645 

gaaatagatt tcacctattt ctctttttgt gttacgctat tgttcaggat cagatcaagc 
aaaaaaaaga ggctgcccaa tttctcgaat caggcaaccc ctttcacaaa aatctttgcg 
aaaaaagatc tatttgatca ttacatcatt ttctcgattt cttcgaattc cggtcccata 
ttcaggttgt aa 



60 
120 
180 
192 



<210> 646 
<211> 1068 
<212> DNA 
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<213> B.fragilis 



<400> 646 

atcctacttt 

atattaggaa 

ttgttatcga 

gaattggcat 

cgtgcaggag 

atggggtcat 

ccgatgattg 

ggagaagcaa 

tctcaaatta 

gatgctgccg 

ggacctataa 

cctcatattt 

ctgcgtgatt 

gcttcgcttg 

caatataaaa 

aatgcctttc 

tatactacag 

gatttctgct 



tatcattact 
ttgaatcttc 
atgtggtatc 
cgcgtgctca 
tgaccaagga 
tgctggttgg 
atgttaatca 
atgaacagcc 
tattggtgaa 
gagaagcaat 
ttgaccgttt 
ctggtttgga 
ggatgaaaga 
aagcaacggt 
ttaatgaagt 
gtgagcatgc 
ataatgccgc 
ctatagaaca 



tttgcatcgt 
ctgtgacgat 
tagccaggct 
ccaacagaat 
agagttgagt 
agtatctttt 
tctgacagga 
cgattttcca 
agcatataat 
tgataaatgt 
ggcacgtcag 
ttatagtttt 
ggatcccgat 
agtagacatt 
ggctgtagcg 
agagaaatat 
aatgatcgct 
gcctgcttat 



ttaaatatca 
acatcagctg 
gtacacgaag 
atagtacctg 
gcggtagcct 
gcaaagggtt 
cacgtgttag 
ttcctttgtt 
gatatggaga 
tcgaaagtaa 
ggcaatccga 
agcggattga 
ttcatcgaac 
ttgatggata 
ggcggtgttt 
ggttggaaaa 
atcaccggat 
tcacgtgtaa 



aaaagcgtat 
cagttattaa 
cttatggtgg 
tggtgcatga 
ttacaagggg 
ttgcacgttc 
cgcattttat 
tgctggtatc 
ttctgggaca 
tgggacttgg 
aggcttatac 
aaacgtcgtt 
accataagaa 
aacttcgtaa 
ctgctaacaa 
tttttattcc 
attttaaata 
ctttgtaa 



gagtacaatt 
agatggttat 
agtagtaccg 
agcgttgaag 
gcctgggttg 
attaaatatt 
taaagaggaa 
cggaggtaat 
gactattgac 
ttatccgggg 
ttttagtaaa 
tctttactct 
cgatttggca 
agcagccaag 
cggtttgcgt 
caagttcagc 
tcaggataaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1068 



<210> 647 
<211> 651 
<212> DNA 
<213> B.fragilis 



<400> 647 

agaagatacc 

ttgttgctgc 

aaggctattc 

aaatatcgtc 

ctattggctc 

gttatgattg 

ttagaaaaag 

ggaatgattg 

tctgctaaaa 

atcggtgctg 

gctattgtgg 



ataaaattga 
tgctaccatt 
gtaaattcgg 
cggatgtaaa 
gtccgcaatt 
cattggatat 
ctaaacggct 
tctttgcagg 
tgtttttgga 
ctatcaatct 
tgattaccga 



taaaaagatg 
gctggcagcc 
agatccggtg 
attctggttg 
cggctctaaa 
ttcaaattct 
tatatctaaa 
agatgccttt 
atcaatcagt 
tgctgcccgt 
tggtgaaaat 



tttcgatttg 
ttctaccttt 
ttgatggcac 
ttatttactg 
ttggagacag 
atgcttgccc 
ttggtagatg 
actcagttac 
ccttcgttga 
agttttactc 
catgaaaggg 



aagaacctgc 
attctaatta 
aattgatgcc 
ccataggtct 
taaagcgcaa 
aagatgttca 
gtatggagaa 
ctataacaag 
tatctaaaca 
cgcaagaagg 
ggagctgttg 



atatttatat 
tagaaagagg 
tgatgtatct 
atttgctgta 
aggggtggag 
gcccagtcgt 
tgacaaggta 
cgattatatt 
gggtacagct 
agtgggacgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

651 



<210> 648 
<211> 984 
<212> DNA 
<213> B.fragilis 



<400> 648 

tgtattatac 

gaaactttag 

gcggtggccg 

atcacatcgg 

gcagcagaag 

gctgcattac 

aaaaagcctt 

ggtaaactag 

acttttcgtg 

gtcattaaac 

gcagtagcta 

gttggcttga 

gctccaaacg 



aaataattgt 
ataaaggatt 
gaaaatcgaa 
acgtaggggt 
ataaatatgt 
tgactgaaaa 
atgttattat 
cttatcaatt 
cagcagcagt 
aaaagatggg 
ataacgctga 
tgaacgagct 
aggtgttgtt 



aatcatggga 
atctaaaacc 
ggtggatgat 
agagactact 
aaatacacaa 
taattcggat 
ggtggtgggt 
taagaaagcc 
ggagcaattg 
ggccgatccg 
tgtggtgatt 
gacgaaaatt 
ggtgcttgat 



ttttttagtt 
aaagaaagtg 
gaagtgttgg 
cttaatatca 
gagttgaatt 
gatgttgccg 
gtaaacggag 
ggtaaatctg 
gtgatatggg 
gcttctgttg 
attgatacag 
aaaaatgtaa 
ggctcgaccg 



ttttctcaaa 
tattcagcaa 
ataatctgga 
tcaagcgtat 
cgattcttcg 
atttcgatgt 
ttggtaaaac 
tatatttggg 
gcgagagagt 
cgtttgatac 
caggacgttt 
tgaagaaagt 
gacaaaatgc 



ggaaaagaag 
gattgcccgt 
agaggtactt 
cgaaaaacgt 
tgaagaaata 
tccggtagag 
aaccactatt 
ggcagccgat 
ggatgttccg 
gttaagttct 
gcacaataaa 
agtgcctgat 
ctttgagcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 



256 



gcaaaacaat tcacacttgc aacagaagta actgctatgg ctataaccaa attggatggg 840 

acggcaaaag gtggcgttgt tatcggaatt tcagatcagt ttaagattcc ggtgaaatat 900 

atcggattgg gtgagggtat ggaagaccta caggtgttcc gcaagaaaga atttgtagac 960 

tccttatttg gagagaatgc atga 984 

<210> 649 
<211> 1200 
<212> DNA 
<213> B.fragilis 



<400> 649 

ataagagata 

tcaataaata 

aaagctcaga 

gtaaaccaga 

ttgccgcgtg 

gagaagaccc 

aactattcaa 

gttatcttgc 

gtatttaact 

gatgatctga 

gatgtcaaac 

aataaagaac 

attgatctta 

gctatccccg 

acgaactttg 

tttcattttt 

aaattagctg 

gatgtggtga 

attgacatta 

ggtattgctc 



aagtcagaat 
aacgaattat 
tattgaagaa 
tacagccggt 
cattgaagat 
ctaaagtgga 
tgaaagcatg 
tacatgtcta 
atcagattag 
atactctgtc 
atacgtgtgt 
atcgtcccag 
ttggtagtgt 
aaaatactcc 
accaacgtga 
ctgtttctct 
gtattaagga 
tgagtaatga 
tcacattaac 
gtaagatgat 



cccaattggc 
ggaagagaaa 
tgttcttgag 
tgtttcatcc 
taccgagagt 
gcatcgcact 
tgaatttggt 
ctttacgcca 
tgatgaagag 
ggagaagatt 
cttgcgcgaa 
aatcattata 
aactgccgag 
tttcaatcgt 
tttgattgct 
tattcattta 
ttatttccaa 
ttttctgaat 
ttcttataaa 
cttccattca 



aaaacaattg 
ttagtaactt 
aatgagggta 
ggtgtgcgtt 
tctgcctggc 
aagaaggtcc 
ttcaattttg 
atttatgctt 
actgtgaaga 
aagcaaaaag 
ggtatcccgg 
atgggtactc 
ataattgaac 
ttcaatgaag 
ttcgattcat 
tcggatgtaa 
aagcaatatc 
agtttggata 
cggaatatat 
gatacaccgt 



ggatttctta 
tagctattct 
tagagacgta 
tacgtattaa 
ttgctgaaag 
ttattccggt 
ctaaatcttt 
catcattgcc 
atgtattgca 
ttgcatctgg 
aagaagaaat 
gaggtaagaa 
gtagtcatac 
tgaaacggat 
tcatcaatgg 
aagatacatg 
cggatcttga 
attacattaa 
tttcccgttt 
tgttagttat 



tttgttgaat 
gacttataca 
tattcacaat 
agaaagtgat 
tatagtagga 
tgatttctct 
tgatgcggag 
atatggagat 
taaagtacat 
agagtttcct 
actcagatat 
tcagaaggat 
tacagtactg 
tgcctttatg 
acttagtcca 
gaatgaaata 
gatttattat 
aacaaatcag 
gttcaatccc 
taacggatga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 



<210> 650 
<211> 369 
<212> DNA 
<213> B.fragilis 



<400> 650 

cttttattca 

acagcctcgc 

tcatctttgg 

gcatctacag 

tctataccta 

cctatcaaca 

tacacgtga 



acgcattcat 
taaagatcag 
tgggtcccaa 
cttcaataat 
ctttatttaa 
gttcatcacc 



cggtattttt 
tctggtaccg 
gccccctgtc 
ctcatctgcc 
ttcgcgcccc 
aatagttatt 



cccgccaata 
aagtatttgc 
actagtacaa 
cgatctcgta 
atccatgccg 
atctccgcaa 



cccgttttac 
aaagagtctg 
tatttgccct 
cagacacgac 
aattggtatc 
acatagggct 



gttttcaaag 
tttggtaata 
tttcatcgaa 
acgaattacc 
agtcacctgt 
cttacaaagt 



60 

120 

180 

240 

300 

360 

369 



<210> 651 
<211> 1848 
<212> DNA 
<213> B.fragilis 



<400> 651 

ttgatgataa 

gccatgacta 

gctgtagtag 

gattttcgtg 

cgtatgcaat 

gctacggctg 

atggtttcta 



aaacttcaga 
cgcaagcgca 
taggtgacca 
tgccttctat 
cgattaatgg 
aaggagagta 
attcggtaaa 



tgaaatgaga 
ggctgacgga 
gttcaggtta 
taaaggattt 
ggttactaat 
ttctattcca 
aattaaagtt 



aaattgattt 
aaggtagtgt 
tcgtatacgg 
gaggtgttaa 
aatagtatta 
ggtgcaacga 
cttccttcgg 



tcttattgat 
ttactgcatc 
taaatacgat 
tgggacctaa 
cctttactta 
ttacggctga 
ataaaaccgg 



tgcactggta 
tgcccccgat 
aaaagtacga 
ccgttcacaa 
tatcttaatg 
cgggaatcag 
gaatacggca 



60 

120 

180 

240 

300 

360 

420 



257 



gatggcaaag gaactgcatc gtctggcaat caaagtggaa cgtcttcctc tgtttctaac 480 

caagacttac ttataactgc gacggccaat aaaacgaatg tatatgagca ggaagccttt 540 

ctattaactt ttaagattta cacaagagag tctcaacttc gctttgagaa tgtgaagctt 600 

cctgatttta agggatttca ttcacaagaa atagagatgc ctgctaatgc aaaatggtca 660 

caggaacatt ataagggaaa gaactatttt acgacagtgt atcgtcaatt tgtattgttc 720 

ccacagcagt ctggaaaact gactattgag cctgcacgtt ttgatgctac cattgctaaa 780 

gctgtacaat ctgatgatcc gtttgatgct ttctttaatg gtggaagcaa ttatgtgaat 840 

gtaagtaagg tgattgtaac tcctaagatt acggttaatg tgaatccgtt accgacaggt 9 00 

aaacctgcca atttctccgg aggagtaggg gagtttagta tcacttcttc cattaattct 960 

aaagaagtga aaacgaatga tgctatcaca ataaagttag tgatttcggg aaccggtaat 102 0 

ttaaaattaa ttgcaaatcc ggaaataaaa ttcccggaag attttgatgt atatgatcca 1080 

aaggtggata gcaaagttcg tttgactcaa gaaggacttt ccggtaataa agttattgaa 1140 

tatcttgcta ttccgagaca tgcaggagtt tataaaatac caggagtatc cttcagttat 1200 

tttgatatca agtcgaaatc ctataaaaca ctgaataccg aagattatga ggttaaggtg 1260 

gaaaaaggag caggaaatgc agatcaggtt attgccaact ttactaataa agaagatctg 132 0 

aaagtattgg gtgaagatat tcgctatata aaactaaatg atgtgaagct tcagccaaaa 13 80 

gataatcttt tgtttggctc tttactttat tggctattct acattgttcc cgccgtggtg 1440 

tttattgtct tctttatagt ttatcgcaaa caggctgccg agaatgccaa tgtagccaag 1500 

atgcgtacaa agaaagccaa taaagtggct actaaacgaa tgaagttagc tggtaaattg 15 60 

cttgcggaga atagcaagga ggcattttat gatgaagtac ttaaagctct atggggatat 162 0 

atcagtgata aacttaatat tccggtttct cgtttgtcta aagataatgt agaagagaaa 168 0 

cttagaaatt atggagtcag tgacgaatta ataaaggatt tcctgaatac tctgaatgaa 1740 

tgtgagtttg cacgttttgc tccaggagat gagagtcagg ctatggataa agtttattca 1800 

tcgtcattgg aagttatgag taagatggaa aattcaataa aacgctaa 184 8 

<210> 652 
<211> 282 
<212> DNA 
<213> B. fragilis 

<400> 652 

aactataaaa ataagataga aatgtcgaag atttgtcaaa ttaccggaaa gaaagccatg 60 

attggcaaca atgtttcaca ctcaaagaga agaactaaaa gaacctttga tttgaacttg 12 0 

tttaacaaaa agttctacta tgtagaacag gactgctgga tcagcctgag cctctgtgct 180 

gctggtttgc gtattattaa taagaaaggt ttggacgctg ctttgaatga tgcggttgcc 240 

aaagggtatt gtgattggaa aaccattaaa gttgttggct aa 2 82 

<210> 653 
<211> 840 
<212> DNA 
<213> B. fragilis 

<400> 653 

gtcatgaaaa aaatattgtt tattgctttg ggtttgttaa tggcagtaac ttctttcggg 60 

caggattcgt taataacaga ttctactcag atgatacagg gagatactgt cagtatccat 12 0 

aatgcagagt tttccggttc caaattagaa gatgccacaa aagctgaggg agatagtgca 180 

tatatcagaa atgactttgc gtctgcaatc cagatttatg aatcactttt gcgtaaaggc 240 

gagtcggccg atgtatacta taatcttggt aatagctatt ataaaataaa tgagatagca 300 

aaagccattt taaactatga aaaggccttg ttgcttcagc ccggtaacgg agatattcgt 3 60 

gccaatttgg agatagctcg tggtaagact gtagataaag tagaagttgt tcctgagata 42 0 

ttctttgtta catggacaaa ggcattaatt aatagtatga gcgtggattc atgggccata 480 

tgggggattg tgagtttctt gctgctaatt gtctctctat atttctttat tttctcgaaa 540 

caagtggtgt tgaagaaagt cggttttatt acaggcatta tctttttgat agttgttgta 600 

atggcaaata tttttgcttc taagcagaaa gaagagttgt tgaacaggga tactgcgata 660 

ataatgagtc cgagcgtaac ggttagaagt acacctagtg aaaatggtac cagcctattt 72 0 

attcttcatg aagggcataa ggttaacatt aaagatgatt caatgaaaga ctggaaagaa 780 

atccgccttg aagatggaaa agtgggatgg gtgccggttg gttcaattga aattatttaa 840 



<210> 654 



258 



<211> 207 
<212> DNA 
<213> B. fragilis 

<400> 654 

aaagaggaga aactgattat ggcaaagaaa gcaaaaggta atagagttca ggtgattctg 60 

gaatgcacag aacacaaaga tagcggtatg ccgggaacat ctcgttatat cacaactaag 12 0 

aacagaaaaa atactactga aagacttgag ttgaagaagt acaacccaat tctgaaaaga 180 

gtaacagtac acaaagaaat taaataa 2 07 

<210> 655 
<211> 3390 
<212> DNA 
<213> B. fragilis 

<400> 655 

tatttaaacg atgcaaaagt aatgataaaa gtaggattca taacgaacta ttttatattt 6 0 

ttgttctcaa aatcaaaaca accgcctatc agaacgctga agaagactgt acgttgggtt 12 0 

atcggtatca tattaggtat atatatcgga actattattt tgctgaatat tccatatatt 180 

cagcgaaata tgactacgtt tgtcacaaaa gaactatccc ggactttggg tacagaactg 2 40 

actatcggta agatagacat tggattatta aaccgtatca ttatagatga tgtattgctc 3 00 

gacgatcaat cgggaaaaga aatgctcaaa attacacgtc tttctgccaa atttgatatt 3 60 

attcctttat tcaacggaaa aatcacaatc agcagcgtgc agttatttgg ctttaatatc 42 0 

aacttgaata aacccgctcc gcacatggag ccaaatttta aattcgtctt ggacgcattt 480 

gcatccaaag atacagtaaa aacaaaaaag gacattgatc tacgtattaa ttccatatta 540 

atacgccgtg gtaaactatc ctacgacgta ttatcggaag aagaaacgcc cggaaagttt 6 00 

aacccgcaac atatcaaact acacaatatc attgccaaca tttcactcaa ggcacttcaa 660 

aacgattcga tcaatgcagc catcaaacgc ctgagtgtag acgaacaatc gggctttgaa 72 0 

ctacgaaagt tgagcctgaa agtcattgct aataacaaag gcatgaaaat agaaaatttc 7 80 

gcaatagaaa tgccgggtac cgaaatgaaa atggatacta tccgaatgga atatgacagt 840 

ttgaaagcac tcaaccattt tgccgataac gttcgcttct ctttccgtac tttaccatct 900 

catgtgactt tgaatgacat ttcagctttt gtcccggcat tatccaattt taaagaaaaa 960 

ctagatctca acattgatgt agaaggtacg ctcaatcaac tgaattgcag aacattggaa 102 0 

atcaacgcag gagataagtt ccgactaaaa ggagatgtat ctttacaaga cttatcacgt 1080 

cctcaagacg cttacgttta tggacatctg gccaatctct ctgccaacaa agaggggatc 114 0 

ggatttttag tgcgcaatct aagcccgcac tataatggcg ttccccctgt attacaacac 12 00 

ttaggaaata cctcttttca tggtgaaata tcagggtact ttacagactt ggttatgtac 12 6 0 

ggcctgttcc gtactgacat aggctccgtc caaaccgact taaagcttag ttccgataaa 132 0 

gctaaagcgc tgttttctta ttcaggaggc gtaaagacca ccgattttga gttagggcaa 13 8 0 

ctgctaggaa acaagcaatt gggcaagatt acctttaatc tggatgtccg aggaaaccac 144 0 

tacaagagcc aatatccttc cattacgtta aaaggtttga tagcatccct cgaatacagt 1500 

aattataaat atgaaaacat cacgctggac ggagaattca aacgcggtgg ttttgacggt 1560 

aaagtggcat tgaatgacga aaacggttcg gtacatttaa acggcaacat caatgtagtc 162 0 

gaaaaagtcc ctacttttaa cttcaatgca gtcatagaca aaatacgtcc acacgacctg 1680 

aatctgacaa aggagtatcc ggatgcagaa ttttctttga agctaaaagc taatttccgg 1740 

ggtggttcca tcgatgaaat gatgggtgaa atcaatatag acagcttaca atttaccgca 18 0 0 

ccagagaaga gctattttct ggataatatc aacatcactg caacccgcca agataaagag 18 60 

aaccaattga aactaacctc cagtttctta aaagcaagta tcgaaggcaa ttacctgtat 192 0 

catacgcttc cggcaagtgt tatgaacatt atgcggcgat atattccttc actaattcaa 19 8 0 

ccggataaaa agcctattaa aaccaataat aactttagtt tcgatattca catatttaat 2040 

acagaactgt tgtcgacagt atttgacatc ccattgaaaa tatattcaca ctcgactgtg 2100 

aaaggttact tcaacgatca ggcacaacgc ttgcgtgtag aaggctattt tccacgttta 2160 

caatatcaaa acacgtttat cgagtcaggt ttggtacttt gtgagaatcc taccgatcag 222 0 

ttcaaagcaa aggtccggtt caataaccta aagaaagaaa gtgctgtaag catctctctg 22 8 0 

gacgcacaag ccaagaatga cactataaat gccaatatca actggggtaa caatgccatc 2340 

agtacttata gcggacgatt atctgccgcc gccagtttct tccgcgcagc cgaagaaaag 2 40 0 

tccccactga aaactgtcgt agatattaaa cagacagaca tcatcctgaa tgatactcta 2 460 

tggcaggttc atccgtcaca agttgtcgtc gattcaggaa aaatagatgt gaatgatttc 2 52 0 

tatttttcac atcaggaccg tcatatccgt atcaatggac gcatttcaga acaagccaaa 2580 



259 



gatacattaa 
tttgacgatg 
aaagaaccgg 
ctgggggcta 
gcacacatgg 
aaaccggaaa 
caatacttca 
ttctacggaa 
aaaataggaa 
ggcattagct 
ggcaaactga 
aacatgttac 
ggaaccggta 
actaccaatc 



aagtagaact 
tggacttcaa 
tcatgaacac 
tggatattta 
aagaagaagg 
gcaaactaga 
tgcgttccat 
agttcaaggc 
tactgaatac 
tcgataatat 
atttccgaca 
ttatgaatac 
atgcaatgtt 
gtaacaccaa 



aaaggacatc 
aggagacgct 
ccgcctgcat 
cggtgcgtgg 
agtgtcgaag 
cttgaatata 
tgttgaagac 
actcaatatc 
ttcgttcacc 
acgcatagca 
tttcagggac 
gaaagaaaat 
gattggaaac 
ttttgtctag 



aatgtaggct 
acgggaacag 
tttaaaaact 
aagaacgaca 
acgcatgtca 
gagacggatc 
ttgcatggac 
gaaggaaacc 
gtaacagaca 
gatatggaag 
ctaagttatc 
ccggatatta 
ccgcaggaat 



acgtattcga 
catatgccag 
tcacctttaa 
tgcgtgctat 
tcgggcatgt 
atacgaatat 
gtaccagtgg 
tgatgacgga 
ccatccgttt 
gacatcaggg 
atttcgaatt 
acttttacgg 
tacaagtcaa 



tgtggtaaac 
tggcatttta 
tgatgcctca 
ctttcttgat 
ctatccgtta 
ccaatttctc 
taaagcccat 
cgcatcctta 
gtccaccagt 
tactatgaac 
caacgtaaac 
aaaagtatat 
tgcagctgtc 



2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3390 



<210> 656 
<211> 1479 
<212> DNA 
<213> B.fragilis 



<400> 656 

aagataagtt 

gaactactgg 

tttttcttat 

ggtacattca 

cgtattaaaa 

atcataaata 

gaagatacgc 

cttccgttaa 

gaaacagagg 

tcagatgtgg 

gttgtagatg 

gaagctgtag 

gtagtggaaa 

gataatttaa 

atagcacggg 

ccaaagaagg 

atagtagttg 

ttttctaaaa 

atacgggaaa 

aagactccga 

gtagataaaa 

atcccggtca 

attaaagaag 

tggccctata 

ggtactgtac 



taaagaaaca 
ttaacagaca 
tgatagaaca 
aattgatagg 
tagagggaca 
gacctttttc 
caatagagga 
taacagagac 
cgaatgggaa 
aagttgcaga 
atattgacat 
tagaaggttc 
attctgaaga 
aaaaggtgat 
agattcaaaa 
aagttaaacc 
ttatgtcgct 
aagaatcgga 
ttccattgga 
atcaacagga 
cttcagaaag 
agccggattc 
gtgaaactct 
tcgttaaaca 
ttaaaatacc 



tttccgtatg 
tgaggtttct 
agctttggac 
ggtcaatagt 
tactaagatt 
acatttcgaa 
attggaggaa 
cgtagagaga 
agtagaaccg 
agacgtttcg 
acttgaaact 
atccattgct 
gccaatccag 
agaagacgaa 
agcagaagta 
ggaaaaccaa 
ttgcggtgca 
acagtcaata 
tactgttgca 
gataaaacag 
tgaatcggta 
agtgaattat 
gactagggta 
taatcgggga 
ggaattggtt 



aatgaaaagt 
caagaagatg 
gctgatcaat 
cgtgaaagtg 
tcttttactc 
acagtagtgc 
gaatcaggga 
gaggaagcaa 
gaaacctcaa 
gaagttatga 
gttgaagatg 
gaggttcgag 
gtcactggag 
ggttctccta 
tcgactatcc 
aaatctcctg 
gcattagttt 
accacagaga 
aaagcggata 
atgtctgagc 
tcccgggaaa 
acaattaccg 
tctctacggt 
gtgattaaga 
aaaaaatga 



tgactataca 
ctgatgtttt 
atgtgaaaat 
taaatgtcaa 
ctgacccttc 
tgaatgagaa 
atatatccga 
aagcagagga 
aagggcagga 
aggaatccga 
ttagcataca 
aagagggtgg 
atacagggca 
aattgactgc 
ctgtaaaaaa 
ttccttactt 
ttatttatta 
ctgtagagaa 
ctattgtcaa 
gtgtcaatgt 
agagcactaa 
gtactaaggc 
tttatggtac 
atccgaataa 



agatcttgtt 
cgtccgggag 
caaaggattg 
tacgggtgaa 
tttaagagat 
tacagtcttg 
aacaactgag 
gaaggtggtt 
tgttgtgtcg 
aaggacagag 
taaaggtagt 
attggataaa 
agaaactaca 
cgaagaaatt 
ggaaaagcgc 
aatagtaatt 
tccggatttg 
aaaagaaccg 
ggtggtagca 
gtcggagaaa 
aacagtggca 
gacttataca 
aaaagattta 
tgtaccgtat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1479 



<210> 657 
<211> 2543 
<212> DNA 
<213> B.fragilis 



<400> 657 

gaagactgct 

cgtacattga 

gatttaagcc 

acaaacccta 

acggagactc 

atccgctggt 



tgaacaagac 
ctactccatg 
cgttcaccgc 
taagaaatca 
ttctgtatat 
agacgggcaa 



agaattataa 
tcggtcatcg 
agaattctct 
gccagaatcg 
tttgcgatgg 
ggtaacttcg 



agattaacat 
tttcacgtgc 
acggaatgat 
taggtgaagt 
tacgtatggc 
gttctgtaga 



cgaggaggaa 
ccttcccgat 
ggaactggga 
acttggtaag 
tcaggaatgg 
cggcgacagt 



atgaagtcat 
gttagagatg 
aatacgtcag 
tatcacccgc 
gcaatgcgct 
cctgctgcca 



60 

120 

180 

240 

300 

360 
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tgcgttacac tgaagcacgt ctgaacaaat taggtgaaga aatgatgcag gacctctaca 42 0 

aagagactgt agatttcgaa cctaacttcg ataatacgct gatggaaccc aaagtgatgc 480 

cgacacgtat tccgaatttg ctggttaacg gtgcttccgg tattgctgta ggtatggcaa 540 

ccaacatgcc gccccataat ctgtctgaag tcatcgatgc ctgcgaagca tatcttgaca 600 

ataaagatgt gaccgtagag gaactgatgg aatatgtaaa agcgcccgac ttccctacag 660 

gaggatatat atatggcata agcggcgtac gtgaagccta tcttacggga cgcggacgcg 720 

tggttatgcg cgcgaaagca gaaatcgaat ccggacagac acatgataag atcgtcgtta 7 80 

cagagattcc ctacaacgtg aataaggcag aattgattaa agcaattgct gatcttgtca 840 

atgaaaaaag aatagaaggc atatcaaatg ccaacgacga gtccgaccgt gaaggtatgc 900 

gcatcgttat tgatatcaaa cgggatgcaa atgcaagtgt agtgctgaac aagctctata 960 

aaatgacagc cttgcagacg tcattcggtg taaataacgt tgcactggtc aacggacgcc 102 0 

ctaaaatgct gaatttacgc gacttgattg tttacttcgt agaacataga cacgatgtgg 1080 

taattcgtcg tactcaattt gacctgcgta aggccaaaga acgtgcacac atcttggaag 1140 

gtctgattat cgcttcggat aatattgacg aagtaattcg tatcatccgc gccgccaaaa 12 00 

caccaaacga tgcaatctcc ggactgatgg aacgcttcaa cctgagcgaa attcaggcac 12 60 

gcgccatcgt tgaaatgcgc ctgcgccaat taacaggtct gatgcaagat cagctccatg 13 2 0 

ctgaatacga ggaggttatg aagcagatag catatttgga aagtatcctg gccgatgatg 13 8 0 

aagtatgccg taaagtaatc aaagacgaat tgctggaagt aagagctaaa tatggtgacg 144 0 

aacgccgttc tgaaatcgtt tattcatcag aagaattcaa tccggaagac ttttatgcgg 1500 

atgatcagat gattatcacc atctcacaca tgggatatat caaacgtaca ccattgacag 1560 

aattccgtgc tcaaaaccgc ggtggagtag gctcgaaggg tactgaaacc cgtgatgaag 162 0 

actttgttga gcacatctac ccggcaacaa tgcacaacac gatgatgttc tttactcaaa 1680 

agggtaaatg ttactggctg aaggtatatg aaatacctga aggaacaaag aactctaagg 1740 

gccgtgctat ccagaacttg ctgaacattg actcggacga tgctgttaat gcatatttgc 1800 

gtgtgaagag tttgaatgac caggaatata ttaacagtca ttatgtactg ttctgtacca 1860 

agaatggcgt tataaagaaa acatctttgg aacaatactc acgcccgcgc cagaatggtg 192 0 

tcaatgcaat tactatacgt gaagacgacc gagtaataga agtgcgtatg accaacggaa 1980 

acaacgaaat catcatagcc aaccgtaacg gacgcgcaat acgtttccat gaagcagcag 2 040 

ttcgcgtaat gggccgtaca gctaccggag ttcgtggtat cacactggat gacgacggac 2100 

aggatgaagt aataggcatg atttgcatta aggatctcga gacagagtcc gtaatggttg 2160 

tctccgaaca aggctatggt aaacgttctg atattgaaga ttatcgtaaa acaaaccgtg 2220 

gcggcaaagg tgtgaagacc atgaatatta ccgaaaaaac aggtaaactg gttacaatca 22 80 

agtctgtaac agacgaaaac gacctgatga tcattaataa atcgggtatt acaattcgtc 2 340 

tgaaagtagc tgatgtccgc atcatgggcc gtgcaactca aggagtccgt ctgatcaatc 2 40 0 

ttgaaaaacg taacgaccag atcggttctg tatgtaaagt tacatccgaa agcctggaag 2 460 

atgaagttcc ggaagaagaa agagaaggaa atattccaag cgatccggaa acgaatacac 2 52 0 

cggtaaatga aacagaagaa tag 2 543 

<210> 658 
<211> 996 
<212> DNA 
<213> B.fragilis 

<400> 658 

attataaata gaatggtttt tgccaatatt gaatatttgt ttttgctgct gttgcttgtg 60 

ccatatattg tatggtacat aatgaagcgg aaaaagactg agccgactct tcagatttct 12 0 

gatgcacgag tatatgcgca tgcccctaaa agttacaaga attatctgct tcatgtaccg 180 

tttggtctgc gtatcatcac tctaatattg attattttgg ttttggcacg tccccaaaca 240 

acaaacagct ggcagaacag cgaaattgaa ggtattgata ttatgttggc tattgatgtg 3 00 

tcgaccagta tgttggcaga agatttgaaa cccaacaggt tggaagctgc caaagatgtg 3 60 

gctgcggaat tcatcaacgg tcgcccgaat gataacatag gaattacact gtttgccggt 42 0 

gaaagtttta ctcagtgtcc tttgacagta gatcatgctg tattacttaa cttatttcag 480 

gggatacagt gcgatattat tgaagatgga acagcagttg gtatgggaat tgccaatgca 540 

gtaaccaggt tgaaagatag taaagcaaaa tcaaaagtga ttattttgtt gacggatggt 600 

accaataata aaggagatat ttctccattg accgctgctg aaattgcgaa gagttttggt 6 60 

atacgtgttt atactattgg tgtgggaaca aacggtatgg ctccatatcc ggtacgggtt 720 

ggtggtacaa cacagtatat taatactccg gtagagattg atgagaagac tcttactcaa 7 80 

atagcaggta ctacggatgg gaactatttt cgtgctacca gtaattcgaa gctgaaagag 840 

gtttatgaag agattgataa attggaaaaa acaaagttga atgtgaaaga gtacagtaaa 900 
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cgtcaggaag aatatcgttg gtttgcgtta gctgcattct tatgtatatt gcttgaggta 960 
ctgttacgga actcaatatt gaagaagata ccataa 996 

<210> 659 
<211> 870 
<212> DNA 
<213> B.fragilis 



<400> 659 

atggaaacaa 

tctaacaata 

tccgaggtgc 

gcgcgtttca 

ttgatggtag 

atggtgacag 

ggagttattt 

catattctat 

atacgtcttg 

ctgtctgact 

catgatgtag 

ctgatgcgga 

ggagtacgta 

tttaccaaaa 

ttactgaatt 



gtgaaattct 
tctttgcagg 
gcgaatatca 
ataaaccttt 
atgtttccgg 
aaattgcagc 
tcttctcaga 
atatcattcg 
ctttggaata 
ttatcgatca 
ttgccattca 
ttaaagatgc 
gggcacatca 
gtaatgtcga 
tgtttgctaa 



aaaaaaagtt 
ccagtatcat 
gtttggcgat 
tgtgaaggtg 
cagtttggaa 
cactcttgca 
cagaatagaa 
tgaattgatt 
tttgacgaat 
ggaaaatttt 
agtatatgac 
ggaaaccgga 
tgaatggtgg 
ttctgtttcg 
acgaaattaa 



cgtcggattg 
tcagctttta 
gatatccgtg 
tttgaagaag 
tttggcactg 
ttttctgcta 
aagtttattc 
gattttaagc 
gtaatgaaaa 
aagaatgcaa 
aggcgagttg 
catgaacaat 
gtgaataaac 
gtacgtactg 



aaataaagac 
aaggtagggg 
acattgactg 
aacgcgagtt 
taaaacagct 
tccagaacaa 
ctcctaaaaa 
ccgatagtcg 
gacgttgtac 
tgaccatagc 
ccgagttgcc 
ggatcgacac 
aaactgagtt 
atcaggatta 



gcgtggactg 
tatggctttt 
gaatgtgaca 
gacggtaatg 
gaagaaggac 
tgataaaatc 
aggacgtaag 
tcggaccaac 
cgcttttatt 
gaataggaaa 
ggcagtaggt 
atcttcggct 
agatgaaacg 
tgtcaaagca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

870 



<210> 660 
<211> 1365 
<212> DNA 
<213> B.fragilis 



<400> 660 

gggtatggaa 

gaatgcatga 

gattcagaac 

gaaaaaccga 

gaagaatcga 

aaactttttg 

cctcaggttg 

aaagcttacc 

tatctgaaaa 

acaggaagac 

tccaatggtg 

ttatataaga 

gaatggatac 

atgcgtgaac 

aatatgctgc 

caatttcgta 

ggagagacag 

aggatggggg 

gactctattc 

ggaatttcta 

cgtatagaag 

gaagttttaa 

gtgattgatt 



gacctacagg 
aaagaaaaac 
agttgatgcg 
caggagagat 
ttaatatgat 
taatgggctg 
ataagtttta 
atgaggagct 
tatcagaggg 
atgtatctcg 
taaaggagtt 
agcaaatgct 
gtttacatta 
gagataatgt 
aacgtatgcg 
aggaagttcc 
aagaggattt 
cttttactta 
ctcaggagct 
ctgaattaag 
gtgaatatta 
taaggtgtga 
ccgatgaatt 



tgttccgcaa 
tatagatatc 
tcaattggaa 
agctgttatc 
tctggagttt 
tctttcggaa 
tggaaaattc 
tcatattgag 
ttgtgatcgt 
tcctatagaa 
tcaggtaatt 
tccggagttg 
tgcatatccg 
atgtaaatac 
tcgtcatgtg 
cgggattcat 
tgaggaattg 
ttcagaagaa 
aaaacaggca 
tgcatcaaaa 
cattgggcgt 
aggggataat 
cgatcttttt 



gaaagaattt 
ataacattag 
gaagccggat 
aatacctgtg 
gcacaagaaa 
cgttatttaa 
aattggaagg 
cgtactctta 
aaatgctctt 
gaaatcttgg 
gctcaggaat 
atcgaacgca 
gcacatttcc 
atggatattg 
acgaaaaagg 
ttgcgtacga 
aaagagtttg 
gaaggaacat 
cgattggatg 
gttggacaaa 
acagagtttg 
ttaatgattg 
ggtgaaataa 



gtagactcct 
ggtgctcaaa 
atgatgtaac 
gttttatcgg 
aagaagaagg 
aggaattagc 
ggttgttaca 
ccactcccaa 
attgtgctat 
atgaggtgag 
taacttacta 
tttctgagat 
ctgaagagtt 
cattacagca 
agacttaccg 
cgttgatggt 
tacgtaaagt 
atgctgcggc 
agttaatggc 
aaatgaaagt 
attctccgga 
gtaactttta 
tttaa 



tatttggaga 
gaacctggtg 
ccatgactcg 
tgatgcaaaa 
caatttggag 
gattgagatt 
ggatttgggt 
acattatgct 
accgattatt 
atatttggtt 
tggagtggat 
tccgggtgta 
gttccgagtg 
tatcagtgat 
gttgattgaa 
gggacatcca 
gcgctttgac 
taattatgaa 
tattcagcaa 
cattattgat 
agtagatcct 
tcaagtacaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1365 



<210> 661 
<211> 1248 
<212> DNA 
<213> B.fragilis 
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<400> 661 

ctttgtaaga 

caggtgactg 

gtaattcgtg 

tcgatgaaaa 

attaccaaac 

tttgaaaacg 

caggcaatgg 

agttggtttg 

accactgtga 

attattcatc 

gaatcttggg 

ttgattcgtt 

aatactgaga 

actccgattg 

gcagaaagtt 

gaatatttta 

gtatcgatgg 

gtaaaaggtg 

ggtcccagtg 

aaaagtgaaa 

agagcaagca 



gccctatgtt 
ataccaattc 
tcgtgtctgt 
gggcaaatat 
agactctttg 
taaaacgggt 
tacctgaaga 
agaaagatgg 
tgagtgaaga 
gaacatttac 
aaatggcatt 
taagattgac 
gtgccaaact 
aaattttgat 
gtaccggtgg 
aaggaagcat 
aaacgcttga 
cgatgaaagc 
gggg'gactga 
tttgcactat 
ataatgccct 



tgcggagata 
ggcatggatg 
acgagatcgg 
tgtactagtg 
caaatacttc 
attggcggga 
ttgcatcgtt 
taaggtactt 
ggttattcca 
tgtacaaaac 
acctgcttgt 
gggacgcggg 
tgaggcaatt 
aggcgaactt 
aagcattgca 
agtagcgtat 
aaaaagaggt 
gctaaaaact 
agagaagccc 
gaagcaggaa 
tctattgctt 



ataactattg 
gggcgcgaat 
gcagatgaga 
acagggggct 
ggtaccagac 
aaaataccga 
attaataatc 
gtttctatgc 
cgtttgtgtg 
tatcccgaat 
ctaaaactgg 
cagaatcgta 
ttaggagaag 
ttaaagaaga 
gcccgtatca 
gccaatgaag 
gcggtaagtg 
gactgtgctg 
gttggtacgg 
actaatcgtg 
cgaaaattgg 



gtgatgaact 
taaataaagt 
ttattgaagc 
tgggacccac 
tgatctttag 
tgaatgcgtt 
gggtagggag 
cgggagttcc 
caaagtttcg 
cggtgttggc 
cttatttacc 
gcgagataga 
atattttgga 
agaatttgac 
cttcggttgc 
taaagacgga 
aggaaacggt 
tagctacgtc 
tttggattgc 
gaagagagat 
tgaaatga 



gttgatagga 
aggtatagag 
tgtagatgct 
caaagatgat 
cgaggctgtc 
gaataaaagt 
tgcttctgtc 
tcaggaaatg 
tacgggtgca 
tgagaaattg 
aaaaccgggt 
agcttgtgta 
tgaggaagat 
tctttctacc 
cggcagttcg 
gttgctgagt 
cattgaaatg 
tggaatcgca 
tgctgcctat 
gaatgtggag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1248 



<210> 662 
<211> 1005 
<212> DNA 
<213> B.fragilis 



<400> 662 

ttagtattta 

agtgctttcg 

ttggtagagt 

ccgggtttgg 

agccgcatac 

agtcagaagg 

gctgatgaaa 

gaacgtcaag 

gctactcaga 

cgttttatgt 

cgccaaaata 

atcgaggcac 

gtagatatcg 

atgattggat 

tatgcgttta 

gatgtacttc 

gatgaaattg 



tggctgaatc 
ttaccaatct 
cactgttaat 
caaaaacttt 
agtttacgcc 
atgaaagttt 
taaaccgtgc 
taactattgg 
acccgataga 
taaaggtgat 
tcaatggaga 
gtaaagtggt 
tgtttgctac 
tcggtgggtc 
tcaaacgtcg 
gtcatcgtat 
tcagcaaaat 



aattgacatc 
tacaacaggc 
cggactactt 
ggccattaag 
tgacttattg 
ccaagtgaag 
tcccgctaaa 
taaagaaaca 
acaggaaggt 
tattgattat 
gaaatttaat 
tcgtcaagtt 
acgttatccg 
acctcgtgcc 
tggctatgtg 
cggattgaca 
attgaataag 



cgcgaattga 
atggaccaaa 
tctgatggac 
acattggcct 
ccagctgacg 
aaaggtccga 
gttcaaagtg 
ttcttgttac 
acatatccgc 
ccgaaacagg 
gtgaagccta 
tatttagacg 
gaaaaatacg 
tctattaatt 
attccagaag 
tatgaagcgg 
gtagaagtgc 



acgagcggat 
tcattgtggg 
acgtgctgct 
cgttgatcga 
ttgtcggtac 
tttttgctaa 
ctttgcttga 
ctgaaccgtt 
ttccggaagc 
aagaggagaa 
tcttgaaggc 
agaagattga 
atttgaaaga 
tagctttggc 
atgtacgtgc 
aggctagcaa 
cttaa 



tgaaagacag 
tcagaaacat 
tgaaggtgta 
tgcaaaatac 
gatggtttac 
ttttgtattg 
agcaatgcag 
ccttgttttg 
tcaggtagac 
attaattatt 
tgaagagatt 
acgttatatt 
attgaaagat 
tgcacgtaca 
tgttgcgcat 
tgtgacttct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1005 



<210> 663 
<211> 1257 
<212> DNA 
<213> B.fragilis 



<400> 663 

aacagaagaa 

ttttcaatgg 

gaagcgaaaa 

aacgaagcat 

tatattcaga 



tagacataat 
ttttactgat 
gcattgccgg 
taactaaccc 
aaagaatcaa 



attaataatt 
ggcagtaagt 
agaagtaaaa 
tgaaacaaag 
cgaaaaggag 



aatcaaacaa 
tttgcattcg 
cctgacttcg 
gataatgcgg 
atggaaaatg 



caatcatgaa 
ctcaggagaa 
caaaagctga 
caacttggga 
cttatctgag 



aagagtatta 
aaatgtaaaa 
acaactgatt 
cgtagcaggt 
aaaaccttat 



60 

120 

180 

240 

300 
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gatacattga aggtatacaa tagcgtactg aatatgtaca attattacgt taaatgtgac 3 60 

gaactggcac agattcctaa tgaaaagggt aaaattaaaa acaaatacag aagcgccaac 42 0 

tcaaaaacaa ttctggcaga acgtcctaac ttgattaacg gtggtattca atacttcaat 480 

ttaaataaga acgaagacgc attaaaatat tttgcagctt atgtagatgc agctacactg 54 0 

cctatgatgg aaaaagaaaa cttgctggaa aaagacacca ttttgccaca ggtagcatat 600 

tatgccactt tggctgcaga tagagtaggt gacaaagatg ctgtcatgaa atatgctcaa 660 

tatgctctga aagacaaaga aaatggccaa tttgcaatgc aattgttgac agatgcttac 72 0 

aaagctaaag gtgatactgc taaatgggta gaaaaattgc aggaaggtat tgttaagttc 7 80 

cctgaaaatc aatatttctt cgcaaatctg gttgactact atagcagctc caaccaaaat 840 

gataaagcaa tgcagtttgc tgatgatatg ttggctaaag atccgaataa caaattatat 900 

ctgtatgtga aggcatatct gtatcataat atgaaagatt atgagaaagc aattgagttc 9 60 

tataaaaaga ctctcgacat agatcctgca tatgcagaag catgctcaaa tttaggtttg 102 0 

gtatacctgt tacaagcaca agaatatgct gacaaagcac cggcagatat caatgacccg 1080 

aattatgcaa cagcacaagc tgagatcaag aaattctacg aagctgctaa accgtattac 1140 

gaaaaagcaa gagagctgaa acctgatcag aaagatttgt ggttacaagg tctttaccga 12 00 

gtatattaca acctgaatat gggaccggaa ttcgaagaaa tcgagaaaat gatgtaa 1257 



<210> 664 
<211> 303 
<212> DNA 
<213> B.fragilis 



<400> 664 

aattgtagga 

aatacaaaat 

caggaaagca 

gaacgtattg 

ctggcgttta 

tga 



ttttgaataa 
atacttctga 
atgctatagg 
tcatcaatcc 
agccgagtcc 



taaagaattt 
actgataaca 
aatacaggga 
cgtcactaag 
tatattaaaa 



acttctgaac 
tctctgctgt 
tttggtactt 
ttgcgactgt 
gataagttta 



tttctcgcag 
ctgatattac 
ttgaggtaaa 
tggttccacc 
aagaaacatt 



attggggtat 
tcaggaattg 
aaagaaagca 
caagttagta 
tccgtatgaa 



60 

120 

180 

240 

300 

303 



<210> 665 
<211> 441 
<212> DNA 
<213> B. fragilis 



<400> 665 

ttaccgatgg tgaaaatcat gaaaggggga gctgttgaag ctggcaaaaa ggcggctaaa 60 

aagggaattc aggtgaatgt attgggagta ggcttacccg atggagctcc cattccgatt 12 0 

gagggcagta acgactttcg tcgtgaccgt gaagggaatg taattgtgac tcgtctgaat 18 0 

gaggcaatgt gtcaagagat agcaaaggaa ggaaatggta tttatgttcg tgtagataat 24 0 

tctaattctg ctcagaaagc tattaatcaa gagattaata aaatggctaa atcggatgtt 3 00 

gaatctaagg tttatacaga ttacaatgaa cagttccaag tgattgcatg gatgatattg 3 60 

ctcttgttat tggtggaaat gttgattctg gaccgcaaaa atccattgtt taagaacatc 42 0 

aggttgtttt ctaataagta g 441 



<210> 666 
<211> 216 
<212> DNA 
<213> B.fragilis 



<400> 666 

agttttctat cttcaaaaca ggatgcaaaa atactaaaaa tgggtgaact gtgctcattc 60 

tttatctata attttaagag aggtcatacc aaggttatat ataaatcagc cggtttatgg 12 0 

atcgacaacc ggccggttta tggtttctct aaagacaaga gatcaccatt tcctgccctt 180 

ctcttccaac gggaaccata catattagag aattaa 216 



<210> 667 
<211> 1551 
<212> DNA 
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<213> B.fragilis 
<400> 667 

acaaacctcc cccctcgtct gttacaagga ctcataaacc taaaccgtaa cagacgtatg 60 

gaaaaaaaga aaatacccgt cgcactgatg atagcggcag gaatgctctt atacaacaac 12 0 

accgtcgcgg cgcagagcct ccctcccaca caggaaactt cgcaacatca gcttagcttt 180 

aacgaggcgc tgcaactgct gcacaaaggc aaccaaagcc tgaagatagc cgacaaaggc 240 

atcgacatag cccgtgccga acgtgggaag ctgaatgctt tctggatgcc cagcctgcaa 3 00 

tcgaccggag catttgtaca cctttcggag aagatagaag tgaagcaacc gctttctcaa 3 60 

ttcaccgatc cggccaaaga cttcgtacat tccatcttgc cggacgataa aatcatatcg 420 

tccatactcg atcaaatcgg gacgaacacc ctcatctttc cgttggcacc gcgcaacctg 480 

accactgtcg acctgaccgc cgaatgggta ttgttcgccg gaggcaaacg tattcatgcc 540 

actaagatag gcaatacgat gatagacctt gcccgtgaga accgggcaca gaccgatgcc 600 

acccaacgaa cactgcttgc cgaaagctac tacggattgc gcctggcaca agaaattgtc 660 

ggtgtccgcc tggaatcgta caaagcactg aagctgcatt acgagaacgc attgaaactg 72 0 

gagtccaccg gtatgataga taaagcggca cgcctctttg cccaagtcaa catggacgaa 780 

gcactgcgtg aactggaagc cgcccgcaag gaagaggccg tggtgcaacg caccctcaag 840 

accttgctga atctggagac gagcggagac atctcacctt cctctcccct gtttatcaac 900 

gatactcttc ctccgaagat ggagtttatg caggtagtgg gcatcagtaa ctatctgctc 960 

aaccaactga gtcttcaaga acacatggcc aagcagcagg tccgcatcga ccagagtggc 102 0 

tatctgccca atatcgccct tttcggcaaa caaactcttt attcacatgg catacagagc 1080 

aacctgttgc cccgcaccat gatcggggta ggcttcacct ggaacctttt cgacggactg 1140 

gaacgggaga agcgaatccg gcaatcacgc ctgacacaac aaacccttgc actaggacag 12 0 0 

gagaaagcgc gtgacgacct gtccgtcggg gtagacaaac tatacaccgg cctgcaaaag 12 60 

gcactcgaca acgtgcgggc gctgaacacc accatcgaac tgagtgaaga actggtacgg 132 0 

atgcggaaaa aagccttcgc cgaaggaatg gtcacttcga cagaggtagt agacgcagaa 13 8 0 

accttacttt cgaaaacaaa agtagccaga ctggcggcct actacgaata tgacgtgacg 1440 

ctgatgaatc tgctggcact gtgcggaata ccggaacagt tcggaagcat gaaggacatc 15 00 

acctctcttc ccattacgga gaacagaaga aatgaaatag aaatcgaatg a 1551 

<210> 668 
<211> 201 
<212> DNA 
<213> B . fragilis 

<400> 668 

aaaacacaac cgtctttcgg agtaaatgct tacttctttt ggtcgaaata tcggctcttc 60 

gaagtgaaaa cacagactct tccccttccc aagcccggtt ttctgggcaa agaagcagaa 12 0 

agttccggga cattctcccc tatatgggta agaaaacaaa aagggatagt tagcagaaaa 18 0 

tgccattacc tgtctttttg a 201 

<210> 669 
<211> 435 
<212> DNA 
<213> B.fragilis 

<400> 669 

tttctatata ctctgccaga ctttgtatgc tggctaccgg gagcatgggg atctggaatc 60 

catttgtgcc attttctctc ttccggtacg ccattggaaa gattgcaatc ggtgttcgga 12 0 

catatcaatg ggatgcctgc tttgtcaagt gcttccgaag caaagaaaat gtgtatgttt 18 0 

ctgcgttgga tgattcgtag ggattctcct gtcgacctcg gtatttggcg gagtttcagt 240 

ccctcagatc taattatccc tcttgacact catgtacatc gcatctcgac tgatcttgga 3 00 

ttgaccaatg cacgtaaatg cctgaaaaca gcacgttgca ttactgatgc gttgcgggaa 3 60 

atatggccgg atgatccggt aaagggagat tttgctcttt tcggatttgg tatcaacgaa 42 0 

ccggtgaaaa gttag 435 

<210> 670 
<211> 807 
<212> DNA 
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<213> B.fragilis 



<400> 670 

caaggagggg 

attgccgaat 

gagaacgtcc 

aaagatttag 

gtactcctct 

ccgctttttc 

gaaatattcc 

atgacgtatg 

aagggacgaa 

gcggtcacac 

ttatatggag 

ggatggcaaa 

aacgcactaa 

ccagagatcg 



caggcatggc 
ccgacctccg 
taatgattga 
gtgagaacta 
cacaatgtga 
atatcctcat 
tttccgtctc 
cacgagtggc 
tcgccacatt 
ccgtaaatac 
agatgggatt 
gcctgagacg 
gggatgccaa 
ctgcactggt 



taactggata 
cacatgggca 
tgacgaaagc 
tctggaaaag 
cgacgaacta 
tcaggaactc 
cagtggcgaa 
gacttgctat 
tcgcagtcgg 
tcccctatca 
caggacagta 
cttcaagggt 
cttcatcatt 
aatataa 



accctcaaac 
aacttgggat 
ctgacccaat 
attatcaaag 
tttctattga 
ggccagttga 
cccatcgcac 
agttccatcc 
acgatggaac 
aaccttgtcg 
cgcgaccttc 
atgggactgg 
gtccgcaaag 



aactgtcgga 
atatcacttc 
atcttgatgt 
aaaaggaact 
aaacccagaa 
ttacagacga 
gggtggcgaa 

tccggactct 
tgatgttcga 
gcgcgcatgc 
tacaatacgc 
ttacgtataa 
acggaaacat 



gaaacgcggt 
atcgaggata 
tcaccagacc 
ggaacgtgaa 
actacaccaa 
tcatgaacgg 
acgtaacaaa 
gggtgaacat 
taaatgcaat 
ctataatgtt 
cacccagaac 
gagtgtgatg 
cgagctgtca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

807 



<210> 671 
<211> 1242 
<212> DNA 
<213> B.fragilis 



<220> 

<2 21> unsure 
<222> (1135) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 671 

gagaaaagat 

aatctaatta 

acctctcgac 

atggctacca 

gacaacacgg 

gtaacccgcc 

tatggttatc 

actctgaact 

ttcgaaagct 

ggagtgaacg 

atttataacc 

tttcaggtgt 

acagccggtc 

ctgttgcctt 

ggcatcctgc 

tttatcattt 

atcatcatca 

tttccggtac 

cattacaccg 

ccttcggcag 

agacgagcca 



attctcagtg 
tgcaacactc 
gcctctactt 
tattcggcaa 
ccacttcgag 
acttcgttga 
tctcgatccc 
attattatca 
cgctcgcacc 
aacagcagat 
cgtcgctgga 
tggtactgct 
aatggttgca 
atacgcttat 
atataccttt 
ccactcaggc 
gtatcgtatc 
tcaacatgta 
aaattacaca 
tgatactctg 
taatcagtcg 



ggactctgtg 
ccctataacc 
cggtgtctgc 
cgggcagatg 
agatattacc 
cgaagccgag 
tccccgcttt 
ctatgccctg 
cgtagctctc 
agaaaccttt 
ttactcggtc 
tatcacggta 
ggcggcgggc 
cttcagcctg 
tcaagggagt 
actggcattg 
catggtcggc 
tccgttggtg 
aaccatgctc 
tatcttcccg 
taaatatgaa 



ctactctgtg 
cgtgtcatac 
ctggtacttc 
gaaaatatcc 
cggcggatgt 
gcacgcaaag 
gaacaagata 
ctctcggtag 
tcccccatcg 
ctactccccg 
tacctgagcc 
tatgccgtag 
ggcgacatca 
atcggtatat 
tggctgctgc 
ttcatcttct 
tcgctgggag 
cgcgacgcct 
tactatggcg 
ttattggcgt 
aacatccggt 



gtaaatcaaa 
aatgtgagtg 
cgttgttcac 
ccatcggcat 
ctgccgtacc 
cggtacagca 
tgatatcggg 
gaggtgagtt 
tgatgaaagc 
tgcaagccaa 
agcctttctt 
gaagcgaaat 
cggttgccgt 
tgggcaactt 
tcaacgtcat 
ccctgttccc 
ctactttatc 
cttatctgtt 
gcggatttat 
tggcgatgct 
aa 



actcaaagac 
gcaacggatg 
gctctttttc 
tgtcgaccgg 
caccttccgg 
gaaagaaata 
gcaggacgcc 
gatggccgcc 
tgtggcgctg 
caatcatccc 
ttttgtactc 
aaagttcggt 
tacgggcaag 
tgtcatgttt 
gacagtgctc 
tgccgtggca 
cggtgtcacc 
tccggtgcgc 
ccacntatgg 
tccgcacctc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1242 



<210> 672 
<211> 753 
<212> DNA 
<213> B.fragilis 



<400> 672 

ataaagaatc tatattttat ggagattttc tggaaaacta ttgcgtatta taattctgct 60 
acatggatct atcagttact gatcattgtc gccggcctgc tgttgacagt gatgcttata 12 0 
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aagaatcccc gtccgtgggt aaagatgggc atgaagctat atatgatttt tctgtatttg 180 

tggattgcta tcgcatatta tgccatctgt tgtgacgagc gcagttataa cggggcgctg 240 

gctatgttct gggtggttat ggccacgata tgggtatggg atgccatcac cggatatact 3 00 

actttcgaac gtacatataa atatgatatc ctttcgtacg tattgttgat tttaccattt 360 

gtatatcctt tggtatccat tgcgagagga cttacttttc caggcattac atcgccggta 42 0 

atgccttgct cggtaacagt tttcacgatc ggtctgcttt tgttgttctc ccgtaaggta 480 

aatatgtttt tggtgctgtt cctgtgccat tggtcgctga tcggcttatc gaagacttac 540 

ttctttaata ttccggagga tttccttttg gccagtgcaa cgatccctgc cctatatctc 600 

tttttccggg agtatttcct caacaacctg catgccgata caaagcctaa agcaaagtac 660 

attaattggt tgcttgtctt tgtatgcgta tctatcggaa tcttacttac caccaccctg 72 0 

tttctggagt tgatgccggg caaacagccg tag 753 

<210> 673 
<211> 1503 
<212> DNA 
<213> B. f ragilis 

<400> 673 

cgtgacgaac acgatatgaa taagaactta caccctttga tgctggccgg aaccggtagc 60 

gatgtaggca aaagcatcat tgccgcagct ttctgccgta tatttctgca agacggttat 12 0 

catccggcac cattcaaagc acaaaacatg gcattgaact cttatgccac tcccgaagga 180 

cttgaaatag gaagggcgca agccgtgcag gccgaggccg caggtgtgcc ttgccatacg 240 

gatatgaacc cgttgcttct gaaaccatcg tccgatcata cttcacaggt ggtgctcaac 3 00 

ggacgtccca tcggcaatcg gaatgcttac gaatatttcc gccgtgaagg gcgggaggaa 3 60 

ttgcgaaaag aggttcatgc cgcattcgac cgtttggctg cccgatataa tccggtagtg 42 0 

atggagggag cggggagtat ctccgagatc aatcttcgtg acagcgatct ggtgaatctg 480 

cccatggcca tgcatgccgg ggcagatgtg attctcgtgg ccgatataga ccgtggagga 540 

gtgtttgcca gtgtttacgg ttcggtgatg ctgcttcggc cggaagagcg gaagcatatc 600 

aaggggatat tgattaataa attccgtgga gatatccgcc ttttcgagtc gggggtaaag 660 

atgctggaag atctttgtgg tgttccggtg gttggggtgg tgccctacta taaagatatc 72 0 

tatattgagg aagaagactc ggtgatgctt cagaccaaga atatccgtgc cggacaaggc 780 

aaagtgaatg tggctgtcgt gttgcttcgt catttaagca atttcaccga cttcaatgtc 840 

ttggagcgcg atccgcgtgt acacttgttc tacaccaaca atacggacga gttgatgaaa 900 

gcggatatca tcctgttgcc cggttcgaaa agcactttgt ccgatctgta tgagttgcgc 960 

cgcaacggag tggcgcaggc catcgtccgt gcccaccgcg aaggtgccac ggtaatgggc 102 0 

atttgtggag gttaccaact gatgggtagg gaggtttgcg atcccgatca tgtggaaggc 1080 

gagatagaac gcttgccggg attggggtta ctgcctgtca gcacccgcat gcagggagag 1140 

aaggttaccc gccaggtacg gttctgtttt cttgaagaca gcgctgtctg cgaaggatac 12 0 0 

gaaatacaca tgggaacgac cacgcccctt gcggatgttc ctgtttctcc actcaaccat 12 60 

ctggcggacg gaagggagga tgggtatttt gtagaccgca cctgcatggg aacatacgta 1320 

catggcattc tcgacaatcc ttcagttatc gattacctgc tggagccttt cgccgataaa 13 80 

ctgaaagaga cggcttttga ttacaaagca tttaaagaag aacaatacga taaactggca 1440 

gcccatgtcc gtaagcacgt cgacttgccg cttatctatc aaatattgac agacaatgat 15 0 0 

tga 1503 

<210> 674 
<211> 1203 
<212> DNA 
<213> B.fragilis 

<400> 674 

tcagtcgtaa atatgaaaac atccggtaaa ctttcgcaga tttcctttat catcgcacgt 60 

gagtttcgtg ccatcagcac cagctatgcc gtactgttgg tactgatggg aggtatcttt 12 0 

gtttatgggt tgctctataa ctatatgtat gctcccaata tcgtgaccga cgctccggtg 180 

gcagtggtcg acaactcgca cagcagcctt agccggcaat acatccgttg gctcgacgcc 240 

acgccgcaag tagccgtata cgcacaagcc atggactatc gggaagcccg cgaatggatg 3 00 

aaagagggca aggtacaagg cattctgtac attccgcatg attttgagac ccgtgtcttc 3 60 

cagggacgcg aggctgtatt ctcactatac gccaccacag acgcctttct ctattttgaa 42 0 

gcgctgcaag aagccacttc acgtgtatac cttgccatca acgatgccca tcgcatggac 48 0 
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ggtgccgtat tcctcccccc gcagggactg cttgccgtgg ccatggccaa gcccgtaaac 540 

gtgaccggca ccgcactcta taaccacacc gaggggtatg gttcttatct gattccggct 600 

gtcatgatgg tcattatctt ccagacctta ttgatggtta tcggtatgct gacgggtgac 660 

gagtatcagc accgcgctac agaaccgttg cttccggggg gcaggacagt agataaaagc 72 0 

ggactctggg gaggggcaat gcgtcttgtt gccggaaaga cttttgtgta ctgcggactt 780 

tatacggtct tctccatgtt cttgttagga ttattacccc acttcttcag cattcccaat 840 

atcggaaacg gactgtacat taccgctatg atggtacctt atctgatggc gacctctttc 900 

ttcgggctgg cagcctcgcg ttacttcacc gattcggaag ctccgctgct gatgatcgct 960 

ttcttctcgg taggcttgat tttcctgtcc ggagtctcct acccgctgga actgatgcca 102 0 

tggtattggc gcatggcaca ttacatcctc ccggccgcac ccgccacgct tgctttcgtc 1080 

aagctaaact cgatgggagc cgatatggca gacatacagc cggaatacat tacactgtgg 1140 

atacaggtga tcgtctattt cgggctcagc gtgtgggtat acaagaaaaa gctggaagcg 12 00 

tga 1203 

<210> 675 
<211> 966 
<212> DNA 
<213> B.fragilis 

<400> 675 

ccaagcccaa aggctctctg ggaacacttg aagaactggc cttgcagatc gggcttatcc 60 

cagcaaacac ttactcccga gctgagacat cctcaaaata tcatattcgc agccgatcat 12 0 

ggcattgtcg acgagggagt cagcctctct cccaaagaga tcacctggca acaaatcagc 180 

aattttcttc acggaggggc aggtgtcaac ttcctttgcc gccagcacgg attcgagttg 240 

aagattgtag atgccggagt ggattacgac ctcccatacg agaaaggaat catcaacatg 3 00 

aaggtacgca aaagctcgcg taactatctg tacgaggcag ccatgacaga agaagaaatg 3 60 

aatttgtgca tcgagcgcgg agcggaagta gtccgtcagt gtcatgccga agggtgcaat 42 0 

gtgctttctt tgggcgaaat gggtatcggc aacacttctt cgtcttccat gtggatgacg 480 

tgcttcaccc atattcctct cgaactgtgt gtcggagcag gcagcggact cgacaatgca 540 

ggcgtccgtc ataaatataa tgtattgcag caggcactgg accattatca gggagacgga 600 

agcgcacacg acctgatccg ctatttcggc ggactggaaa tggtaatggc aataggcgcc 660 

atgcttcagg cagccgagtt aaagatgatt atcctggtag acggattcat catgacaaac 72 0 

tgcatccttg cagcctccca actttaccct gaggtattgc attatgccat cttcggtcat 780 

cagggagatg aatccggaca taagctggta cttgatgcca tgggagccaa gccattactg 840 

aatctgggtt tacgtctcgg agaaggaacc ggcgccatct gctcctatcc tatcattgac 9 00 

tctgccatac ggatgatcaa cgagatggac aactttgcac atgcagccat caccaaatat 9 60 

ttctaa 966 

<210> 676 
<211> 621 
<212> DNA 
<213> B.fragilis 

<400> 676 

agccatgcaa agataaatat tgtttccgaa atacccatag caatggcaca atattttgca 60 

tccgggaatg gaaatataaa atattaccgt acatttgcta accaaaaata cacagataga 12 0 

ttcatgaaac agatcatact catcaccgga ggagctcgtt cgggcaaaag cagctatgcc 180 

gaacgcctgg cgttatccct ctctcctaat ccggtttact tggccacctc acgtatctgg 240 

gacgaagaat ttcgtcaaag ggtattgcgc catcaagcca accgcggacc ggaatggacc 3 00 

aatatagagg aagaaaaaga attgagccgc cactctttgg aagggcgtgt agtgctgatc 3 60 

gattgtgtaa ccctctggtg caccaattat ttctttgatc tcgaagcaga caccgacaag 42 0 

gcactgactg ctgttaaagc cgagtttgac cgactgacac aacaggacgc gacccttatt 480 

tttgtcacca acgaaatcgg tatgggagga acttcagaaa acctgataca acgaaagttc 540 

actgacatgc aaggatggat gacccagtat atagcctccc gggccaatcg ggtaatacta 6 00 

atggaacggg gattcctgtg a 621 

<210> 677 
<211> 1509 
<212> DNA 
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<213> B. fragilis 
<400> 677 

aagaagaaaa atataatggc aaaagaactg aaagacctga ccaaacgcag cgaaaactat 60 

tcgcagtggt acaatgattt ggtggtgaaa gccgatttgg cagaacaatc ggctgtgcgt 12 0 

ggatgtatgg tgattaagcc ttacggatac gctatttggg agaaaatgca gcgtcagctg 180 

gacgacatgt ttaaagaaac cggacacgtt aatgcttatt tcccgttgct gattccgaaa 240 

tcatttctga gtcgtgaagc tgaacacgta gaaggctttg ccaaggagtg tgccgtagta 3 00 

acacattatc gcctgaaaaa tgccgaagat ggttcgggtg tggtggtcga tcctgctgca 360 

aaattggaag aagagttgat tattcgtccg acttctgaaa caatcatttg gaatacttat 42 0 

aaaaactgga tccagtcata tcgtgatctg cctattttat gtaatcagtg ggctaacgtt 480 

ttccgttggg aaatgcgtac gcgattattc ctccggactg cggaattctt gtggcaggaa 540 

ggtcatacag cacacgcaac gcgcgaagag gcggaagaag aggctatccg tatgttgaat 600 

gtatacgccg agtttgcaga gaagtatatg gcagttccgg tagttaaagg tgtgaaatcg 660 

gctaatgagc gctttgccgg tgcacttgac acgtatacca tcgaggcaat gatgcaggat 72 0 

ggtaaggcat tgcagagtgg tacttcacac ttcttgggac agaatttcgc aaaagcattc 780 

gatgttcagt ttgtaaataa agagaacaag cttgaatatg tatgggctac ctcttggggt 840 

gtttctaccc gtctgatggg ggcactgatc atgactcact cggatgataa cggtctggta 900 

cttcctccgc atctggctcc gatccaagta gtgatcgttc ctatctataa gaatgacgag 960 

cagttgaagc tgattgatgc taaggtagaa ggtattgtgg caagattgaa gcaattgggc 102 0 

atttcagtga aatatgataa tgctgacaat aaacgtccgg gctttaaatt tgccgattat 1080 

gaattgaagg gtgtgcctgt ccgtctggtg atgggtggac gtgacttgga gaacaatacg 1140 

atggaggtaa tgcgtcgtga tactctggaa aaagagactg tgacttgcga tggaattgag 12 00 

acgtatgttc agaatctgct ggaagagatc caagctaaca tctataagaa agcgcgtact 12 60 

tatcgtgact cacgtatcac tacggtggat agctatgatg agtttaagga gaaaatcgaa 132 0 

gaaggcggct ttatcctggc tcactgggat ggaacagtgg agacggagga aaagatcaaa 1380 

gaggagacaa aggcgacgat tcgttgcatt ccgttcgaat cgtttgttga aggtgacaaa 1440 

gagccgggta agtgtatggt gacaggaaaa ccgtctgctt gccgtgtgat atttgctcgt 15 0 0 

tcttattaa 1509 

<210> 678 
<211> 507 
<212> DNA 
<213> B. fragilis 

<400> 678 

gtctatatga aacaagaact aaaggaaaag cttttgcttt tagcggataa atatgaggtg 60 

aaagaattta ttatggacga tccgatacag tttccccatc ggtatactga taaagctgat 12 0 

attgagatct ccggactgat cgctttctgg atcgctaccg gtaatagaaa ggccattatc 180 

aaaagtggtg accggattga tcacgagctt ttcctgaatg ctccatatcg atatatatta 240 

agtgaagaat ggaggaaata tcgggggagt aacatccagt tttttatcgc tattactcct 3 00 

ggaatgattt ctatatactc tgccagactt tgtatgctgg ctaccgggag catggggatc 3 60 

tggaatccat ttgtgccatt ttctctcttc cggtacgcca ttggaaagat tgcaatcggt 42 0 

gttcggacat atcaatggga tgcctgcttt gtcaagtgct tccgaagcaa agaaaatgtg 480 

tatgtttctg cgttggatga ttcgtag 507 

<210> 679 
<211> 345 
<212> DNA 
<213> B. fragilis 

<400> 679 

tctgtggtga agttcattat aacatattcg tttgtcaaat tggtacctta taaaatgttt 60 

tctggatcaa aaagacaggt aatggcattt tctgctaact atcccttttt gttttcttac 12 0 

ccatataggg gagaatgtcc cggaactttc tgcttctttg cccagaaaac cgggcttggg 18 0 

aaggggaaga gtctgtgttt tcacttcgaa gagccgatat ttcgaccaaa agaagtaagc 240 

atttactccg aaagacggtt gtgtttttca aaaatgaata agagttttgc ccaatatctc 3 00 

tatatgtttt cttccattac atatattgtt cttatttggg agtga 345 



269 



<210> 680 
<211> 1002 
<212> DNA 
<213> B. fragilis 



420 
480 



<400> 680 

gagatggaga attcagaatc taaaaaaggc agaaccttaa gtatcgcatt catcgttgta 60 

cttgtggcag tagcactctt caccgtcatc ggaatgattg ccatgcgcca ccagcctctc 12 0 

gtcttgcaag ggcaggccga agctaccgag attcgcatca gcggcaaact gccgggacgc 180 

atcgacacct tcctggttga agaaggccag tgggtgaaac aaggagatac gctggtagtc 240 

atcaacagtc cgactgtaga ggccaagtat cgccaggtgg acgcattgaa acaagtagcc 3 00 

gtagaacaga acaagaagat tgacgccgga acccgcaagc agatcatagc tacggcgcag 3 60 
caattatgga acaaaactca aagcgacctg acattggcac ggacaacgta caaccgtatt 
ctcactttat ataaggacag tgtagtcact tctcaacgta aagatgaagt ggaagccatg 

tacaaagccg cacaagcggc cgaacgggct gcttacgagc aatatcaaat ggctgtagac 540 

ggagcacaaa gtgaagataa agcctcggcc cgctcgatgg tcaatgcggc caacagcacg 600 

gttgatgaag tttcatcact cctggtggat gcccgcctga tcgcaccgga agatggacaa 660 

atagcaacca tctttcctaa acggggcgaa cttgtcgcac cgggcactcc gatcatgaac 72 0 

ctggtggtga tggatgatat acacgtggta ctgaacgtaa gggaagacct gatgccggac 7 80 

ttccgtatgg gaggtacatt cattggggat gtgcccgccc tggcccaaaa aggaatcggg 840 

ttcaagatat attatatcag tccgctgggt agttttgcta cctggaagtc gaccaagcaa 900 

acgggcagct atgatttaca gacattcgaa atccatgctc gtcccaccaa gaaagtggag 960 

gggctgcgtc cgggaatgtc ggtactggta gaaatcaaat ag 10 02 



<210> 681 
<211> 411 
<212> DNA 
<213> B . fragilis 



<400> 681 

gaaccggaga aagcaagtca agaccggccg aagaactact tgcagaactt atcagagaaa 60 

gggaaggtaa tgattgaaat acataccatc gtaacctttg ataaagaaat gaaacggctt 12 0 

agtaagaagt atcattcaat aattaaagat tacgcagctc tgatagaaga tttaaaaaag 180 

aatccgcata taggggtaga cctgggaaac ggcatacgaa aagtacgaat ggctatagcc 240 

agtaaaggga aaggaaagag cggaggcgca cgggtcatca ccgatacatc agccattatc 3 00 

agcgtagaag aaggcagagt taccctactt accatttatg acaaatccga ccgggaaaat 360 

atctccgaca atgagataat aagacttcaa caagaaattc tgaagaagta a 411 



<210> 682 
<211> 498 
<212> DNA 
<213> B. fragilis 



<400> 682 

aaacagtttg 

ggtattggtt 

cttgcggcag 

gccacggctt 

aaggtgtatt 

aagtttgcct 

gagcgtctgg 

atctttatgt 

tccctgacac 



atatgatagc 
ttggagccat 
ccggacatgc 
ctttgttcgg 
gtccgatgac 
acaatatggt 
gcaaatacat 
tggcagtagg 
ggcattaa 



tttagatatt 
atccgatcct 
ctgccgttat 
ggcattggtt 
tgtgctttat 
gttctcgttg 
ggagacgttt 
agctaccttc 



ctttccgatg 
ccgttgcggg 
tgcctgatga 
atcggctttg 
ataccggcat 
attatgagtc 
ttctctaatg 
cccatgtttc 



gattttttgc 
ctttcaagat 
ctttcctggg 
gcagtttgtg 
tgctcccgat 
tgcaaacaat 
gcctggttac 
tgcttcctca 



cgcaatagcg 
gattgcgata 
tgtcgacatt 
gctcgggcgg 
gattccgggt 
gaacgaaccg 
ctgtaccgtt 
caaagctttt 



60 

120 

180 

240 

300 

360 

420 

480 

498 



<210> 683 
<211> 840 
<212> DNA 
<213> B. fragilis 



270 



240 
300 



<400> 683 

aatgacttat gtgctttctc tttcctttct tttttgtact tttgcgcccg atttaaaaaa 60 

gtgagcaatt ttatgactac aaatgaatct ttgatttcca tatccaagtt cattgccgga 120 

tattcggccc atttgatggg agcaggtgtg catacctccc gtgtgatccg taattcaaag 180 

cgcatcggag aagcctatgg agtggatgtg aagttgagtg tgtttcacaa aaacatcatt 240 

ctgactatca ttgacaacga gacgcgtgaa gcctgcaatg aagtgattga tatccctccc 3 00 

catccgatca gttttgaaca taactcagag ttgagtgcct tgagctggga ggtttacgac 3 60 

aaacatctgt ctttacacga attgtcggat aagttcaaca aaatcatatc ggcaccgaaa 420 

atagatccgc tatttgttct tttactggtc ggatttgcca atgcttcatt ctgtaagttg 480 

tttggtggcg atattatttc tatgggcatt gtcttttcgg ctaccatcac cggacttttc 540 

ctgaagcaac agatgcagaa gaagaaaatc aatcattata ttattttcat tgtttccgct 600 

tttgttgcgt cgctttgtgc atcgacggca ctgatttttg ataccacttc ggagatagct 660 

cttgccacca gcgtgcttta tctggttccg ggtgtgccgt tgatcaacgg tgtgattgat 72 0 

attgtagagg gatacatcct tacgggattt gcccgattga cggaagccgc gctactgatt 7 80 

gtcagcattg cgataggcct gtcgtttaca ttgttaatgg ttaaaaacag tttgatatga 840 

<210> 684 
<211> 1743 
<212> DNA 
<213> B. fragilis 

<400> 684 

tcgcctctcc ggggatatgt aaacaagtat tcaatcaaca taacttttta tactatggaa 60 

cttttaagaa acctgtttga gggatacccc aacctttggg gtggaggagt ggcacattcc 12 0 

gtgcttatcc tgtcgctggt cattgcgttt ggcattatgc tgggtaaaat aaaagtagca 180 
ggcatctctc tgggagttac ctggattttg tttgttggca ttgtcttcgg acattttaat 
ctgaatctga acgagcattt attgcacttt ctgaaggagt tcggacttat cttatttgta 

tattccatcg gattgcaggt ggggcccgga ttcttctccg cttttaagaa aggaggattc 3 60 

accctcaata tgttggctat gatcgttgtc tttgcaggag tcatcattac ccttgcattg 42 0 

cattttataa ccggaatacc gattaccacc atggtaggta ttttatcggg agcggtgacc 480 

aacacacccg gattgggtgc tgcgcaacag gccaacagtg acctgaccgg gatagatgca 540 

ccggagattg ctttgggata tgctgtagcc tatccgttgg gcgtagtggg atgcatcatg 600 

tcgctgttag gccttaaata ccttttccgt attaatacca agcaggagga agccgaagcc 660 

gaacagggac tgggacattt acaagagttg acagtccgtc ctgtttcatt ggaggtccgt 720 
aatgaagctc ttcacggtaa acgtattaag gatatacgtc cattggtaaa ccgtaatttt 
gtggtatcgc gtatccggca tttgaacgga aagaaagagt cggaattggt aaattccgat 
actgagcttc atctgggtga tgaaatattg gttattgcga ctccgataga tatagaggcg 

atcactgcgt ttttcggcaa accgatcgaa gtggaatggg aacagctgaa caaagaactg 960 

atttcacgcc gaattctgat aaccaagcct gaactgaacg gtaagacatt ggcgcaattg 1020 

aagattcgta ataattttgg tgccagtgtc acccgcgtca accgttcggg agtggatctg 1080 

gtggcaagtc cccagttgca attacaaatg ggagaccgtg tgacgattgt cggcagtgag 1140 

ttggcggtga gtcatgcaga aaaggtattg ggtaattcga tgaaacgcct gaatcatccg 12 00 

aatctgattc ctatttttct gggtattgcc ttgggatgta tcctgggtag catcccgttt 1260 

atgtttcccg gaattcccca accggttaaa ctcgggttgg caggaggccc gttgattgtt 13 2 0 

tcgattctta tcagccgttt cggcccgcag tataagctga ttacttatac cactatgagt 13 80 

gccaatctga tgataaggga aatcggcatc tcgctgtttc ttgcttgtgt cggtctcgga 1440 

gccggtgacg gatttgtgga aaccattatc catgaaggcg gatatgtgtg gatcgcttac 1500 

ggtatgatta taacaatcgt tcccctgctg ctggccggtt ttatcggacg ttatgctttc 1560 

aagctgaact attatacgtt gataggggtg ttggccggtt ccacaacgaa tccaccggcg 162 0 

ttggcctact ccaatgatct gacatcgtgt gatgcgccgg cagtaggtta tgctacagtc 1680 

tatccgctga cgatgttcct gcgtgtgctt acggcacaat tattaattct ttcgttaggt 1740 

tga 1743 

<210> 685 
<211> 576 
<212> DNA 
<213> B. fragilis 



780 
840 
900 



<400> 685 



271 



atgatgaaac 
gaaagggtag 
gcagtgatcg 
caccacctcc 
cgcgataaga 
gatacaatga 
gtctctgctc 
accttgaatc 
gatctgtttt 
tggcaggcct 



gaatctatac 
agaaagatga 
gcattatccg 
aaagagagtt 
atccgaatgt 
ctgccggact 
agttacagtt 
ggcaagatgc 
ttgtaatggc 
tcgcatataa 



acggaccggt 
tatccggatc 
ttcattgctc 
aatggtagtt 
gctgtcgccc 
gaaagagaac 
tgcccgtacc 
tgttccggaa 
acgcttcgac 
gacaaagaag 



gaccggggaa 
gaggcgaacg 
cctcaggagc 
atgagtcatg 
ggactggcgg 
ggttatttcc 
gtagcccgcc 
gatattctga 
atgcaacaac 
aaataa 



caaccggcat 
ggaccatcga 
atgactggca 
tggctactcc 
ctttctgtga 
tgttgcccgg 
gtgcagagcg 
gctttatcaa 
aggactggcc 



tcatggcgga 
tgaattgaat 
gaagttgctg 
atccgccatt 
gcaagagatg 
tggcacacct 
gcggctctgg 
tcgtctgtcc 
ggaggaacgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

576 



<210> 686 
<211> 783 
<212> DNA 
<213> B.fragilis 



<400> 686 

cagaatgtca ggcggagaat gtgctttcgg tctgttatgc ggtatacttc cttcagcact 60 

cctgctaccg taccgatact ggatggctat cgtatttccg ctgatcatgc tgtatctgct 12 0 

ttgcacactg atgaaacgga agttgcaagg ttataccggc gactgttgcg gagcactgtt 180 

tcttctaagc gaactgtctt tctacctggg aatagttata ctaatgttta tatagtcatg 240 

gaagtcatat taatccgcca tacctctgtc gatgttccta aaggagtctg ttatggccag 3 00 

actgatgtac ctttgcgaga tagttttgaa gaagaagcct caattacggc tcaacaacta 3 60 

cagaacgacg tatttgatgc cgtattcaca agcccgctga gtcgttgcac ccgcctggca 42 0 

gaccactgcg gttatccgga tgccattcgt gatgcccggc tgaaagagct gaacttcggt 480 

gaatgggaga tgcaggagtt tgataaaata tgtgatccgc gactggagga gtggtataac 540 

gactacttcc atgtagcggc tacaggcgga gagtctttta tgatgcagct tcaacgggta 600 

tcggagttcc tgaatgaagt gagtggaaaa gagtataaac gcatagccgt ctttgcacat 660 

ggaggggtgc tgatttgtgc acaaatctat gcagggatac tgagaatgga agacgctttt 72 0 

aacgcactga caccttacgg cggagtggtc cggctgcaac ttaactcaaa gacagaagaa 780 

tag 783 



<210> 687 
<211> 978 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (704) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 687 

gatagagaaa 

ttgtttccgt 

ccgcatcctg 

gggagagccc 

ttttttacct 

atacaaacac 

atggtcttcg 

gtgggacggg 

ttagccgaaa 

gtgcccggca 

aacgagcgtt 

attcttgccc 

aggttcgtgg 

gctttggccg 

gtgtggaagc 

gccgtctgtg 



gtatggatgg 
taccgttagc 
tggtaggatt 
gcgtttggaa 
tcctcttttt 
tgctgatctt 
aagccgtaga 
atacctctgc 
atctgagtga 
tgatggcgta 
accgtcaatt 
gcctgacggc 
gtaaatatgg 
gtattctgaa 
cttttatagg 
tcaatcggca 



aatgttcttt 
ctggttgttc 
cggtaaactg 
agggggaatg 
taaagtgatc 
ttgttgcctg 
ccgttcattg 
actatctgca 
tggggtaatt 
caagatggtg 
cggctgtata 
tttgttgatg 
cagccggcac 
ttgccgtttc 
caacaacgaa 
ggcagaggta 



tggtatatta 
gatcgctggc 
atagcttggg 
atgtcggtag 
ggagagtatt 
gcaggtacaa 
gacgaggggc 
caagaggtgc 
gcaccacttt 
aatacacttg 
gctgcccgga 
atcctcgtca 
gccagcccaa 
ggcgggcccc 
cgagcattga 
ctgatggttg 



gccttgtgta 
aaggcgatcc 
gagagaagtg 
cattgattgt 
ccatcatact 
ccctgatacg 
gtaaacaagt 
gtaccgcagc 
tttggtatgc 
actctatgat 
tcgatgatgt 
ccgnacggtt 
actccggtat 
actattattt 
caaccgagga 
tgttggtgtg 



tttttgcagg 
gtcctggttg 
tctgaatgcg 
gggagtctac 
gacagcgctg 
tgaggtccgg 
ggcccgcatt 
tttggagact 
agtgttggga 
cggttaccgc 
tgccaattat 
ttctttgctc 
ccccgaagct 
cggcgaagag 
tatgaagaaa 
gctgactatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
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cttctgtctt tgagttaa 

<210> 688 
<211> 399 
<212> DNA 
<213> B.fragilis 

<400> 688 

tttattatct ttgtatttgt actaaaagaa acaagatgta ttatgaaaga gcctgaaaaa 



978 



50 



300 
360 



tacaagcaac cagaagagga aaccacccga ctgtccgagc cgacagtagc ttacaatagt 12 0 

atggcttatc tcgaattaga agcagaaaaa gcagaactga tccggactat tgccaacata 180 

gacagtaaag aaatcatcga taaagtgaaa cagaaacttc acgatgtact cggtttggac 240 
aaaaacaggg aaaccgaacc ggagtgtaaa aaatatattc tcgcaaatat aaaagaagcc 
ttctgcgaac aagaaagagt aagaaccgga gaaagcaagt caagaccggc cgaagaacta 

cttgcagaac ttatcagaga aagggaaggt aatgattga 399 

<210> 689 
<211> 3255 
<212> DNA 
<213> B.fragilis 

<400> 689 

atcaatttaa tagataatca catgaacaag aaactaatcc tatctatttt cgttctggca 60 

ggtgctcctg tgctgctgtc ggctgcagga gaagcacggt tgttgcgttt ccccgctacg 120 

aacggaaatg agatcgtatt ttcgtatgcg ggcgatttgt ataaagtgcc tgcttcggga 

ggtgaagcac aacgcctgac ttcccatgtg ggatacgaaa tgtttccccg gttttctccg 

gacggcaaaa cgattgcatt caccgggcag tacgatggaa atacagaagt gtataccatg 



180 
240 
300 



420 
480 
540 
600 
660 



780 
840 
900 
960 



cctgcaacgg gtggcgaacc actacggata acctacacgg ctaccaatag ccgtgacgac 360 
ttgggtgacc gtatgggacc taacaacatt gttatgacat ggactccgga cggacaacgt 
atcgtgtatc gtaatcgcat cagcgacgga ttctccggta aactctttac tgtagacaaa 
gaaggcggat tgtcagaagt cattcctctt cccgagggag gcttttgcag ctattcaccg 
gacggaaaac aattggcata caaccgggtg atgcgcgaat ttcgtacctg gaagtattat 
aaaggcggta tggccgatga catctgggtg tataatccgg gaaacaaaac agtggagaat 

gttaccaata atgtagctca ggacattttt ccgatgtgga ttggcgatga aatctttttc 720 
ctttccgacc gtgaccgtat catgaatatc ttcgcataca atacgaagac caaacagact 
gtaaaggtga cgaacttcac tgagtatgat gtgaaattcc caagcgtcca tggcaatacc 
atcgtttttg aaaacggcgg atatatttat aagatggatg ctgcggcccg gaaagctgaa 
aaggtaaaca ttacactggc ttctgataat atctatgccc gcaccgattt gaaagaggga 

gcgaattatg tgactgcggc cagcctttca ccggatggag cacggatggt agtgacaagc 102 0 

cggggtgaag tattcaatct gccggtagag aaaggggtta caaaaaatat aactcgttcg 1080 

ccgggagctc acgatcgtga tgcacagtgg tcaccggacg gaacacagat tgcttatatc 1140 

tctgatgcca caggggaaac cgaactttat ctgcagaatg cggcaggtgg cgagccgatg 12 00 

cagtttactc ataagaacga tacatatatc cgtgacttta aatggagtcc ggattctaag 1260 

aagatagtgt atatggatcg taagaaccga gttaatctgc tggatgtggc ttccgggaaa 132 0 

gtttctttat tattgcaaga tccggtggga gtgccgggtg gagttacttt ctctccggac 13 80 

agtgagtggt tgacttatac acggatgggt aaaaatgaaa tcaatgtcgt atatgtctac 1440 

aacattgcgg aaaagaaaga atatccggtg accgacaaat ggtataactc ttcttctccg 1500 

gtgttcagtg ccgacggaaa gtatctgata ttctcttctg cccgtgattt taacccgact 1560 

tacggatcat tggaatggaa ccatgtatat aataatatgt atggtgtgta catcgctttg 162 0 

ctgtctaagg atacatcgtc tcctttcatg cagaaagatg cggaagtggc tgtatcgaat 1680 

gctaccccca aaagcgggga taagaaaccg gcagataaga aggaagtggc cgatgcttcg 1740 

ttggtgaagt tcgatccgga tggcattacc gatcgcatcg ttcgcttgcc cttgtctccg 1800 

tcttattatg gtaactttta ttcggatggc aacaaggtgt actactgggg acgtggtggt 1860 

acgaaaatgt atgacttggc aagtcagaaa gaggaatcga ttgccgatgg agcttcgatg 192 0 

gatgttactt acgatggtaa gaaggcactt ttctttaaag gccgtcagat ttatgtgacc 1980 

aatcttcctt cgggtaagac agaactgacc gctccggtcg atttaagcaa tatgaagatt 2040 

actgtggatt atccgaaaga gtgggcacaa atttttgatg aagcttggcg tgcctatcgt 2100 

gacggattct atcaggagag catgcacggt gtagattgga aagcaattaa agaaaaatat 2160 

gcggtcttgc tgccttacgt taaaactcgt ttagacctga attacattat cggtgagatg 222 0 



273 



ataggtgaat tgaactgtgg gcatgcttat gtcaatccgg gagaaacgga acagcccaaa 2280 

cggatcaata ccggcttgct tggtgcggaa ataactcgcg acaagagtgg ctttttccgt 2340 

ctggagaaga tattccccgg agcatcttgg agcaaagaac tgcgctctcc gcttacggaa 2400 

ccgggtgttg atgtgaaagt gggagagtac atcgtagcta ttgacggtgt gccgactaat 2460 

acggttaaag atatgtattc tttactggtg ggtaaagcag agatacctac tgaaatttcg 252 0 

ctgaatgcca aacctcagct ttccggagca cgtaaggttg tgatcagtcc gcttgccaat 2580 

gaatatcctt tgatacatta caattgggta caggataata taaagaaggt ggaccaggct 2 640 

tccaacggac gtatcggata tatttatatt ccggatatgg ggccggaagg cttgaacgag 2700 

tttgcccgct atttttatcc gcaacttgat aaagaagggt tgattatcga tgatcgtgcc 2760 

aatggtggag gtaatgtttc accaatgatt ttggaacgtc tttcccgtga accttatcgt 2820 

ctgactatgg gtagaggtac cagccatgtg ggaacagtgc ctgatgctgt acaggtggga 2880 

ccgaaggttt gtttgattaa taaatactcc gcttcagatg gcgacctgtt cccgtggggc 2 940 

ttccgcgcac ttggcttggg taagttgata ggaactcgta cctggggagg cattgtgggt 3 000 

atcagcggat cattgccata catggatggt acggacatac gtgtgccatt ctttacgagc 3 060 

tatgacccga agaccgggaa atggattatt gagaaccatg gagtagatcc ggatattttg 312 0 

attgacaatg atccggtgaa ggagtggaat ggagaagacc agcaactgaa cagagccatc 3180 

gaagaggtta tgaaacagct taaagatcgt aaaccgttgc ctccggtacc tgctccgaga 3240 

O O ET C 

gattttagta aataa o^~>~> 

<210> 690 
<211> 1347 
<212> DNA 
<213> B.fragilis 

<400> 690 

aaaccgacaa taatggaatg taggatgatt tctcaatttc tgatagcggc tccttcttcg 60 

ggcagtggaa agacaacggt cagtcgtgga ttgatggctc tgttgattaa gaagggactg 12 0 

aaggttcaac ctttcaaatg cggtcccgac tatatcgaca ccaaatatca tacggcagtt 18 0 

tgcagacgtc cttccatcaa tttggatacc tttatggctt cggccggaca tgtaaaggag 2 40 

ctttatgccc gttatgccac aggggccgat gcctgcatca cggagggtat gatggggatg 3 00 

tatgacggtt acgaccgtga ccggggttcc tcggcagaag tggccggatt actgaattta 3 60 
cctgtcatat tggtggtcga tgccaaatcg gccgcttatt cggtggctcc tttgctttcg 
ggcttcattc actttcggcc cgagatcagg atagcgggtg ttatattcaa tcgggtaggg 

tctccgcgcc attacgaaat gttgcaggaa gtctgtaccg agttgggaat tacctgtttg 540 

gggtatttgc ccaaacagga gagcttggta caggaatcac gttatctggg gctggatttc 600 

agccattcga aaggaacgga cgcactggaa gagctgaccg gattaatgga aaagtacatc 660 

gactataacc gtttgcttga ggaaacgaaa cttcctgctc cgatacctcc tgtttcaaat 720 
atttctctac aggaagattt gaagatctcc gtggcatgca attcggaatc tttctctttc 
atttatcagg aacacctgga tgtgctttgc cgcctgggaa ccgttattct ctttaatccg 
gaggataatc gcccgttgcc tgaaggtacg gacttgcttt atcttcccgg aggctatccg 
gaaaagcatt atgagaaatt gcgtcaggct tggcaaagga tgcagtctat acgtaactac 

gcggagtctg gcggacgagt acttgccgaa tgcggaggaa tgatttatct ctctaaaggc 102 0 

attctccttg accggtcgga gcactcggac agtgaggtcg ggttgcaggc aggggtactt 1080 

ccgttcttta tctcgaatcg taaggctgac aggcgcttga ctctggggta ccggcagttc 1140 

gattataacg gacaacatct tcgcggacac gagtttcact atacacaatt cgagccgaaa 1200 

ccggaagagt cactggaatc agtcactcag gtatacaatg ccaagagaat gcctgtcagt 12 60 

acacctgtgt tccgatataa aaacgtgata gccagctata cgcatctata ctggggggag 132 0 

atcgatttac ttaaattgtt tgaatga 1347 



420 
480 



780 
840 
900 
960 



<210> 691 
<211> 2466 
<212> DNA 
<213> B.fragilis 

<400> 691 

ctgtttccct tccaaaacaa agcaaacatg ataattagta aaaatccttt gggcgacata 

gccaaactaa acagaatttg tgcttcggca cagatcggat ggtgggaagt gaacttcact 12 0 

acaggaaagt gttttatttc ggaaaccctg cttaaatcat tggaagtcag tagtgtatgg 180 

ttagacattg acgagttgat gtctaccgta cgacaggatt atcgtaagcg cattacggat 240 



60 



274 



gagtttacct 

cgcggtaatg 

caattgattg 

tgtgcatgga 

ctcctgaaac 

attctgtact 

aatcaaagct 

cagaatatct 

gttattttga 

gccgaaaacg 

ggatatatgg 

tggttctctt 

cgggtgatgc 

gaactatata 

ggtgtcggcg 

cgggatatac 

ttcgatgaag 

cggagcctga 

gataatacgg 

attatctctg 

ttcgccgttc 

attatcggta 

gaaaaggcga 

gatggtgcgg 

cccaatccac 

gaagcggagc 

ttcctggcaa 

gatcttctga 

aaaaacaata 

gcgggatcgt 

gtgcagaaga 

ccgctcgata 

atgagcaatg 

gcccggcagc 

caggaagctg 

gggcttcccc 

gagttgggga 

cgctga 



ccatacctcg 

tattctggat 

ccaccggata 

atcagcgcat 

ttttgagtaa 

tctttaaagg 

gcctttatga 

gttcggaaga 

actcacccga 

ggacaaactc 

gtatcgacat 

cgctggcaaa 

acagtgagaa 

ataaggaagg 

ataagaaccg 

atgaaagcct 

aacgcaggct 

tgctttatga 

agcgcaacaa 

attattccaa 

gccagtggta 

tcttttcaca 

aagcgggtac 

atcggtggaa 

ggttggaact 

ttcgggccgc 

acatcagtca 

tgacggttga 

cattgcttct 

ttgagtatat 

tgaggaacaa 

cctggttcag 

cgattaagtt 

aacttgaaat 

ttttcgaccg 

tgtgcaaaag 

agggttcacg 



gaagggggtg 

acattgtgcg 

tggccagcgg 

caacaatttg 

tgataccggt 

tgcccgggtc 

agtggcggcc 

tgctccctgg 

cgaactgcct 

gatgatgctg 

cgtagacgga 

tatcattagt 

attgtttcac 

tatgttgctg 

gatcatcgga 

tcgggcaggc 

ttttcagtcg 

tgcggaagac 

tgcgctgagt 

agtggggtat 

tcgtaatctg 

catgcatcct 

ggaacgcttc 

ctggatacac 

ggtagaggtg 

acgggacaag 

tgaaatacgt 

cgatccggca 

gcaattgttt 

gccgaaacct 

ggtgcccgaa 

tgccgacagc 

tacgcatcga 

gttcgtagaa 

ctttatgaaa 

cattatcgag 

cttctggttc 



ttcgaacaga 

ttgagcatgg 

atagagagtc 

ctctattgcc 

gacgaattgt 

tacatcgtgc 

tgtaacgtga 

ttctatcagc 

ccgcttgccg 

gcacctctga 

taccggaagt 

atctgcatgg 

gatatattca 

gactgcaata 

ctgaacctgt 

cgtcccggta 

gagcgaagag 

aacctgtcaa 

aaggtgcacg 

gccaaaatca 

ggagaaagcc 

gatgaccgta 

tttgacggtg 

aagtcttcta 

aactatgata 

gcggaagagt 

acgccgctga 

gagcaggaag 

tcggatatca 

gtctgccttt 

ggagtcgaac 

ggatatctga 

ggcaccatca 

gatacaggta 

gtggacagtt 

aagatgggcg 

acgcttccag 



cctttcctgt 

aggaggagaa 

cggagacaca 

agaactccat 

ttgaagaaat 

gctacaactg 

tcaccctgaa 

agatacatgc 

tgcgtgatcg 

tgcgtgagga 

ggaacagtga 

agctgcgcat 

ccaatattcc 

accggaacct 

ttgaaagtcc 

catttcatct 

gagtgatgga 

actacctgtt 

actttgagaa 

atctgctgga 

atgatacacc 

agtcagtgct 

atctgcgtat 

tggtgactgc 

taacagtcca 

ccaatcggct 

atgccattgt 

agttccgccg 

tcgatctttc 

atcagttctg 

tgcagattga 

atcaggtggt 

ctgtcggcta 

tcggtatttc 

ttgtacaagg 

gacacattgg 

ctttttcttg 



gacttccggt 

tgaagaaggg 

aggttatcaa 

tgccaactcg 

gctggctgac 

gaaaaacgga 

agagaaacta 

taatcgtccc 

cgaagtactg 

aggggtatgg 

agattatcaa 

gatcaaagag 

ggtgggtctc 

cgagatattc 

caatatgacc 

gaaatacgat 

tctcgacata 

ggtcaatatc 

cttcttctct 

tcacaccgga 

tttggcggac 

cgatttttat 

tcgtccggca 

ctatcagtca 

gaaagagacg 

gaagtctgct 

aggcttctcc 

gaccatacag 

aaagatcgat 

tgccatgatg 

cgaggactca 

taccaacttt 

tcggatcgac 

cattgaaaat 

taccgggttg 

cgtaatctcc 

tatacccaca 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2466 



<210> 692 
<211> 870 
<212> DNA 
<213> B.fragilis 



<400> 692 

cagacaatga 

aatttcagtt 

gggcgcatct 

ctggcggacc 

gccatttatc 

tttagtgagt 

ctcgatgcag 

ggagaggtgc 

ttcgtgatcg 

gctgccggat 

ggacttcggt 

cggatgccct 

ggcatcccgg 

tcgcttgagg 



ttgaaggaca 
ccaacgtata 
ctgccatatc 
gccacgctct 
tgatagcgca 
atgccgatgc 
tgccggagga 
gtgataagaa 
atcagtcata 
ttcccaatgt 
tggggtatgt 
ggtcggtcaa 
ccggtctctc 
cgataggcgg 



cggagacgat 
taataaggtg 
cgcttatccc 
gcctgccgct 
gacctttcgg 
ctgccgcatg 
cgtgcatatg 
gtatctgacg 
cgagtatttc 
gatccttcta 
cactgcgcat 
tcagcttgcc 
catgaaagat 
gttagaggtc 



tcttataaat 
aacctcgacg 
gaaccggaac 
tcggtttgcg 
ggaacgaaca 
catgggcaca 
gtctggcttt 
gaactgattg 
acgctgaaag 
cattcgatga 
cccggactga 
atcgaagcag 
tatctggcgg 
tggcctaccg 



accgccaccc 
gactgcgtgc 
cttatacgct 
tgacgaatgg 
cggccattct 
aggtcacatc 
gcaatccgaa 
caaagcatcc 
agctttttac 
ccaaacgcta 
tagggcgtct 
gactctacct 
aatgtgcccg 
atactcactt 



gatacggagc 
acatttgtgc 
ggaggcccgt 
tgctacggaa 
gatgcctacg 
gctgtacaca 
caacccgacc 
ccgggtctgt 
ggcgcaggaa 
tgccattccg 
gcgtacgaac 
gctttccgaa 
cctgaaaagt 
tatgctggtg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 



275 



tgcctgcgtt tcggaaaagc tgcgctttaa 



870 



<210> 693 
<211> 312 
<212> DNA 
<213> B. f ragilis 



<400> 693 

gcttctccga 

aaatgggccg 

ataaaattgc 

gcacataagt 

tttagaactc 

gtcagcggat 



tgcgctttga 
aatatccggc 
tcactttttt 
catttcattt 
aacctaacga 
ag 



attacggatc 
aatgaacttg 
aaatcgggcg 
caccagactc 
aagaattaat 



acacgggagg 
gatatggaaa 
caaaagtaca 
tccttgtcac 
aattgtgccg 



tatgcacacc 
tcaaagattc 
aaaaagaaag 
attgcttatc 
taagcacacg 



tgctcccatc 
atttgtagtc 
gaaagagaaa 
tttattatta 
caggaacatc 



60 

120 

180 

240 

300 

312 



<210> 694 
<211> 753 
<212> DNA 
<213> B. fragilis 



<400> 694 

gat ttgccga 

atccgcgaag 

tggctcacgg 

tccgttgccg 

gacgggttgg 

tccatcatga 

cttcttctat 

gcaggcgaca 

cgaaaagaag 

gctttcggtc 

atggctatcg 

ttgcaaggtt 

tacctgggaa 



tgaatatatt 
ttccagcaga 
gcggcatcat 
tattgttggc 
cagatttctt 
aagattcgca 
ggagtctttt 
cgataagcaa 
aagaaagcaa 
tgttatgcgg 
tatttccgct 
ataccggcga 
tagttatact 



agcagcattt 
atgttttaaa 
ggcaggagta 
acttgccgcc 
cgatggattc 
tatcggcagc 
gatgtcgctc 
gctgacctca 
ggccaaagta 
tatacttcct 
gatcatgctg 
ctgttgcgga 
aatgtttata 



atctttttta 
cacgttgtgc 
ctttggttga 
cggttattga 
ggaggaggta 
tacggtgtca 
cccctctcct 
tcacaaatca 
gtatataaca 
tcagcactcc 
tatctgcttt 
gcactgtttc 
tag 



cccgcctccc 
cttactggcc 
gcgcacagat 
tcaccggtgc 
cgaaccggga 
tcgggttgat 
tcgcatgtat 
tcaacttcct 
gaatgtcagg 
tgctaccgta 
gcacactgat 
ttctaagcga 



cttctggcgt 
tttgtccgga 
cctccccttc 
cctacacgaa 
gcggattctc 
tttctacttc 
tacattgatt 
gccttacgca 
cggagaatgt 
ccgatactgg 
gaaacggaag 
actgtctttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

753 



<210> 695 
<211> 201 
<212> DNA 
<213> B. fragilis 



<400> 695 

tattttatat ttccattccc ggatgcaaaa tattgtgcca ttgctatggg tatttcggaa 

acaatattta tctttgcatg gctttacaaa ataaaagcaa cacatgcggt agtagctcag 

ttggcagagc ggcggcttcc caagccgcag gtcacgagtt cgaccctcgc ctaccgctca 

aaagtttatc tcctcaacta a 



60 
120 
180 
201 



<210> 696 
<211> 531 
<212> DNA 
<213> B. fragilis 



<400> 696 

cttaagagtt 

ataccttccg 

agcactgtat 

atacccggta 

tactttttat 

tcttttttct 

acctccactt 



caaccaacag 
ctcccagcgc 
gtacagagca 
gcacagcaat 
cttcagcagt 
ccttactcat 
ctttcaggcg 



attttcctat 
tttcagctta 
ccaaccatct 
aatatcttcc 
tttcacagca 
attcttgttg 
attgctgact 



ataatcatct 
cctatgatct 
tgtgccaacg 
agtttatctt 
tccatgcgga 
cctatcagca 
agagtagaac 



tctcaatagg 
cccaaaaccg 
gcatcacagt 
taggagcatt 
aaagcaattc 
aagcttctga 
cggaactgac 



caacaccaaa 
tttctcatcg 
aggactcttc 
catcagtacg 
gtccaatatc 
tctcattacg 
aatatcgaaa 



60 

120 

180 

240 

300 

360 

420 



276 



gctgttgcta ttgaaaaggt cggattgtat atccccggag ggacagcccc tctattctcc 
actgtgctta tgttggctat acctgctaag attgccggat gtaaagagat tgtactttgt 



300 
312 



atagcatccg ccaatccgat gccgggagca acttctacag aaccggtaat gacatgcact 480 
tcagccttca ccccgttact tttcataaaa gcatctaaaa ttccgggata g 531 

<210> 697 
<211> 312 
<212> DNA 
<213> B. fragilis 

<400> 697 

ctgagttctc cgattatatt ggtaatggtg aatgtaatca ggagaatgaa ggcaaataca 60 

agtattattg agaaagaaag tatccggttt ttaataaatt taagccagct ctttttaggg 12 0 

acggctttta ttccccaaat tgaatttaat gaactttgta tttctgcaaa aatagtggtg 180 

gcaccaaaaa tagagatacc taagctgaca aaggcggccg acctggagga gtctgtattt 240 

tctgcatttt ctataataga acgaagggcc tcggtaactt cggggcccac aatcggttgt 
agttggacat aa 

<210> 698 
<211> 195 
<212> DNA 
<213> B. fragilis 

<400> 698 

ttgaggctgt tgtcgatcag gttcagcagc gctgacacgt gccttgttgc tttgttatcg 60 

gcagcatatg tagtgacaga gtataaggtt actgctatta ctataagcgc tatattgaaa 12 0 

gaccttccca cagtaaatgt tgttttcccg atgtttgttt tatgcttgtt tatgatgcat 180 

1 qc 

gtagtgcaaa gctaa - L:7 - J 

<210> 699 

<211> 1305 

<212> DNA 

<213> B. fragilis 



<400> 699 

actcttaagt taaaaattat gaaactgatt aaatatccgg accggtcgca atggaatgag 
atcttgaagc gtcctgtgct tgagacagaa aatcttttcg atactgtacg caatattatt 
aaccgtgtga gagccggagg ggactgggtg gtcatggaat atgaggctgt gtttgataaa 18 0 
gctgaactca cctcattggc tgttacttct gcggaaatag aagaagcaga aaaggaggta 240 
cccatcgaac tgaaggcagc tatttatctg gctaaacgta atattgagac atttcattct 3 00 
gcacagcgtt ttgaggggaa aaaggtggac acgatggaag gtgttacttg ctggcaaaaa 3 60 

420 
480 

acaccgcctg ataagaatgg aaaggtacat cctgctatcc tgtttgccgc tcgtctggct 540 
ggagtcagca aaatctttaa ggttggtgga gtacaggcca ttgccgctat ggcttacggg 
acagagagta ttcctaaagt ttataagatc tttggtcccg gcaatcagta tgtcaccgct 
gccaagcaac ttgtcagcct gcgggatgtc gccattgata tgcctgcggg gccgtcggag 
gtagaggttc tggctgacga atctgccaac ccggtgtttg tggctgccga tcttctttct 780 
caggcggaac atggagtgga tagccaggcg atgctggtta cgacttctga gaagctacaa 
acggaagtcg tttacgaggt cgaacgccaa ttaggctatc taacccgtcg cgatattgcc 
gaaaaatctt tggccaacag taagttgata ttggtgaagg atatggagga agctttggaa 
cttaccaatg cttacgctcc tgagcacctg attattgaga cgaaggacta tatggaagta 1020 
gccggacaga tagtcaatgc cggttcggta tttttgggtg ccttctct cc cgaaagcgca 10 8 0 
ggtgattatg cttcgggaac taaccatact ttacctacca acggctatgc caaagcctat 1140 
agtggagtga gcttggacag tttcatccgt aagatcactt ttcaagagat acttcccagt 12 00 
ggaatgtcgg ctattggccc ggctatcgag gtgatggctg ccaatgaaca cctggatgca 12 60 
cataaaaatg cggtaactgt tagattggag gagataagaa aatga 13 05 

<210> 700 
<211> 360 
<212> DNA 



60 
120 



600 
660 
720 



840 
900 
960 



277 



<213> B.fragilis 



<400> 700 

aaaaggaagt ttaacaggta catcttccga gaaaggagac atagacaaaa ttattattac 

caatataaaa taaaaagaat gaaaaagaaa tattatgcag cgttattagc tgtagttgtt 12 0 

attgcattta cgggatataa tgtttatcag agtcaaaagg cggatgcctc tttatctgat 180 

ttggcaatgg ctaacgtaga agcattggca aatggagaac tttcgaatgg aaattgtgaa 2 40 

ggttcttggt cacaagaatg ttgtaagtgt gattatattc attatactta tgcttgtgca 

atagaagtga ctggtaatag ctgctataca gtaagtggat gtagccatta tacaaattaa 

<210> 701 
<211> 207 
<212> DNA 
<213> B.fragilis 



60 



300 
360 



60 



<400> 701 

tgtatatcta cctcattcaa attcataacc gatcaatttt tctcagaaga aacaaagaat 
cagagtaata gttcaacaaa acaaagatta tctaaaatgt catctccaca ttacaatcta 12 0 
acaaaaccac aaattctatt tttctctttt cataaaaaag taataaaacc cgatttgcaa 180 
aaacaaataa aacgactacc tttgtga 2 07 

<210> 702 
<211> 1911 
<212> DNA 
<213> B.fragilis 



60 

120 

180 

240 

300 



<400> 702 

ctttgcacta catgcatcat aaacaagcat aaaacaaaca tcgggaaaac aacatttact 
gtgggaaggt ctttcaatat agcgcttata gtaatagcag taaccttata ctctgtcact 
acatatgctg ccgataacaa agcaacaagg cacgtgtcag cgctgctgaa cctgatcgac 
aacagcctca actatagcaa agaagctccc aacgacagta ttatccaatg gggcaacgaa 
ttggctcctc tcctgaaaaa gcagaaagaa tataaaactc tatttcagtt gaaacaactg 

attgtgacag cttatgcctc acgaggagac atgaacatgg ccatcgacca tgcccgccgg 360 

atgtataagg aagccaaaga attgaactcc cccatcggta tagctctttc cagccgtgcc 42 0 

attggggatg cctacttgaa tgccaacatg cagcaacccg ccatcgaatc ttataaagaa 480 

gctctggaat tgcttgacaa aataccgggt agcgaaatcc tcgaacaaga gattcttccg 540 

aaattcatcc tgaccctgat tcaggcctcc cacatggacg aggtacgcat ctatctgcaa 600 

aagtttgaaa acctgtatgc cgataaccct aatcctacat tccacttttt catatgtgcg 660 

tgcaatgcct actataacat cgagtccggt gatcccgaaa agggaaaagc cgaactggac 72 0 

aaagccagga aaatccacga acaactgaat tatctctacc tgcgtagtat ctacaactat 780 

atattggccc agtactatca agctgtcggg aagtatgaac tggccctgca acaatacgaa 840 

tgcctgacaa aggtccctaa agcacctgcc cccaacaaac acatcggttt gcagcttgag 900 

tgtgcccaac tgctgactca aatgggacga acggaagaag cctatcgtat ctatcaaaag 9 60 

gctaaccggc aaaaagactc tttgaacgct ctgagctatg cccggcaaat caatgaccta 102 0 

cggggaatgt accagataga ccgaatggaa atccggaacc aaattcaacg aaaccaaatc 1080 

atcttgtgga tcatcatagt ttccatcttt atattgatgc ttgttttgct gttgattgtc 1140 

cgcatccggc aggagtccaa ccgacttctc cgctccaaag aagaattgga aatagcccgt 1200 

aagtatgccg agaactcgat acgtaccaaa agtctgttcc tgtcgaacat gagtcacgaa 1260 

atacggacac cactgaatgc actttccgga ttctcatcca tcctgaccga cgaatccatc 132 0 

gacaatgaca cacggtatca atgcaatgac atcatccagc aaaactccga actgctgcta 13 80 

aagctgatca atgatgtaat agacctgtca aatctcgatc ccggcaagct gactttcaat 1440 

tttaaagaat gtgacgccgt caatatatgc cgtaacgtaa tcaacaccgt acagaaagtg 1500 

aagcagacac aagccggagt cagttttgtc acttcactgg atagactgac tttgcgtaca 1560 

gacgaggcac gcctgcaaca ggtattgatc aacctgctga tcaatgccac caagttcact 162 0 

actgaaggaa gcatcaccct gacattagaa aaagaatcag aaaccatagc tctgttcact 16 80 

gtgacagata ccggatgtgg tatcctccgt gaaaaacagg accagatatt caatcgtttt 1740 

gagaaactga acgaaggtgc acagggaaca ggtctgggac tctcgatttg tcgacttatc 1800 

atcgaacaaa tcggagggag aatatggatt gacccggact acaccgaagg tgcgcgattc 1860 

cggtttacac accccgtccg gcccgcaaag ggaaaggagg cagaaagatg a 1911 



278 



<210> 703 
<211> 1170 
<212> DNA 
<213> B.fragilis 

<400> 703 

gaatgccaaa ctgattggag cactgaaaaa atatcaatag gaaaaatgaa aaagaaagta 



60 



300 
360 
420 
480 



600 
660 



840 
900 
960 
1020 



ctttttattg accgtgatgg cacgcttgtc attgagccgc ctgtcgacta tcagctcgat 12 0 
tcactggaga agcttgaatt ctatcctaaa gttttccgca atttgggctt tattcgcagt 180 
aaacttgatt ttgagtttgt catggtgacc aatcaggatg gtttgggcac ctcttctttc 240 
ccggaagaaa ctttttggcc ggcgcacaat ctgatgttga aaactctggc cggagaaggt 
attacgttcg atgatatcct gatagatcgt agtatgcccg aagattgtgc ttctacgagg 
aagccgcgta caggaatgtt gactaagtat atttccaatc cggaatatga tctggagggc 
agctttgtca ttggagatcg tccgacagat gtagaattgg ccaaaaatat aggttgccgt 
gccatttacc ttcaggaatc cattgatttg ctgaaagaaa agggactgga aacttattgt 540 
gctcttgcca ctactgattg ggatcgggtg gctgagttcc tttttgcagg agaacggaaa 
gcagaaatac gcaggacaac gaaagaaacc gatatcctag tagctctcaa tctggatggt 
aagggtactt gtgacatttc taccgggtta ggtttctttg accatatgct tgagcagatt 72 0 
ggtaaacatt ccggtatgga tttaacgatc cgggtgaagg gggacctcga ggtagacgaa 780 
catcatacca tcgaagatac ggctatcgca ttgggtgagt gtatctatca ggcgctgggt 
agtaaaagag gaattgaacg ttacggttat gctttgccca tggatgattg cctttgcagg 
gtatgcctgg atttcggagg acgtccgtgg ttggtatggg atgccgagtt taagcgtgaa 
aagataggag aaatgcctac cgagatgttt ttacactttt ttaagtctct gagtgatgca 
gccaagatga atctcaatat taaggctgag gggcagaatg agcatcacaa gatagaggga 1080 
atattcaaag cgctggcccg tgcgttgaag atggcgttga aaaaggatat ctatcatttc 1140 
gaaatgccgt ccagtaaagg agttttgtaa 117 0 

<210> 704 

<211> 2817 

<212> DNA 

<213> B.fragilis 

<400> 704 

aagaaaaaga cgaccgtcgt gatatggaca ggatgtttaa acgatgatag gtttgtacct 

ttagcaatga aaaaaactat tcaacagctg gtactcgaac gtatccttat attggatggc 120 

gctatgggta caatgattca gcaatataat cttagagaag aagattttcg taatgagcgc 180 

tttgcgcata ttcccggtca actgaagggg aataatgatt tactttgtct cacacgccct 

gatgtgattc gggatataca ccgtaagtat ctagaagccg gtgcagatat cattgagacg 

aatactttta gttctactac tatttctatg gccgattatc atgtacaaga gtatgtgcgt 



60 



240 
300 
360 



600 
660 



gaaatgaatc aagcggctgt aaagctggca cgtgaagtgg ccgatgaata tacggcacta 42 0 

aatcccgata aaccccgttt cgtagccggt tcggtaggtc ctaccaataa aacatgttct 480 

atgtcgccgg atgtgaataa tccggcttat cgtgctgtga cttatgatga aatggctgat 540 
gcttatcagc aacagatgga agctatgctt gaaagtgggg tagatgcttt attgatagaa 
actatctttg atacgctgaa tgccaaagct gctattttgg cggcagaacg tgcaatgaag 

gctacaggag taaaagtgcc tgttatgtta tctgtgacgg tttccgacac cggaggacgt 72 0 
actctttccg gacagacgtt ggaagctttc ctggcttcag tgcaacacgc tgatatcttc 
tcagtcgggt taaactgttc gtttggtgcc aggcaactga aacctttctt agagcaattg 
gccgctcggg ctccttatta tattagtgct tatccgaatg ctggtctacc taatagttta 
ggaaaatatg accagactcc ggcagatatg gcccatgaag taaaagagta tgttcatgaa 

ggattgatca atatcatagg cgggtgctgt ggtactaccg atgcctatat tgcagaatat 102 0 

cctgcattga ttgccggagc aaaaccgcat attccggttt gtaaaccgga ttgtatgtgg 1080 

ctttcgggat tagaactgtt ggaagtgaaa cctgaaataa atttcgttaa cgtgggggaa 1140 

cgttgcaatg tagccggttc gcgcaaattt cttcgtttga ttaatgaaaa gaaatatgac 12 0 0 

gaggcattat ccattgcccg taaacaggta gaagacggag cactgattat cgatgtaaat 12 60 

atggacgacg gccttctgga tgcaaaggag gagatgacaa ctttccttaa tctggtggct 132 0 

tcggaaccgg aaattgctcg tgttcctgta atgattgatt cttcgaaatg ggaagttatc 1380 

gaggccggat tgaaatgtct tcaaggaaaa tcaattgtga attccatctc gttgaaagag 1440 

ggagaggaga aattccttga acatgctcgt acggttcgcc aatatggtgc ggctgtggta 1500 



780 
840 
900 
960 



279 



gtgatggctt ttgatgagaa agggcaagct gatacagcca cccgtaaaat agaagtttgc 1560 



1620 



2640 
2700 



gaacgggcct atcatttgct tgtagataag ataggattca atccgcatga tatcattttc 

gacccgaatg tattggctgt ggcaacaggg atcgaggaac ataataacta tgcggtagat 1680 

tttatagagg cgacggcttg gattaaaaag aatcttccgg gcgcccatat cagtggggga 1740 

gtaagtaatc tttcgttctc attccgtgga aacaactata ttcgtgaagc gatgcatgcc 1800 

gtatttcttt accatgccat tcagaaaggg atggatatgg ggattgtgaa tccgggtact 1860 

tctgtattgt atacagatat tccggcggat gtactcgaga ggattgaaga tgtagtatta 1920 

aaccggagaa gtgatgccgc agaacgattg atagaattgg ctgaccggtt aaaggaggct 1980 

tctgcgggta atacttcggc cgggcaaccg gtaaaacatg atgcctggag ggacggtacg 2 040 

gtagaagaac gcttgcaata tgctttggta aaaggaatcg gggattttct ggaagaagat 2100 

cttgctgagg ctttgcctaa atatgataaa gcggtggatg tgattgaagg accattgatg 2160 

aatggaatga atcatgtggg cgaattgttt ggcgcaggta agatgtttct tccacaagtg 2220 

gtgaaaacag cccgtacgat gaagaaagcc gttgcaatct tacaacctat tatagaatcg 2280 

gaaaaggtgg aaggtactgc ttcggcagga aaagttttgc tggccactgt gaaaggggat 2 340 

gtgcatgaca ttggcaaaaa tatagtctcg gttgtgatgg catgtaatgg ttacgatatt 2400 

attgatttgg gagtgatggt accggctgaa tcgattgtcc aaaaagccat tgaggagaaa 2460 

gtggatatga tcggacttag tgggttgatt actccttcac tggaagagat ggtacatgtg 2 52 0 

gctatggaat tagaaaaagc cggattggat attccattgt tgataggagg agcgactacc 2 58 0 
tctaaactac atacagcatt gaagattgct ccggtttatc acgctccggt tgttcacttg 
aaggatgctt cgcagaatgc gggtgttgct gctcggctga tgagtccgaa atcgaaagaa 

gagttggcaa aagaattatc cggtgaatat gaagcccttc gtgataagag cggcatgatg 2760 

aagcgtgaaa ccgtttcatt gaaagaagct caggaaaaca gattgaaact tttttga 2 817 

<210> 705 

<211> 2367 

<212> DNA 

<213> B.fragilis 

<400> 705 

atcaaatgga cttcgtttgc tgtcataata gttacttttt atgcaacaaa ggtacacttt 60 
tatgaagaaa aaaagagtaa acgaagacaa aaaccaaaat tatttattat ctttgcggcg 120 
ttgaaacatt ccgaacccct ccaacgggag acaaattcgg atttccgatt cagccaaaca 
tccctaattc atcatttaaa gattattgcg tttgttaagt ttccgacctc tgcgtcgcag 
cttcagccgt acgtatttca acgaataaaa ttaaaatatt taaataactc actatactgt 
atgtataaca tcattcaatt gaacgacaag aatttgtcgg aactacaagc tattgcccag 360 
gaattgggta tcaaaaaaac agactcactt aagaaagaag aacttgtcta caaaatcctc 42 0 
gacgaacaag ccatagccgg agctactaaa aaggtagctg ccgacaaact gaaagaggaa 480 
cgcaaagaag ataagaaaaa acgctctcgg gtgacagtaa agaaggaaaa cgccgacaag 540 
gttttctctt ctaccaagaa tggagaacta accaaaacag atgccaaaac acctgcagcc 
aaaacacagc cacaacctaa aacaacagaa ccgaccccag aaacagctaa agaggcaaat 
gccgaaacaa acgccactcc ggccgaatct gtcaaagtga caccttatgc cactccgaaa 
aagaaaccgg gacgtccccg taaaaatcag gtagaaacag aagctaaacc cgcagaagaa 
actaccgaaa aaccggaaac agtaccatcg gcacaagaag aaaagcccgc tgcccaaccg 
gaaacagaaa aacgtcccat cagcaaaccg attctcaaac ccaaaccggc cgttgtagac 
gaagaaagct cgatcctctc ggatatagat gcagacgatg attttatccc catcgaagac 
ctgccttcgg aaaaagtaga attgccaacc gaactgttcg gcaaatttga atcgaccaaa 
gccgaagcag caacagcccc cgaacctgtg gcacaacccc aacgcccgcg tgtgattcgc 



180 
240 
300 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 



ccacgagaca acaataacaa caacaattac aacaacaata ataataacca acgcaacaat 1140 



1200 



aaccagcgtc agcctgtaca acagcgtccc atgccgcaac aaaatgccgc cgaagccgca 

cccgttcagg aacgccgcgt gattgaacgt gagaaacctt atgaatttga tgatatcctc 1260 

accggaaccg gtgtattgga aatcatgcag gatggttacg gattcctccg ttcgtcagat 1320 

tataactacc tctcttcacc ggacgatatc tacgtttcgc aatcccagat caaactattc 1380 

ggtctgaaga ccggtgacgt agtagaaggt gtaatccgtc cgcccaaaga aggcgaaaaa 1440 

tacttcccgc tggtaaaggt ttctaaaatc aacggacgtg atgccgcttt cgtacgtgac 1500 

cgtgtaccgt tcgaccatct cactccgctg ttcccggacg aaaagttcaa gctttgcaag 1560 

ggaggctact ccgactcgat gtcggcacgt gtagtcgacc tcttttcacc aatcggtaaa 1620 

ggacagcgtg ccttgatcgt ggctcagccc aagaccggta aaaccatcct gatgaaagaa 1680 

atcgccaatg ccatcgctgc caaccatccg gaagtatata tgatcatgtt gttgattgac 1740 

gaacgtccgg aagaagtaac cgacatggcc cgcagtgtca atgcggaagt gattgcttct 1800 



280 



acattcgacg aacctgccga acgccatgtg aaaattgccg gcatcgtact cgaaaaagct 1860 

aaaagattgg tagagtgcgg acacgatgta gtgatcttcc tcgactctat cacccgtctg 192 0 

gcgcgcgcat acaatactgt atctccggca tcaggaaagg tactctcggg tggtgtggat 1980 

gccaatgcac tacacaaacc gaaacgtttc ttcggagcag cccgtaacat agagaacgga 2 040 

ggttcgctca ccattatcgc tactgccctg atcgacaccg gttcgaagat ggacgaagta 2100 

atctttgaag agttcaaggg tacaggtaac atggagttgc agctcgaccg caacctaagt 2160 

aacaaacgta tcttccctgc tgtcaacatt gtggcatcga gcacccgccg cgacgacttg 2220 

ctgctcgaca aacagacact ggaccgcatg tggattctac gcaagtatct gtcggatatg 22 80 

aatcctatcg aagcaatgga tttcgtaaaa gacagattgg aaaaaaccaa agacaacgac 23 40 

gagttcctga tgagcatgaa cagctaa 23 67 



<210> 706 
<211> 1143 
<212> DNA 
<213> B. fragilis 



<400> 706 

agtaactttg 

attggagtca 

cactctctta 

gaggcagttg 

aactcaccta 

tttacctatc 

gggttactta 

ttaaataaaa 

caaagagaaa 

tatcgtctag 

tcaactttgt 

attctcatct 

tctgtatcta 

gacaatactg 

aaggatcaat 

cctaatcaac 

aaacacgaat 

agcgcaatca 

accgacaaga 

tag 



caacaccaac 
ttatactttt 
ttcaaaaaga 
ccaacaatac 
acaaaatagg 
agcacaaaat 
tgatggatag 
atgatataaa 
tatggagtca 
aaaatgaaat 
ggaagcgaat 
acaccattac 
cagttgatat 
tagaaacaga 
ttgtctttga 
aacaaaagat 
tgaaggaaga 
ataaattaaa 
ctaatgagga 



acaagatctg 
tattatgatt 
aaaagaagca 
aatccaaaaa 
tacgtatgag 
tcaagacgtg 
tttacaaagt 
aggatatatc 
accatcaaat 
tgttagtgta 
gcctaaaacc 
tctatttgta 
aacatcagat 
aaagcggact 
gaaagacttt 
attattattt 
attttggcct 
gaaaatttta 
atactatgta 



gcaaatatga 
gccataggtg 
ttcaagccac 
tgtaaagata 
actcggacct 
gatagtgaaa 
agtgatatac 
aatactggaa 
agcatccctc 
gattacataa 
aatatctaca 
ctatactata 
cccaatatta 
aattctacca 
gtcctattca 
ttcttaaata 
aaaaacagtg 
gaggaaatca 
ctaatcaggg 



ataaaagaat 
cctcaaccta 
aagttgaaaa 
tacccttaaa 
tctgttcaag 
tattattcgc 
aagccctaat 
taattgtctc 
gcaatgcaga 
tgtatatcga 
tcaatttagt 
ggaaacagca 
tcacggaccc 
taaaagagga 
atgaccgtcc 
ggcctaatta 
atccaaccaa 
acagtaagta 
ataaatcagc 



atttttagta 
tacaatcatt 
tatcttgaag 
tggttttaat 
agataccttg 
tcgccaattg 
aattaaagac 
caaacatcta 
aatgattacc 
ctatagtttc 
agtcgaagtg 
aaagaacaga 
catatcagtt 
gttatcattc 
gatcaaaatg 
cagagtgaac 
taatatgaca 
tacaattatc 
agaaaaaata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1143 



<210> 707 
<211> 402 
<212> DNA 
<213> B. fragilis 



<400> 707 

atggagattc tgtttcctga actattgtac tcaattattg ttttaaataa acgtttaaga 60 

tacagttacc ttaatataat tatgctgaca tttaccgata actttgagaa tgataaagag 12 0 

ttgatacttc gtgatcatct ggcacttgaa agaaccaagc tggctaatga aagaactttg 180 

tttgcatata tccgtatggc actttacctt ttgactgtgg ggatagggat atttcaaatt 240 

gaaagcattt cacgtttgga tgggctggcg tggggatgta ttatagccgg aatctttttg 3 00 

tttttcttgg gctttgtccg tttcgaacaa atgagaaagc atttgaaaca gtatacgaaa 3 60 
acatgtcgtg atactgagaa tgaatcgtca cggaagaagt ga 



402 



<210> 708 
<211> 1929 
<212> DNA 
<213> B. fragilis 



<400> 708 

aaaatgaact acggattcgt aaaagttgcg gcggccgttc cccgcgtaaa agtagcagat 



60 



281 



180 
240 
300 
360 
420 
480 



600 
660 
720 



tgcaaattta attctgaaag attggagggt cttattacca tagccgaagg taaaggagta 120 
cagattctca cttttcccga aatgtgcatt accggatata cttgtggaga cttgttcgcc 
cagcaacttt tgcttgaaca ggcagaaatg gctttgatac agattctaaa cagtacccgc 
caactggaca tcatttccat actgggcatg ccggtagtag tcaactccac agtaattaat 
gctgcagtag ttatccagaa aggcaaaata ctgggagtag tgcccaaaac ttacctgcct 
aattataaag agttctacga gcaacgttgg tttacatccg ccctacaagt ctcggaaaac 
agtgtgcggc tttgcggaca gattgtcccg atgggcaaca atctgttgtt cgaaactgcg 
gaaacaactt tcggcataga aatctgtgag gacctttggg ctaccgttcc gcccagttcc 540 
tcactcgcac tgcaaggggc tgaaatcatc tttaaccttt ccgccgatga cgaaggtatt 
ggtaaacaca attatctttg ctctctgatc agccagcaat ctgcacgctg catctccggt 
tatgtttttt cgtcgagtgg cttcggtgaa tcgacaacag atgttgtttt tgccggaaac 
ggacttattt acgaaaacgg atatctactg gcacgaagtg aacgtttctg catggaggaa 780 
cagttgatta tcaatgaaat tgatgtggaa tgtatccgtg cagagcgtcg ggtcaacaca 
acttttgctg ccaacaaggc taattgtccg ggcaaagagg ctgtcagaat ttctacagag 
tttgtcaaca gtaaagatct gaacctgacc cgtacattca atccacatcc ttttgttccg 
caaggaaacg aactcaacag tcgctgtgaa gagatcttct cgattcaaat agccggactg 
gcacaacgtc tgctacatac cggagcaaaa acagccgtaa taggtatttc cggaggactt 
gactcaacac tcgccttatt ggtgtgcgtc aagacattcg ataaattggg attatcccgc 1140 
aaagatattc tgggtataac aatgccggga ttcggaacaa ccgaccgcac ttatcacaat 12 0 0 
gccatcgacc tgatgaattc cttgggagtt tcaatacggg aaatcagtat cagggaagca 12 60 
tgtatccaac actttaaaga tatcggacac gatctcaata tacacgatgt aacgtacgag 132 0 
aattcacagg cacgcgaacg tacccaaatc ttaatggata tagccaacca aacatggggt 1380 
atggtgatcg gaaccggaga cctgtcagaa ctcgcattgg gatgggcaac gtacaacgga 1440 
gatcatatgt cgatgtatgg tgtcaacgca ggtattccca agacactggt gaaacactta 1500 
gtacagtggg tagccgaaaa cggtatggat gaaacatcca aagcaactct gctggatatt 156 0 
gtggacactc ctatcagtcc ggaactgata ccggcagatg aaaacggaga aatcaaacaa 162 0 
aaaacggaag acctcgtcgg tccttacgaa ctacacgact tcttcctgta ttatttctta 1680 
cggttcggct tccgcccgtc aaaaatctac ttcctggcac aaactgcatt cagtggagtt 1740 
tatgatgatg aaacaatcaa aaaatggctg caaactttct tccgccgctt ctttaaccag 
cagtttaaac gctcttgcct gccggacgga ccgaaagtag gaagtatatc catcagcccc 



840 

900 

960 

1020 

1080 



1800 
1860 



1929 



agaggagact ggcgcatgcc aagtgatgcc agttcggctg catggctgaa agaaatagcc 192 0 
gaattgtaa 

<210> 709 
<211> 870 
<212> DNA 
<213> B.fragilis 

<400> 709 

ccttacattc atcctcctgt aatccggaaa cgggatcttt gcgtaacgaa aaaaacaaaa 
gttatgcatt tacggacgta ttatcccaca gtagttctct cggatattca tctgggaact 
caacactcca agacagagga agtcactcac ttcctgaaat caataaattg tgatcgctta 
attctgaatg gtgatatcat tgacggatgg catttgcaga aaagcggttt gggtaaatgg 
aaagctaaac atacggattt cttcaaagta ataatgaaga tgatggagaa tttcgggaca 
caagtgattt atgttcgtgg taatcatgat gattttctag ataatctggc acctctgaat 
ttttataata tccggattgt gaaagactgt atctacgaaa gccacggcag acgttattat 
gtgacacatg gagatatttt tgatacggtg actactcaaa tgaaatggct ggctaagttg 
ggcgacacag gatatacttt tctgttatgg ttgaataagg tatacaatct ccgcagaatg 540 
aagcagggaa aaccttatta ttccctttcc cagtctatta agaatagggt aaagactgcc 600 
gtttcttata tttctgattt tgaaaaagag cttgttggcc tggcaagggc taaaaagtgt 660 
gatggcgtga tatgcggtca tattcaccat cctgccaata ctttttatga agatatccat 72 0 
tatctgaatt caggcgactg ggtagaaaca ctttcggctc ttactgaaga tgaagatggt 
aactggacta tccgctattt tgatagtgga ttactaaagg aagataatca taaggaaaaa 
caaactatat ccataacaat agcatcatga 

<210> 710 
<211> 579 
<212> DNA 
<213> B.fragilis 



60 

120 

180 

240 

300 

360 

420 

480 



780 
840 
870 



282 



<400> 710 

cactgcgatc tttgcagcac catattaaaa agaggtaaag ttatgaaaag acatcttatt 60 

ttagtatttg ctttattggc atccagtgtt gccttaaatg cagtcaattc attgccggac 12 0 

gatgataaat ccgacaataa gaacaaaaca gaactcaatt ctgtcgtcaa aaagacatgg 180 

gagttctact ccaccatcaa acaaccttct gccgatgcac tggctaatgc gggtaactac 240 

aaattcgggc aagaagccgg ttatctctat aaccaattca tgaagatcta tgtagtcagg 300 

gaagaagtgg ttcccggaga ccctacccgc cgtaccgtaa ttcgcaaacc cactatctac 360 

aacgcagtac gctccatcga gaaacagctg aacaaagagc tcaaaagcaa ccaaatgacc 42 0 

agagagcaag tagctgcaga gttcacaaat gtactgaaag tagcgatttc tgcctatgat 480 

tccgaaagcg aatcatttga agacgcttta cagataaatc gcaaaaatgc aaccgacctt 540 
ctctccgtat ttcaaaatgt aaaactgaca gaaatctaa 579 

<210> 711 
<211> 597 
<212> DNA 
<213> B . f ragilis 



60 

120 

180 



<400> 711 

aaaacttatc aaaaagcctt tctgaaccat atatttccgt atttttgttt cctgatattt 
ataaatatag aattcacctt attatatata gtggaaatga tagaagttac ggatgcctct 
ttgcaaaaag ctgcgggtga gggaatggat gaatttatcc aggtgtttac agacaagtat 

aaagaagtga ttggcgggga act tact gcc gaaacaatgc cactgttgac aggggagcag 2 40 

cactctttgc tggcctatca gatttttcgg gatgaagtca tgttcggtgg cttttgtcaa 3 00 

ttgattcaga atgggtatgg aggttatatc tttgataatc cttttgcaaa ggtaatgcgc 3 60 

ctgtggggag ccgaggattt ctcgaagttg gtttataaag ctaagaagat atacgatgct 42 0 

catcgccacg atctggagaa agagcgtaca gaggatgaat ttatggctat gtacgagcaa 480 

tacgaggcct ttgatgatct tgaagaagaa tatctggata ttgaggagga ggttacagca 540 

ctggtagcaa gctatgtaga cgatcattta gagttgtttg caaaaatagt taagtaa 597 

<210> 712 

<211> 2031 

<212> DNA 

<213> B.fragilis 

<400> 712 

cccggactac accgaaggtg cgcgattccg gtttacacac cccgtccggc ccgcaaaggg 
aaaggaggca gaaagatgaa gcgactgata cttatcatca tagtatgttg ccgggcctta 12 0 
ggatggtgtc atgcaaatac acaaacggaa acggacagcc tgtatcgggt gacacagtca 
cttccgcatg actcgactcg tctggaaatg ttcaagagac tggcacaaat agagcagctg 
actcccaagt gtatcacctt ctcgggtctg ttgcgcgagg aagccacctt gcagaagaat 
gacagataca acgccatagc cgcctatctg cacacagtgt actactataa ccaaaacaac 
cgggacagcg taaaaaaatg gcttgacaca atggagcctt atgcccgcaa atcgcaaacc 
tgggatctct attttgatgc gctccgcttt cagatagacc tctgcaccta cgaagagcag 
tatgaacttg ccatcaacga agcgaaccag atgtacgaaa gagcccaaaa agtgaactgt 540 
gcccgcggac tgatcggagc aaaacaatgc ctgggcaacg cctatatcag tacagagcga 
tgggacgaag gaatgaaagc attggaagct gcctatcagc tctcttcaca aacagataat 



60 



180 
240 
300 
360 
420 
480 



600 
660 



780 
840 
900 
960 



gcggtagtac gaatctcgat tctctgtcaa ctgatttcca taaccaagga tcagaaaaac 720 
aaccaattac tctccgaata ccttgcgaag ctaaaagaaa cactgcatca ccatacctcc 
acgaacccga tgctcaaaaa ggcattttat gatgtttacc tgttctgcga agtatattac 
acctattatt atctatatgc aggccagccg gaacaagcac ataaaaatct ggttaacgca 
ggcaaatttc tcaatggcaa caccttcttt ctctacaggg tgctctatta cgatgcctat 

gcagcatact tccgggcttg caaagcgtat gaccgggcac ttgccaagat agactccacc 102 0 

atcatgctgc tgcaagagga cttcaacagc aattacatcc accaaaaatt gacaaaagcc 1080 

gacatcctgg cagaagccgg acgaagtgcg gaagccattc ccctgtacat cgaaacgctc 1140 

catcttaaag actctatcga aacgaccgtt ctcgacaaac agatgcaaca gataaaagct 12 00 

aaatacaaca tcgacaaagt ggcattggaa gaggaacagc tgaaaagcta catccaactc 1260 

ggcaccttaa tagtggtggt cattatcctg ataattctgg ttgctttcat gctccgcatt 1320 

tcacacgtac gcaaggcttt ggaacgatcg gaaaaagaaa cacgggaaac aacccgtatg 13 80 



283 



180 
240 
300 
360 
420 
480 
540 
600 
660 



gcggaagaag ccaacgaaat gaaaaatcgc tttttgtcga acataagcta tcacatccgc 1440 

atcccgctga acggtgtagt gggtttctcg caattgatag cctccgaacc caacatgccc 1500 

gatgagctcc gtaaagaata ctcttccatc attcagaaga actccgaaga attgatgcgc 1560 

ctggttaatg atgtactcga tttatcacgg ctcgaagccg gtatgatgaa gttcaacatt 1620 

caggagtacg gactggcgga actctgcaat gaagctacct atatggcacg catgcatagt 1680 

gaagggtgta ccgtcatccg gttggaaaac gaaattgaaa cagacctgaa catccgggta 1740 

gacaccgtcc gtttcacaca ggcgttgcta agtgcactga cgtatccgca gaagtacaaa 1800 

gaaaaacggg aaatcgactt taaggtgaca ctcgacacgg agaagcactt catcaacttc 1860 

cgcatcacca actctccgct ggctgacgaa agatttactt cgcaagaggt atgcatccgt 1920 

cacgaaatca accgcctgct atttgagtat ttcggaggaa gctacaaagt acagacaaat 1980 

ccggacggaa agccggccat cctcttcacg tttccctcag gaagaaattg a 2 031 

<210> 713 
<211> 759 
<212> DNA 
<213> B. fragilis 

<400> 713 

agtgggtggc agccggatac cctttccagt gtagacgaaa agggagaggt gaagtatcat 60 

aaacccgatt gcgctgtgaa aggaagtatg gatttgaacc gtaaattctt attgcgacaa 12 0 

tatctgaaag attatctcag cgctgtagtg ggggataaga tagagggagc caatcattcg 

gatttttcgg atgcctgtct gttgcatcag atagtggata cacctaaggt aagctatcag 

gtagcctatc cacaatcccg gaagagatac cggtatatac gttatacttc gacccccgaa 

aaaacacttc aactggcgga attacagctt ttccgaaaag tggatgatca agagaaaata 

acggctaagg tcatcgatgg cagtaatgct tttattgcag atgaccggtt tgatcgtttt 

aaagtgaatg acggtgatgg attaaccttt tttcttacga aagagaaggg agcattcgtt 

acacttgatt taggtaagcc ggaaaagatt gaaaaaatag tctatatgcc tcgtaacgat 

gataacttca ttcggttagg ggatcagtat gaactgtttt atcaggatgg atttcgtggt 

tggatttctt taggcaggca agtagcctca gaattaacat tgcactatga caatataccc 

caaaattcag tactttggct tcggaattta tcaagaggga gagaagaaac cgtatttcga 720 
aacgaggacg gtcggcaggt tttctttgta aagtggtaa 759 

<210> 714 
<211> 948 
<212> DNA 
<213> B. fragilis 

<400> 714 

aaaagaatta aaatggaaat ccattccgaa agaaagaaaa gacttagttt atccctgctc 60 
ttcaaaataa taaaagatac agtttgggga ttcatagatg acagcgttat gaggttgagc 120 
gcttcattag cctatgcgac tttgttttca attattcctt ttctttccct tctagtcact 
gtcggtgtct ttttccatat ggatttggcc aatcaacttt atgtccaact acaaccgatt 
gtgggccccg aagttaccga ggcccttcgt tctattatag aaaatgcaga aaatacagac 
tcctccaggt cggccgcctt tgtcagctta ggtatctcta tttttggtgc caccactatt 
tttgcagaaa tacaaagttc attaaattca atttggggaa taaaagccgt ccctaaaaag 
agctggctta aatttattaa aaaccggata ctttctttct caataatact tgtatttgcc 
ttcattctcc tgattacatt caccattacc aatataatcg gagaactcag tcaaaaattc 
atctttaagt atccggaagc agccgattcg ttggtaaaag tggtaggaat catcataaac 
atgagtgtca ctaccatcat ctttacactc atatttaaaa tattacccga tgccaaaatc 
aagagcaaag acgtttgcat cggagctgtt gtaaccacca tactgctact gataggtcaa 
tggggaattt ccttttatat aggaatagcc aatgtgggga ccgtctatgg ggctgctgcg 780 
ttcatggtgg ttttcgtcac ctggatttat tattcttcca tcatcatata taccggtgca 840 
gaatttacca aagcatgggc aaacgaaatg ggaagtaaaa ttttccccga cgaatatgca 900 
gtagccacca aaaccattga aatacacgaa gacaagccta tcgaataa 948 

<210> 715 
<211> 192 
<212> DNA 
<213> B. fragilis 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



284 



<400> 715 

aaaacatcta acattcatta tttatctctt ttaaaatatc tggctctcga ttataaaccg 60 
agacagagtc cggcagatat tcttgcttta gtaatgacga cataccatcc tcattctctt 
ctaccaaaac tgttgtctcc catcggtctt cttaaagatc cgttcgtcct ttccctccgt 
atgattcttt aa 



120 
180 
192 



<210> 716 
<211> 2181 
<212> DNA 
<213> B.fragilis 

<400> 716 

tttttttata aactaatgaa cagactcaaa ctttacttac tggcgctgac tgcgctggcc 60 
gtttgttccg caaaggcgga cgagggtatg tggttactgc aattaatgca gcagcaacac 120 
tctatcgata tgatgaaaaa acagggactg aaactcgagg cacaggattt gtataatcct 
aacggagtct cactgaaaga tgccgtaggt atcttcgggg gaggatgtac cggcgagatt 
atttcaccgg aaggattgat attaaccaac caccactgcg gatacgcttc catccaacaa 
catagctctg tagagcatga ttatctgaca gatggatttt gggcaacttc aagagacaaa 3 60 
gaattgccga ctccaggact gaaatttaca tttatcgaac gcatagaaga cattacggat 420 
attgtaaatt taagaattgc cgctaaagaa atcactgaat cagaatcatt cagcagtaca 
tttcttaata aactggctaa ggagttgttt gaaaagagcg acttgaaagg aaaaaaagga 
atcgttcctc aagctttgcc tttttacgcc ggaaataaat tctatatgtt ttataagaag 
gtatatccgg acgtacgtat ggttgccgct cctccttcat caatcggtaa gttcggtggt 
gaaacagaca actggatgtg gccacgccat accggtgact tttcaatgtt ccgtatctat 720 
gctgacgcga atggcgaacc ggcagaatac agtgcttcca atgtccccct gaaaaccaag 
aaacacctga atatctctat caaagggctg aaagagggag attatgccat gattatggga 
ttcccgggaa gcaccagccg ttatctcacc gtctcggaag tgaaagaacg catggaggca 
agcaatgccc cccgtatccg tatccgcgga acccgtcagg acgtgttgaa agaagcgatg 
aatgccagcg ataaagtacg tattcaatat gccaataaat atgcaggttc aagcaactat 
tggaagaact ccatcggcat gaacaaagct atcatcgata acaatgtttt gggaacaaaa 
gcagaacagg aagctaaatt cgctaagttt gccaaagaaa aaaataatac cgactacatg 1140 
aatgtagtgg caaagatcga cgaggctgta gctaaaactt ctccaatcaa atatcaacag 12 00 
acctgtctga cggaaacatt cttcggcggt attgaattcg gtagcccatt tatggtaatg 1260 
gacaaactga aagaagcatt ggaacagaaa aacgattcaa gtattgaagc taacatcaaa 13 2 0 
gtgctgaaag aggtattcaa cgacatccat aataaagact atgatcacga agtagaccgt 13 80 
aaagtggcca aagccctgtt gccactatat gcagaaatga ttcctgccgg acagcgtcct 1440 
gccatctacg atgtgattga gaaagagtac aaaggcgact acaatgccta cgtagatgca 1500 
atgtacgata cttcaatttt ggccaatcag gcaaactttg ataaattcat caaaaaaccg 1560 
actgtaaaag caatcgaaaa agatatagcc actcaatatt cacgtgccaa gtttgacaaa 162 0 
tacaccaatc tggccgaaca aatgggaaaa ttgccggaag actggctttt attacacaaa 1680 
acatatatcc gcggactagg tgaaatgaaa ttgcctgtac catcttatcc ggatgccaat 1740 
ttcact at cc gcctgaccta tggcaatgtg aaaccataca gcccgaaaga tggtgtatat 18 00 
tacaaatact acacaacaac cgacggaatc cttgaaaaag aaaatccgga agaccgtgaa 1860 
ttcgtagtac ctgccaaact gaaagagttg atcgagaaaa aagatttcgg acgctatgca 192 0 
ttgcccaatg gtgaaatgcc ggtttgtttc ctgtctacca atgacatcac aggcggtaac 1980 
tccggaagtc cggtactgaa cgaaaacggc gaattgatcg gttgtgcatt cgatggtaac 2040 
tgggaatcac tgagcggtga catcaatttc gataataacc tgcaacgctg tatcaacctg 2100 
gacatccgtt atgtactctt tattctcgaa aagctgggag gatgcggaca tttgattaac 2160 
gaaatgacga ttgttgaata a 2181 

<210> 717 
<211> 1044 
<212> DNA 
<213> B.fragilis 



180 
240 
300 



480 
540 
600 
660 



780 

840 

900 

960 

1020 

1080 



<400> 717 

gagttaatcc tgattttgca ttggcgaaat atgatttata gaaattggct tgggattatt 
ggggtagctt gtatactttt tgtctcatgc aaacccacgg aggttgacgt gcctattact 



60 
120 



285 



180 
240 
300 
360 
420 
480 



tattattcgt cttttctctt gaaagatact actgttatgc tttctaaagt agaaatggat 
cctatgcagg taaatgcacg gtgtatggtg tgggatggcg ataagcgggt attggttcgt 
acgtcaacta cagattctat ttatgctgtg tttgcttatc cggaaatgaa atttttgagc 
tataccggta gtttgtcaga atataaacag atattagcca aatgcaatga aggcttttac 
ttggtaaaag atgattcgtt atatttatat catttaacag ataaggattt gttgcagaaa 
acgacaactc atttccttta taatagcaat aagattcgat tatctaaaat taagaaattg 
aatgataaaa tgtatacagc tcatgcatat acggatcctt cttataatga tattaggtta 540 
aatgagttct acatgcttga tgctgagaat aatatcttat atcctaaagg gcattatccg 600 
gagcgtactg aggtaagatt taaaacgata tttgacttta agtttgccta tgcgcacgaa 660 
gtatggccaa aaccggacgg aagtcgcatc ttagttaatt atgtgaggac tcgccgtttt 720 
cggatttatg atttgagtgc ccgattattg cacgatgtat gtcttgatta tgcatctaat 
aaatatgttg tggatgcaga tcctaaacgt tggacaacgt ttattagaga ttgttttgtt 
actgataaat acatttattt gctatgtccg gagggtgagc aatccagttt ggttatagta 
gattgggacg gtagaccaat agcgcgttat cgattggatg aaaagatttt tttctttttt 
atagatccgg atagaaatct tttttgtggt attaactcaa ataatgggca gtctttttat 
ttccttgatt tagatataaa ttag 1044 

<210> 718 

<211> 798 

<212> DNA 

<213> B.fragilis 



780 
840 
900 
960 
1020 



60 

120 

180 

240 

300 

360 

420 

480 



<400> 718 

atattgataa aatgtttcat ttgcatacga atccgttgta cctttgtcgt cgaaaaagtt 
gtgtatccta tgaataaagt attgcctttt ttacttttgc tttttgtttt tacctcttgt 
agtcgcaagt ataagattga aggcgcctct tctgtaacca gtctggacgg taaaatgctt 
tttattaaag tacttcagaa tggcgagtgg ctcaatattg attctgccga agtggtgcat 
ggactatttt cgatgaaagg taaagtcgat tcggtagtaa tggctacact ctatatcggc 
gacgaaagca tcatgccttt agtgattgaa aaaggtaata ttcaggtttc aattacaaat 
acagaattgg tagcaaaagg aaccgctctg aacaatgccc tctacgcttt tattgataaa 
aagaattcat tggatgttca gatagaagaa ttgcaacgta aagaagcccg catggtgatg 
gatggtgccg acttggctga tattcatgag caattgactc acgagggcga ttcgttaatg 540 
caagatatga atggctttat caaaaaattt atctcagata actacgaaac agttttaggt 600 
ccaagtgtat ttatgatgct ttgcagcaca ctaccttatc ctgttatgac tccccaaata 660 
gaggacatca tgaaagatgc tccttattcg tttaagaata acaaattagt gaaggatttt 72 0 
attacaaaag cgaaatcgaa tatggagctg attgaagagc atcagcgcat ggaacaaaat 7 80 

7 9 8 

gcgaccttga accattag 

<210> 719 
<211> 1158 
<212> DNA 
<213> B.fragilis 



<400> 719 

ggaaaaacaa actatatcca taacaatagc atcatgaaat ttctgtttat tgtgcaagga 
gaggggagag ggcatttcac ccaagccatt acccttgaag acatgttatt acgtaatggg 120 
caccaggtag tggaggttct tgtcggcaaa agttcgtcac gtaccttgcc cggctttttc 180 
aaccggagta tccaggcacc ggtaaagcgt ttcaccagtc cgaatttttt gcctacagcc 
gaaaataaac gggctgatct gaaaaagagt tttgcataca atctgataca cgtaccggaa 
tattttcgca gtatgtgtta tatcaatcag cgcattaagg aaacaggggc ggaagttgtg 
atcaacttct acgaacttct gaccggactt acctacgcac tcttccgtcc ctccgttcct 
tatgtttgca tcggacacca atatctgttt ttacacaacc actttgagtt tcctcgaaaa 
agtgtgattc aactctccat gttgcgcttt ttcacacgga tgacgagtct gcgcgctagc 
aggcggttgg cactctcttt tcgtaaaatg gaatcggacc ggactgaacg gatatccgtt 
gttcctcctc tgcttcgcag ggaagtgacc gctatgcagt cggcacaggg taactacatt 
cacggatata tggttaactc aggttttgca gatagtgtag aggctttcca tgccttgcat 
cctgaaattc ctatgcactt tttctgggat aaacaggatg ctgacgaggt gactaaagtg 
gatgccacac tgagttttca tcagattgat gatgtgaaat ttcttaatag aatggccggt 
tgcagagcat atgccagtac ggccggtttt gagtctatct gtgaagcgat gtatctgggc 



60 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 



286 



aaacctgtac tgatggttcc tgctcatatt gagcaggatt gcaatgctta tgatgcccgg 

caggccggtg ccggaattat tggagaatct ttcgatttgg agtcgttgct tcgttttgcc 

ggaacgtatg ttcccaaccg ggaatttatt cgttgggttc gtagctgtga acgacagatc 

attggagaac ttgaacgact tgctgatcag cattcggctg tcactgtacc tacattaact 
aattattttc cgatatga 



<210> 720 
<211> 282 
<212> DNA 
<213> B.fragilis 



<400> 720 

agtggtaaaa acaggaaaga aacagatcaa aaaaagagga atttctttta tttattttgt 60 

aaaatagact gtaaatcgct gaatggtctg attttgttgt tttttgaccg gatattagct 120 

tccttaatct tattttttct caaactttgt aaggtaaatc aatgtagaaa gtatgactat 180 

aatgaagttg agactgggag ttcgtgggat gatgtggcta accttggtaa ttatgatgtg 2 40 

gggcatcata tcttgtcgaa ctcaagaaga gaaatgtctt ga 2 82 



<210> 721 
<211> 873 
<212> DNA 
<213> B.fragilis 



<400> 721 

atatttatga aatacttata tgtgttatta gctttttctt ttttgttttc ttgtaaagat 60 
gagaataaaa aacatgcgga atctgttttg agggaatgga tgaataagga aattgttttc 12 0 

■ 180 
240 
300 



ccgaataaaa tgtattttag tattcagggt aaagagaatg ttgattttcg tataaaagat 
accgaatata agattgtcgc ctatgttgat tctgccggtt gcaccagttg taaattacac 
ttgtctaaat ggaaagagtt aatccattat gtggattcta ttcagtctga gcgtgtacag 

tttttgtttt ttttctttcc caagaatgga agagacatat atcatacaat gagaatggat 360 

aaatttacct atccggtttg tgttgacaca ctcgattctt ttaataagtt aaatcatttt 42 0 

cctgacgatg taagattcca gacttttttg ctgaataagg agaataaggt tgtagcagta 480 

ggtaacccca ttcataaccc gaatatcaga gatttatttt tgaatataat ttccggtggc 540 

acttctcttc cagatgaaaa acgtcctcaa acagaggtga agatagaggc tctgtctatg 600 

gacttgggta tgtttgattg gaaaaaagaa cagaaatgta tttttaccgt tgagaatacg 660 

ggaaaagagt tgcttgtgat tgatgatgtc aatacttcgt gcggatgtac tacagtggag 72 0 

tattcgagag aaccggttca gtccggaaag acgatagata ttaccgtcgt ttataaggct 780 

gaatatccgg agcattttaa caagacgatt actgtctact gtaactcgcc tgtttcacct 840 

ttgcaattga aaataaaagg agatgctaaa taa 873 



<210> 722 
<211> 411 
<212> DNA 
<213> B.fragilis 



<400> 722 

tcggttatga atttgaatga ggtagatata cattatttaa ttgcagccat tagtgtgata 6 0 

acttcggcat tggtgtttta cacaatagga gtgtggggag agcgattgca gaagaggttg 12 0 

aaattttggc atctggtatt ttttttgttg ggactgctgg ctgattctgt gggaacggct 180 

ttaatggaga atattgcgcg actcacacac ttgcatgatg aaatacatac tgtgaccggc 240 

attatcgcta tcctgttgat gtttattcac gctatgtggg ctatctggac gtatgtgaaa 3 00 

gggagtgaaa gagccaagga acatttcaac cgtttcagta ttgtggtgtg gtgcatttgg 3 60 

ttgatacctt actgcatagg cgtatatctt ggtatgtcat tgcatcattg a 411 



<210> 723 
<211> 1068 
<212> DNA 
<213> B.fragilis 



287 



<400> 723 

aatctcatga aatactgtct gacatttctc tttcttttgg taatctttac tgggtgcact 60 
tcagatttgc cgaaagatcg gatgttgtat gcttcttttc ctaaggagga gacactacat 12 0 

• ■ ' 180 
240 
300 
360 



840 

900 

960 

1020 

1068 



tctaaggtaa ttcagcttga ttcggtttat atgcgttatc cgtttcgggt acatgtgtcc 
ggtgatcagg ctgttgtcct ggatttacat ggtactgatg tgtattgcca tctttttcat 
tatcctgatt tccattatct gtcttcgttt ggcaggagag gagattcacc ggaagagatg 
ctttcagtag aaacagtgaa atgtatagat ggttcatttt ggactttaga tgccaacaaa 
ggcgagttaa ctaggtttga gtttgtttcg gatagagatt cgcttctgcg tgcagaagcg 420 
atctctttcg ataaagacag cattctgcgt gctcttgatt ttgtggcatt caatgatacg 480 
acttttctga tacctgacta ttcgggagat agccgattct gttgggtgaa ccgacaaggg 540 
aagtttttga agaaaagtgg agtgattcct tcattgaacg aagaagcatt gaaagaggcg 600 
cgtcctgcct tagcacaagc ttggcgcagt tttattgatt ataatcctca taacggagtg 660 
ttggttgctg ctactcaatt aggtgaagtt cttgagattt ataatcttca aaacggtttt 72 0 
catagggtct gtttaggtcc taaaggggaa ccggaattca aacttgcggg cgggtatgct 780 
attccggatg ggatcatggg attctcggat gtgcaggtta cggatgaggc tatttatgct 
gttttccatg gtcacacttt taaagagatt atggcacagc accaaaaaga gggaagagct 
acagatggtg gacaatatat ttatgttttc aacttacaag gggaaccttt atgtaaatat 
accttagatc gttatatcac aggtttccat gttgatgaaa gaaataagac tattacagca 
acagatgtta ataacgacca acccattgtg gagttccgct ttggctaa 

<210> 724 

<211> 564 

<212> DNA 

<213> B. f ragilis 

<400> 724 

gacgaaatga aaaagtttag atgtactgtc tgcggttatg tttatgaagg tgacgcagct 60 
cctgagaaat gtcctttgtg taaagctcct gcaagcaaat tcgtagaagt tgttgaagaa 120 
gaaggtggtg cactcacttt tgttgacgaa cacgtaatcg gtgtagctaa aggttgtgac 180 
gaagaaatga ttaaagacct gaacaatcac ttcatgggcg aatgtactga agttggtatg 2 40 
tatttggcta tgagccgtca ggccgatcgc gaaggctatc ctgaagtagc tgaagctttc 
aaacgttatg cttgggaaga agcagaacat gcttctaagt ttgctgaact gttgggtgat 
tgcgtatggg atactaaaac aaaccttgaa aagagaatga atgctgaagc cggtgcttgc 
gaagacaaaa aacgtatcgc tacacgtgct aaagctttga atctggatgc tatccacgat 
accgtacacg aaatgtgtaa agacgaagct cgtcatggta aaggtttcga aggactttat 
aaccgctatt tcggtaagaa ataa 564 

<210> 725 
<211> 2172 
<212> DNA 
<213> B.fragilis 

<400> 725 

ataatgatga aaagaaactt attatctgct gcgtttgcac tgatggcact ggccgtcagt 60 

gctgacgaag gaatgtggat gctgactgac ctgaaagcac agaatgaagc tgccatgatg 12 0 

gatctcgggt tacaaatccc tatagaggaa gtctacaatc cggatggaat agctttaaaa 180 

gatgctgttg tacatttcgg aggcggatgt accggtgaaa tcatctcggc ggagggattg 240 
gtattgacta atcaccactg cggatatgga gcaattcaac aacatagcag tgtagatcac 



300 
360 
420 
480 
540 



300 



gattatctga caaatggatt ctgggcaatg aaccggaacg aagagttacc ctgcaaaggg 3 60 
ttgacagtaa ccttcatcga ccgtatcctg gacgtgacaa cctatgtaaa cgagcaactt 
aaaaaagatg acgatcccaa cggcatcaat tatttgtctc ccaaatatct ggcaacggtt 
gccgaccggt ttgcaaaagc agaaaatatc caaatcactc cggcaacacg tttggagctg 540 
aaaccatttt acggaggcaa caaatactat ctatttgtaa agacagtcta caatgacatt 
cgcatggtag gtgcccctcc ttcttcgatc ggcaaatttg gagctgatac cgacaactgg 
atgtggccac gccacacagg agacttttct ttgttccgca tctatgcaga caagaacggt 72 0 
cagccggctg aatactctaa agacaatgtt cctttacaag taaagaaaca tttgacaatc 780 
agcctggcag gagttaaaga aggtgatttt acatttgtca tgggatttcc cggacgcaac 840 
tggcgctaca tgatttccga cgaagtgaaa gaacgaatgc aaacaaccaa cttcatgcgc 900 
caccacgtac gtgaggcacg gcaggccgta ctgatggatc aaatgctgaa agatccggca 960 



420 
480 



600 
660 



288 



gtacgcatac attatgcaag caaatatgct tcctccgcta attactggaa aaatgccatc 102 0 

ggtatgaacg aaggtttggt ccgactgaaa gtgttggata ccaaagaaaa gcaacaagaa 1080 

caactgttgg caatgggacg tgagaaaggc gatgactctt atcaaaaggc ttttgatgag 1140 

atacgctcga ttgtggcgca tcgtcatgat gccatgtatc atcagcaagc catcagcgaa 1200 

gcattggtaa cggcactcga tttcatgaaa attccttcaa ccgacggatt gaaaaaagca 12 60 

cttgaaagca aaaatgccac aaagattaaa gaagaaaccg ataagctgaa agcagaagca 13 2 0 

gataaatatt tcgcatctgt tccgtttccg gaagtagaac gactcgtagg aaagaaaatg 13 8 0 

ctggaaacct atgccggata tattccggaa gatcagcaaa tcggtatttt caaagtaata 1440 

gacagccgtt ttaaagggaa caaggatgcc tttatcgatg cttgcttcaa gtactcgatc 1500 

tttggttcga aagagaactt caacaagttt atcgctcacc ccactcttaa caaactggat 1560 

aaagactgga tgatcctctt taaatattcc atcacggacg gactgttgaa aacggcactc 162 0 

gccatgaagg atgccaataa gaactatgat gcagctcata aagtatgggt aaaaggtatg 1680 

atggatatgc gtcaagttgc cggtacgcct atctatccgg atgcaaactc aaccctgagg 1740 

ttaacttatg gtcaggtatt gccgtacgag cccgccgacg ggacagtata caattactat 1800 

acaacactga aaggggtaat gcaaaaagaa gatccggata attgggagtt cgtagtgcct 1860 

caaaaactaa aacaactgta tcatgcaaaa gacttcggac attacgcgat ggaaaacgga 192 0 

gaaatgcctg tttgcttcat tgtcaataca gacaatacag gaggaaattc cggaagcccg 1980 

gtattcaatg gaaaagggca attgatcggg accggattcg atcgtaatta tgaaggcctg 2040 

acaggagaca ttgctttccg gccttcttca caacgtgccg cagtagtaga catccgctat 2100 

actctattta ttattgataa gtatgcaggt gcctcacaca tcatcaagga gctggatatt 2160 

2172 

gtagaagaat aa 

<210> 726 
<211> 1560 
<212> DNA 
<213> B.fragilis 

<400> 726 

aacacgatga atacatttac acttggactg attgtgatag cctatctgct gtcactggcc 

tatcttggtt ttttaggtta taagaaaaca tcctccgcca gtgattacct ggtaggaggt 

cgacagatga atccgtttgt catggcactc tcttatggtg ccacttttat atcagcctct 

gctattgtcg gctttggtgg ggtagcagca gcttttggta tgggtattca gtggctttgc 240 

360 
420 
480 
540 
600 
660 



60 

120 

180 



tttttgaata tgtttatagg tgtagtgatt gcctttattt ttttcggact ccagacgcgg 
cgaatgggtg ctaaattgaa tgtaagtaca tttcctcaat tgttaggcag gcattatcgt 
tcacggggaa tacaagtctt tgttgccgca gtgattttcc tcggaatgcc tttgtatgcc 
gcagtggtta tgaaaggtgg tgctgtcttt atcgaacaga ttttccagat tgattttaat 
atttcacttt taatatttac attggtaata gctgcctatg tgatcgcagg aggtatgaaa 
ggagtaatgt atacagatgc tttacaggca gttattatgt ttggctgtat gctgtttctg 
cttttttcgt tgtatcgggt actggatatg ggctttactg aagccaatca ggctctgacg 
gacatagctc ccttagttcc tgaaaaattt aaggcgttgg gacatcaagg gtggacggct 720 
atgcctgtta ccggttcacc tcaatggtat acattagtca cttcacttat cttgggagtt 
ggaatcggtt gtcttgcgca gcctcagttg gttgtgcgtt ttatgacggt tgaaagtagc 
aaacaactaa accgtggggt ttttatcggg tgtttttttc tgattattac cgtaggtgct 
atttatcatg caggtgcatt gagcaatctt ttctttctta agaccgaagg tgttgtagct 
acggaagcag tcaaagatat ggataaaatc atcccatact ttataaataa ggcaatgccg 
gattggtttg ccgctctttt tatgctctgt atcctttctg ccagcatgtc tacactgagt 
tcacagtttc atacgatggg agcttcggtt ggttccgata tttatggtac ttacaagcct 1140 
cgttcacgtg gtaaattgac taatgtgatc cgtttgggag ttttattttc aattttagtg 12 00 
agttatatta tctgctatat gttgcccaac gatattatag cccgtggaac ttctattttt 1260 
atgggtattt gcgctgcagc tttcctgccg gcctattttt gtgctttata ttggagacgt 1320 
gctacccgtc agggagtgat ggcaagcctt tggataggga ctataggtag tttgtttgcg 13 8 0 
ctggcctttc tgcaccagaa agaggctgcg gcaatggggg tatgcaggtg gcttttcggt 1440 
aaggatgtgt tgatcgaagc ctatcctttt ccgatgatag atccgatatt gtttgcattg 1500 
ccattatcgg tagcagccgt tattattgtg agtttgttaa cagagaaagg gaaaaaataa 15 60 

<210> 727 
<211> 1503 
<212> DNA 
<213> B.fragilis 



780 

840 

900 

960 

1020 

1080 



289 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 



<400> 727 

acgaataaga aactaataag aaattataaa atgaaaaaga taatattact catcgtatct 60 
gtgtggatgt gcgtttcctg tggaaatctg gaagagatga acattgatcc ggacaatgcc 12 0 
acccagaccc accccaaact cctgcttacc caaatctgca tgaatgcttt taaaagaggg 180 
actgacggaa tgtatgctac caaaaaagta attcaagccg acggagagag tgcagatcaa 240 
tattacaaat ggacccgcgg aagttttggc tactatgaca atctccgcaa tgtacaaaag 3 00 
atgggtgaag aggcagaacg tgtaaatgct ccggtgtata cggcactcac taagttcttc 3 60 
cgcgcctact acttctatga actgactctc cgtttcggag acatccccta cagtcaggcc Aon 
ttgaaaggag aaaaagaaga aatatacact cccgaatatg atgcacaaga ggatgttttt 
gcaggaatcc tccaagaatt gagagaagca gacgaaatac tggcaaatga cgcatctgtc 
attgacggag acatcattta taacggaaat agcacccagt ggaggaaact gatcaactct 
tttcgtctga aagtgctgat gaccctctcc aatcatacaa cagtagggaa tataaatatc 
gcttctgagt ttaaaaacat tgcgacaaac agcccgttga tgaatagcct ggcagacaat 
ggacagttgg tttacctgga tcagcagggc aaccgatatc ctcaattcaa tgcccaatgg 
tccggctatt atatggatga tacatttatc caacgtatgc gcgaacgtcg ggacccacgc 
ttgttcatct tcagcgcaca gaccaacaaa ggaaagactg aaggaaaacc tatcgacgac 
ttcagctctt acgaaggagg agaccctgcc gccccttata gcgatgctat tatcaaagtt 
agtgagggta ccatatcgcc catcaacgac cgtttccgta cagatccgat tgtagagccc 102 0 
accatgctga tgggatatgc cgaattacaa caaattcttg ctgaagctgt tgtacgggga 1080 
tggatcagtg gcaatgcaca aacgtattac gagaaaggta tccgcgcctc attctctttc 1140 
tacgaaaccc atgcaaaaga ttatgccggc tatctgaacg agaacgcagt ggcccaatat 12 0 0 
ctgaaagaac cattggtcga cttcacccaa gcatcgggta ctgaagagca gatagaacgc 1260 
attatcatgc agaaatacct ggttacattt taccaaggca actgggattc cttttacgaa 1320 
caactacgta ccggctaccc ggacttccgt cgcccagccg gaacagaaat ccccaaacga 1380 
tggatgtatc cgcaaggaga atatgataac aacggtacta acgtagaaac ggctattaca 1440 
cgccaattcg gtgcaggaaa tgacaaaata aaccaagcta cctggtggca aaaaaaatca 1500 

1503 

tag 

<210> 728 
<211> 2013 
<212> DNA 
<213> B.fragilis 

<400> 728 

agaatgaaga aattatcctc ttttctgctt ttgttgctgg tcgtatttac ggcgcaggca 60 

cagatacaag agcctgtgaa gtttaaaacg gagctgaaaa ccctgtcggg agccgaagcg 12 0 

gaaatcgttt ttacaggtac gatcgatgcc ggttggcatg tatattctac cgatttaggt 180 

gatggtggtc ctatctccgc tacttttaat gtagagaaga tgtcaggtgc cgaagttgtt 2 40 

ggtaaattaa cccctcgggg aaaagaagtt tcggactttg acaaactgtt cgaaatgaaa 3 00 
gtacgctatt ttgaaaaaac ggctcaattt atacagaaga taaagtttac cggcagtgac 
tattcaatag aagggtatct ggaatatggt gcatgcaacg atgaaaattg tctgcctcct 
acacaagttc cctttaaatt ttcgggtaaa gccgctgcta ctgccgaagt ctcggcaaaa 

gaaacccctg caactccggt aaaagagcca gtcgccactg ttacagacag tattgtagaa 540 
ccgacagcta caactgttac cactgcgata ggcagtgttg acttatggaa gcctgtaatt 
aatgatttga agaaattcgg tgaggcaaac tctcaggaag atatgtcatg gatctatatt 
tttattacag gatttttagg aggtttgctg gccttgttca ctccttgtgt atggcctatt 

attccgatga ctgtaagttt cttcctgaag cgaagcaagg acaaaaagaa aggtatccgg 7 80 

gatgcatgga cttatggggc atccatcgtg gtaatctatg tagcgcttgg ccttgccatt 840 
accttgatat ttggtgccag tgctttgaat gccctttcca ctaatgctgt tttcaatatc 
ttgttctgtc tgatgttgat cgtatttgct gcttctttct tcggagcttt cgaactgaca 

cttcccgcaa aatggagtac ggcagtggat agcaaggcgg aagctacaag cggattactg 102 0 

agtatttttt tgatggcgtt tacattatcg cttgtatctt tttcttgtac aggtcctatt 1080 

atcggatttt tgttggtaca ggtttctact acaggtagtg tagtcgctcc cgcgattggt 1140 

atgttgggct ttgccattgc attggctctg ccatttactt tattcgcttt atttccgtct 12 00 

tggctgaagt caatgcctaa gtctggcggt tggatgaatg tgattaaagt gacattgggt 12 60 

ttcctggaat tagcttttgc tttaaaattc ctgtctgttg ccgatttggc ttatggatgg 1320 

agaattctgg atcgtgagac tttccttgct ttgtggattg ttatttttgc tctgcttggt 13 80 

ttctatctgt tgggtaagat taaatttcct catgatgacg atgatacgaa agtaagtgta 1440 



360 
420 
480 



600 
660 
720 



900 
960 



290 



ccgctgaaca agtcgtatgc ttatgatgag gatatatcca agtatatcaa tttcttgcaa 
acaggacttg aaaattatcg gaaagagaaa tag 



<210> 729 
<211> 1032 
<212> DNA 
<213> B.fragilis 

<400> 729 

tatatgtgga agaagttgtc gctgtatgtt tgtttaataa ctattttgtg ttcttgtcag 

aaacaacgta gtgcttatgc acctcctacg tttccggaag taaagaaaat acatgctcat 

cgtttgtcgg acgaactcct gatcagctat cttttggata tggctgttag tgaggactat 

atctttatat tggctttggc agataatgcc tggttgcagg tatatgataa gactacaggg 

caactgcttg gaagttttgt aacaagaggg cagggaccgg gtgaagcgac tactgcgaac 

atgtgctatt ataatgcacg tgaaaagaaa atttctgtgt atgacgaatc ttctatgaag 



1500 



tctcgtttct tcatggcact ggtttcatta gcttttgctg tttatatggt tccgggctta 

tggggagcac ccttgaaagc ggttagtgct tttgcaccgc ctatgaaaac ccaggatttt 1560 

aatctttata ccaatgaggt acatgccaag ttcgatgatt atgatttagg tatggaatat 1620 

gcccgtcagc ataacaagcc ggtaatgctc gactttacag gatatggttg tgtgaactgt 1680 

cgtaaaatgg agcttgccgt atggaccgat ccgaaggtta gcagcatcat taataatgac 17 40 

tacgtactga ttactcttta tgttgacaat aagactccgc ttactgaacc ggtgaagatc 1800 

atggaaaatg gtacagaacg cactttgcgt acggtaggtg ataaatggag ttatctgcag 



1860 



cgtgtgaagt ttggtgccaa tgcccagcct ttctatgttc tgatagacaa tgagggtaat 1920 

1980 
2013 



60 
120 
180 
240 
300 
360 

ttattgactt atcagtttga taaagacgct gataattggg gagcgttgat agaagaatgg 42 0 

■ ■ : 480 

540 
600 
660 



tctttttatg atttaggagg tacactacgc cgggtgtggg aacttcggaa tggtaggttt 

ttggtagatg gtcagttggg aacaaagtcg gatcagcaaa aacgttttca gatgttggca 

gatgcaaaag tggtggctga ttacaatgat tttcctatag atactccgaa agaacgttcc 

gtttggtcat cgccagcaat tgcgatatct ccggattgta aaaagatggc cgtaggaact 

ttatatggag gaatccttga attatttgat ttatcacaaa acatagaatt aagagcaatc 72 0 

cgaaaatttt atcctccggt cgtgcaatat ttatccggaa ctatccaaaa cacagaggag 780 

actgtttggg gcttttctgc gttatgtgct acagatgaaa ggatttatag tgtatttata 

ggtgacaaga atcccaattt atttaataac ttatctgttt ttgattggga tggccgggaa 

ttaatcaaat ataatactga ttgcctcgtt ttgagaattt gtgcttcaac tcaggaacca 

aataaactgt atgggattgc tttttctgaa actcatgaat tttatctggt ctctttttcc 
ttgggttctt ga 



840 
900 
960 
1020 
1032 



<210> 730 
<211> 777 
<212> DNA 
<213> B. fragilis 

<400> 730 

aacgactacc tttgtgagcg atttgagaat cgtgactacg tatttttaat tatatcactt 
aaaaaagagc taattatgac ttattcacac gaagtggaac acatgtgtgt tgtaaagaag 
ggtcctaacc acggaccggc tcccataccc gaagaaggaa aatgggtaaa atcaaaagaa 
attgttgata tttcaggtct gacacacggt gtgggttggt gtgctcctca gcagggtgca 
tgtaagctga ctctgaacgt aaaagaaggt atcatccagg aagctctggt tgaaactatc 
ggctgttcag gtatgactca ctcagctgct atggctgctg aaatcctccc gggaaaaact 
atcctcgaag cattgaacac agacttagtt tgtgacgcca tcaacactgc tatgcgcgaa 
ctcttcttac agatcgttta cggacgtact cagtcagctt tctcagaagg tggtctgatc 
atcggtgcag gtcttgaaga cttaggtaaa ggtctgcgta gccaggtagg tacattgtac 
ggtactttgg ctaaaggtcc tcgttacctt gaaatggcag aaggttacat caagacaatt 
gctcttgaca aaaacgatga aatctgcgga tacgaattcg ttcacatggg caaattcatg 
gatgaaatca agaagggtac tgatgcgaat gaagcattga agaaagttac cggtacttac 72 0 
ggacgcttca ctgcagaaca gggagctgtt aaacacattg atccacgtca cgaataa 777 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 



<210> 731 
<211> 195 
<212> DNA 



291 



<213> B.fragilis 



<400> 731 

ttattttccg atatgaaatt gaataagact gattatatgc ttgagcgcac atccgatggc 
ggttattatg cttggcttac tgtaaatatg cagtgtaatg cgtatgggga ttcacccgaa 120 
gaagcggtaa aaaacctgga acagaccatg gaagacctgg ttgaagaaat gtatttggta 
gaggatttta tatag 



60 



180 
195 



<210> 732 
<211> 582 
<212> DNA 
<213> B. fragilis 

<400> 732 

atcaatgtag aaagtatgac tataatgaag ttgagactgg gagttcgtgg gatgatgtgg 

ctaaccttgg taattatgat gtggggcatc atatcttgtc gaactcaaga agagaaatgt 12 0 

cttgaagagg ttttatctct tccgctcgcc aataaagaag aactacaaaa agtactggat 180 

cattataaag atgacagcct gaagtatcag gccgtttgct ttctaatcag gaatatgcct 2 40 

tttcatgcag gatacgaggg aaatgctttg aagtattatt accaatattt tgatatttac 3 00 

gcgcaaggaa aattaggacc gcacgaagtg attgattctt taaaagaaaa ttcgttttct 3 60 

gtttcgcaat taaaacggat agaggatatt gccaatattg attcttcttt actggtgcag 42 0 

aacgtggatt gggcttttaa ggtgtggaga gagcagcctt ggggcaagaa tgtaagtttt 480 

gataattttt gtgagtttgt tctaccttat cgattgggag atgaaccact tggattctgg 540 

agagaagata tttataaacg ctataatcca atattagatt ag 582 

<210> 733 
<211> 1026 
<212> DNA 
<213> B.fragilis 



60 



<400> 733 

ggaggaaaag aaattatgat tagagaagta aaatttgaaa gtcaggaccg ccgtatcaaa 
ggtatcatcg aagccttgaa cgctaacggc atcaaagaca tcgaagaagc taacgctatc 
tgcgaagctg ctggagttga tccttataaa acgtgtgaag aaactcaacc gatctgtttt 180 
gaaaatgcta agtgggctta cgtagtaggt gctgctatcg ctatcaagaa aggttgcaaa 
aacgctgctg atgctgccga agctatcggt ataggtctgc aggcattctg tatcccgggt 
tctgtagctg acgaccgtaa ggttggtatc ggtcatggaa atctggctgc tatgttgtta 
cgtgaagaaa ccaaatgttt cgctttcttg gcaggtcacg aatctttcgc tgctgccgaa 420 
ggtgctatca aaatcgctgc aaaagcagac aaagtacgta aagaacctct gcgttgtatc 
ttgaacggtc ttggaaaaga tgctgctcag atcatctctc gtatcaacgg ctttacttat 
gttcaaacac agtttgacta tttcacaggt gaactgaaag tagtacgtga aattgcttac 
tctgacggtc ctcgtgcaaa agtaaaatgc tatggtgcag atgatgtacg tgaaggcgta 
gctatcatgt ggaaagaagg tgtagatgta tctatcacag gtaactctac taacccgacg 
cgtttccaac acccggttgc aggtacttac aagaaagaac gtgtactggc aggtaagcca 
tacttctcag tagcttcagg tggtggtaca ggtcgtactc ttcacccgga taacatggct 
gccggtcctg cttcatacgg tatgactgac actatgggtc gtatgcactc agacgctcag 
ttcgccggtt cttcatccgt tcctgctcac gtagaaatga tgggattcct gggaattggt 
aacaacccaa tggtaggctg tactgtggct tgtgcggtag atgtagctca ggcattggca 102 0 

1026 

aagtaa 

<210> 734 
<211> 351 
<212> DNA 
<213> B.fragilis 

<400> 734 

aagatgaaag agtacattga tgttttaaag aagtggaaag atttcgacgg tagagccaga 60 

agacgtgaat actggatgtt tgtcttgttc atggctattt ttgctattgt cgcaagtatt 12 0 

attgacgcta ttttgggtac gatttgcgta tttgtaggta tttattattt ggctatgctt 180 



60 
120 



240 
300 
360 



480 
540 
600 
660 
720 
780 
840 
900 
960 



292 



ctgcctatga ttgctgttag tatacgtcgt atgcacgata ttggcaaaag cggatggtgg 240 
ttatttatca ctttcgtacc ggtgatcggc agcctttggt atctcttcct gactattcag 300 
gacggacagc cgggtagcaa ccaatacggt gaaaacccta aaggaattta a 351 

<210> 735 
<211> 1056 
<212> DNA 
<213> B.fragilis 

<400> 735 

attggaggag ataagaaaat gaaaacattg caagaattaa cccgtccgaa tatctggaga 60 
ctaaaaccct attcttcggc ccgtgatgaa tatagtgggg cggcagcatc tgtttttctg 120 
gatgctaacg aaaacccgta taacctgccg cacaatcgct atccggatcc gatgcaacgg 180 
gatctgaagt tggaattgtc caagataaaa aaagtagctc ctgcccatat ctttctggga 240 
aatggcagtg atgaggctat tgatttggtg tttcgtgctt tctgtgagcc gggcagagac ^ 
aatgtagttg ctatcgatcc tacgtacggc atgtatcagg tttgcgccga tgtcaatgat 
gtggaatacc gcaaagtgct gcttcacgat gattttcagt tttctgccga tgagttgttg 
gcagttgcgg atgaacggac taagatgatt ttcctttgtt cacccaataa tccgacggga 
aatgatctgc ttcggtctga gataataaag gtgatcaatg atttcgaagg attggtcatt 540 
ctggacgagg cttataatga tttttccgat gaaccctcat ttttgtcaga gttggataag 600 
tatccgaatc tgattatctt acagactttc tcaaaagcgt ttggttgtgc agctattcgt 660 
ttggggatgg cttttgcctc cgaggggatt atcggtgttt tgaataaaat caagtatccg 720 
tataatgtga atcagctgac tcagcaacaa gctatagaaa tgctccacaa atactacgag 780 
atagaacgct gggtaaaaac attgaaggag gagagggggt atctggaaga agctttcgtt 
gagttgcctt gggtattaca ggtatttcca tcgaatgcca acttctttct ggcacgtgtg 
accgatgctg tgaaaattta taattatctg gtgggagagg gtattattgt acgtaatcga 
aattccatat cattgtgtgg caactgcctt cgtgtgactg taggtacacg ggctgagaat 
gccaaactga ttggagcact gaaaaaatat caatag 

<210> 736 
<211> 594 
<212> DNA 
<213> B.fragilis 

<400> 736 

aatataaaac tgcggaataa tttgaattac aaagcattat ttcaaatagc aatgtcattg 60 
atttcctctc cggccaaggc ctgggaggaa attcgtttag aggatagacg ggcagttctt 120 
actgtttttg tctatcctat gattggttta tgtggtttat ccgtattcat tggtgctttg 
tggactaatg gttggggagg accacaaagt ttccagttgg ccatgacgca gtgttgtgcg 
gtggcggtag ctttgttcgg aggttatttt ctggcggctt atgctatcaa tcagatgggg 
ataaaaatgt ttggtatgac caatgatatc cctttggcac agcagtttgc aggttatgcg 3 60 

• ' ■ 420 
480 

ctggtagaag aaaagaatcg gttgcgttac accattttct cgtcgatttt gttaatacta 



ttagttgtca cttttttgtt acatatagta accggattgc ttcctgattt cagtattatc 
ggttggctgc tccaatttta tatcgtttat gtggtatggg aaggagcaag ggttgtgatg 



300 
360 
420 
480 



840 
900 
960 
1020 
1056 



180 
240 
300 



540 



tgtccggcgg taatacaagt tgtgttcaac aagctaacag ctatattaaa ttaa 594 

<210> 737 
<211> 2175 
<212> DNA 
<213> B.fragilis 

<400> 737 

tcgaatcttt gtatttttgt caaaattgaa ttgatacatt atattattaa atcgcacaaa 60 

aacatgaaga agatgcttat ggctgccgga atggctgccg tgatgactgc ttgcggcaca 120 

gccggacaga aagcagccac cgatgccgga aacccttttc tggcagagta ttcaactcct 

ttcggtgttc caccgttcga cctgattaaa gtagagcatt acaaagaagc tttcctgaaa 

ggaatggaag aacagaaaaa agaaatagat gctattgtca atcagcgttc ggttcccgat 

ttcgataata ccatcgctgc attcgatcag agtggagagt tgttaaataa ggtgagtact 

gtgtttagtg gtctgaacag ttgtaacacg aacgatgaaa tgcaggcttt taataaagag 



180 
240 
300 
360 
420 



480 



840 
900 
960 



293 

attactccgt tgctttcggc acatcgggac gatattagtc tgaatccggc tctttttgcc 

cgtgtgaaag aagtttatga acgtcgggag aaactgggat tggataagga gcagaataag 540 

ttactggaag aaacttacaa gaagtttgtt cgtggaggtg ccaatcttga ttctgtggat 600 

caggcgaagt tgcgtcaact caatagtgag atttcgatgt tgcaattgac ttttggacag 660 

aatctgctga aagaaaccaa cgcttttgag ttggtgattg ataagaagga agatctcgcc 72 0 

ggattacccg aaagtcttgt ggcatccgca gccgaagcgg ctaaaggggc aggtatggaa 780 
gagaagtggc ttttcacttt gcacaatccg agtgtaatgc ccttcttgca atatgcagat 
aatcgcgagt tgcgtgagaa aatctttaaa ggatacatca atcgcggcaa caatggcaat 
gaggccgata ataatgaaat cgtgaaaaaa ttggttgctt tgcgtctgga gaaagccaaa 

ttgatgggat atgccgatta tgcttcttat attttggaag accgcatggc aaagaacgag 102 0 

gaaaatgtat atcgtttact gaatcagatc tggactcctg cagtggcgaa agccaaggag 1080 

gaattgtctg atattcagtc tgaaataaag aaggaaggcg ctaactttac ccccgaagga 1140 

tgggattggc gctattattt tgagaaagcg aagaaagcca agttcagttt agacgagaat 12 0 0 

gaagtgcgtc cttatcttga attgaataat gtgcgtgaag gtgctttcta tgtagctaac 12 60 

agactttatg gcattacttt caccgaaatt aaagacattc cgaaaccgca tgaagaggca 132 0 

caggcttttg agtgtaaaga taaagacgga acccatcttg gtgtgctgta tatggacttt 13 8 0 

ttccctcgta atagtaagcg gggaggcgca tggtgtggaa cttatcgttc tcaaacctat 1440 

cgtgacggta aacgtttggc gccggtagtt acgattgtgt gtaactttac caagccttct 1500 

tcgggacagc ctgccctgct tagtgccgat gaggccggta ctttattcca tgaatttggt 1560 

catgcactcc acaatttgtt taaagatgta cactttcatg ccgtatccgg tgtaccgcgt 1620 

gattttgtgg aattaccttc tcaggttatg gagcattggg tattcgaacc ggaggtgctg 1680 

aaaatatatg ccaaacatta tcggaccggt gaagtgattc ctgctgcatt gattgagaaa 1740 

ctcgataaga gtggaaagta tggccagggg tttgccacaa ccgaatatct tgccgcttct 1800 

ctgcttgata tggattacca tgtactgaaa gagattcccc ggaatatgga tgtcactgaa 1860 

tttgaggctg ctgtgctgaa agagcgtggc ttgctaagtc agatacctcc tcgttatcgt 1920 

actacatact tcaatcacat catgaacagc ggctatacgg ctggttatta cagttatatt 1980 

tgggccgaag tgttagatag cgatgctttt gaagcatata aggaaaccgg tgatctgttt 2 040 

aatcaggaag tggcttcccg tttccgtcgt tatattctca ctcccggagg catcgacgat 2100 

gcgatggata tgtataagaa ctttcggggt aaagaaccgg gcatagaacc tttgttgagg 2160 

aataggggac tatag 217 5 

<210> 738 
<211> 738 
<212> DNA 
<213> B.fragilis 

<400> 738 

ataatacaat gtgttttacc attacgttgt tattttaatt atttgttttt atatttgtca 60 
gccaaagctg ttttttacag ttgcaaaatt actctatttg tttcaaatat acactataaa 120 
aggatgttaa taataggaat agcaggcgga acaggctcgg gaaagaccac cgtcgtacgg 180 
aaaatcattg agagtctacc agctggtgaa gtagtattgc tacctcagga ttcatactat 240 
aaagacagta gccacgtacc ggttgaagaa cgccagaata tcaattttga ccatcccgat 3 00 
gcttttgaat ggagcctttt gtctaaacat gttgccctcc ttaaagaagg caagtgtatc 3 60 
gaacaaccca cctattctta tttgacttgt acccgccaac ccgaaacgat ccatattgaa 
ccacgtgaag tggtcataat cgaaggtatc ctggctttat gtgacaaaaa gctgcgcaat 
atgatggatc tgaaaatatt tgtagatgcc gatccggacg aacggttgat ccgtgtgatc 
caacgtgacg tagtggaaag gggccgcact gcagaggctg taatggagcg atatacgcgt 
gtgctgaaac ctatgcattt acagttcatc gaaccatgta aacgctacgc agatttgatt 
gttcccgaag gagggagcaa tcaagtagcc atcaatatat tgaccatgta tataaaaaaa 72 0 
cacatcggta ggccatga 738 

<210> 739 
<211> 1395 
<212> DNA 
<213> B. f ragilis 



420 
480 
540 
600 
660 



<400> 739 

gccatgaaac ggcatctgat aatttactcc ctgctttttc ttcttttctg tgtattgtct 
tgccgcaaca aacaagcagt agctatagag gagtcctctg cacacgatct ggaacaaatc 



60 
120 



294 



aaagatagcg gagaactcgt tgttctgact ctttatagtt ctacttctta tttcatctat 180 

cgtgggcaag acatgggttt ccaatacgaa ctcagtgaac aatttgccaa aagtttagga 2 40 

gtgaaattgc gaatagaagt agccaaaaac gtaccggaac tcatccgaaa gttactaaat 3 00 

ggcgaaggag atattatcgc atacaatatt ccgattacta aagaattaaa agacagcctg 360 

atctattgtg gcgaagaagt aatcacccac caggtaattg tccaacgaac caatgggaaa 42 0 

acaaaaccgc taaaagatgt aaccgagttg gtcggaaaaa acatatatgt gaaaccgggc 480 

aaatattacg aacgattggt taacttgaat aaagagctgg gaggaggcat tctgattcat 540 

caagtaacca atgacagcat taccgccgag gatttgataa cccaagttgc acaaggtaaa 600 

attccttata cagtggctga taatgatgtc gctaagttga atgcgactta ttatcctaat 66 0 

ctgaatacca gtctgtctat cagttttgac caacgcgctt cctgggctgt acgtaaagat 72 0 

tgtccgcaac tggcagcagc agcagacgaa tggcataaac agaatatgac ttcgccggca 780 

tataccgcaa gtatgaaacg atattttgag atcagtaaag caatgcctca ttctcccatt 840 

ttatccttaa aagagggtaa aatctctcat tatgacaact tattcaagaa atatgcgcaa 900 

gagataggtt gggactggcg tctgttggca tccttggcct ataccgaatc gaacttcgat 96 0 

acaactgccg tatcatgggc cggagcaaag ggactgatgc aattaatgcc tgccaccgcc 102 0 

cgtgcaatgg gggttccacc gggcaaagag caaaacccgg aagaaagtat caaagctgcg 10 8 0 

gtgaaataca ttgcagcgac agatcgcagc ctaagcatgg tgccggataa acaggaacgg 1140 

attaagttta tactcgcttc atataatgcc gggctgggac atatttttga cgcaattgca 1200 

ctggcagata aatacggtaa gaataaaacc gtatggacag acaatgtgga aaattacatc 12 60 

ctactaaaaa gcaatgaaga atatttcact gatccggtat gcaaaaacgg atatttccgt 13 2 0 

ggaatagaga cctacaattt cgtcagagac attaactcaa gatatgaatc atataagaag 13 80 

aaaataaaaa gttga 1395 

<210> 740 
<211> 1431 
<212> DNA 
<213> B.fragilis 

<400> 740 

tccaatatta gattagtatc ccggtttgtt gcctcaaagc tccaggattc cgttgggttg 60 

ctgcaaaagg tgttgatgga tttcgcttgt ttgtggaaca atcccatttt cccgttttgt 12 0 

tttcccgcgg gtcctcattt aggtccttca gtagtgtcat ggcgtgcggg gagctgtcgc 18 0 

gagttcgccg atttggtagt gtatgtaatg cgtgctttgg gtattccttg cgggacagac 240 

tatatgccga tgcgtggaga taataacgtg ccgcatttct ggaattttac attggataaa 3 00 

gatggaaaaa cgtatattac ggaatttccc gatcctaatt ggaaacgggc tgtgagtatg 36 0 

tataatccta aggcaaaggt ataccggaat acgtatggct taaactggaa agatgtaaag 42 0 

agacaacagg gaaaaatgat gcatccggcg tttcgaaaac ctctatatca ggatgtcacg 480 

gctgtgtatg ccgacagctt gaatcgtgat ctggtagtgt cttctgatat tttgtgtaag 540 

gaagttcaca aaggagatat tgtctatttt tgcctttcca caaggatgga ttgggtacct 600 

atagcatgga ctgtttttga agaagactca ttgcgctttc aagatacgga aggtagtgtg 660 

attggttgtt tggctacatg gaatggaaaa cgtcttgtga tgcagtccga gccgtttacc 72 0 

tatgataaaa tgtcaggaac gattgctttg ctcactcctc aaagtgaaaa agaagatata 78 0 

accttgtatt ttaagtttcc gctgttctgc gacttaggta tccttcgtat gcccggagga 840 

gtttttgaag gaagtaatga ttcgcagttt cgctctgcag atacattgta ttatgtaaaa 90 0 

caatggcctt tccgcttgaa caacactatt tttccggaga aagaaaagtc ttatcgctat 960 

gttcggtaca aggggccgaa ggggagttat tgcaatatag cagagatggc tttctttgaa 102 0 

gatacctcgg atacgttggc gttgaaaggg cggatcatcg gaactccggg ttgttttcag 1080 

aaagacggct cgcatgatta ttacaaagta tatgatggca atccctatac ttatatggat 114 0 

tataagactc ctgatgaggg gtgggtcgga ttagattttg gcattcctcg ccggataaag 12 0 0 

aaatttactt atattcctcg taattcggat aattttatcc ataaaggaga tgtatatgaa 12 60 

ttattctatt ggcatgacaa gaaatggaat tcgttaggtc ggcaagtggc aaaagcagat 132 0 

tctttaaatt atgtaattcc gaaaggggta gccttatttt taaagaatca tacggaggga 13 8 0 

aaggacgaac ggatctttaa gaagaccgat gggagacaac agttttggta g 1431 

<210> 741 
<211> 720 
<212> DNA 
<213> B.fragilis 



295 



<400> 741 

gaaaggaaaa gtatgaaaac agtagtagac aaagcctctt caaggggtta tttcaatcat 
ggttggctga aaacccacca tacatttagt tttgcaaact attataaccc gtcaagaatg 12 0 
catttcggcg tattgagggt actgaatgat gatagcgttg accctgaaat gggattcgat 180 
acacaccctc accagaatat ggaagtcatt tctatccccc tgaaggggta tctgagacat 
ggcgacagcg taaaaaacac ccggacaatc acacccggcg atatccaggt tatgagtacg 
gggaaaggta tcttccacag tgaatataat ggaagtgaca aagagcaatt ggaatttttg 
caaatatggg tattcccgag aattgaaaat acagagccgg aatataacaa ctacgatatt 
cgtcctttac tgaaaagaaa cgaacttgct ctaattattt caccggacgg taaagtaccc 
gcttccatta agcaagatgc atggttttct atgggaacat ttgacgcagg aaagagtttc 540 
gaatacaagt tgcatcagga aggtaacgga gtttatcttt ttatcatcga aggagatgtg 
gaagttgcag gcaaccgatt gtcacgacgt gacggcatcg gtctttggga tacaaagagc 
tttaaagtgg aaataaccca agaagcgacc ttattgctaa tggaagtacc aatgcgataa 

<210> 742 

<211> 1482 

<212> DNA 

<213> B. fragilis 



60 



240 
300 
360 
420 
480 



600 
660 
720 



60 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 



<400> 742 

gtatttcgga ggaagctaca aagtacagac aaatccggac ggaaagccgg ccatcctctt 
cacgtttccc tcaggaagaa attgacttct tcccctatcc gctcgtctac ttctacagaa 120 
agttgtatct ttgcaacctg caaacttaaa cccttcttta tgtataccaa caaacagatc 1 Qn 
tggagtgtca gttacccgat tctcctgagc ttgcttgcgc aaaatgtcat caacgtcacc 
gacactgcct ttctgggacg tgtcagtgag atagccctcg gtgcttctgc catgggtggg 
cttttctata tttgtatttt caccattgcc ttcggattca gcaccggttc ccagatcgtc 
attgcccgcc gcaacggtga agcacgttac ggcgatgtag gtccggtcat gattcaggga 
gtcttgttcc tgttggtcat ggctctcctg ctcttcggat tcaccaaagc gttcggcgga 
aacatcatgc gcctgctggt ctcttccgaa agcatttatg atgccacgat ggagtttctc 
gactggcgca tcttcgggtt cttcttctca tttgtcaacg tgatgttccg ggcactctac 
atcggaatca cccgcaccaa ggtgctcacc atcaatgcag tggttatggc gctgaccaat 
gtggtactgg actatgccct gatattcgga cacttcgggc ttccggaaat gggcatcaaa 
ggagcagcca ttgcttccgt aatcgccgaa gcggcttctc tgctcttttt cctgatttat 
acgtacatca ccgtcaacct gaaaaagtat ggtctcaacc gcttgcggtc gttcgacccg 
gttttgttga tgcgcattct cagtatatcg tgcttcacca tgcttcagta tttcctgtcg 
atggccacct ggtttgtttt ctttgtggca gtggagaggt tgggacagcg cgaactcgct 
attgccaaca tcgtgagaag catctacatc gttatgctga ttccggtaaa tgcactggcc 
accacgacca acagcctggt gagcaacgcc atcggcgcgg gaggcatcaa ctacgtgatg 
ccgttgataa acaaaatcgg gcgcttctct ttcctgatca tgctgggact ggtcatcata 1140 
accgccctgt tcccacaagc attgctctcg gtatacacca acgaaacggc attgatcaat 12 00 
gaatcggtat catcggtata tgtcatctgc gtggccatgc tgattgcctc tgttgctaac 1260 
gtcgtcttca acggaatatc gggtacaggc aatactcaag cagccctgat gctcgaagcc 132 0 
atcacgattg caatctacgg atcgtacatc attttcatcg gaatgtgggt gaaagctccc 13 80 
atcgaatggt gctttacgat tgagattctg tactatacac tgttgctcgc cacaagctat 
atttatttca aaaaagcaaa atggcagaac aaaaagatat aa 

<210> 743 
<211> 1269 
<212> DNA 
<213> B. fragilis 

<400> 743 

tatatgacag gagtagcgga aaagaaaaaa agaatgataa agatttacct attatgggtg 
ctgttagtcg gtagcttgtg ctgttcttgt acaggaaata aacgattgga atatgcgttg 120 
gagtttgccg gggagaatag gggagagctt gaaaaagttt tggaacacta taatgatagc 
ggactgaaac aggatgccgc acgctttttg attgaaaata tgccccgcta ttttagttat 
gaaggatggc aattggatac gttaaaagca attcatgcag ccacagaaca tacggatgga 
tgggtgaata aaaaagatcg caaaaaatgg gaacattttt cttttcggac tttaaagaaa 3 60 
gtttatgatg ctaaagtgat taaagctgag ttcttgattc atcacataga tcaagccttt 420 



1440 
1482 



60 



180 
240 
300 



296 



480 



y ^ > — — ^r---_i — ■ — ■ — 

acttttccga aggagaaagc cagagaagag atgcattttt tgtcatgcga agtagaagat 
gaagagattc ttcctttaat ttga 



840 
900 
960 



gaagtttttg aaaaaagatc ctggaataaa tatttgccat ttgatgattt ctgtgaattg 

attttgccat atcggattgg tgatgaaccc ttggaggaat ggcgtggttg gtatagggag 540 

cgttatgaat ctatattgga ttcgctctat caagggacag atgtggtaga agccaccgat 600 

cgtttagggg cttatttgcg tcaggaaaaa gacttcaggt atagtgttga gctggactta 660 

ccccatttag gtgcaggttt tttgctagct aacagggttg gaagctgtga ggcgtcttgt 72 0 

gattttacgg tctatgtgtt acgtgcgctt ggtattcccg ctgcaacgga tatttatcat 780 
tatggacccg gtaagggagc cggtcatgtc tggaatgtat tgagggatac aaccggtggc 
tatgttcctt tctggtttat tcagactaaa gtggagcggg gcggaagtga taaacgagaa 
aaagggaagg tatacaggcg gtgttttgga gcacagcagg agaaagtatc aggtatccgc 

cgcgatcggt ctgttccgtt tccgctgaaa gatccatatt taaaagatgt tacaagtgac 102 0 

tatttcccgg caaatcaggt tacaatagaa attgatcctc aggttgataa aaagtatatc 10 8 0 

tgcctgggtg tgtttacatt ggaaggatgt atgcccatag atataactgt gcagaaagga 1140 

aataaagcaa cctttatgaa tgtagaaccc ggaattttgt ttcaaccgct atatgataac 12 0 0 

gggatgaagt gggtggcagc cggataccct ttccagtgta gacgaaaagg gagaggtgaa 12 60 

i. j_ x. 1269 
gtatcataa 

<210> 744 

<211> 504 

<212> DNA 

<213> B. fragilis 

<400> 744 

aacgaaaaag gtatgaaaaa gattattaac ccgtggaagg ggatggaagg atataattgc 60 

tttggttgtg cccctaacaa tgaagccggt gtgaaaatgg aattttatga ggataacgat 12 0 

gaagtgatta gcatctggcg tccccgtccc gaataccagg gatggattga tacactacat 180 

ggaggtatcc aggccgtact tttggatgaa atctgtgcat gggttattct ccggaagtta 240 

cagactacgg gggtgacatc aaaaatggag acacgttatc gtaagtcgat cagtactaat 



300 

gattcacatg tagtgctcaa agcgcatatt aaagaagtga agcgtaacat tgtgataatt 3 60 
gaggcacgtc tttataataa agatgaggaa ttgtgtacag aagctctctg cacttacttc 42 0 

y ^ -- j *- 480 

504 



<210> 745 
<211> 1017 
<212> DNA 
<213> B. fragilis 

<400> 745 

cccatattta taatgaaaaa gattatttta agcagcgtat tattgctatc cggcttcttt 
atccaagcgc aacaagctcc cgagaaaatc agctttaatt ccaatggtga atttaagata 12 0 
gcacaattta ccgatatgca cttgggacat gatcaggaga aagaccgaat agtgggagat 180 
atgatcaaag aagtacttga ttctgaaaag cctgacctcg tgatatttac aggagacaat 240 
actactatgg atgaagtccg gcaagcttgg gaagccatat ctgccgaact gtcggcccgc 3 00 
cggatccctt ggacagccgt attgggaaat catgatgacg aatatgccgt aaagcgtgat 3 60 
gaaatcattc gtatcatccg ggaacaaccg tattgtatga tgaaacaagt ggcagaagga 42 0 
ataaaaggag aaggtaacca tattctccct atttacagtt cgaaagacgg aaataaaaca 480 
gccgcattgc tttattgcct ggacacaaat gcttattcga agataaaaac agtaaaagga 
tatgactgga tcggacgatc tcaaatagac tggtactccc gcgaaagccg gaagtacaca 
gaacggaatg agggacaacc attacctgca ttgaccttcc tccatattcc gctaccggag 
tacacccaag catgggaatc gttcgaaacc aaacgttacg gagaccgtaa cgaaaaagaa 72 0 
tgcagtcccc atataaacag cggtatgttt gccaatatgc tggaatgcgg tgatgttatg nQn 
ggtgtttttg ccggacacga ccacgtaaac gattacatcg ctactctcta taacatcgct 
ttaggatatg gacgagcttc gggcggaaaa aatacttacg gagataaaac accaggcagt 
cgtatcatcg tattgaaaga aggtaaacgt gaattcgata cttggcttcg ggaaaaagga 
aatatggcaa aactgaatgt atgtacatat cccggctctt ttgtaaaaga gaaatag 1017 



60 



540 
600 
660 



780 
840 
900 
960 



<210> 746 
<211> 3165 
<212> DNA 



297 



<213> B.fragilis 



<400> 746 

ttagtactta tgaacagtaa atttctcctg ctactctgta gtatgttatt gtgcacatca 
cttgcattcg cacaatcagt caaagtaaca ggtacagtca cagacaaaat gggggcagta 
attggtgcca ctatcatggt gaaaaactca tcaaacggaa ctgtcaccga tatagatggt 
cgttacagca tcgaagttcc taaaaacgca acactactat tctctttcgt aggttacagc 
acagtagaga aagaggtagg taacaacact gtaatcaatg ttgaactgtc cgatgacatt 
caggccatcg acgaggtagt ggtcactgca atcggtatca agcagcaaaa gaagaagatc 3 60 
ggttacacaa cccaacaaat caacagtgag gtattgaatg ccactcccag tctgaatgtg 42 0 
ggctcggccc tttccggaca agtagccggt ctgttggtag ccaaccctac cggtattttc 480 
caggcaccga gtttcaaact gcgcggcaac gcaccattgg ttgtactgga cggagttccg 540 
gtagaaaccg actttttcga catctcaagt gagaatattg aaagtgtcaa tgtactaaaa 600 
ggtacggcag cctcagcttt atacggttca cgcgggaaaa acggagcaat tctgatcacc 66 0 
agtaaaacgg ccaaaaaaga aggcttggaa atcaacttct ctaccaacaa catgatcaca 72 0 
gccggctttg cagtgcttcc cgagacacaa catcaatacg gtagcggttc aaatggtaaa 780 
tatgaattct gggacggtgc agatggcggc atttcggacg gtgacatgac ttggggaccc 
aaattaaatg taggaaccaa agtagctcag tggaacagcc cgatcaggga taaagtgact 
ggaaaagaga ttccctggtg gggagatgta aaaggtactc agtatgatga caaatcgcgc 
tatgaacgta tacctatcga ctgggtatcc catgacaacc tgaaagactt tctgcaaacc 
ggactagtaa ccaacaataa tatctcaata gcttataaag gagaaaaagc acgctacttc 
gtcaccggac aatatgctta ccaaaaggga caggtgcctt ctactgaaat gcacagtgga 1140 
ggtatcaact tcaactctac ctttgatctg gctaaaaact tgcagctgga tgccaatctg 10nn 
gcctacaaca aaatagttgc cccgagttat ccgcgctacg gatacggacc taaaaaccac 
atgtacacca tcgttgtatg gatgggagac gatgtgaacg gtaaagaact ccaaaaacac 
aaatacgttc ccggacagga agggtatcgg caggcaagtt acaattatgc atggtataat 
aatccttact ttgcagccga agagctccag caatccgaaa gtcgggatgt ggtgaacggg 
caagtccgcc tgaattatca aatcctcccc aatctgaaca tacagggacg tgccgcctta 
cgccagaaaa caattcttca ggaaatgaaa gtacccaaaa cttacatgaa ctacggtgac 1560 
tcccgggaag gtgactacaa agtatggaat gaccgtcaaa ctaatgtaga cgctgatgta 162 0 
ctggctacct acactcaaga tctgactccg gatatcctct tcaccctgaa tgccggaact 1680 
tcggtattct accgtaatta ccgtcaggaa tatcagtcta ccgacggttt gattgttcca 
ttcgtataca gtatcaaaaa cacacaaggt ccttccatta ccgatgccaa ccgaaatgaa 
aaatcaatcc gtagtattta tggatcaatc aaccttgatc tttacaaata tgcctatctg 1860 
acgttgacag gacgtaatga ctggtcatct actctggcaa aaggcagtaa ctcttacttc 1 Q ™ 
tatccttctg tcgcactgag tactatggta tccgaataca tcaaattgcc aacatttatg 
gactatctca aaatgtatgg ttcatgggcg gttgtctcta ccgacctgtc tccctaccag 
atcatgtcca cttatacaaa agattccaat tacggttcaa atccatctat ttcctaccct 
tcttctctgg tcaactacta cattaaacct cagaaaacga catcctggga agccggattg 2160 
tcaactgcat tcttccgtaa ccggttatct ttcgacctga cttattatca tacgatcgat 2220 
gaaaaccaga ttatcgacct gaatatttcg aatgcatcag gtttcaccag ccgtaaagtg 2280 
aacggtaacc aatataccac caacggatgg gaaatcatgg ccaatgtaca ggctatcaaa 2340 
aataaagatt ttcaatggga tttctccttg aactggagta agagtgtaaa aaaattgacg 2400 
gaaatatatg gcggacagaa aaagttcggt gacctgaaag tgggcgaccg tgccgatgca 2 460 
ttttacggtt cacaatggca gaaaagtgct gatggagaat tgattctgga tgaaaacggt 2 52 0 
atgcctacta aagacgcata taaacaatat ctgggacatc tggatccgaa cttccgaatg 2580 
ggtatgcaaa atactttccg ctacaaagac ttcacactgt ctgtcgatct ggacggcgct 2 640 
tataaaggag taatctattc tgtattgagc gaaaagttat ggtggggagg aaagcatccg 2700 
gaatcagtgg agtacaggga tgcacaatat gccgtcggac acccgatata tgtacccaat 2760 
ggggtagtcg taaccggagg agagctgaaa cgtgacatcg acggtaatgt aatctctgac 2 82 0 
acacgcacct acaaacgtaa cacgacagcg gtcgattggc aacaatggtg ccagaactat 2880 
ccttatcaag cttatgtatc ttcgaaagaa aatgccaaat ttgccaatgt attcgaccgt 2940 
agctacatta agctccgccg agtggcactg acttacaact tcaccaaact actttcgaaa 3 000 
caaagccccg tgaaaggact tacagctaca gtgtttggca acaacctagc tgtctggaaa 3060 
aaagtcccct ttgtcgatcc ggactacacc ggagacagca acgacggagg tgccaacgat 312 0 
ccaaccgcac gctatatcgg catgggcgtc aacataaaat tctaa 3165 



60 
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1200 
1260 
1320 
1380 
1440 
1500 



1740 
1800 



1920 
1980 
2040 
2100 



<210> 747 
<211> 1251 



298 



60 



240 
300 
360 



<212> DNA 
<213> B . f ragilis 

<400> 747 

ccgtatatta tagaaatgtc tacctacgca ccctttgcca agccgctata cgtaatgctc 
aagcccgtag gggctgtatg caaccttgca tgtgattatt gttattatct ggaaaagtcc 12 0 
cggctttatc aagaaaatcc caaacatgtg atgagcgatg aactgcttga aaagtttatc 18 0 
gagcaataca tcaattcgca aaccatgccg caagtactct tcacctggca cggaggagag 
acattgatgc gtccactctc tttctacaaa aaagcaatgg agttgcagaa gaaatatgcc 
cgtggaagaa gcatagacaa ctgcatacag actaacggaa ccctgttgac cgacgagtgg 
tgtgagtttt ttcgtgaaaa caactggctg gtaggagtct cgatagacgg tcctcaagag 42 0 
tttcatgatg aataccggaa aaacaagctt ggcaaaccct cgtttgtgaa agtcacgaat 480 
ggcatcaatc ttttgaaaaa gcatggagta gaatggaatg ccatggcggt agtgaatgac 540 
tttaatgctg attatccgtt ggacttttat cactttttca aagaattagg ttgccattat 
attcagttcg ctcccattgt ggaacggatc ttcccgcatc aggacggacg tcatctggcc 
tcactggcac agcgcgaagg aggagaactg gcagaatttt ccgtaacacc ggagcaatgg 
ggaaactttc tctgtacact cttcgatgaa tgggtgaaag aagatgtagg cgactattat 
atccaactct tcgattctac ccttgccaac tgggtaggcg aacaaccggg agtatgctcc 
atggcaaaaa catgcggaca cgccggcgta atggaattca acggagacgt ctactcatgt 
gaccatttcg tgtttccgga attcaaactg ggcaacattt acaatcaaac tttggtagag 
atgatgtata gtgaacgcca gactgctttc ggacaaatga aacaaaagtc acttcccacc 
cagtgcaaag agtgcgaatt tttatttgcc tgcaatggtg aatgccccaa aaatcgtttt 
tgtcgcacag caaatggtga accgggacta aactatctgt gcaaaggata tcatcaattt 1140 
ttcaagcatg tggctcctta tatggatttc atgaaaaacg aattgatgaa ccagcggccg 12 00 
ccggccaatg tgatggacgc tatcaaagaa aacaaattga tcatagatta a 12 51 

<210> 748 
<211> 615 
<212> DNA 
<213> B.fragilis 
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60 



600 
615 



<400> 748 

aaagagatta tggtaataaa gaaagcggtt tatgtatggg tgatcgggat actgggaatg 

atttcattcg cagcttgttc atctgcttct caaggagaag tcccttcgac atccaatgct 12 0 

gcgttggata atatttttgc acgtaaaagt gtgcgggctt atttagacaa ggaagtagaa 180 

aaagaaaaaa tagattggat gctacgtgcc ggtatggctg caccatccgg aaaagatatt 240 

cgtccgtggg agtttgtatt ggtcaccgac cgggttgctc ttgattcgat ggccgctgct 300 

ttaccttatg caaagatgct gactcaagct cgctatgcca ttgttgtatg tggagatgta 3 60 

gctcaatctt cctattggta tctggattgt tcggctgctg cacagaatat attattggct 42 0 

gccgaagcac aggggctggg tgcggtatgg acagctgctt atccttatga agaccgtatc 480 

agggttgttc gtaaatatac ggagcttccg gggaatatag tgcccctgtg tgtgattccg 540 
tttggttatc cggcaactgc ccaagagcct aaacagaaat ttgatgagaa aaaaattcat 
tacgataagt tttaa 

<210> 749 
<211> 849 
<212> DNA 
<213> B.fragilis 

<400> 749 

aaagtaacta ttatgacagc aaacgaagtc catttgattt atttctcgcc tacccacacc 

tctaaacaag ttggagaggc aattgttcgt ggaaccggaa taacaaatgt gataaacacg 12 0 

aatttaacac aacaggcaac tcaggattta gtgattgccg aatctgcatt agctattatt 18 0 

gtcgtgccgg tatatggagg tcgtgtagcc cctttggcca tggatcgtct ggcaagtgtg 240 

cgcggaagta atactccggc ggttatcgtg gtggtatacg gtaaccgtgc ttacgaaaaa 3 00 

tcgttgatgg aacttgatta ttgggctatt caacaggggt ttaaagtgat tgccggtgct 3 60 

actttcatag gagaacactc ttatagtaca gaaaaatatc ccgtagctgc cggacgtcct 

gacgaacgtg accttgctgt ggcagccgat tttggaaagc agatttcaga taaaatagca 



)0 



420 
480 



tctgctaccg aaccggaaaa attatatgcg gtcgatgtcc gtaaaatccg gcgtccgcgt 540 
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cagccttttt ttccattgtt tcgctttttg cggaaagtga ttgccttgcg taaaagtgga 600 

gttccccttc cccgtactcc ttgggtggaa gatgaatctt tgtgtactca ctgcggtacg 660 

tgtgcgaaaa tgtgtcctgt aagcgccata gccaaaggtg acgagttgaa tacggatgcc 720 

gaacgctgca ttaaatgttg tgcctgtgta aagggatgcc cacagaaagc cagagtatat 7 80 

gataccccgt ttgccgtact actgtcgcaa tgttttgtta agcagaaaga tccctgtacg 840 

ttggtttaa 849 

<210> 750 
<211> 906 
<212> DNA 
<213> B.fragilis 

<400> 750 

aatgtatata tatatatgaa gaaagaggtt tggataaaac tggtgaaacg aatcggaaac 6 0 

tggattgtga atatctgttt ctattcttgt gtggcttttg ttgcctggat ggtattgcag 120 

gtgttttgcc tgacttcttt caaaattccc tccaattcaa tggaaccggc attgctttcg 180 

ggagacaaaa tactggtgga taaatggacc ggtggggcac gtctgtttaa tatctttgcg 240 

tcattgcgag gagaagaagt ggatatctat cgtctaccgg gtttcggatc gtttcagcgg 3 00 

gacgatgtgc ttgtttttaa tttcccttat caggatggga gcgacagcat cggatttgat 360 

ataatgaagt attatgtgaa acggtgtatt gccttgccgg gtgatacttt ggaaatacgt 42 0 

aagggctatt atcatataaa aggaatcaca gacagtgtgg ggaatgtgca ggcgcaacat 480 

cggattgcac gtgtcagaag ggaagattca catgggatcg tgatggatgc ttttccgtgg 540 

gacggacgtc tgggatggac cattcaggaa ttcggacctc ttccggtacc ggccaaaggg 6 00 

caggtggtga aaatagatac attgtcttgt ttgctttacg gaagattgat ccattgggag 660 

cagaagaaga gactgcggca aaaaggagag gcggtatgtc tgggcgatag tgcaataacg 720 

gaatataagt tcacagagaa ttactatttc gtatcgggag ataatatgga aaattccaag 7 80 

gattcacgtt attggggaat gttgcccgaa tcatatattg taggtagggc atttacaata 840 

tggcggtcgg acgatccttt acgtggaaag attcgttgga accgggtatt taaaagaata 900 

aaatga 9 06 

<210> 751 
<211> 1278 
<212> DNA 
<213> B.fragilis 

<220> 

<221> unsure 

<222> (524) , (1246) , (1269) , (1270) , (1271) 

<223> Identity of nucleotide sequences at the above locations are unknown. 
<400> 751 

ttcaaagata tgttcgacaa tttaagcgaa agactcgaaa ggtcgtttaa gattctgaaa 60 

ggtgaaggca aaatcaccga gatcaacgta gcagaaaccc tgaaagacgt gcgcaaggca 12 0 

ctgctcgatg ccgacgttaa ctataaagta gccaaaggat tcactgatac ggtgaaggaa 180 

aaggcactgg gacagaacgt gctcacagcc gtaaaaccga gccagttgat ggtgaagatt 2 40 

gttcatgacg aactgaccca gctgatgggt ggagaaactg tcgaaatcga caccaaaggt 3 00 

cagccggcag tcatcctgat gtccggtttg caaggttcgg gtaagaccac tttctcgggt 3 60 

aagctggccc gcatgctgaa aaccaagaag aacaaacgcc cgttgctcgt tgcatgtgac 420 

gtttaccgtc cggcagctat cgagcagctt cgcgtattgg ccgaacagat tgacgtaccg 480 

atgtactcgg agatcgacag caaagatccg gtttccatcg ccangaatgc catcaaagaa 540 

gcacgtgcca agggatacga tctggtaatt gtcgatacgg ccggacgtct ggcagtcgac 600 

gaacagatga tgaatgagat cgctgccatc aaagaagcca tccagcccaa cgaaattctg 66 0 

ttcgtggtag actctatgac cggacaagat gcggtcaaca cagccaaaga gttcaacgaa 72 0 

cgcctcgact ttgacggcgt ggtgctgacc aagctcgacg gtgatacccg cggtggtgcg 780 

gccctctcta tccgttcggt agtgaacaaa cctatcaagt ttgtaggtac gggcgagaaa 840 

ctcgatgcca tcgaccagtt ccaccctgcc cgtatggccg accgtatcct gggtatgggt 900 

gacatcgttt cgttggtgga acgcgcacag gaacaatatg acgaagaaga agctaaacgc 960 

ctccaaaaga agattgccaa gaaccagttc gacttcaacg acttcctcag ccagatgtcc 1020 

cagattaaga aaatgggtaa tctgaaagag ctcgcttcaa tgattccggg tgtgggcaag 1080 



300 



gccatcaaag atatcgatat cgacgacaac gctttcaaaa gcatcgaagc catcatctac 

tccatgactc cggaggaacg cagcaatccg ggcatcctga acggttcgcg ccgtacacgt 

atcgccaaag gtagcggtac gactagtctt caccgcgggt tccagnggcg ctctattggc 
atcagtttnn nagatcac 



1140 
1200 
1260 
1278 



<210> 752 
<211> 651 
<212> DNA 
<213> B.fragilis 



<400> 752 

aaaagataca 

gcaagttatg 

ctgctaatga 

agtaataccg 

ggcaaacgta 

ccgcaaggaa 

gatcaccgtt 

gtgctggtca 

ggagctatgg 

ctggccacag 

aaggatactc 



taattatgaa 
ctgccgagcg 
aagcgttatc 
atttatccga 
cagctccgtc 
cttacctgta 
ctgcggttgc 
gtgatctgtc 
atgcagggat 
tcccgcgtgc 
aaatgccgat 



aaagttacag 
gacgattcag 
cgaacgtcat 
tttactctgg 
agccatgaat 
tgatgctaaa 
tggcggtcag 
taagctgggt 
tgtttcgcaa 
atcgatggat 
gatgaatcat 



tttttagtgt 
ttacctaagc 
tctactcgtg 
gctgccaatg 
cgtcaggata 
gggcataagc 
gcctttgtca 
gatgcgaaga 
aacatctcgt 
ctggtacggc 
ccggttggat 



gtttcctgct 
ccgacatgaa 
aatatgcttc 
gcattaaccg 
tcgacatata 
tcaatctgat 
acaatgcgcc 
gtaatcatgt 
tgttctgttc 
tcaaagctgc 
attttaaatg 



tttgtcggtg 
tcgtgccgga 
gaaggctttg 
gtcatctgaa 
tgtggtgctt 
ttctgaggga 
ggtatcgttg 
tcaattgatg 
ggctgcacgt 
tttgaaatta 
a 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

651 



<210> 753 
<211> 600 
<212> DNA 
<213> B.fragilis 



<400> 753 

aggaatttaa 

gcccgttttc 

ctttatcttt 

atggaatcat 

atcatgaatt 

ttatcttcct 

tctgagcagg 

gacacttcac 

tgccaggttg 

gtgcactact 



tacgtacata 
tgtttaatca 
gcatcagatt 
acaaccgatt 
atttgatgga 
ctatgccaac 
gagcggcaca 
aacatgccca 
agatgaaaca 
attataaagg 



ttattgtcat 
aaaaaatatt 
taaaatggta 
actagagcac 
acataagaca 
actttcaaag 
aatgcttact 
ttttctgtgc 
agtggaaggt 
tgtttgtaag 



aagaaacggc 
cgtattttgt 
atgaatacat 
aacatcaaac 
cacccatcgg 
actacagtct 
attgacgagc 
aagcgttgcg 
cttcaaatgg 
aaatgtttaa 



atacgaaagc 
ttgcgaaaat 
ttttggaaag 
cgtccatgca 
ctgatgaaat 
ataatacctt 
ggaacaccaa 
ggcgtattta 
atggacatga 
ataatatacg 



aattttgtat 
aaaaagaact 
aagaaagaaa 
gcgaattgct 
atatacggag 
gaggctgttt 
ttttgatgcg 
tgatttgaaa 
agtaagcgaa 
tattgactaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 



<210> 754 
<211> 1023 
<212> DNA 
<213> B.fragilis 



<400> 754 

acaaagttaa 

tatatcctga 

ggacgtatcg 

atggcacgaa 

gacctggtaa 

ttatgcgaaa 

ggtttcttat 

gtcattattg 

tgtcctattt 

ggagttatcg 

gccggtggtt 

caccaagaag 

tcgatcatcg 



tgaaacaaat 
ctaatgatga 
gaatcaaaga 
aagctgtcaa 
ttgtagctac 
gagtggggtt 
atgcattaga 
ttggtgccga 
tcgctgatgg 
attccgtttt 
cagtttgttc 
gacgtacagt 
aaagaaacca 



caatgcagta 
gatatccaga 
aagacgcatc 
acaactgatg 
caccactccc 
gaaaaatgca 
aaccggggct 
taaaatgtcg 
tgcagcagct 
aagaacagac 
cccttcttat 
atttaaatat 
actgacaaaa 



attacaggag 
attgtagata 
ctgaatgaag 
caacgtactc 
gattatcggc 
tttgccttcg 
aactttatcc 
tctgtgatag 
ttcttgttgg 
ggcaaaggac 
tttacagttg 
gctgtagcca 
gatgaaatag 



ttggaggata 
ccaccgatga 
aaggactcgg 
aaagtaatcc 
ttccttcaac 
acatgcaagc 
gttcgggaaa 
actataccga 
aacccacaac 
ttcctttttt 
acaaccacat 
atatgtcaga 
actgggtcgt 



tgtgcccgat 
atggatcatg 
cacctcatac 
ggatgatatt 
ggcttccatt 
ggtctgcagc 
atacaaaaaa 
tcgtgccacc 
agatcattta 
acacatgaaa 
gcattacctt 
tgcatgtgag 
tcctcaccag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 
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gccaatcaac gtatcatcag tgctgttgcc caacgtctgg atgtaccatt agaaaaggtt 840 

atgatcaata tcgaacacta cggcaatacc agtgcaggta cgcttccatt atgcatttgg 900 

gatttcgaaa ataaactcaa aaaaggtgat aatttgattt tcaccgcttt cggagccgga 960 

tttgcctggg gagctgttta cgttaaatgg ggatatgatg gcaagacaaa taacgcatgt 102 0 

tag 1023 

<210> 755 
<211> 864 
<212> DNA 
<213> B.fragilis 

<400> 755 

gatattgtaa ttatgttaag aatcgcagta caagccaaag ggcgtctttt tgaagaaacg 6 0 

atggcccttc ttgaagaatc agacatcaaa ctgagcacaa ccaaacgtac tttactcgta 120 

caatcgtcca actttccggt tgaggtactt tttctccgtg acgatgatat tccccaatct 180 

gtagctacag gagttgccga cttgggtata gtaggagaaa acgaatttgt agagaggcag 240 

gaagatgccg aaatcattaa gcgtctcggg ttcagcaaat gccgtttgtc tttggctatg 3 00 

cccaaagaca ttgaatatcc cggtttgagt tggtttaacg gaaagaagat agctacttcc 3 60 

tatcccggaa ttttagatgc ttttatgaaa agtaacgggg tgaaggctga agtgcatgtc 420 

attaccggtt ctgtagaagt tgctcccggc atcggattgg cggatgctat tttcgatatt 480 

gtcagttccg gttctactct agtcagcaat cgcctgaaag aagtggaggt cgtaatgaga 540 

tcagaagctt tgctgatagg caacaagaat atgagtaagg agaaaaaaga gatattggac 600 

gaattgcttt tccgcatgga tgctgtgaaa actgctgaag ataaaaagta cgtactgatg 660 

aatgctccta aagataaact ggaagatatt attgctgtgc taccgggtat gaagagtcct 720 

actgtgatgc cgttggcaca agatggttgg tgctctgtac atacagtgct cgatgagaaa 780 

cggttttggg agatcatagg taagctgaaa gcgctgggag cggaaggtat tttggtgttg 840 

cctattgaga agatgattat atag 864 

<210> 756 
<211> 462 
<212> DNA 
<213> B.fragilis 

<400> 756 

gaaaataaaa tgaaactggc tcctataaat ataaagaata aacgtgctac tttcgactat 60 

gagttgatcg atacttatac agcaggtatt gtgttgaccg ggacggagat taagtccatc 12 0 

cgtctgggta aggcaagctt ggtagatacg ttttgctatt ttgcgaaagg cgagttgtgg 180 

gtgaagaata tgcacattgc cgaatatttt tatggctcgt ataataatca tgcggcccga 240 

cgtgaccgta agttgctatt gagcaaaaag gagctgaata aattggaaag agggacgaaa 3 00 

gacgccggat tcaccattgt ccctgtgcgt ttgtttatta atgaaagagg tttggccaaa 3 60 

gtggttgtag ctttggctaa aggtaaaaag caatatgata aacgggaggc tttgaaagaa 42 0 

aaagacgacc gtcgtgatat ggacaggatg tttaaacgat ga 462 

<210> 757 
<211> 477 
<212> DNA 
<213> B.fragilis 

<400> 757 

aaacaaagaa tgaagaaagt attatcatta gtagctttgg ccatgatcag caccattatg 60 

tttgctgtaa acgatggagt caaagcagat caaaacaaaa aagaggcaaa gagcggtgag 120 

gttatcgtga tgaataaaga gatgtttatc aacgatgtct ttgattacca gaattcaaaa 180 

gagtggaaat ataaaggtga taaacctgcc attatcgacc tgtatgcaga ttggtgcggt 240 

ccctgccgca tgacagcccc gattatgaaa tcgcttgcta aagaatatga cggaaaaatc 3 00 

gtaatatata aggtgaacgt ggataaagaa aaggaactgg ctgcactatt caatgcaaca 3 60 

agtattcccc tctttgtatt tatcccaatg gagggcgaac cccaactgtt tcgtggagca 420 

gcagataaag ccacttataa aaaagcaatc gacgagttcc tgttgaaaca gaaatag 477 



<210> 758 



302 



<211> 579 
<212> DNA 
<213> B.fragilis 



<400> 758 

tttaccccga ttatgaaatg gatgattttg atttttttga attttttgtt ttgtgcccaa 60 

cttgttgggc aagtatcacg acccgataga aaccttttgc gtggtgagac gtatgtgatt 12 0 

gaggtgccga aaggatggaa acgtccttct gctgtgcatt cttgcaatga tgaacctttg 180 

aaacgcgtta atgggaaata cgaaactaca aagtttatga gagtatattc aaaacgtaaa 2 40 

gatcgttgtg gtgcggtatt gaccattatg gaaatacaaa aatgtgcatc ttttcaggaa 300 

atatttaagg aagacagtat ttgggcatcg acggatacta cgcaggtgaa ggtgatatat 3 60 

aagtctgtca atagtaagaa tggggttaaa aagatggctt ttacttcgta taaggcagag 42 0 

cgtcatccgg aaactaacga attatctgct ttgcaaaagg ctgaatggta tttgcagggg 4 80 

cgtgaaaatg tatattatat cagttttacg tcttgctcat tgtttttaga actgctaccg 540 

cagattaaag atattgtggc gtcgttaaag gaactttaa 579 



<210> 759 
<211> 1458 
<212> DNA 
<213> B.fragilis 



<400> 759 

atgcattgga taatggaaaa tggtgtaatg atgcagtatt ttgaatggaa tctgccaaat 60 

gacggaaatt tatggaaaca attaaaagaa gatgcgtcac atttacatga gattggtgtg 12 0 

acagcagtat ggattccccc cgcttacaaa gccgacgaac aacaagacga aggttatgca 180 

acctacgatt tgtatgatct cggcgagttc gatcaaaaag gaaccgtaag aacgaaatat 2 40 

ggtacgaaag aagaactgaa agaaatgatc gatgaattac ataaaaatca tatttccgtt 3 00 

tatctggatg tagtactgaa tcataaggca ggaggtgatt tcactgaaaa gttcatagtt 3 60 

gtagaagtcg atcccaatga tagaacccaa gcattaggaa aaccgttcga aatacagggc 42 0 

tggaccggat acagcttcca tggacgtaag gataaatatt cagacttcaa atggcattgg 480 

tatcattttt caggaaccgg ttttgacgat gccaaaaagc ggagtggcat cttccagata 540 

cagggtgaag gcaaagcgtg gagcgaaggg gttgacaatg aaaatggcaa ctacgatttc 600 

ttattatgca atgatataga cctggatcat cctgaagtag tcaccgaatt gaatcgttgg 660 

ggaaaatggg tttccaaaga gctgaacctc gacggaatgc gtctggatgc catcaaacac 720 

atgaaagaca agttcattgc acaattcctg gatgcggtaa gaagcgaaag aggagacaaa 7 80 

ttctacgctg ttggcgaata ttggaatggt gatttgaaca cactcgatgc atacataaaa 840 

tccgtgggtc acaaagtcaa cctatttgat gttccattac attataattt attccaagca 900 

tcacaagaag gcaagaatta tgatctgcag aatatcctaa aaaacacatt agtcgagcac 960 

tactgtgatc tggcagtcac ttttgtcgac aatcacgatt cgcaatcagg cagttccttg 102 0 

gaatcacaaa tagaagactg gttcaaacca ttggcctatg gtctgatatt attaatgaaa 1080 

gacggttatc cttgtttgtt ctacggagat tattatggtg tcaaaggaga aaactcacct 1140 

catacccaaa tcattaatat tcttctggat accagaagaa aatatgctta tggcgatcag 12 00 

attgagtatt tcgatcatcc ttccgccatc ggctttattc gtacgggaga tgaagaacat 12 60 

gtcggttccg gtttagtctt tttaatgtct aatgatgaag ccggcagtaa aaagatggat 132 0 

ttgggcgaag aacataaagg tgaaatatgg catgaaataa ccggaaatat tcagcaagaa 13 80 

atcacattag acgaaaaagg aagtggagaa ttttctgtta atacccgtaa tattgctgtt 1440 

tggataaaaa agaattaa 1458 



<210> 760 
<211> 477 
<212> DNA 
<213> B.fragilis 



<400> 760 

atgcataact tttgtttttt tcgttacgca aagatcccgt ttccggatta caggaggatg 60 

aatgtaaggt tactatatga tgaacaaaat cgggacttct ctgttattat aataaaacaa 12 0 

ataactatat ttgccccctg taaacgaggg ctctatattt atcaaaagaa aggaaaaatt 180 

atggaaaaat ttgaagattt aatacagtca caaagtcccg ttttagtaga ttttttcgca 240 

gaatggtgcg gcccctgtaa agcaatgaaa ccgattcttg aggatctgaa acagcaggta 300 



303 



ggcgagaaag cccgtattgt aaaaatcgat gtggacacac acgaagaact agctgtaaaa 3 60 
tacagaattc aggctgtgcc gacttttatc cttttcaaaa agggagaagc tgtctggcgc 420 
cattccggta tgattcaagc cagcgaactg aaaggagtta ttgaacaata cacataa 477 



<210> 761 
<211> 1014 
<212> DNA 
<213> B.fragilis 



<400> 761 

ataatgaatg ttagatgttt tttatggggg 

gaatctgata ggattatgca ttatgctcaa 

agaatacagg ttccttcggt gttattgtat 

ctgatagtat tcaatgaaaa aatggatact 

acttttcaat atggttttgg aacacagggg 

attacccctg tgaaatatca aaagaacggt 

catattagtg tcaagaaaga caaagctatc 

aattgtttta atgacttgat aagtatttcc 

gagaatgaaa aagaatttag gtttctttat 

tatcctgaaa cagaggaacg tttcggatct 

atgaccgtcg ctaagcctga taagagttgt 

ttcagaattt atggtaaaga tggagaatta 

gggcaagaac gtcctgaagt ggatgattat 

gcaacggaca gttatattta tacattaaat 

cggaaaacta ctcctaacat ccaagtattc 

aaactcgatt gttttattaa cacttttgtc 

gcttttgttg aagacgaaga tcatatttat 



attttgttta 


taactgtatc 


aagttgtata 


60 


tttgagcata 


ccataaattt 


gaaatccgat 


120 


ccacggagtt 


tagttttatg 


tgatagtaat 


180 


atgtttcaat 


gcttccattt 


gccggatttg 


240 


cagggaccga 


atgatttcgt 


tctcccttct 


300 


tttgtcatgt 


tagacggaat 


taacctgaaa 


360 


gtacagactt 


cgactttaaa 


ttatggattt 


420 


gatagcagtt 


attgttgtaa 


tggaggtttt 


480 


cctgacggaa 


atcatgaatc 


atggggagaa 


540 


gttttggaca 


ggaatcaggc 


gtatataaag 


600 


tttgtttcgt 


tctaccaaca 


tatacgccgt 


660 


aaaagagatg 


ttattttaga 


tattcttccc 


720 


ttgagattca 


tacatcctat 


aagtgtctat 


780 


ctggatatga 


caacagagga 


aattgagaat 


840 


gattgggaag 


gaaagccact 


tacacaatat 


900 


gttgatgaag 


ttgcaaataa 


gatttatgga 


960 


gtatttaatt 


taccccgatt 


atga 


1014 



<210> 762 
<211> 1050 
<212> DNA 
<213> B.fragilis 



<400> 762 

aaaggatgtg tcagttgtaa tatttgtgac acatcttttt 

cataagataa ttctatatat tatttctgtt ttaacagtgt 

gatgtacctg ataaggtgag tttacaaccc caggtaatga 

atgcctggtg atttattgct gattgacgat tatttagttt 

aacaaatttc tgcatgtaca tcgttcttcc gatggaaaat 

aaaggagaag gtccacagga atttgtaagt cctttaatca 

tgtattgctg ctcatgatgc taacgggaaa accagaggct 

attgtcggaa aagaaccttt tatgtcttta tcagattttg 

aaattggacg aacaactgta tctgactgaa accgaaaatg 

gtgagttcaa atgggaaaaa atctacattt ggggtttatc 

catatgggta catataaaac ttacgataaa gatcgtggac 

aatttttctt atttggcttt gtataaaaag gaaggggata 

cgcatgcctg aaaaagaaaa ctattctgtt gttgatgggg 

gtgatgggag tgagagatat atgcatgact aaagattata 

cgggaagttg atccgttgga tgaaaggact gtcggacgta 

acggtttttg tgtatgatta tgatggtaaa ttactgaaaa 

gtaatgcgca ttgctgctga cggacgaagt aatgctctgt 

gattttgcat tggcgaaata tgatttatag 



gtaactttat 


taaagttatg 


60 


ttacttcttg 


cacaactact 


120 


atgatagtct 


tttgacaact 


180 


ggtctgatcc 


tttctctgat 


240 


atatcggttc 


tatggggcaa 


300 


atcgtttttc 


cattaatcgc 


360 


atttatctat 


tgacagttta 


420 


atcggaatat 


acgaatggct 


480 


gtgagaacga 


ttattttaaa 


540 


cgattcgtga 


agtgaaacac 


600 


tccttgcttt 


tggtcctttt 


660 


attttaagtt 


attatgggaa 


720 


cgattaggtt 


tgatcgtagc 


780 


ttgttactct 


ggagcgtgac 


840 


atgcaagtaa 


atgtccccgt 


900 


ttgtaaattt 


gggcatgcct 


960 


atgtgatagg 


agttaatcct 


1020 






1050 



<210> 763 
<211> 1797 
<212> DNA 
<213> B.fragilis 



<400> 763 



304 



gtttttttcg tatttttgcg aaaaaactta atgaaaatga acacgcactc actatttggt 60 

taccttttta ttgctttatt tagtctttta gttgtatcat gttattcgac gccggatgga 12 0 

gtcatgtcat ctctgtctca agctgagaaa ataatggaat ctcgcccgga tagtgcaatg 180 

gctattttgc aacatatccc aactccggaa actcttcatg gtaaagcgca ggcggactat 2 40 

agcctattga tgacacaggc tatggataaa aactacataa attttacttc agattcgctg 3 00 

attaaatttg ctgttggtta ttatggaggc catactgaag atcttgtagc taaaggaaaa 3 60 

tctttttatt attatggaag ggtgatggaa agccttgata aagtagagga tgcaatgacg 42 0 

ttttatttaa aggcgaaaga tgtacttcaa agcagtgatc agtttaaatt attgggacta 480 

atatcagagg gaataggaac tcttaatagg aaacagaaat tatttgatac tgcattaaat 540 

agctataagg agtctttaac ttattattct ctagtaccag actctctctg tatgacatat 600 

gctaatagga atattggtag agtgttttta tataaaaata ggcttgatag cgcctactat 660 

tattatgata aagcaattta tatttctaat gcaaataaat atgtagctgt agggtcgttg 72 0 

ttattggaat taggagtgat tcatcgttca gaaaaagatt acattggtgc tgaacgatat 7 80 

tttttgacat ttcttgagaa agaaaaaact ccaaataaat tgtattctgg gtatttggca 840 

ttaggaaatt tgtatttata catgaatcgt tttgaagatg cagaacattt tcttatgtta 900 

tgtttggata gccctgatcc agttgttaag agagatgcgt gtgagtgttt atatgattta 9 60 

gagaaagaat caaataaatt taaagaagct gtgatctata aagatatagc ggattcctta 102 0 

cgaatgatga cacaagatat tgatactcaa aatgccatag cagatttgca gggtagatat 10 80 

aataacgaaa aatggcagag ggaaagtcta caatccagta ttgagaagaa gaatattctt 1140 

ttaataagtt cgtttgtggg ttttattgca gtaatggtta ttatttatat ttattataaa 12 00 

tatagaacca atcaaaaact ggttaaggat atcaatgaaa gaattcgtaa aaatgatgtt 12 60 

gacataaaga tgtatcaaag gcaaatactc aattatcaag atttgcaaaa ggaaacattg 13 2 0 

caggattatc gaaatcagat aggagaattg catgggaaaa tgtctgtcct tgaagatcag 13 80 

aataaagcat tatctcttcg tttaacagag aagaagcatg atataccgga aagtgaagcc 1440 

gatgatctct atgctattta tatgcaagca cttcatatac taataatgtt aagagggaaa 1500 

aatatagaga atacttcagg tcagaaattg cttttggatg ccgattggga taagttattt 1560 

catctatcta atgctataca tggtgatttt attacgcgta ttaagaatga ttttcctact 162 0 

cttaccaaac atgatattga aatttgctgt ctattaagat ttggtattga acatgaggtc 1680 

ttaggaagta tttttctgac ggagactgat tcagtgacaa aagctaaaag acgtatgaaa 1740 

aaacgactga atctatctgc ttcggatgat ttggacgttt ttttgctaaa atattag 1797 



<210> 764 
<211> 312 
<212> DNA 
<213> B.fragilis 



<400> 764 

aataacaacg taatggtaaa acacattgta ttatttaagt taagagacga cgttcctgta 60 

gaagagaaac tcgttgtgat gaatagtttt aaggaggcta ttgaagcatt acctgctaaa 12 0 

atctctgtga tccgcaaaat tgaagtcgga ttgaatatga atccgggaga aacctggaat 18 0 

attgcgttgt atagtgaatt tgataatctg gatgatgtga agttctatgc tacccatccc 240 

gagcatgtgg ctgccggtaa gattttggca gagacaaaag aaagtcgggc ttgtgtagat 3 00 

tatgaatttt ag 312 



<210> 765 
<211> 213 
<212> DNA 
<213> B.fragilis 



<400> 765 

agaaatgtat ttggtagagg attttatata ggacaagagt ttatagcata tcagatgctg 60 

aaactgagaa aaaacttcat tgataatcaa aagagacagg gtacgccgtt tgaacggaag 12 0 

tgtaccctgt ctttttatta ttcttctaca atatccagct ccttgatgat gtgtgaggca 180 

cctgcatact tatcaataat aaatagagta tag 213 



<210> 766 
<211> 864 
<212> DNA 
<213> B.fragilis 



305 



<400> 766 

gtctctgcat ttacttattg tgaattagaa tttatatctt tgcaaactct aaaaaataga 
tttatgaaac aattgaaatt aatggtgttg accttaaccc tgttgatggg tactatgttt 
acttcatgta tggattccgg agaaagcggt cctcagcagt gggccggtgt ggtgaaagtg 180 
aatgatagaa tgggttatgt tacattcaca gatgctgccg gtacagagct gatccctact 240 
aacacgattc ctgtaacttt gaatgcaaga atggcttaca tttattgcca ggttgatgaa 
ggtcaggacc tctcaacaaa tcctaagtca attaaaatta cacttttagc agatcctaca 
ggaattgatg ctacagcaat aaccactccg aaagtagaat caagtgatgt gactactaat 
gcacctgttg gttcgttgag ttttgcatca ggatattcaa ctgtggcccc atttcagttt 
agtgaaaata cgattgtatt accagtactt tatcgtgtga aaaatgtgac tactacagaa 
gatattaaaa atgagcttgc taaacatact tttactcttg tctgctatac agatgatatt 
aaatctggtg ataccatttt gaaactttat ttacgctata aagttgagga tgaacctgct 
gctattgctg agcgtgcaac acgtacttcc agctttaagg cttatgaaat cagccaaatc 
ttaagagaat atactctgaa gagtggacaa actaaacctg ctaaaataac tatagtagca 
cagcaaaatg agtacaacaa taagttggaa gatacttcta ctatagagaa ggtatatgaa 
atagaatata aaactgcgga ataa 

<210> 767 
<211> 393 
<212> DNA 
<213> B.fragilis 



60 
120 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
864 



60 

120 

180 

240 

300 

360 

393 



<400> 767 

aatatgaaat tgcgtgttat tttatcgtta attgtggtat tgttcattgg acagtccatg 

tgtgctatgt caactcaaat tcttcgtaga cccattattt tagatggtga aattattgaa 

gaagaagcga gtcgttccat caacccgttg attcctattt ctgcagatat tgatggcact 

actttattta ttgaatttac aaaggttata ggtaatgtgg atattacagt gaaagatgat 

accaaaaaag aagtttattc atcttctgtg gatgtaactg ctgctaatca agctacttcg 

ttctctattg ccgatttagc accgggaact tacctgcttg aatttaccaa ttcgaatggc 

ggttatgtat atggacaatt tattgtagaa taa 

<210> 768 

<211> 714 

<212> DNA 

<213> B.fragilis 

<220> 

<2 21> unsure 
<222> (613) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 768 

atgatggctt cgatgctttt gaaagcgttg tcgtcgatat cgatatcttt gatggccttg 
cccacacccg gaatcattga agcgagctct ttcagattac ccattttctt aatctgggac 
atctggctga ggaagtcgtt gaagtcgaac tggttcttgg caatcttctt ttggaggcgt 180 
ttagcttctt cttcgtcata ttgttcctgt gcgcgttcca ccaacgaaac gatgtcaccc 240 
atacccagga tacggtcggc catacgggca gggtggaact ggtcgatggc atcgagtttc 3 00 
tcgcccgtac ctacaaactt gataggtttg ttcactaccg aacggataga gagggccgca 3 60 
ccaccgcggg tatcaccgtc gagcttggtc agcaccacgc cgtcaaagtc gaggcgttcg 
ttgaactctt tggctgtgtt gaccgcatct tgtccggtca tagagtctac cacgaacaga 
atttcgttgg gctggatggc ttctttgatg gcagcgatct cattcatcat ctgttcgtcg 
actgccagac gtccggccgt atcgacaatt accagatcgt atcccttggc acgtgcttct 
ttgatggcat tcntggcgat ggaaaccgga tctttgctgt cgatctccga gtacatcggt 
acgtcaatct gttcggccaa tacgcgaagc tgctcgatag ctgccggacg gtaa 

<210> 769 
<211> 237 
<212> DNA 



60 
120 



420 
480 
540 
600 
660 
714 



306 



<213> B.fragilis 
<400> 769 

tgcggttttc cgggtgtttt ctttttgttg gcaaagaaga agtgttttgt gtttaaaaga 60 

cttgttcttt gcaaacaaaa cacttgtttt aacactctgt taaatcaagc attttatata 120 

ctgaaagtga agtcgtttaa gttgttgata cagagtggat cgttgttgtt tctagttttt 180 

ctgctatcct gcttgtccgg gtgtgatgat agatggtatt tatcaaaacc acaatga 237 

<210> 770 
<211> 1149 
<212> DNA 
<213> B.fragilis 



<400> 770 

tattcatcgt 

ctgctattgg 

actgtcgaca 

cccacacagg 

ggtgacgaag 

tttgatctgg 

aaaatacgtc 

catctcgata 

gcttccatac 

gcagccttta 

ccctggcacg 

cagtataagg 

ctgctttatg 

tatccgggag 

caatatatgg 

gataagccta 

acacagactc 

aatgcacgtg 

gactttgtga 

tataaatag 



taaataaacg 
caggattggg 
ctctgcgaac 
ggattatgtt 
atcgcagtga 
gccacattga 
aggagacgat 
accccttgac 
tgcccggtgg 
tgaatacttt 
agcataccgg 
ctctttggcg 
cttattcacc 
atgatatcat 
agcaattgga 
tagccattac 
tctatccggt 
aaagggtaaa 
agttctaccg 



aattatgata 
tgcttgttca 
ggcggaaaca 
tggtcatcat 
cgtgaaaagt 
actggaaaga 
taatcaatat 
cggtaaagat 
tgtacatcat 
ggagacggaa 
cagttggttc 
gatgacgcat 
gggatcggaa 
cgatctggtg 
taagtcgctt 
cgaaaccggt 
aatcagcaag 
ccactattat 
tgaaccgaaa 



aagagaataa 
ccttccggaa 
gtgaatttac 
gacgatccgc 
gtgtgtggtg 
gagaaaagtc 
aaaaggggag 
gcgtgggatg 
gcgaaattta 
gaaggtacaa 
tggtggggac 
gatcgtatgc 
cccaaagatt 
ggctttgaca 
gctatcctga 
ttcgaggcta 
tatcctatca 
gctccttatc 
actctgtttg 



agatattagc 
agaaaacagg 
tgaataatct 
tttacggtgt 
attatccggc 
tggataacgt 
gagtggtttc 
tgagtgatac 
taagttggct 
aaataccggt 
aaaatctttg 
atgcccgggg 
cgactgctta 
cctatcagtt 
ctgaagtagg 
ttcccgattc 
gttatgtgtt 
ccggacaggt 
tgtcggacgt 



tacaggtgca 
agcggattcg 
acggaaggtt 
cggctgggaa 
tgtcatgtcg 
gccgtttcgc 
ttttagctgg 
gacggttgta 
cgatgctgtt 
tattttccgt 
cacggccgac 
agtaaagaac 
tctggagcgt 
cgaccggaca 
taaggcgcac 
tgtctggtgg 
ggtgtggcgc 
gtccgccgat 
gaagaacctt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1149 



<210> 771 
<211> 1560 
<212> DNA 
<213> B.fragilis 



<400> 771 

tctcaaagac 

tgccgccgcc 

acagccccgt 

tctatttttt 

ggaatctgcc 

gcgcaacagg 

gtacatctgg 

aagcaattgg 

ataaacggaa 

ctcacagcgt 

gaaaagacca 

gtttgtacgg 

ctgccggtag 

gccaataaac 

cgttatgaca 

actttcgatg 

ccgacaataa 

ccccgcatcg 



ctttggctac 
gaatgtctgt 
tggctgactt 
atcggagatt 
tgctatggct 
catccgaact 
acgataacat 
aacaacaagc 
tgactattta 
tcaatccgga 
ctcccccgac 
ggaatttcaa 
tcattttggt 
cttttcggga 
agcgtaccaa 
aagaatcagt 
atcccgaacg 
cccaacgtag 



aatcgtggac 
tcagtacgtt 
ctatctccct 
tcttaataaa 
attgctggca 
tctcgaccga 
ccggaaaatg 
aggtaagtat 
ttattgcgac 
cggaaaggtg 
gacatcggta 
gcttcccggc 
acatggctcg 
cctcgcgtat 
agtatatgga 
ggatgacgcc 
gatctacatt 
cgataaagtt 



cgacctgatc 
tgcacgcatc 
gctgatttac 
caagttatga 
gtaactcctg 
ctgattgcag 
ctttccgtag 
cagtcgcatg 
gttaagttcg 
aataccattc 
caagataaaa 
acactgactc 
ggggccagcg 
ggactggccg 
gccgacagcg 
ctttcggcaa 
ctcggacata 
ccggcaggga 



atctatctga 
cgtgcggtcg 
ggaatccttt 
caaagaaaaa 
tattgcaagc 
gccaaggaga 
agatgttgaa 
gagagtggaa 
aacgcttacc 
gtttcgtacc 
taaaagagac 
ttcctaaaaa 
accgggacga 
agcgtggaat 
cacctgcagg 
ttaaacttgc 
gcctgggagg 
ttattctgct 



tagtcctgtt 
gagagcaaaa 
tcctcggaac 
tctactcaaa 
ccaggatcgt 
cagtgtgtat 
cggactgttc 
caccgaacca 
attgcgtttt 
tgttccggct 
agacatacag 
cggcaaagat 
aacggtaggg 
agccgtgatc 
caaagaaatt 
ccgttccata 
caccttggct 
tgccggtgca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 



307 



gcccgtccac tcgaagatct gtttataagt caggtgaagt ttctcgcctc tgcactccca 1140 

tcggctaaag atattgaaaa ggaaatagcc gaattacaga aacaagtgga caacgtgaaa 1200 

aggctgggta cagacacatt cgacattaca actcctttgc ccatgaatct ctctcaagct 12 60 

tactggatgc ttgccaatca atataaacct ttggaagtgg tccgaaaact gactctcccc 132 0 

atacttgtcc ttcaaggcga acgtgattat caggtcacca tgcaagattt cgaattatgg 1380 

caatccgccc tggcaaagca tccgaatgcg atatttaaat cttatccccg actcaatcat 1440 

ctgtttcagg aaggagaagg gaagtcaacc cctcttgaat acagccgtcc ctcctctatt 150 0 

ccttcttacg tgacggatga catcgcagct ttcatcaacc gacccaagcc cggtaactga 1560 



<210> 772 
<211> 1569 
<212> DNA 
<213> B.fragilis 



<400> 772 

tatataatta cgatgaataa gaaaataatt atccccctcg cactggctcc attggctgcc 60 

ccggctctgc aagcccagca ccagcagccg aacggacgta cggacacacg ccccaacatc 12 0 

attctcttca tggtagacga catgggctgg caagatacat ccctgccttt ctggacccaa 180 

aagacacact acaacgaggt atacgaaact cctaatatgg agcgccttgc caaacaaggt 240 

atgatgttca cccaagccta tgccagcagc atcagttcgc ccacccgctg tagcctgatt 3 00 

acaggaacta acgccgcgcg tcaccgggtg accaactgga catatcccaa aggccagcaa 3 60 

acagaccgcc cgagcgatgt attcaatgta gcggactgga atgtaaacgg ggtttgccag 42 0 

gttcccaata tcgaccacac gtttcaggca acctcactgg cagaaatcct gaaagacaat 480 

ggctaccaca cgattcattg tgggaaagca catttcggcg ccgtcaacac tccgggagaa 540 

agtccttatc acatgggctt tgaagtcaac atagccggac atgcaggagg cggattggca 600 

agctacctgg gtgaaaataa ttacggaaac cggacggacg gtaaaccgaa tccctggttt 660 

gccgttccgg gattagagaa atactgggga accgatactt tcgtcagtga agctctgacg 72 0 

ctcgaagcta tcaaagcact cgatcatgcc aaagaataca atcagccttt cttcctctac 780 

atggctcact acgctatcca tgttccgatc gataaagaca aacgcttcta tcaaaaatat 840 

atcaataaag gattgactcc caaagaagct gcttatgcgg ccctgatcga aggtatggac 900 

aaaagtctgg gtgacctgat ggactggctg gataaaaacg gagaagcaga caataccatc 960 

gtcatcttta tgagcgacaa cggcggtctg tcgagcgaac cggaatggcg tgacggaaaa 102 0 

ctgcacacgc agaactctcc tctcaacagt gggaaaggat cggcttacga aggcggtgta 108 0 

cgcgaaccga tgatcgtccg ctggccggga gttgtaaaac cggataccaa atgtgataaa 1140 

tatttaatta tcgaggactt ctatccgacc atactcgaga tggcacaaat caaacattat 12 0 0 

aagacggtac agccgatcga tggaattagt tttatgcctc tgctgacaca taccggtgat 12 60 

ccgtccaaag gacgcagcct gcactggaac ttccctaatc attggggaaa cgacggtccc 132 0 

ggcatcggcc cgacctgtac cgtacgcaaa ggtgactgga agttgattta ctactatgac 13 8 0 

agcggtaaaa aagagttgtt caatattccg gaagatatag gagaaaagaa tgacctggca 1440 

gccctacatc cggacattgt gaaaagttta tctaaagagc tgggtgacta tttgcgcaaa 1500 

gtaggcggcc aacgcccttc attcaaagca accggaaagc catgcccatg gccggacgaa 1560 

atcaaataa 1569 



<210> 773 
<211> 321 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (304) 

<22 3> Identity of nucleotide sequences at the above locations are unknown. 



<400> 773 

aaacgaatta cgaatggatg ctcctgtgga aagctgcatg gctacccatt cttaatcccc 60 

caatacgcaa tagaaccgca atttgcaatc acaatgaaga taattattgc tggtgccgga 12 0 

gctgtaggca cccatttggc taaattactc tcacgcgaga aacaggacat catcctgatg 180 

gacgatgacg aagagaaact aagtacgttt agttctaact tcgacctgat gactgttacg 2 40 

gcctctcctt cgtccatatc aggactgaaa gaggtaggca tcaaagaggc agacctcttt 3 00 



308 



attngcggtc actcccgatg a 321 

<210> 774 
<211> 1410 
<212> DNA 
<213> B.fragilis 

<400> 774 

aataagaaaa ccaagaatag aatgactgcc atgattacac tgaaagagaa gatcggttac 60 

ggactgggcg atatggcttc gtccatgttc tggaaactgt tcgggtccta tctgatgatt 12 0 

ttttataccg atgtattcgg cttgcctgct gccgtggtag gaaccatgtt tctgattacc 180 

cgggtatggg attcggcttt cgatccgatc gtgggagtga ttgccgatcg cacacagacc 240 

cgctggggga aatttcgtcc ttatctgctg tatcttgccg ttccttttgc actgattggt 3 00 

atttttactt tcaccactcc ggagttgaat gataccggaa aactggtcta tgcctacatc 3 60 

acctattctt tgatgatgat ggtatattcg gctatcaatg tgccttatgc ttcactgctg 42 0 

ggagtgataa gtcctgaccc gaaagaacgg aataccctgt ccacttaccg tatgactttt 480 

gcctatatcg gcagttttat tgctttgctg ctcttcatgc cgatggtcaa cctgtttggt 540 

ggtgcagaag acgagcaacg gggatggatg ttgagtgtgg tggtgattgc tgtgatgtgt 600 

gcggctctgt tttatctttg tttcgccttg acacgtgaac gggtaaaacc gatcagggaa 660 

gtacaaaact ccctgaaaga cgatttgaaa gatttgcttc acaaccgtcc gtggtggatt 72 0 

ctgctcggag ccggagtggc agctttggta ttcaattcta ttcgtgacgg agctacggtt 780 

tattacttca agtactttgt tgtggaagag gattactcca cggtttcctt ctttggcgtt 840 

tcttttgtgc tgagcggcct ttatctggca gtgggacagg ctgccaatat tgtgggagtg 900 

attcttgcag ctccggtcag taaccgtatt ggcaaaaaaa acacatacat gggagccatg 960 

agtctggcta ctctcctttc cgttatcttt tattggtttg ggaaaggaga catcaccctg 102 0 

atttttgttt ttcaggtgct gatcagtatc tgtgccggaa gtattttccc tttgctctgg 1080 

tccatgtacg ccgactgtgc tgactattcg gaactgaaga ccggcaaccg ggctacaggt 1140 

ctgatctttt cgtcttcgtc catgagccag aagttcggtt gggctatcgg aagtgcactg 12 00 

accggttggt tgttggctta cttcggcttt cgtgccaacg aagtgcagag tgtagaggct 1260 

attcatggga ttaagatgtt tctcagctgg ttgcccgctg tcggaacagt tctgtccgtc 132 0 

gttttcatca gtatgtatcc gctgtcggag aagaagatga gagaggtgac ttcagagttg 13 80 

gagaagagaa gaaaggctat tcaatcataa 1410 

<210> 775 
<211> 1995 
<212> DNA 
<213> B.fragilis 

<400> 775 

aaagcaagaa caatgaaaaa gaatctatta tatattttta gtttagcaag tgttttatgc 6 0 

tcttgcaatg actttctcga caaagagcca ctagatgccg tacctaccga caaatatctt 12 0 

ttggcagaaa gcgatttagc agcctattcg gctaatctat atgatcaact tccatcccac 180 

actccaggcc aatacagtat gggagtattt gcaacagaca ataatagtga caaccaagca 2 40 

gcaagtaatc caaacggttc atttgtaaag ggagaaacac gtgtggctca aagtggaggt 3 00 

gcttgggatt ttgggaaaat ccggaatgtc aattatttca tcaataaggt acgtccccga 3 60 

ctggaagccg gcgaacttag tggagtagaa gctaacaata tgcactatct gggagagatg 42 0 

tatttctttc gcgcttatat ttactttact aaattagttg cactcggtga tttccctatc 480 

ttaaaacatt ggatttcgga agattatgaa acagttagag aggcaagtaa acggcgccca 540 

cgcaatgaag tagcacgttt catcatacaa gatttagatt ctgcttacta ttatatgaaa 600 

gcaaccccac caatgagcaa tcgcctaacc aaagactgtg ctgccctcat gaaaagtcgt 6 60 

gtggcattat ttgaaggtac ttgggaaaaa taccacaaag ggaccgcacg tgtaccagga 72 0 

ggtccgggat ggccaggagc aaacaaagat tatttaaagg acttcactat caatattgat 7 80 

tctgaaatta aatacttcct gacagaagct aaaactgccg ctcaaatagt agctgataaa 840 

tacactttat ttaacgatta tccgtcgtta ttcaacagcc aatcattagc taacgcttcg 9 00 

gaagtgttat tgtggagagc ctacgacgcc agtttaactc cggcagtcaa ccattttgtt 9 60 

gtcggttaca tccaacgcaa tggaggtggg aataccggat ggactcgtag tatgatgcaa 102 0 

agttatttga tggaaaatgg cttgccaata tacgcaaaca attctggtta tcaaggagat 1080 

aaaacttatg aagcagttgc aaccaatcgt gatccacgac tgatttataa tactttatta 1140 

cctggagatc tcttatctga aggaggaagt aacattgaat atctagtcaa aggatatggt 12 00 
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tattattatc gtgcaccaat tgtacttgga caggacgaaa acaaatgtcc caccggctat 12 6 0 

tcagtaaaaa agggattagc aacagatgcc gcacaaggac ctacactccc atcaactaca 1320 

gcctgtgtca tattccgtgc agcagaggca tacttgaatt atatggaagc tgattatgaa 13 80 

ctgaataact cgcttgatgc caacagttcc aaatactgga aagctttacg aaatcgagca 1440 

ggaatggata ccgattttca aaaaacaata gacgctacag atctgagtaa agagatcgat 150 0 

tttgcccgct attcaggttc tgaatttgtt tcaaccactt tgtataatat tcgtcgggaa 1560 

cgtcgcatcg aatttgcggc cgaaggatta cgcctaaatg atctgaaacg ctggcgtgca 162 0 

ttagacatga tgcaaggtta tcacgtagaa ggattcgatt tatggagtga aaattatcaa 1680 

cgttacaaaa ctcctagccc aataccagtt gcagacgtca ctctctctgt cattaatctg 1740 

attgaatcag gtaacaataa tgctaatgta tcagctaaat cagaaagtcg gtacttacgt 1800 

ccttaccgga tcaatacaaa caacattgca tacaatggct ataattggaa ccaaaataaa 1860 

tatttaaatc caattgcttt tgaccacttc cgtctgacga cagcagaaga aggatcaacc 192 0 

gactatacaa cctctacgat ttatcaaaat ccaggatgga agatagaaac gagcagtctc 19 8 0 

cctgaaggag attag 199 5 

<210> 776 
<211> 651 
<212> DNA 
<213> B. fragilis 

<400> 776 

atgataaaag ctatgaacaa cctcaatgaa ttatatgaag ccattttggc cggtaaattg 60 

gaacaggcag tcagtgttac ccgggaagct gttgccggag gagcagcacc ccaggaaatc 12 0 

attaatgaat atatgattaa agccatggaa gccattggag cacgttttga atcgggacaa 180 

gtgtttgttc cgaacctctt gatgagtgcc cgtgccatgc gtggtgccct cgatatactc 240 

aaaccactga tgcaagggca ggtcaattcg tatatcggtc ggattgtgat tggtacggta 300 

aaaggggatt tgcatgatat aggtaaaaac ttggttgctt cgatgtttga aggatgtggg 3 60 

tttgaagtca tcaatctggg agtggatgta tcgagtgata aattcatttc tgcggcattg 420 

gaaaataagg cagatattat ttgcatgtcc gcactgctca ccactaccat gaattacatg 480 

aaggaagtga tcgatgccct tgaaacctcc gggttgaggg gaaaagtaaa agtaatggta 540 

ggaggagcac ctgtcagcga tgcctttgcc aaatctatcg gtgccgatgc ctataccagt 600 

aatgccaatg cagccgtaat aatggccaag aagttgataa acgcctgttg a 651 

<210> 777 
<211> 1914 
<212> DNA 
<213> B. fragilis 

<400> 777 

attatggaat atacaattct gatccttctt ctccccttcc tctccttcct ggcattaggg 60 

ataggaggca agtggatgag ccaccgaaca gcgggcacca taggcacgct ggtattggca 12 0 

gcagtgacag tactctcgta cgtcacggcc gtacattact tctcggcacc ccgtctggca 180 

gacggaacgt ttgccacact cattccttat aactttgaat ggcttccgtt cacggaaaca 240 

ctaacgttca acctgggcat tttgctcgac cccatctcgg tgatgatgct gatcgtaatt 3 00 

tctacagtca gcctgatggt acatatctac tctttcggct atatgaaagg cgaacgggga 3 60 

ttccagcgct actacgcatt cttatcctta ttcaccatgt ctatgctcgg actggtagtg 42 0 

gcaaccaaca ttttccagat gtacttattc tgggagttgg taggtgtatc ttcttacctc 480 

ctgatcggtt tctactatac ccgtccggct gctattgccg ccagtaaaaa agcattcatc 540 

gtgactcgct ttgccgacct gggcttcctg atcggtatcc tgatatacgg atactacgga 600 

ggtactttcg gatttacccc cgacacagtt tcaatgttga gcggtggcgc cggtatgttg 660 

cctctggcac tcgggctgat gtttgtcggt ggtgccggca agagtgccat gttcccgctg 720 

catatctggt taccggatgc catggaaggt ccgactcccg tcagtgcact gattcatgcc 7 80 

gctaccatgg tagtagccgg cgtttacctg gtggcacgca tgttcccgct tttcatcgaa 840 

tatgctccgg acgtactcca cctgattggt tgggtaggtg ctttcaccgc tttttatgct 900 

gccagcgtgg cttgcgtgca gagtgacatc aagcgtgtac ttgctttctc gaccatctca 960 

caaatcggat ttatgatcgt ggcactgggt gtttgtacct cttccgatcc gcatcacgga 102 0 

gggttgggat acatggccgg catgttccac cttttcacac acgccatgtt caaggccttg 1080 

ctcttcctgg gtgcaggcag cattatccat gccgttcact ccaacgagat gtcggctatg 1140 

ggaggattac gcaaatacat gccgatcacg catatcacct tcctgatagc ttgtctcgcc 12 00 
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attgcaggta ttcctccgtt ctcgggtttc ttctccaaag atgaaattct ggcagcttgc 1260 

ttccagtata gcccgacgat gggttgggtg atgaccgtca tcgcagctat gaccgccttt 132 0 

tatatgttcc gtctctacta cggcatcttc tggggtggca cagcaccggg gcaaaagtcg 13 80 

acaagcgatg gtacaagcca cgtacatact ccccacgaat ctcccctgac catgactgtt 1440 

ccgttaatct tcctggccgc cgtcacttgc gtggccggtt tcattccttt cggacatttc 1500 

atcagctcca acggtgaatc gtataccatc catcttgaga catcagtagc cgtcacaagt 1560 

gtagtgattg ctgtggcgtc catcgtcctg gccacttgca tgtacctgcg tcagcagcaa 162 0 

cctctggcag ataaacttgc caaacgtttt gccggactgc accgtgcagc ctatcatcgt 1680 

ttctacatcg acgaggtgta tcagttcatc acacaccgga ttatcttccg ttgtatctct 1740 

acaccgatcg cctggttcga ccgccacgtg gtagacggat tcttcaactt catagcctgg 1800 

ggtacccatg ctacaagcga tgagatacgg ggattgcaaa gcggacgtgt acagcaatac 186 0 

gcttatgtat tcctgctcgg agcgctgata cttatcttaa tattaatctt ataa 1914 

<210> 778 
<211> 1320 
<212> DNA 
<213> B.fragilis 

<400> 778 

ataaaatatc tgatgattat gaaaatccta tcgactatcc 
tttggggcgt gcacttctcc tcaggtttct cctgatccct 
cgtctgacgg tgaatggaaa cccctattat tatataggaa 
attttggggt cacagggaca gggaggtaac cgggagagat 
ttgaaggctc ttggtattaa caatttgcgt gttcttgttg 
attccgacga aagctgagcc tgcacttcag gtggaagccg 
tttgacgggc tcgatttctt cctgtcggaa gttgataaac 
ttcctgaata acagctggga gtggtcgggc ggatattccc 
catggtgaag tgcctatgcc gaatgtagcc ggatgggatg 
caatatgcta agtcggaaaa agcacaccat ttgttccggg 
aatcgtgtca atcggtatac tggaaaaaaa tatagtgaag 
cagataggta atgaaccccg ttcgttcggt gaggacaata 
attgccgatt gcgctgctct tattaaatct atggattcta 
tcggaaggaa tggccggttg tgagggggat ttgtcacttt 
gcgaatgttg attatactac gattcatatt tggccgaata 
aaagatattc cgggtaccat cgggcaggca atagaaaaca 
catgtgcagg aagcttttaa gataaacaag ccgctggtac 
agagacagtg tgaagtttac ttcgaatact tccactgttc 
gctgtgtttg atatcgtcga aaagcatgct gccgaaaagg 
ttctgggcat ggggtggatt tgcggaacct caacatctct 
tatatgggag atcccgggca ggaggaacaa gggctgaatt 
acgataaata tgataaagga ggcggtaagt gatattaacc 

<210> 779 
<211> 1191 
<212> DNA 
<213> B.fragilis 

<400> 779 

ttcagacatc atatggacga gattcttaaa caagaaatgc agaaagagct tactacccgt 60 

attcttcctt actggatgga acggatggta gatcaggaga acggtggatt ttacggacgc 12 0 

atcaccggac aggaggaatt aataccccgg gccgataaag gggctattct gaatgcgcgt 180 

attttatgga cctattctgc tgcctatcgt ctgctgggta gagaggagta caaagagatg 240 

gcaaaccgtg ccaaacgata ccttatcgac cacttttatg attccgagtt cggaggggtc 3 00 

tactggtcac tcaattatag aggtgagccg ctggatacca agaaacagat ttatgccatc 3 60 

ggctttgcca tttacggact gagcgagttc catcgggcta ccggagatcc ggaagcattg 420 

atgtatgccg tccgtttatt caatgatata gagtcccaca gctttgatgg gctgaagaac 480 

ggttattgtg aagcgcttac ccgtgaatgg aacgaaatag ccgatatgcg cctcagcgag 540 

aaagatgcga atgaacgcaa gaccatgaat acccatctgc atatcctcga accttacacc 600 

aacctgtacc gggtctggaa agatgcacgg ctggaacgtc agctctacaa cctgatagga 660 



tattaacctt 


gttgattgtg 


60 


ttgtccgtgt 


gtcaaacgga 


120 


ctaatttttg 


gtatggagct 


180 


tacttcgtga 


actggattat 


240 


gagcagacgg 


aaaagatggg 


300 


gtgtgtataa 


tgatactatt 


360 


gggatatgta 


tgccgtactt 


420 


agtatcttta 


ttgggcggga 


480 


ctttttcgaa 


ttatgtggca 


540 


atcatattac 


tcacgttgta 


600 


atcctgcaat 


tatgtcttgg 


660 


aaaagagttt 


tgcagcctgg 


720 


accatctggt 


ttctattgga 


780 


ggacttctat 


ccatgccgat 


840 


attggggatg 


gatcgataag 


900 


cctgctctta 


tatcgatatg 


960 


ttgaagagtt 


tggtttaccg 


1020 


agcgggatcg 


gtattacaga 


1080 


gtgttttcca 


aggatgtaac 


1140 


tttggcaaag 


gggagatgac 


1200 


cggtttatgc 


aacagattcg 


1260 


agataattca 


gaaacaatga 


1320 
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ctttttacag agaagatact ggataaggac acatcccatt tacaactctt tttcgataac 72 0 

gactggcaaa gcaaataccc ggtcgtctct tatggacatg atatcgaagc ctcatggttg 7 80 

ttgcatgaag ccgcccgggt attgggagac gccggactca ttgcggagat agaacctgtt 840 

gtaaagaaga tagctgcggc tgcatccgaa ggacttacct ccgacggagg aatgatatac 900 

gaaaagaatc tcactaccgg acacatcgac ggcgactacc attggtgggt acaggccgaa 960 

accgtagtcg gatactataa cctgttccga tatttcggtg atcgcggggc tttgcaacat 102 0 

tccatcgact gctgggagtt tattaaacga catttgactg acgatgtgca tggcgaatgg 1080 

ttctggagcc ttcgtgccga cggtagcctg aaccgggatg atgataaggc cggcttctgg 1140 

aaatgtcctt atcataacgg acgtatgtgc atcgagctgt tgggcgaata a 1191 



<210> 780 
<211> 1809 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (1138) 

<22 3> Identity of nucleotide sequences at the above locations are unknown. 



<400> 780 

attgttttcc aaactttttg tacttttggg cacaatttac cgaatatgaa aaacaagcac 60 

gtcaaaagaa taaacctgct tttgtccatc ttattgggag caggtgtctt cattttcttc 120 

ggagtatact actcctacca tctgcattat caggaacagt tccagatgtt tctctttaca 180 

tccgactatt ttgtcgaaca agtatcccat cccgggggaa tggcggacta tctgggaggc 240 

tttctcaccc aattctatta ttattcgtgg gcgggagccg ctatcttgac cggtgcaata 3 00 

gggggtattc acaggttgat ggtttggatt gcaaatcgcc tgggcggaca cccggcatgg 3 60 

tatccgctta ctctgttacc ttctttatgt ttctttattc tgttctgcga cgaaaatttt 420 

cttctttccg gagccatctc tgtgggaatg gtgctgggag cactcatcgg atatacattt 480 

attgagaata ggcagatacg cctgatttat tggggagtcg gcattccgct gctttatctg 540 

ctggcgggag gatgtgcatg gcttttcatt ccattgatat ggataactga gttttgccgg 600 

tttgccggta ggcgtctgcc ttggtggatt ctggtgggag gtacattggg aattgccgga 660 

gtgacctatt ggatatcctt ggctgtattt ccatatccgg ccgaccgcct gttgtgggcg 72 0 

atcggaagtt atcgttttcc attagtcttt ccacaaatgc aggtgatagc ctggctggct 780 

gtgattctgg taccgttatt ggtagctcgt ttgcccgaga agatgacttg gagatactac 840 

tcgggagcat gggttctgca atttatcctg atgttgtttg ttttgaatct gtatggcaaa 900 

tacggtatcg ggttgaataa agaggaagtg atggggtatg attatcatgt gcgcatgcaa 960 

gaatgggatg aagtgattgc gatggcggaa aagaaagcac ccgatacacc gatgtcggtt 1020 

tcttgtctca atttggcttt agcaatgaaa gggcagctgc gggaacgtat gtttagtttt 1080 

taccagcggg ggaaggaagg actactgatg tcatttgtca acgacttcac cattcttntg 1140 

gtagccggtg aaccttatta ttacctggga ttggtcaatg tggcgcagca gtttgtcttc 12 00 

gaggctatgg aggccgttcc ggattaccgg aagagtgtgc gttgttttaa aaggcttgct 12 60 

gagaccaatt tgattaatgg caggtacgaa gtggcccgga agtatctgcg tatcttgcag 1320 

cataccctct tttataaaga ttgggctact gaaactctgg cttgcctgaa tgatgaagac 13 80 

cgggtcaatg cacatccgga atacgggaga ttgaggaggc ttactccgcg aaccgatttc 1440 

ttctttaacc ctgacctgcc ggagatgaca ttggagttcc tgcttcatgc aaatccccgc 1500 

aaccggatgg cgtatgagta tttgatggct tgtacactct tgaagaagga tgtgggacgt 1560 

tttgtgcatt attatccttt gggagccgat ctgggatatt cttccgttcc caagggctat 162 0 

caggaagcgt tgttgttcta ttggcttatg agcaaacaca ctgcaacaga tacaattccc 1680 

tggaagatag acccgcagac agagaataga ttgagagaat acgcgcagat tttcacatct 1740 

gcccgttcgg cagatgcatt gtctgcccga ttcggagata catattggtt ttatgcagac 1800 

tttagataa 1809 



<210> 781 
<211> 777 
<212> DNA 
<213> B.fragilis 



<400> 781 
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atttacttac aggaaaaatt ccatgttgca tataatcggg gaaatgttat gacgttctgt 60 

cggcattatt ataatccggt gaccggagga aatttttatg atactacgca agtggttcgt 12 0 

catatcttgc cgggaggttc ttatcatgct accttcaaag ccgatttgaa gatcattgct 180 

gattttgcac acaatgcaaa gggcgatgac gcagagttga ttccgatcat attccgtcct 240 

tggcatgagt ttgatggtaa ttggttttgg tgggcaaaaa atcattgttc ggttgaagaa 3 00 

tttaaaaagt tgtatcggtt tacagtcact tatctcagag attctttaga ggtgcataac 360 

tttttatatg cattttctcc ggactgtggt ttcactactg aggccgaata tctgaaacgt 420 

tatccgggag acaaatatgt agatgttgta ggtatggata attattggga ttttcgtccg 480 

tatgggggag atacctccct ggtagttctg aaagcccgta tccttacgca atatgcgcaa 540 

aagcatggaa acctttctgc cattactgag tcaggtacgc agacacgtga ttcattgtgg 600 

tatacacaat tgttatctat tctgcgttcg gaaggggtag ccttgaatta tgtatgcact 660 

tggtcggggt tttctcctta taaaggacat ccggcagcag ccgatttttg tcggtttaag 72 0 

agggacactt tggtgctctt cgctgatgaa attcctaatt tttatacttg gcactga 777 

<210> 782 
<211> 1197 
<212> DNA 
<213> B.fragilis 

<400> 782 

agcaccggac agtcctgctg gcccggtgct ttacctgttt ttcaaactat ggacagatac 60 

agactaaaga ttatcgcatt gacagcactg gtgtgcctga ccggcagcag ctgtacggac 12 0 

gacgagaaca acggacaggg caataacatt atttatggtg agaatatcat cggaaacgga 180 

gaacagacct tcgagataaa agatcatcaa tacctgaaac ggggcactta cctgatgaag 240 

ggatggtgtt acgtcactta cggttcgacg ctgaccattg aagcgggcac cgttatcaaa 30 0 

ggagacaagg aaacccgcgc cgccctgatt gtagaaccgg gcgggaaact gattgccagg 360 

ggaacggtag atgctcccat cgtattcacc tccgaaatgc ccgccgggaa acggaaaccg 42 0 

ggtgactggg gaggattgat tttatgtggt tatgcccgga acaatgaaga catcatgcag 480 

atagagggag gcccgcgtac catgcacgga ggtccgaaca acgcggataa ttcgggcgtc 54 0 

ctgagctacg tccgtgtaga gtttgcggga tatccgttca agaaaaacca ggagatcaac 600 

ggcatcacct tcggttcggt aggaaacggc acgcaaatag accacctgca agtatcgtat 660 

gccaacgacg atgccttcga atggttcggc ggaacggttc atgcggaata tctggtggcc 72 0 

tatcactgct gggacgacga tttcgacata gacaacggct atagcggcac atgccggcat 780 

ctgctgggca tccgccatcc gcgaatagcg gacatcacag gttctcatgc cttcgaatgc 840 

agcaataatg gcacgaacac tcctgcgaca cctaccacag ccgccacctt tgaagatgtc 900 

acgatctatg gccctgcctc aggggatgcc tcattcgtaa atcatcccga ctttatcaat 960 

ggcggcggcc tccggcctga gaatgaaagc atgctcgggc tattcggcgc ggcactgtat 102 0 

atgggcaaca acacatcggt gactttccgg aactgccgga tcagcggata tccgtcggac 1080 

atggagggca caccggcctc agcggataac gtagtattca gcgaaaggga agaaaccggt 1140 

tatccggagt ggacccaagg atggtgcaac ttcaacccgc aagagacaga gtattga 1197 

<210> 783 
<211> 1134 
<212> DNA 
<213> B.fragilis 

<400> 783 

aagatgagta agaatatgga atacagaaaa catagaatag agtatttaag gactactgtt 60 

gaatattccc tttttggggg tgagggagga acgagagagg cacatttgat gtttcatgtc 12 0 

gatccggaag cggggagtta tgaagagcag ctgacggcca ttcgtaaagc ataccatagg 180 

atactgagca ggaaggtgaa aatcagggga atggtacccg tgttttgccg ttacttcctg 2 40 

agtgatgccg ccaatcagtg ggaggctttg caggctgtct tgcagaaaga gccgtcgtgc 3 00 

gctgtctctg tcgtgcaaca gcccccgttg gatggaagta agattgcttt gtgggtctat 3 60 

ctgacctccg aaccgaatgc cgcttacaag cattactgga cagccggtgc gggtgtgtct 420 

tgcggcaaat cggaacggca aatgaaaacc ttgttgaaat cttatgaagc cgatctggta 480 

ggaaaaggct gtacgctggc ttccgattgc atccgtactt ggatctttgt gcagaacgtg 540 

gatgtgaatt atgccggcat tgtcaaggca cgccgtgaaa actttctggg gcagggactg 600 

accgaatcga cccattatgt agccagtacc ggcatagagg gacggcatgc cgatccgaag 660 

atacatgtgc tgtttgatgc ctatgcggta aaaggattgc agccgggaca ggtgacttac 720 
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ctgcatgcgc tgtctcatct cagcccgaca gccttgtatg gagtgacttt cgaacgggga 7 80 

acttccgtgg agtatggcga ccgtcgtcat ctctttatca gcggtacggc gagtatcgat 840 

catcgcggtg aagtggttca tgtaggtgat gtccgggaac agacccggcg gatgtgggaa 900 

aatgtagaga agttgctcga agagggcaaa gccggttttg aggatgtggc gcagatgatt 9 60 

gtttatctgc gggacgcttc ggattatccg gttgtgcgcg ctttgtttgc caaacgcttt 102 0 

cctgatacac cgatacagtt tgttgttgct gctgtatgcc ggcctgcttg gctgatcgag 1080 

atggagtgta tagcgatcgt cgctaacagt aactcctctt atgaatcatt ctga 1134 

<210> 784 
<211> 1197 
<212> DNA 
<213> B.fragilis 

<400> 784 

ataacgataa aaacactgtc cattatgagt ctgtttaatg ataaagttgc taaattgctt 60 

gccgggcatg aagcactgct gatgcgtaag aatgaaccgg tagaagaggg aaacggagtg 12 0 

attacgcgtt accgttaccc tgtactgact gcagcgcata ctcctgtctt ctggcgatac 180 

gacctgaacg aggagacgaa tccttttttg atggaacgta tcggtatgaa tgcgacgttg 2 40 

aatgccggag ccattaagtg ggatgggaag tacctgatgt tggtgagagt ggagggagca 3 00 

gaccgcaaat ctttttttgc tgttgccgaa agcccgaacg gtattgataa tttccgcttc 360 

tgggagtatc cggtgacctt acccgaagat gtggttcctg caaccaatgt atacgatatg 42 0 

cgtctcactg cccatgaaga tgggtggata tatggcatct tttgtgccga acggcacgat 480 

gacaatgctc ccataggtga tttatcgtca gctacagcca ctgccggcat tgcccgtacc 540 

aaagacctga aaaattggga acgtctgccg gatctgaaaa caaagagtca gcaacgtaat 600 

gtggtgctgc atcccgagtt tgtggatgga aagtatgcac tttatacccg tccgcaagac 660 

ggatttatcg ataccggtag cggaggtggt atcggatggg cattgattga cgatataacc 720 

catgccgagg ttggagaaga gaagatcatc gacaaacgat attatcatac catcaaggag 7 80 

gtgaagaacg gtgaaggacc gcatcctatc aagactcctc agggatggct tcatctggca 840 

cacggagtac gcaattgtgc tgccgggctc aggtatgtat tgtatatgta tatgacatcg 9 00 

ttggatgatc ccacccggct gatagcttct ccggcggggt actttatggc tccggtagga 960 

gaagagcgca ttggggatgt gtcgaatgtg cttttttcga atggttggat agccgacgat 1020 

gacggaaaag tatttatcta ctatgcttcg tcggacaccc gtatgcatgt agctacctca 1080 

actatcgaac ggttggtgga ttactgcctg cacactcctc aggacggctt ttcttcctca 1140 

gcttcggtag agatactgaa aaacctgatt gaacgaaatc tgagattgat gaaataa 1197 

<210> 785 
<211> 423 
<212> DNA 
<213> B.fragilis 

<400> 785 

ggcatcaaaa atcattcaaa tagattatta atgaaaagaa ttggaatata tattgttatt 60 

gtagtatgca tcctgtcttg catatcttcg cgaaggaacc tcttgacaga aaccagattg 12 0 

atgttggttg atacgagagc aactgaacat acggcggcat tgttctataa cttgcggcaa 180 

ctgaccggaa aacgggtggt ttatggacaa cataattatg aaatggatgg gttcgattcg 240 

gatagtacac gctgggagga tgaggcaaac cgatgtgatg cgtatgatgt gacgggggct 3 00 

tatcctgcct tggctagttt tgaattcctt cattttacga atcctcgtag ttggggaaac 360 

aaaagaattg aatttactta caggaaaaat tccatgttgc atataatcgg ggaaatgtta 42 0 

tga 423 

<210> 786 
<211> 483 
<212> DNA 
<213> B.fragilis 

<400> 786 

aagatgaaaa atgaagaata tacatatcta ggcggcctga tgcaaggcat cggctccctg 60 

ctgacgggta tgaaaaccac catcaaggta tattttcgaa agaaagtgac cgaacaatac 12 0 

ccggagaacc gcgccgaact caaaatgttc gaccgctttc gcggtacatt gaacatgcct 180 
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cacaacgaaa 
aatgatacca 
atactggcaa 
gcttgcccac 
accaaacttg 
taa 



acaatgagca 
ttaaagtgac 
agtatgaata 
acgacgcaat 
tcctgcaact 



ccgttgtgta gcctgtgggt 
cagcgaaacc attgaaaccg 
cgaccttggt tcgtgtatct 
caccttcgac caggtatttg 
caaccgcgaa ggaagtaaag 



tgtgtcagat 
aagagggcaa 
tctgccagct 
agcatgccgt 
taatcgaaaa 



ggcatgtccc 
aaagaagaaa 
ctgtgtcaac 
attcgaccgg 
gaaaaaagaa 



240 
300 
360 
420 
480 
483 



<210> 787 
<211> 322S 
<212> DNA 
<213> B.fragilis 



<400> 787 

aataaaagaa 

gcagcatcgg 

tatgcggcgg 

gatcatgaag 

ttagtgaagt 

ggatatgtca 

gatgccaatg 

ttctatatga 

attcaagatg 

ccgcaatcgg 

cggatgcgta 

ctgtttcata 

acaggacacg 

gaagtttatg 

aatggtcaga 

ttacgtggtg 

tctgaaccgg 

ccggaattgg 

gcccgttggc 

acccatatag 

gatgtcattt 

ggaaatcgtg 

tactatcctt 

aatatgaacg 

tggtatatca 

tatcgggatt 

atcgatcaaa 

aatcagatgg 

aatggaaagg 

gctccttgtg 

tatccggcag 

tcttcgggtg 

tttgaagtaa 

gggttaaaag 

gctgggaaat 

gatgtacttc 

catatagctt 

gatgaatggg 

atgagtactc 

aaacagcaac 

cagattagtt 

gaagagattg 

gcttcgtata 

gccgattgtc 

ttgacttcaa 

ttaccggaat 



aaatgaaacg 
ctgtcaaagc 
tagaacttca 
aagtaccgga 
cttggagaga 
ttcggacagt 
gattgcttta 
atggagatgt 
aacgaactcc 
ctaccattta 
tgaacttcat 
attttgaata 
gatggggctg 
atgattatga 
tcaaagagaa 
taaaaatagg 
ataacaaaga 
actatctgct 
gaagagtctt 
ctgtatcggg 
gtgcccctat 
aatactgggg 
ataatgtaga 
gattttatgc 
gtaaggctcc 
ttgcacttgc 
acgaaccttt 
tacatactta 
atgtggagat 
atgaaggagg 
ttgattttag 
gtgttgctac 
agaacacgga 
gtgttcatac 
tagctgataa 
aacaattgcg 
tgaatactga 
cgcgt age tt 
agaateggtt 
gggtacaggc 
ggcgtaatga 
atacattggc 
eggtatatge 
tggcegggag 
taatggaggg 
ttgtatccgg 



aatggcatat 
acagattttg 
geggtattat 
caggaaaacg 
caaaggagta 
aaaagaaaaa 
tggtgtatat 
atatcctgat 
gacagtggct 
ttcttgggat 
tatgattcat 
caaagggcat 
teceggatgg 
ttteggggea 
gggagcaact 
tttagggtta 
cttgataaag 
ttgttttcaa 
tgatggattt 
ctgggggcta 
atcttactat 
atgtccctgg 
tettteggaa 
gctgacatgg 
ttggtataat 
caattatgga 
tgctaccgat 
tccgttgatg 
aaaagccacc 
agagtgcgtg 
taatagtccc 
tgtttatctg 
aggttggcaa 
gctttatgtt 
acagctgaaa 
tctctcccga 
ttttgaaaat 
tttgtatcgt 
tgtgaaacag 
tccttcgcat 
agagceggea 
gtctgatgtg 
agtggatatt 
tgetgacegg 
aactccgttg 
gatatttcac 



cacctgctgg 
ttaacagctt 
tatcagctgt 
gaatttgttc 
ttacctctga 
ggcagggaat 
gggttgctgg 
aagaaagagg 
atccgtggat 
gattggcgtt 
aactataacg 
ttatcgcgtg 
aatatcaacg 
gactatggct 
atattcegta 
gatattgatg 
gtacaggtcg 
tccgaaggtc 
tatgaagaga 
actgeggaat 
tcggccgctt 
ctggaacgtg 
acaatceggg 
agattggegg 
catgaagtgc 
gaaaatgcag 
ttcggtgagt 
aatctttatt 
gtatatgcag 
ggatatatta 
gaacgaatgt 
gatcgattgg 
tcctggaaat 
cgttttcaac 
acaattgata 
ttacgagcac 
tatcaatgga 
attgaagata 
aactatgtag 
attatagcta 
gtaagttcct 
aattgttatc 
gaagggcata 
gaagctcegg 
cacatteggt 
tateggagaa 



ctatcttaat 


ttgggggatg 


60 


ctgctacgcc 


gattgaagaa 


120 


ccggacgctt 


gttgtccatc 


180 


tgacaagact 


ggatcatccg 


240 


agtccatgcc 


gggagagcag 


300 


tggttgtcat 


tgcaggtgtt 


360 


aagatcactt 


aggaatgegt 


420 


ttcagacgag 


aataccgttg 


480 


ttcttccctg 


gactaatttt 


540 


acatcataga 


tcaggcagca 


600 


ggttttgcgg 


acataatgaa 


660 


gatggatgee 


tactataaag 


720 


aatatctttt 


eggggcatet 


780 


tgcataatga 


aacgttgacg 


840 


aggtgattgc 


ttatgeacac 


900 


tegttttgee 


ggaatatcag 


960 


cagaaatagc 


tegtgagtat 


1020 


agaaaaatga 


ggctttttat 


1080 


tgaagcggaa 


gtcgccttct 


1140 


eggtgaatag 


tetgectgaa 


1200 


ttgagccggg 


aagtgtttat 


1260 


attttaacag 


ttctgagtat 


1320 


cctttggaga 


tgcttctgcc 


1380 


atgetattte 


tccaaagatg 


1440 


tggactcttc 


ggagaaagtg 


1500 


tggatgecat 


tactgacatt 


1560 


gtcaggaaac 


acceggattt 


1620 


cgatgacttt 


tgggggaaag 


1680 


agaaaaaagg 


tacaaaaaat 


1740 


tggctgatga 


ctggttgcag 


1800 


ecataeggat 


tgettctgea 


1860 


gaggacctgt 


tategctegg 


1920 


cattgaccgt 


tccggtgaaa 


1980 


cttttaatgt 


aatagccaaa 


2040 


gttgtatggc 


cgttacttcg 


2100 


gtattcaggg 


ageagcatge 


2160 


atgatttacc 


ggggaaaatg 


2220 


tttcttccta 


tggaaacatt 


2280 


agaagatcaa 


ccagctacgt 


2340 


aaggtactct 


tgagggagca 


2400 


tcgtggtttg 


tcgtaatgga 


2460 


aggataagtt 


tcatggagca 


2520 


aaagcccttt 


gggaatacct 


2580 


ttattgtgat 


taattccccg 


2640 


tttcagtcgt 


tgaaaategg 


2700 


caggagagaa 


agtgtggaaa 


2760 
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aaaataccat ttaagcaccg gaccagaggt gtcttcacat taaccttacc cgcttctgag 2820 

attacgcgtc agggaataga atactacatt tcggtttcag attctgacaa tgtattttgc 2 880 

tatccgggtt cggctccggc tcggaatcat acggtggtag taactgaggt accgggagat 2 940 

gataaacccg aagttccgat gataaaacca atttgtggta aacgtatgtt ttggagtcgt 3 000 

gtgccaaatg tggaaatgta tcgcatctat cgtagcagaa ctcctgattt taaaatcgga 3060 

gcagatacgt ttgtgacgtt tgtagcggga aatacacaga gttttgccga taatggattt 312 0 

gatttcgacg ggacttctct gaaaggaact tattattatt gcgtgacttc cgtatccttt 3180 

tgggatcatg aaagtgaggc atcaaaaatc attcaaatag attattaa 322 8 

<210> 788 
<211> 1281 
<212> DNA 
<213> B.fragilis 

<400> 788 

aacagtatga atatgaaaac gaattatcta aaactcaact cttgggccgt agcagccctg 60 

atgggaatgt gttcacttgc agcttgtagt gacgacaaca gcggcgaagg cggcggaaac 120 

ggcgacagcg aagaggtgat cgccaacaac ggaacactga aaggaagcgt agacggatcg 180 

aaaaccgtca tcctgaccaa aggctacaac ttctccctcg acggagaata tatcgtcaaa 240 

gccggttcca ccctgaagat cggcgaaggt gtgacaatca gcgccaaaag cgatgatgcc 300 

accatcgact acatcctcgt ggagcaggga gccaagatcg aagcggtagg tactgcctcc 3 60 

gcaccgattg tcatgactgc cgataccaaa gaaccgggag catggggcgg catccacatt 42 0 

tgcggcaaag ccccgatcaa tatcggatcg accggtaaat cggaagtcgg agatgccgct 480 

tacggtggtt ccgatccggc ggacaactcg ggtatcctga agtacattcg cctggaatac 540 

gccggataca agttcactac ggaaaaggag tgtaacggct tcaccttcta tggtgtagga 600 

aacggtacga ccctcgaata cctcgaagca tacaaaggta ccgacgacgg cttcgaatgg 660 

ttcggaggta cggtcaatgc caaatatctg gtatcggtga gcaacagcga cgattcattc 72 0 

gactggacag agggatggag cggaaaaggg caattctttg tcgcctacca ggaagatccc 7 80 

gccactttgg gatatacatg cgactgcctg atcgaggccg acaactatga caagaatatg 840 

gatgccgctc cgatctcatg cccgacactg gccaacctga cactgatagg cgccaacaac 900 

gacgaaggca aaagaggcat ccgcctgcgt gccggaactc aggccaagat ctacaatgca 960 

ctcgttacag gcaaggccaa taacctgact accgaaacag aacagaccga gaaattcctg 102 0 

atcgacggtc cttcggtact gaactacatc gctatcgcga gagatatcaa ggcaagcggt 1080 

gacggcggtt actcttctgc cctgttcaca gccgaaggca atcacaatgc catcaaccag 1140 

actttgagct tcagcaatat ctttatcgga acacaggacg gaggagccga cctgtcagca 12 00 

gacagcttct ttgaaaaagc ggcttataaa ggtgcagtga aagcagacaa tgaatggacc 12 60 

aaaggttgga ccaagttata a 12 81 

<210> 789 
<211> 1218 
<212> DNA 
<213> B.fragilis 

<400> 789 

ataatcatga acagaatcaa caccacccta ctcttacttt tttgctcagt ctattgcttg 60 

gcgcaacagg ctactatccc cgttcccaag ccctttcagt tgaaatggca tcaagcggaa 12 0 

atgggagccg tattccatta tgatctgcat gtgttcgatg gagtacgcta cggacaaggc 180 

aacaaccgca tcaatccgat agaagattac aacatattca accctacgga actaaacaca 240 

gaccagtggg tgctggcagc caaagcagcc ggatgtaagt ttgccgtact gactgccact 3 00 

catgaaaccg gtttcggtct ctggcagagt gacgtaaatc cttattgcct caaagccgta 360 

aaatggagag acggcaaagg ggatatcgtc cgtgactttg tcaactcttg ccgcaaatac 42 0 

ggcttacaac cgggtatcta catcggtatc cggtggaatt ctcttttggg catacataac 480 

tttaaggcag aaggagaagg agaatttgct cacaaccggc aagcatggta caaacgactg 540 

tgtgaaaaga tggtgaccga actttgtacc cgttatggag atctatacat gatttggttt 600 

gacggcggcg ccgatgatcc tcgtggagac ggaccggacg tagagcctat tgtgaataaa 660 

tatcagccta attgcctgtt ctatcataat atagatcgtg cagatttccg ttggggtggt 72 0 

tccgagaccg gtaccgtagg ttatccctgc tggtccacct tccccgctcc ctgttcacat 780 

cacaaacgga tagaaagcaa tgtcgatcaa atcgaactgt tgaagcatgg cgacaaagat 840 

ggaaaatact gggtaccggc catggcagat actcctttac gtggagccaa cggacgtcac 9 00 
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gaatggttct gggaaccgga tgacgaaaac aacatctatc cattgaacga actaatggat 960 

aaatatgaaa aatcagtagg acggaacgct accctgattt taggcctgac acccgacccg 102 0 

aacggattaa tacctacagg agacgaacaa cgcctgaaag aattcggtac agaaatcaat 1080 

cgtcgcttct cttctccatt agcccagata tcgggacagg aaaaaaagtg cgaccttgaa 1140 

actggacaaa aagcgaccgg tgaactactg cgtcattcaa gaagacatac agaacggaga 12 00 

acgtatccgc caatataa 1218 

<210> 790 
<211> 2706 
<212> DNA 
<213> B.fragilis 

<400> 790 

aaagacatga aacaacaaat cgggagactt ctctccactc ttcttctcgc aacattttcc 6 0 

ttaggaatca cagcaggggt catccaggga accatcattg ataaacagac caaagaaccc 120 

ctgaccggag ctaccgtaca gattgccgga accacgaccg gaaccgtagc cgatgtagac 180 

ggtaactaca cactgacgct aagcaacggc acctatacca ttgaagtgaa atatatagga 240 

tataaaacac tccggatgaa tgaagtgaaa gtgaaagcca atgcgacact gaactttgaa 3 00 

ctggaagtag acgcgcaaac gctggacgcc gtcaccgtag tggcccggaa aaacctggaa 3 60 

ggcgaaaagg ctttactgca agagagacag aaagcaacgc ttgccatcga aaacatggga 42 0 

gccaaagaga tgaccctgaa aggtatatcg aacgtacagg acggagtcaa gaaaataacc 480 

ggtatctcca ttgcaagcgc cggacaactg atagtacgcg gactgggtga ccggtacagc 540 

acgaccaccc tgaacggttt gcccatcgcc tcgcccaacc cggacaacaa gttgattccg 600 

ctcgacctct tcccggcctc taccgtaaag aacatcaccg tcagtaaagt atatgccgcc 660 

ggagcctttg ccgactattc gggcgcacat atcgacatca gcaccaagga gaacacggga 72 0 

agtgactttt tctccatcgg cttcaacgta ggcggacgct tcaacactgt cggaaaagat 7 80 

ttctattata gcgaccggaa aggcggactc ttcagtacgg gaaacctcag gaataaagac 840 

cggattctgg ctatgggtaa aagcgagttc cgcgattacg cccgcaacaa tgacccgttc 900 

ggcacaaact tcgctatcag caagcaccgt tcactacccg aattcggtgg taacctggga 960 

ggaggcaaga gctggacact ccccaacgga aaccgtctga gcgtgcttgc ctcggtaggt 1020 

gtcagcaacg aaaaccaaat cttgaaagac gcctacgtga ccactatgac cgctcagggc 1080 

acacacctcg acaagttcaa ttatgacagt tattccagcg cactgaaaat agccggattg 1140 

ggcaacatcg gctactcgtt ccggcaggcg gaccacatca acttcaccgt gttctatgca 12 00 

cgcaatgcca tcaacgatta catgtcccgc gaggggatcg atgccgaaaa gaacaacatc 1260 

acatcgagca acagcgtttt ccatgcctac tcactactga acaaccagtt gctgggacac 132 0 

cacgaactga cttctcagtg ggatgtaaac tggagtgctt cgtacggact gaccaacagt 13 80 

gacgaaccgg atcgccggca ggtggtcttc ttccgtaacg aaggcagcga taagctgaac 1440 

ctctttaaac tcaaccagac taccaaccgc tacttcggag aactgcaaga gaaagagatt 1500 

gtaggagatc tgcgcacctc gtacaaatgg ggagatgcga acctgattcg tgtgggaggt 1560 

acttacaaaa gcaaaaaacg tgactttgaa agcgtgaact tctactacga tatcaatgcc 162 0 

ttgaacgctg acgtcaccaa catttatgat accaacggat atctgaatca ggaaaacata 1680 

gccaacggga cgataaaagc caacatcgat gcacagcccc gttacaacta ctacgccgga 1740 

atggatgtgt gggcagggtt tgcagaaata gagtactacc cgatggaatc tctgctggtc 1800 

aacgtgggac tgcgctacga gcaggccaaa caatgggtac gctattggac ggacggcgga 1860 

caggagaaga aaacgaacct ggacaaaggc gacttcttcc cggcactgaa cctgaagtac 192 0 

agcctgaacg aaaccaacag cctgcgcctc tccgtatcac gcactgtcac ccgcccttca 1980 

tttatcgaaa tggctccgtt cctctaccag gaatcttacg gaagtgccta tatccgcggt 2 040 

aacaacgaac tgaaaaacgc ttataattat aacatcgacc tgcgctatga tttctttccg 2100 

aaacgcaaca acggggatat gttctctgtc acgggttatt tcaaaaaact gaaatcgccg 2160 

attgaacaga ctcaggagtc ttcgggcggc acagtgatcc gctctttccg caacgccgaa 2220 

gatggaatag ccacaggagt ggaaatagaa ttccgcaaag aactgttcaa gaacttccgt 2280 

atcggagcca acggttcata catgtacaca aacgtcgtat tgcccgaagg cggggtatat 2 340 

accgactcgg aacgcgctct gcaaggagcc tctccgttcc tgatcaatgc agatctcagc 2400 

tacactcctc aactgagagg agaaagcgac ctgacactgg cactggttta caatgtgcaa 2460 

ggcccgcgca tcgagacagt aggtatctac ggaacaggta acatcaagca acaaaccctg 2520 

cacacgatgg acttcatagc aagctatgcc atcaacaaac acctgagcct gcgcctgcag 2580 

atgaaagact tgctgaacag taccatccgc ttcaagcagg agctgccggc aacgggacaa 2640 

aaggtggaag tagaatcatt ccgtccggga acccatgcag aaataggagt ctcgtacaga 2700 

ttctaa 2706 
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<210> 791 
<211> 716 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 

<222> (695) , (706) , (707) 

<2 23> Identity of nucleotide sequences at the above locations are unknown. 



<400> 791 

gataaaatga aaaacaaccg attgatcatt acccttattg ccttgtttct cctaggattc 60 

gggctgaaag cccaaactgc ttctactgaa gaaactgctg ctcagaaaga gaaaagaatg 12 0 

gaatggtttg cccaggccaa gttaggaatc tttatccatt ggggaatcta tgccgtgaac 180 

ggagtatcag agagctggtc attcttcaat aactatcttc cttatgaaga gtatatggct 2 40 

caggaaaaag gctttacggc atcggcctat aatcctcagg agtgggtgaa actgattaaa 3 00 

gaaagtggtg cacgttatac ggtcattacc accaagcatc atgatggcgt agctctttgg 3 60 

gatacgaagg cgggtgacct cagtactgtg aaaagtactc ctgccgggcg tgacctgatt 420 

gctccttttg tgaaggaagt acgtaaacaa gggctgaagc tcgggttcta ctattcgctg 480 

cttgactggt cacatccgga ctatcccaac aaaacccgta cggaagtacg ttacaaaaac 540 

gatccggatc gttgggctaa gtttgttaag tttaattttg gacagctttc cgagttaaac 600 

aaaacttgga aacctgatct ttactggttt gacggagact gggaacaaac tgctgaggct 660 

tgggattcga agtcttcacc acggggctgg aagantgcgc gtcatnngga gctcaa 716 



<210> 792 
<211> 840 
<212> DNA 
<213> B.fragilis 



<400> 792 

tatatgtttg actttagtat aataacaagt tggatacacc agacattaac ctccgtcatg 60 

ccggagggat tggctgtatt catagaatgt gtcgttatcg gggtgtgcat tgtggctttg 12 0 

tacgccatac ttgccattct ccttatttat atggaacgca aggtgtgcgg tttcttccag 180 

tgccgactgg gtccgaaccg cgtaggaaag tggggaagca tccaggtgct ctgcgatgtg 2 40 

ctcaagatgc tgaccaaaga gatcatcgaa ctgaagcatt cggacaaatt cctttataac 3 00 

ctggctccgt tcatggtgat tatcgcctca tttctcacct tttcgtgcct gcctatcagt 3 60 

aaagggctgg aagtgctgga cttcaacgta ggtgtcttct tcctgttggc agcttcgagc 42 0 

ataggcgtag tgggtatcct gctggccggc tggggttcga acaataagtt ctcactgatc 480 

ggtgctatgc gaagcggtgc acaaatcatc agttatgaat tgtctgtcgg acttagtatt 540 

ctcacaatgg tggtcctgat gggtaccatg caggtttctg agattgtgga aagtcaggct 60 0 

aacggatggt ttatcttcaa aggacacatc ccggccctga tcgctttcgt tatctatctg 660 

atagccggca acgcagaatg taaccgaggt ccgttcgacc ttcccgaagc ggaaagtgaa 72 0 

ctgacggcag gataccatac cgagtattcg ggtatgcact tcggcttctt ttatctggcc 780 

gaatatctga atatgttcat cgtagctgcc gtagccgcca ccatcttcct gggaggctga 840 



<210> 793 
<211> 2511 
<212> DNA 
<213> B. fragilis 



<400> 793 

ccccgcatcg acaattacga atacctgctg ccaaagaaca aagagttctt ccagaaactg 60 

ggtgtcgact cccttattta cccggaaatg ctggctgcca aggagatcgt atcgtccatg 12 0 

cgtatgagtt gggtgcgcca atggtgggaa ttttgcggag gatcacttat cctgatcggt 180 

acaaagatgc gtgaaaaagc cgaaatactg aatgtcacgc tggccgaact aggtgcgccg 240 

gatattccct atcacgtagt agccatcaaa cggggtaccg aaaccattat cccccgtgga 3 00 

gacgatacca tcaaactgca cgatatcgta tacttcacca ctacccggaa atacatccct 360 

tacatccgaa aaattgccgg aaaggaagaa tatgccgacg tacgcaatgt gatgattatg 42 0 
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ggcggcagcc gcatcgcagt ccgtacagca caatatgtac cggattacat gcaggtaaaa 480 

attgtggaca acgacataaa ccgctgtaac cgcctgacag agttgctcga tgataagacc 540 

atgattatca acggagacgg acgggatatg gacctgttga ttgaagaggg actgaagaac 600 

acggaagctt tcgtagcctt gaccggtaac tctgagacca atatcctggc ctgccttgcc 660 

gccaaacgca tgggagtgag caaaacagtg gcggaagtgg aaaatatcga ttacatcggt 72 0 

atggcggaaa gcctggacat cggcacggta atcaataaaa aaatgattgc cgccagccac 780 

atctaccaga tgatgctcga tgcagacgtt tcgaatgtga agtgcctgac ctttgccaat 840 

gcggacgtag cagaattcac agtacccgaa aacgccaaaa ttaccaaaaa caaagtgaaa 900 

gacctgggac tgcccaaagg gaccactatc ggtggcctga tccgcaacgg agaaggtata 960 

ctggttacgg gtgataccct tattcaggca ggtgaccacg tcgtcctatt ctgcctcagc 1020 

atgatgatca agaagataga aaagtcctta attgatggta tgataaactc taaaatgata 1080 

tatcgcatca caggtttcct cttactgata gagacaggcc tgttactttg ctgtgcaggt 1140 

gtttcgctga tataccgcga ggatgacctg agcagtttcc tgttgtcggc aggattgact 1200 

actttagtcg ccatccttct gctggctctt ggcaaaggag ccgaaaaaca acttaaccgc 12 60 

cgagacggat atgtcattgt cagtgtagca tgggtcgtgt tttccttatt cggaatgctc 132 0 

ccgttctacc tcagccatta tataccaagc ataaccaatg ccttcttcga aacgatgtcc 1380 

ggattcagta gtaccggagc caccattctc gacgatatcg aagcactgcc ccacggactt 1440 

ctcttctggc gaagcatgac acagtggata ggcggattgg gcattgtctt ttttaccatt 1500 

gccgtactgc ccattttcgg tgtgagcggc gtacaactct ttgccgccga agccagcggg 1560 

cctacctacg ataaagtaca tccccgtatc ggtgtgacgg ccaaatggat atggactatc 162 0 

tatgccggac tgacagccat tgaagtgatt ctcctgttat tcggaggcat gggactgttc 1680 

gacagtatct gccactcgtt tgccacgacc ggtacaggag gttattctac caagcaagac 1740 

agcatagcct attacaattc accttatata gaatatgtga taggtgtttt tatgtttctc 1800 

tccggaatca actttacgct acttctcctg ctatttaccg gtaaactgaa gaaagtatct 1860 

caaaatgccg agttaaagtg gtacgtgatg tcagtgattc tctttaccgc attcattgcg 1920 

gcagtgctct accgcaccac cccgatggga gctgaggaat ctttccgcaa agcctttttt 1980 

caagtagctt cgctgcatac ttccaccgga tttgtaacag ccgactacat gcaatgggta 2 040 

ccggtacttt ggggtaccct gactgtcatc atgctgatag gtgcctgtgc cggaagcacc 2100 

acaggaggta tgaaatgcat ccgaatggtg attctggcca aagtgtcacg aaatgaattt 2160 

aaacacatcg tacatccgaa cgccgtactt ccggtgcggg tgaacaaaca ggtcatttct 2220 

cctgccatcc tgtcgacggt actggcattc tcattcatct atgccgtcat catcattgtc 2280 

agtgtactgt tgatgctggc aatgggcgta ggtttcacag aatctatcgg gacggtgatt 2340 

tcaagtatcg gaaatatggg accgggattg gggagctgcg gtccggccta ttcatgggac 2400 

ggactccctg atctggccaa atggttattg tcgttcctga tgttactggg acgtctggaa 2460 

ctattcaccg tcttactttt attcagttct gacttttgga aaaggaatta g 2 511 

<210> 794 
<211> 1878 
<212> DNA 
<213> B.fragilis 

<400> 794 

tgtagggtag agttggtaaa agagggaagt ttttatggaa ctatccctct ttttaactca 60 

gcttgcaata taaatctaac aaaaatgata atgcgaaaaa ttcaatatct gtttattgct 120 

ttgtttatct gcctggaaat ccaggcacaa gacaaattta atatcagggg agtgttgccc 180 

tggcataatt ttctatcggg acctacttcc tggaatctgt cggattaccg gatttatctg 240 

gatgaatgcc ggaagaatgg tatcaatttt attggttttc ataattatac tggtggtgga 3 00 

gaacgctatg ccacctatgt ggaacctatg ataaaaatag aatataaaaa tattcttccg 3 60 

caagcttgtt ttgataattc gatgacagcc cgttggggat atcttccgat ggctgtgaaa 42 0 

gactttgctt ttgatacggg aaagatattt caattgcctg ttggtgcaga agcttttggc 480 

aataatggtt cgataacatc acattcttct cgggaacatt atgaaaaagc tcagtccttg 540 

atgagggatg ttctgaagat ggcacatgaa cggggaatcc gaatggctat gggatttgaa 600 

tttggagtga tcccttccga atatttttct ttgaatgtag ccggagattg tttttattgg 660 

gcaggtgaat cgaatatgat ccccaatccg aaaagtcaga tagctgccga gatacattat 72 0 

gcggcaattg atgatatcct aaacacttat ccggatatcg attatatctg gatgtggctg 780 

aatgaacatt catttatggg tgtcgatgtt cagaaagcac ttagagataa accctttgcc 840 

cgggcttatc aagagaatca ggcactcttt aaagaggctg ccgattcatc ggctcgtttt 900 

gtcggggtat gggcattgga atacatgaaa ttgacttata aacatttgaa atcaaaaggc 960 

tctcgtgcaa agttaatcct tggtggttgg ggagggggac atcagttgcc ttctttattg 1020 
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aagQ'gs-ctgg atagggcctt accccaagat attattttca gttgccttaa tccggattta 1080 

ggaaaaagtc cgcaacctga tttcttggaa gagattgccc gaaaccgtag tgtttgggct 1140 

gtaccctggt tggaggggga tcatcaactc tggcattttc agccgagagt caatatgatg 12 0 0 

cgtgaacagg taaagcttgc tgccgaacaa aatctggacg gagtaattgc tattcactgg 12 60 

cgtacagagg aaccccgttt taattttcga acgtttgccc gttttgcttc ggataagggt 13 2 0 

gctgacgaga gtgtagatca attgtatgac cgatatctga cagaggaatt tggggaagaa 13 80 

gccgcaaaag aaatgactcc tttacttgcc agaatggatc gtgaacagat tcaatggaat 1440 

gtaccgtcac cggaatttta tgcatatact ccggaatggg gattattaga tgaaaataat 1500 

gtgcgaatac gacaagagtt ggtgtcttcg ggagaatcgt tattgaaaaa gttaagagga 1560 

gagaaacggg agaatctgaa acgtttcata gcaatgttcc gttttgagct gttgttgggt 162 0 

gaggtagatc gggctatgat gcctgctttt atcttaaaaa agaaggaagt gcaaggtgag 1680 

aaaataaatg gttcgcagga gtatatggac gcatatcggc tgttagtttc agcccctgtt 1740 

aaagaaatgt ttgatactta tatggaacgt gttcattctc gtggagaatt gggagtactt 180 0 

agctctttga atcaacgtgt gtggcgcgaa tataatgatc ttaaaattta tctggaaaat 1860 

aaaataaaag aaaaatga 187 8 

<210> 795 
<211> 660 
<212> DNA 
<213> B.fragilis 

<400> 795 

aggtataaag atatgaagaa actgataata ttcgatttgg atggtacttt attgaatacc 60 

attgccgatt tggcacatag tacgaatcat gctctgcaaa ctttgggata tccgactcat 12 0 

gaagtcgctt cctataactt catggtgggt aacggcatca acaaattgtt tgagcgtgca 180 

ttgcccgaag gagagaaaac cgaggagaat gtgctccgcg ttcgtaaaga atttcttttg 240 

cattatgacc ggcataatgc cgacgagagt cgcccttatc cgggaattcc ggaattgttg 3 00 

gaaacattgc agcataaagg ttataaattg gccgtggctt ccaataaata tcaggcagcc 360 

accgagaagc tgatagcaca ttatttcccg ggaatccggt ttgttgctgt atttgggcag 42 0 

cgtgagggag tgaaggtgaa gccggatcct gctgtggtgc atgatatttt gcagattgcc 480 

gatgtttcga aagacgaagt gctgtatgtc ggcgattcgg gagtggatat gcagacggct 540 

atcaatagcg gagttacttc ctgtggagtt acgtggggat tccgcccccg tacggagctt 600 

gaatcgttct gtccggatta tatagtagac aaggcggaaa ctattttgtc tattgtttga 660 

<210> 796 
<211> 1497 
<212> DNA 
<213> B. fragilis 

<400> 796 

agcacaaaag atatgaactt tttatcctta ttcgtactca ttcctctgct gatgcttggc 60 

gggttatacc ttgccaaaag cattaaggcc atccgcggag tgatggtagc gggaagtacg 12 0 

gcgttgctga tcctgagtgt tgtgctgacg ttcctctatt tgggcgagcg ccaggcagga 180 

gctacagccg agatgttgtt ccgtgccgat acggtatggt atgcaccgct tcacatagcc 240 

tactcggtgg gagtagacgg aatatcggta gcgatgctgc tgcttagcgc tgtcattgtg 3 00 

ttcaccggca cctttgcctc ctggaagttg cggccgctga caaaagaata tttcctgtgg 360 

ttcaccctcc tgtcgatggg agtattcggt ttctttatct ccatcgactt attcaccatg 42 0 

ttcatgttct acgaaatcgc attgataccg atgtacttac tcatcggcgt atggggttcg 480 

ggacgcaaag aatatgcagc catgaagctg accctgatgc taatgggtgg ttcagcattc 540 

ttgctgatcg gtattctggg tatcttcttc ggtgccggcg gaacaaccat gaacattctt 600 

gaaatagctc aactgcataa cattccgttt gcgcagcaat gcatctggtt tccgctcact 660 

ttcctgggat tcggtgtgct gggagcactc tttcccttcc atacctggag tcctgacggt 720 

catgcctcgg caccgactgc tgtctctatg ctgcatgccg gcgtattgat gaagctcgga 780 

ggctacggtt gtttccgcat cgccatgtac ctgatgccgg aagctgcaaa cgaactgggc 840 

tggatcttcc tgatcctgac aggtatctcc gttgtatacg gtgctttcag tgcttgcgta 900 

cagacagacc tgaagtacat caacgcatac tcttccgtaa gccactgcgg ccttgtgctc 960 

ttcgctatcc tgatgatgaa ccagacagca gctaccggag cggtgcttca gatgctcagc 102 0 

cacggattaa tgacagctct gttcttcgcc ctcatcggta tgatatacgg acgtacccat 108 0 

acccgtgacg tacgcgagct gaacggactg atgaaagtga tgccgtttct cagtgtctgc 1140 
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tatgtgattg ccggacttgc caacctgggt cttccgggac ttagcggctt cgtagccgag 1200 

atgactatct tcgtcggttc attccagaac ttcgatgtat tccatcgtac actgaccatc 1260 

atcgcttgct cgtccatcgt gatcacggca gtctatatcc tccgactggt aggtaagatt 1320 

ctatatggaa cgtgtaccaa caaacatcat ctggcactga cggatgcaac ctgggacgag 13 80 

cgctttgccg tcatctgtct catcatttgt gtcgccggac tgggtatggc tcctttctgg 1440 

gtcagccaca tgattggcga gagtgtattg cccgttgttt cacacttaat accctaa 1497 

<210> 797 
<211> 1596 
<212> DNA 
<213> B.fragilis 

<400> 797 

aatatggaag aaataaaata catagaaccc gcagcactac acgacgaaat gctgcgtctg 60 

cgtaacgaaa aacagatgga cttcctcgaa agcctaacgg gtatggactg gggagtggca 12 0 

gacgaaggtg acgcaccgaa cgtaacccgg ggacttggag tagtctatca tctggaatcg 180 

accgtaaccg gcgaacgcat cgcgataaaa acatccacaa ataaccgcga aactccggaa 240 

ataccttccg tcagtgacat ctggaaagcg gccgacttca acgagcgtga agttttcgac 3 00 

tattacggca ttgtattcat cggacatccc gacatgcgac gtctttatct gcgtaatgac 3 60 

tgggtaggcc atccgatgcg taaagataac aacccggaga aagacaatcc gctacgtatg 42 0 

gacaatgaag agacatatga tacgactcgg gaaatagagc tgaatccgga cggaacgtat 480 

caaactcagg agaatgtgat cttcgatgac cgtgaatacg tagtcaacat cgggccacag 540 

cacccggcaa cccacggagt gatgcgcttc cgcgtctcac ttgaaggcga aaccatcaaa 600 

aagctcgacg ccaactgcgg atatatacac cgtgggatcg agaagatgaa cgaaagcctc 660 

acctatccgc agactttggc actgaccgac cggctcgatt atctgggagc acaccagaac 72 0 

cgccatgcgc tctgcatgtg catcgagaaa gcaatgggta tcgaggtcag cgaacgcgtg 780 

aaatacatcc gtaccatcat ggacgaactt cagcgtatcg actctcacct cctattctac 840 

tcctgtcttg ccatggacct gggcgcattg acagccttct tttacggatt ccgtgaccgt 900 

gaaatgattc tggatatgtt cgaagaaact tgcggtggac gtttgataat gaactacaat 960 

accattggag gcgtacaggc agacctgcac ccgaacttca tcccgagagt aaagaagttc 102 0 

atcccttacc tgcgtggaat catccacgaa tatcacgatg tattcaccgg caatgtcatt 1080 

gcccggcaac gtctgaaagg tgtaggggtg ctgagtcgcg aagatgccat ttctttcgga 1140 

tgtaccggtg gaacaggccg tgccagcggc tgggcatgtg atgtacgcaa acgtatgcct 1200 

tacggcgtat acgataaggt ggattttaaa gaaatcgttt ataccgaagg cgactctttt 1260 

gcccgttaca tggtgcgtat ggacgaaatc atggagagcc tgaacattat cgagcaattg 132 0 

attgacaata ttccggaagg accgatacag gagaaaatga aacccatcat ccgggtaccg 13 80 

gaaggaagtt actataccgc cgttgaaggc agccgcggtg aattcggagt gttcctcgag 1440 

agtcatggcg acaagacacc ttaccgtttg cactaccgtt cgacggggtt gccactggtt 1500 

tcggctgtcg acaccatctg ccggggagct aagattgccg acctgatcgc tatcggcgga 1560 

acgctggatt atgtggtacc ggacatcgac agataa 1596 

<210> 798 
<211> 1611 
<212> DNA 
<213> B.fragilis 

<4QQ> 798 

gcgctgctgg atccactgga tgaaaaggcc tatgattttg tgtcgcctga gcagttgggt 60 

gacagcgaaa gtgcagcttc tcagctcgtg acaggagcct ataatacggt gatcaccagc 12 0 

tttattgctc cgggatctta tctttacctg accaatatgg actgtgacta cgcatcagga 180 

gcttcatggg cattcggtaa tgtgggagcg ggaaacccac aaggtttttg ggggatagac 2 40 

cacatgtggc aaggtagtta tacgttaatc caccgggcaa atctgggtat atccaagatt 300 

tcggcaatga gtaatctgag ccaggagagt aaacaggatg ctttagctca actctgtttc 3 60 

cttaaagcat gggcttattt taatttggtg agaaattacg gtcctgtacc tattttccgg 42 0 

aaatccattt cggaaggaga agctatgagt caacctcgtg catcggtttc ggatgtatat 480 

gcacatatta tcgaattgtt ggaacaggct gaaggtatgt actcaaaaga tgatgcaggt 540 

ttcgtggtgg ggcatgcttc aaatggagct gctaaagcat tgttggcgaa ggtatatgtt 600 

actatggctt ccggggcgat gtccggtgtg cctatcgtgg ttaagggagg aaatccgaat 660 

atatttgaac cacaacctat cacgcacatt gcaaagactg ttgcaggtta tgagtctttt 72 0 
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gatccggcca agtattatgc gttggcacgt gacaaagctt gggaggtgat aaacgaatat 780 

accctgtttg ataattatat ggacgtatgg gccataggaa accgtaataa gggagaacac 840 

atctggatgg cacaggccat cagcggtgat aaggactttg gaaacacgat ctgtcaggat 900 

tatgtgggca ttttcaaaga agacggtacg atggaaggta actggtatgg tatgcgcgat 960 

cactggtatc tgcttttcga agaacaggat acgcgtattg tcgatggagt tattcatcga 102 0 

tatgcgtcgg atggtatatc caatggtaag gttatctata actattatcc ccgttggtat 1080 

gcaaataaag tggacaataa ggaggtatat gacagtcatg gcaatgcttt tgatggtacg 1140 

gaagtctatc atgaacgtca ggggtggaca ttggcgaagt tgacaaaatt tacttttgtt 1200 

acataccgga aacaaaaaaa cagtgatttt cactttacgt tactccgttt gccggatatc 12 60 

atgttgattt atgctgaggc tgtcaatgaa ttgaatggtg ggccggacgc tgaggcctat 132 0 

aatcaggtga accgcattcg tacgcgtgca catgccactc cgttctccgg aatgaatcag 13 80 

gatgaattcc gttcggctgt actggaagag cgtgcccgcg aattggccta tgaagctgac 1440 

cgccgttatg atcttttccg ttggggtatc tatctggatg tgatgaatgc catcgatatg 1500 

gatgagcata atgtgactaa acgtcgtttg gaacgaaatc ttctttatcc gatacctacc 1560 

agtgaagtga actctaatga taagattgat tctaataatc cgggatggta a 1611 

<210> 799 
<211> 1011 
<212> DNA 
<213> B.fragilis 

<400> 799 

aatgttatgg aactatcatt atatataaaa 
atccccatta tgacccatcc gggtattgaa 
acgaacggtg aaattcatta tcacgctatc 
gcggcttgta ccactattat ggatcttacc 
agtatgtccc ccaatgaagt acccagtgtt 
gttgaggctt tgcagattcc ttcggtcgaa 
gaccgcctgg cagcagaagg aatagacaag 
tctctggcag gtcgtctata cgatatgacg 
gatactgttc tgcttttact ggagaaatgc 
atcaaagaga ccggagtggc cggtgttatt 
aatgaagatt gtcaacgcta ttcttcggtc 
gatgattcgt ttgcagttat tctgcataat 
atgctggcta ccggtgctaa agggtatcat 
ctccgggaat gcccttcgga tgtatgggtg 
agggtgctga cgcctgaaga tgtttttgca 
gaatacgcta actttattat atccaccggt 
aatattcagg ccttttatct ggctgtagag 

<210> 800 
<211> 1458 
<212> DNA 
<213> B.fragilis 

<400> 800 

aagaagatgg attattcaca atttctatat atgaaagagg agctgtcact gatagcagtt 60 

atcctcatcc tgtttgttgt cgacctgttt acctgtccgg accaaaaagg tgccgctcct 12 0 

aaggtgaacg tcaggtcgct caccttgccg gctgtgatcc tgatgaccct ccacaccgta 180 

atcaacctct ttcccggaac tccggcagag gctttcggcg gaatgtatca gtacacaccg 240 

atgcaaacca tcatcaaggc agtgctcaac gtaggtacca tcatcgtgct actgatggcc 300 

catgaatggt tgagacgcga agacacccgc atcaagcagg gagagttcta tgtactgaca 360 

ctctctaccc tgctgggtat gtactttatg atttctgcgg gacatttcct gatgttcttc 420 

atcggattgg aaatggcaag tatcccgatg gccgcactgg tggcattcga taaatatcgt 480 

catcactccg cagaagcagg tgccaagtac atcctgaccg cactgttctc aagtgcattg 540 

ctgctattcg gtctttcaat gatatacggt acgtccggta cgctctattt caatgacctc 600 

cccggacaca tcacaggcaa tatgcttcag attatggcat tcgtgttctt ttttgccggc 660 

atgggattca aaatctcatt ggttcctttc cacctctgga cagccgacgt atacgaagga 720 

gcgcctacag ccgtaacctc ttatttaagt gtgatttcca aaggatcggc agctttcgtg 780 



gataaattag 


tatcaggaaa 


acgggtagcc 


60 


ttgttagaga 


aacgggtatt 


ggatgccgta 


120 


cgtgccttga 


acgaatgttt 


tccgcaatcg 


180 


gtggaagcag 


aagcttttgg 


agctcgactt 


240 


tgcggacgtt 


tgctcactgg 


atatgcagat 


300 


tcgggacgca 


tgcctcagta 


tttgctggcc 


360 


cccgtgctgg 


ccggttgtat 


cggtccctat 


420 


gaaattatga 


tggctatcta 


taccgagccc 


480 


acggaattca 


ttctccgtta 


ttgtctggct 


540 


atggcggaac 


cggctgccgg 


acttctttcg 


600 


tatgtgaaac 


gtattatcga 


tgctgtccag 


660 


tgcggtaaca 


ccggacattg 


cactgctgct 


720 


ttcggaaata 


aggcggatat 


gataactgct 


780 


atgggtaatc 


tggaccctgt 


aggagtgttc 


840 


cggacagaag 


aacttctgac 


ctgtaccgga 


900 


tgcgatactc 


cgcccgaagt 


accttttgac 


960 


aagtacaata 


agggtaggtg 


a 


1011 
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ctgatgacca tcctgatgaa agtgttcgca ccgatggtgg cacaatggca ggaagtgttg 840 

ttctgggtaa ccattgcttc catcaccatt gccaacctct ttgctatccg ccaacagaac 900 

ctgaaacgtt tcatggcatt ctccgctatc tcccaagcgg gatacatcat gctgggtgtc 960 

atcggaggca gtgaaatggg aatgactgcc ctggtttatt atgtactggt ttatctggca 102 0 

gcaaacttag gtgtatttgc agtcatctca attgtggaac aacgtagcaa caaagtggag 1080 

atagacgact ataacggact gtacaagacc aatcccaaac tggcttttat catgaccctt 1140 

gccctgttct cgctggccgg tatccctccg tttgcgggtt tcttctcaaa gttcttcatt 1200 

ttcatggctg cattcaacag cggattccat ctattggtat tcattgccct gatcaataca 12 60 

gtcgtatcgc tttactacta cttactgatt gtaaaggcca tgtatatcaa tcccaatgaa 132 0 

gaaccgatcc ccactttccg cagtgataac tacaccaaag tgagtctcgc actttgtact 1380 

ttgggtatca tagctctggg tattgcaagt tgcatctatc agggaattga caagttctca 1440 
ttcggaatgg gaatgtaa 1458 

<210> 801 
<211> 381 
<212> DNA 
<213> B. fragilis 

<400> 801 

acatactatt ggattatgaa ttttacattg ttagttgtcg ttctgctgac cgcaattgcc 60 

tttgt cggtg tggtgatagc cctttcaaac gctatctcgc cgcggtcgta taatgcacaa 12 0 

aagttcgaag cgtatgaatg tggtatccct acgcgcggta aatcatggat gcagttccgt 180 

gtagggtact acctgtttgc cattctgttt ctgatgttcg atgtcgaaac agtatttctg 240 

tttccctggg ccgtcatagc ccgtgacctg ggacctcagg gattgattag tattctcttc 300 

tttttagctg agttgggtct gggcccttgc ctatgcctgg aagaaaggag cactgtaatg 3 60 
gaaataatga aaaagcctta a 3 81 

<210> 802 
<211> 198 
<212> DNA 
<213> B. fragilis 

<400> 802 

gcggaggccc ggtttcgcaa atcctatcat gtggtgaacg gagtagacaa gattctcccg 60 

gtcgatgtat atattcccgg atgccctccc cgcccggaag cattttatta cggtatgatg 12 0 

caactgcaac ggaaagtgaa gatagagaaa ttcttcggag gagtaaaccg gaaagagaaa 180 
aaacctgaag ggaaatga 198 

<210> 803 
<211> 1557 
<212> DNA 
<213> B. fragilis 

<400> 803 

gagaatacgc gcagattttc acatctgccc gttcggcaga tgcattgtct gcccgattcg 60 

gagatacata ttggttttat gcagacttta gataagatgg aaaagataaa atactgtttt 12 0 

agcatgatag ggctgctctt cttgtttgca gcttgtcaag agaaggtgac atcccctgcc 180 

agggtggata cattgccatc gatatttccc gattatgtcg gggttaccat tccctctacc 240 

attgccccgc ttaacttccg ggtgacggac gatggggtag aggcggttga tgtcgtgatt 3 00 

gccggtacga aaggaaagcc tgtacggctg aatggaagat tggtagacat tcccgccaag 3 60 

caatggcacg aacttcttga aagtaataag ggagacagta tcgaggtgaa agtctctgtc 420 

cgccagggga agaagtggaa agagtatcgt ccgtttccga tatatgtcag tcctttcccg 480 

atcgattacg ggttggtata ccgtttgctc gcacccggat atgaagtgta cagcaagatg 540 

gggatctacg aacgtgaact ttcaacattc cgccagactc ctttatttga gaatacgcag 600 

gtgacggccg cctgcatcaa ttgccatgct ttcaaccgga cggagcccac accgtcgagc 660 

gtacatgtaa gaggcgggca tggagccact gtaatcgaca caggagatcg gttggaattt 72 0 

ctcgatacca aagccgacgg gcaattgtcg gcctgtgtct atccgtactg gcatccttcg 7 80 

ggcgaataca tcgcttattc ggtgaacaaa accaatcagg cctttcatct gggaggaaag 840 

aagccgatag aggtattcga ccaggcttcg gatgtggtgg tttatcatcc ccggtcccat 9 00 
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cggatactga 
cccgacggac 
aaagacgtga 
gaccggatcg 
ccttcgttcg 
tggcacaaag 
gaggaggtga 
tttgtgttca 
gacggacagg 
tacgatcagt 
gacaagcggg 



ctactccttt 
gaacgcttta 
aatacagctt 
atacgttgat 
acggaaaata 
aggccgatct 
acagtgacga 
gcagccggag 
ggcgtatagg 
tgatctattc 
agatggccaa 



gctgagtacg 
tttctgttcg 
gtgcagcatt 
ctcagcccgc 
cctgatgttc 
ctggctgctg 
cacggaaagc 
aggtgacggg 
gaaacctttc 
gtataatgtt 
agggctgatg 



gcttcgtttg 
gccgagcaga 
gctttccatc 
acgctggaca 
acgctttccg 
gacttgaaga 
tatcacaact 
ttgtacaccc 
ctgctgcctc 
cccgagtttg 
tcgaaggagc 



aaacttttcc 
aagagatgcc 
ccgaagacgg 
aaagtatctc 
attatgggaa 
ccggaaccta 
ggagcagcaa 
gcctttatat 
agcaagatcc 
tgtctgttcc 
gagttaaagt 



ggcattctcg 
tgtccgatac 
aacgttcggc 
tttccccaaa 
cttctccatc 
tcgcaacttg 
ttcacattgg 
ctcctcggtg 
gtatacgttc 
ggtgcaatgg 
gaaataa 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1557 



<210> 804 
<211> 756 
<212> DNA 
<213> B.fragilis 



<400> 804 

tgccaatgca 

atgaattccc 

gatattaatc 

gatgcattgg 

gatgccgagc 

tctgttataa 

gggtacgagt 

tttcttctgg 

aaagtggaat 

tattgcggat 

tgcggggtcc 

attgcttatg 

gattgctata 



gccgtaataa 
atattgttga 
tttcaatggg 
aaacagagat 
cggccggtac 
ctccttatct 
ttgaggcttt 
atgcttacgg 
cccgaatgtt 
ggcacgtcac 
gattgagtga 
gtccatgtat 
aaaacagaaa 



tggccaagaa 
agagtgtctg 
agccggttac 
tgccggaatt 
ttgtgaggtg 
taaagatgct 
tcagcatagg 
ttcggcaatt 
tcctttggga 
gcaacagcag 
ttcttcgctg 
tgtcaaacgg 
taaactaaac 



gttgataaac 
cctttcactt 
gtgcctgatg 
tgtacacccc 
ggagtaaacg 
gcgggctatg 
ataggcagtc 
gctgaagcag 
tacggagtca 
ttgcttttca 
atgtcgccta 
aaatatggat 
agatag 



gcctgttgaa 
cgttgagact 
cggagataca 
gctttctgta 
gtatttctct 
tactctttgt 
agggagacat 
ttgtccgtga 
gtcatcctta 
gctgcttgcc 
ttaaatcggt 
gcgaactatg 



tatgccactg 
cgatccggag 
ggcaatatct 
tgctctgttc 
gaaaacagga 
tgctactgcg 
cttacgcgaa 
agtatgccgg 
cagtcccggt 
tgaatttcct 
cagtggtatt 
cggcaaagcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

756 



<210> 805 
<211> 345 
<212> DNA 
<213> B . f ragilis 



<400> 805 

caacaaaaag 

aaaacagact 

ggtaatctca 

ctgtttgccg 

caaaaacagc 

ggaactctat 



ggaagaattt 
taatgcgtat 
aagacctttg 
ccgccgaatg 
cccgttggct 
tttttatcgg 



gaacttaaaa 
ttcatatttt 
gctacaatcg 
tctgttcagt 
gacttctatc 
agatttctta 



actgaaaagg 
gtagccatcg 
tggaccgacc 
acgtttgcac 
tccctgctga 
ataaacaagt 



agacgcttat 
tgatattggt 
tgatcatcta 
gcatccgtgc 
tttacggaat 
tatga 



gggatcaaag 
atgtctgata 
tctgatagtc 
ggtcggagag 
ccttttcctc 



60 

120 

180 

240 

300 

345 



<210> 806 
<211> 519 
<212> DNA 
<213> B.fragilis 



<400> 806 

aagaatatgg 

gccatgtcca 

ttcgtgcttt 

gtacagatca 

acgagtggag 

gtcactacga 

ccgacaagcg 

agcagtggta 



gacttacact 
tactgacagt 
tcggcacagc 
tggtctatgc 
aaggcgaccg 
ttataggtgc 
atccggaacc 
aatatggata 



tgaaacagta 
gaccacgcag 
aggtatctac 
cggaggtatc 
ggccgctcac 
aatcctggtg 
tgtagaaatc 
tgtattgcct 



gtattctact 
cgtatcgtgc 
tttctgttag 
gtagtgctct 
ctgaaacgaa 
ctcttcatta 
agtatcaaga 
tttgaagcag 



ttctggcagt 
gttcggccac 
gatacacttt 
atgtattctc 
gtaaattcct 
cactgacaca 
ccatcggaca 
tcagcattct 



gttcatcatt 
ttacctgctg 
cctcggatcg 
catcctgctg 
ggcagggctt 
caaatttgtg 
tgctttgtta 
gttgctggcc 



60 

120 

180 

240 

300 

360 

420 

480 
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tgtatcgtgg gcggattatt aattgcacgt aaaagatag 519 

<210> 807 
<211> 2799 
<212> DNA 
<213> B.fragilis 

<400> 807 

actactttta tggataacga tattgaacga tttcggcaaa tggcgtccat ggcgcaactc 60 

ggatggtggg aagctgattt cacagccggg cattatgtat gctcagaata cctctgcgat 12 0 

ttattgggac ttgaaggaaa taccatatct tttacggact tcaggaaacg ggtgcgtgag 180 

gattatcagg aacagatagt ccgggagttc aatgcttcca tccataggga gttttatgaa 240 

cagactttcc ctattcactc caaagaggga atcgtgtggt tgcacacccg tttgggggag 3 00 

cgtgaagaaa taccgggcag gggagtcgtt tcattcggta tcatgcagcg ggtagaagct 3 60 

cccaatgata cttccgagcg ggttctggag cgtgtcaacg acttgctgta caggcagaac 42 0 

tccatttccc attcactctt acgctttctc aaagatgata gtgtggacct ctgtatcatg 480 

gagatattga aggatatcct cgatcttttt catggaggac gcgtgtatat ctttgaatat 540 

gatgaatatt accgctatca ggactgtacc tacgaggtgg tggccgaagg agtgttgccg 600 

gagatcgata gcctgcaacg tatcccgact gacagtttac cttggtggag gcagcagacc 660 

ctgtcgggta aaccggtgat actggattca ttggaccagc ttccgaaaca tgcaaaagcg 72 0 

gaatatgcga tcctgagccg ccagaacatc aagtcactga tgatcactcc gctgatagcc 7 80 

ggcgaacatg tatgggggta tatggggatc gatctggtga agaattatcg caactggaat 840 

aatgaagact tccaatggtt atcgtctctt gccaatatca tcagcatctg tatcgagctg 900 

cgtaaagcga aagacgaagc tgtgcgcgaa cgttcttttt tgcgtaatct gttccgcttc 960 

atgccgatgg ggtatatacg tatgactatg gtccgggatg ctgccggact accttgtgat 102 0 

taccggatag ccgatgccaa tgatttgagt tcggaactca taggaatgcc tctttccgat 1080 

tacgtgggat gccttgccag cgagttgcat gcggacttta aggccaaggt ggattatctt 1140 

ctcgatgtga tggagggcag tgtgcacaaa gagactgatg tctacttcca tcgcacccag 1200 

cgcagttccc attgcatcgt gtattctccg gaaaaggacg aggtggtcgc tctgtttctg 1260 

gactctacgg agacgattcg tgcccatagg gctttagatc gcagtgagaa gcttttcaag 1320 

aatatctttg ccaatattcc cgcgggagtg gagatttacg ataaggatgg caatttgctc 13 80 

gacttgaaca actgggatat ggaaaccttt ggtgtaaaag ataaagccga tgtaatggga 1440 

gtcaacttct ttgagaatcc gaatgtgcct cttgaaatca gagaacgggt acggaacgaa 1500 

gacctggtcg atttcagact gaactactct tttaataagg cttccgatta ctaccattcc 1560 

gataaaagta atataatcga gctgtataca aaggtcagta aactctttga cagccaaggg 162 0 

aacttcaacg gctatgtgct gatcaacatc gataatacgg agcgtatcga tgccattaac 1680 

cgtatccgtg attttgagaa cttcttcctt ttgatatcgg actatgcaaa agtaggttat 1740 

gccaaactga acctgctgag taaacgtggc tatgccatca aacagtggtt taagaatatg 1800 

ggtgagacgg aggatattcc gctttcatcc gttgtgggcg tttatgataa gatgcatcct 1860 

gaagaccggc agaaggtctt tgacttttac gagaaggtat tggcgggtga agagaaggac 192 0 

ttccgtagcg aaatgcgtat cctgaaaccc ggcgctacca acgagtggaa ctgggtacgg 1980 

atgaatgtgg tagtaaccaa gtttgaaccg gagcatgggg aggtggagat tatcggcatt 2 040 

aattatgata ttacggaact gaaggagacg gaagccatgc ttatcgaggc gaaagagaaa 2100 

gcggaaaaca tggatcggct gaagagcgct ttcctggcga atatgagtca cgagatacgt 2160 

acaccactca atgcgattgt cggtttctcg ggcctcttgg tcgatacgga agacatggag 222 0 

gaacgctgcg aatacatcaa gatcgtacaa gagaataatg acttgctgct gcagctgatc 22 80 

tcggacat cc tggacttatc gaagattgag gccggtacgt ttgagttcac ctacggggag 2340 

acggatgtga atatgctttg tgaagatatc gttcgcagct ctcagataaa ggttccccag 2400 

ggagttgaat tagtattcga tccgcatcct tcggattgca ctgtgataag tgatcggaac 2460 

cggttgcatc aggtcatctc caatttcgtg aacaatgccc tgaagtttac ctcctcgggc 2520 

agcatccatg tgggatatga aaagaaggaa gagggtgtgg agttttatgt aagcgacacg 2 580 

ggaatcggaa tctctaaaga gcaactgacg catatctttg aacgctttgt gaagctgaac 2 640 

agctttatcc acggaaccgg gctcggactc tccatctgta aaagtattgt ggagcagctg 2700 

ggcggcgtca taggagtgga ctcggaagaa gggaaagggt cccgtttctg gttcaccatt 2 760 

ccctatatta acagcgaaca gtcaatcgtt aacgattga 2799 

<210> 808 
<211> 558 
<212> DNA 



325 



<213> B.fragilis 
<400> 808 

ataaagtcta tcccgtatga agacttcatc gacaacgaat cgttggaaaa gatggtcaaa 60 

gaactcaatg aaggcggtgc aaacgtcctt gtgggagtac ttgacgatct tatcaactgg 12 0 

ggacgcagga actcgctggg gccacttact ttcgcaacca gttgttgcgg tatcgaattc 180 

atggcactgg gtgccgcgcg ttacgacagg gcccgcttcg ggtttgaagt agcccgtgcc 240 

agtccgcgcc aagccgacat gatcatggta tgcggcacca ttaccaacaa aatggctccg 3 00 

gtactgaaac gtctgtatga tcagatggca gatcccaaat atgtaattgc cgtaggagga 360 

tgtgcagtaa gcggaggccc ggtttcgcaa atcctatcat gtggtgaacg gagtagacaa 420 

gattctcccg gtcgatgtat atattcccgg atgccctccc cgcccggaag cattttatta 480 

cggtatgatg caactgcaac ggaaagtgaa gatagagaaa ttcttcggag gagtaaaccg 540 

gaaagagaaa aaacctga 55 8 

<210> 809 
<211> 3216 
<212> DNA 
<213> B.fragilis 

<400> 809 

accttatgta taactttaaa taagaaacga atgaaaaaaa tttcaatctt attcatgttg 60 

ttgcttggca ttactacatt atatgcacag caattgaaca ttacgggtac tgtgattgac 12 0 

aaaaagctca atgagccaat catcggtgcc acagtccaag taaaagggac gaacaatgga 180 

tccatcacgg acatggaagg taagttttct ctaaaaaacg ttagcaaagg aggtatactg 240 

actgtttcct acataggtta caccactcag tcaattcctc tcaatggtac acaaacatcc 300 

ttcaggattg agttaagtga agattcaaaa actcttgatg aagtagtggt agtaggcttc 3 60 

ggtactcaga aaaaagtaaa tctgaccgga gcggttacaa gtgtagatac caaagcacta 42 0 

gcatcacgtc cggtatcaca agtcggtcaa gccctgcaag gtgtagttcc aggcttaaat 480 

ctatcgactc ctgatttagg gggacagttg ggacaaacaa tgaacgtaaa catccgcgga 540 

acaggaacca ttggtaaagg atcaagtgcc tcaccactta tactgattga tggaatggaa 600 

ggcaatatga ataatctgaa tccagaagat attgaaaata tctctgtctt gaaagatgct 660 

gcttcttctt ccatctatgg ttcacgtgct gcattcggtg ttatcttaat cactacaaag 720 

aaaggaaaag cgggcaaaat gcaagtgaac tataacaata gtttccgcta ttccggacca 780 

accagccttc ctaatcaact tgattcctat cgttttgcca attatttcaa tgatgcagcc 840 

attaatcaag gaggaagtgt gatctttgat gaagagacca ttgaccgtat ccaaaagtat 900 

atggcaggcg agattacaac caccaccata gctaacggta ccaactggca tttccacgaa 960 

aaagcaaatg ataacgtaaa ctggtggaaa aaacattttc aatgggcctg gtcaaacgaa 102 0 

cataatatca gtttaaatgg aggaacagag aagttacaat actatgtttc agggagctac 1080 

ttaaaccaag atggtaatct tcgttatgga aatgataatt ataaacgtta caacgcaaca 1140 

gcaaaagtca atacccaaat caacaaatat gtagatttca acattaatac caaatttgtt 1200 

cgttttgatc ttgacaatcc agtatatctt gaggaaggtg gacttcttta tcatgacatt 12 60 

gcacgtatgt ggcctatgat gccttttaaa gatccgaacg gttattatat gagaaatgga 132 0 

aaactcaatc aattgactga cggtggacgt gccaaaacac ataatgacaa tatttatctt 13 8 0 

cagggacaat tagttattca tccgctaaaa ggatggaata tctatgcaga agcaggtatg 1440 

agagtcatca accaaaataa gcaaaccaac cttaatccaa tctatgagca cgacgtaaac 150 0 

ggtaatccat tagcattggc tttcagcgga agttactcac caggatcttc atttgcacgt 1560 

tcagcatacc acaatagtaa cttttatacg acaagtgtgt acaccgatta cacattacaa 162 0 

ataaaagatc attatttcaa agctttagtc ggaatgaata ccgaagaata tgtatatcgc 1680 

gaacttgccg cacaacgtcc tgacgtgatt agtagtctca ttccagaaat tagtgcagca 1740 

acgggagaag ataaaatcaa tagttcaaaa tacaatgatt ggtctacagc cgggttcttc 1800 

ggacgtctca actacagtta caaagaccgc tacatggctg aagtaaatgt tcgttacgat 1860 

ggatcatccc gctttttaaa agatcaacgt tggaatgtat ttccttcttt ctctttggga 192 0 

tggaacttag cacgtgaatc attctttgaa ccaattaaca acattattaa tacactaaaa 1980 

ccccgcgtat catggggtat gctcggtaac cagaacacag actcttacta tccgttctat 2 040 

ttaacacaaa gtgtaacagc caatggtggc aattggctaa tggacggcag tagaccaaca 210 0 

acagccggag ttcctggaat ggtcagcagt acactcacat gggaaaaaat ctataatacc 2160 

aatttaggca tcgaccttgg tatgttcaac aatcgtctga acatgacttt tgaatacttc 222 0 

atacgtagaa cgaaagacat ggtaggccct gcagccgaag tcggtgcaat attaggaact 22 80 

gctctgccaa ataccaataa tgctgagttg aaaaataaag gatgggaact acaggccaat 2340 
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tggagagata atattggaaa agttaactat aatataggat ttaacctttc tgacaaccgc 2400 

gccaaagtaa tttcatatcc aaacgcttct aaagccctat gggattctaa tggaaatact 2460 

ctttattaca acggaatgac tatcggggaa atttgggggt atgaaactga aggtattgcc 252 0 

caaacagacg cacagatgac cgaatggctg gctagcaatg atcagagtaa aataggttca 2 580 

gtttggggtg caggtgatat catgtatcga gaccttaatg gtgatggtat agtagacaaa 2 640 

ggaaacagta ctgccacaga ccatggtgat ttaaagaaaa tcggaaatag cactccacgc 2700 

cttcgtttcg gtttaagctt aggagctgac tggaagggtt tcgatattca aatgtttttc 2760 

caaggagtca tgaaacgtga tttatggttg agcggaccaa tgttctgggg agcagatgga 2 820 

ggagaatggc aatcagtagg ttttgacgaa catcttgatt atttccgtcc tgaaaataca 2880 

acttctatat tcggagcaaa tttgaactcc tactatccca aagcctactt aggagacaaa 2940 

ggaaacaaaa acaagcaaac tcaaacgcgt tatctgcaaa atggtgctta catgcgtatg 3 000 

aaaaatctgc aaataggata tacattcccc aaagcttgga tgaataaagc aaaaattgaa 3 060 

aagctccgca tttatgtcag tggagagaat ttattcacaa tcagtggtat tgccgatatg 312 0 

ttcgatccag aagcaacagc cggtaacgga tttagcaacg gaaagactta tccgctgtca 3180 

aagactattt catttggatt aaatattact ctttaa 3216 

<210> 810 
<211> 2085 
<212> DNA 
<213> B.fragilis 

<400> 810 

ttcatacgca tgaaaagata tttcatcata agtttgctta ctttggcaag tacggtcgct 60 

cctttgacgg ctgtatttgc ccaaagttct tttatctacg aaaaaggtaa atcgtttaaa 12 0 

gatgtaaacg cctctccaat gcctcagacc atccgcctgg acagaacggc agaaccggtc 180 

atttatgaga atgcagttcc tgagaatgca accactatat gctaccgcat ccaactgccg 240 

tcttatgtac gggggacatt cttcagtcgg gattcccgcc ccggagatta cgaatggccc 3 00 

aacaatacca atcgtctctt accttggatg ttcaatcatc tgacagacct tacccgggac 3 60 

gactatccgg gtattccttc caacgcacgt ccttctacac tgggagacgc tttattgttg 42 0 

caactgaccg atggaagcta tctattcacc aaagcaatag cgggtgataa cagcctcagc 480 

tggtttcagg taaataccga cggctcgctc aatttatatg tatcgacatt gggaaccgac 540 

cggctcgaac acaaagtacc tgtagcactg gttcaaagtg ccggcaacat ctatcaggta 600 

ttccagcagg cttacgaaac cctgatatcc gaccggaacg tatcggccct gcaaaagcgc 660 

acggaaaaga actattttga ggctctgaac tatctgggat ggtgtacttg ggaacattac 720 

catttcgata ttgatgaaac aaaaatcctg aatgacctgg atgccatcga aacctccgga 7 80 

gttcctgtac gttacgtact gatcgacgat ggtcacctgg ccaacaagaa tcgtcaactg 840 

acaagtttta cccccga.tcc tcaacgtttc ccgaacggat gggctccgat catggcacac 900 

aaaaacaaag ataaaatacg ttggatagga ttgtggtatg ccctctccgg atattggatg 960 

ggaatctccc ccgataatga ttttccaacc catgtaaaaa acagcctcta ttctttcaat 1020 

ggaagtcttt tgcccggtaa aagcaccccg aatatcgaca cgttctacca gtattatgtt 1080 

cactctctga aaacccatgg attcgatttt cttaaagtag acaatcaggc attcacctta 1140 

ccgctttaca tgggctctac tgaagtcgta cgtcaggcga aagagtgtaa tctggcattg 12 0 0 

gaaaagcaaa ctcacgcaca gcaggtggga ctgatgaact gcatggctca aaacgtactt 12 6 0 

aacacggacc acaccctgca tagcggagtt gcccgtgtca gcattgacta taaaaaatac 132 0 

aatgagaaca tggcaaagtc gcatctcttc cagtcataca ccaacacatt actgcaaggg 1380 

caaaccgtat ggccggatca cgatatgttt cattccagcg atacgatctg tggcagtttg 1440 

atggctcgtt ccaaggctat ttcaggcggg ccggtctacc tgtccgattc tccgaaagaa 1500 

tttgtaaaag agaatatttt cccactgatc gataaagagg gcaaaatatt ccgcccggaa 1560 

gcccctgcca ttccgacccc ggaatcggta ctgaccaatc cactgcaaga cggaaaggca 162 0 

taccgggtat tcgctcctac cggtgacgag gctgtatccg tcatttgtta taacctcaac 1680 

acctcaccca aacaccggaa agtaaccgcc gaaatagacc cgaaagatta tctgttacgc 1740 

gaaacactga ccggcaaacc aacacctcaa caaaaacgag tgattctatt cgactggaat 18 0 0 

aatcagacag ccactgaact gaccggtaaa cagactgtag aattggatgg ctttaccgac 186 0 

cgtctattcc atctctgtcc gatccatgac ggatgggcgg ttatcgggat acaggaaaaa 192 0 

tatctgtcac ctgcggccgt ccggattcta tcttcgacac cggacaaatt ggttctcaat 198 0 

gtattgtctc cgggaactct gaaaatatgg acagagaact ccggaaaaca agaactgaga 2 040 

aacattcagg taaaggaaac cggaaaaatg accatcagaa aataa 2 085 



<210> 811 
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<211> 1464 

<212> DNA 

<213> B. fragilis 



<400> 811 

ctttacagca ttatgaaaaa tatcatccct caagcactgc ttaccatgcc tattttgagc 60 

actggactac aagcacaaga aaagcaaccg actcccaatc tagtcttcat catggccgac 12 0 

caatatcgtg gagatgccat cggttgcatc ggtaaagaac ctgtaaagac tcctcacctg 180 

gacaagcttg cctccgaagg aattaacttc accaatgcta tcagtagtta tccggtatca 240 

tcgccggcaa gaggaatgct aatgaccggt atgtatccca ttggcagtaa agtaaccggg 300 

aactgtaact ccgaaaccgc tccttacgga gtggaacttt cccaaaacgc ccgctgttgg 3 60 

agcgatgtgc ttaaagatca gggatacaat atgggataca tcggaaagtg gcatctggat 42 0 

gcaccctaca agccctatgt agacacttac aataatcgcg ggaaagtggc atggaacgaa 480 

tggtgtccac ccgaacgtcg ccacggtttc gaccattgga tagcttatgg aacatatgat 540 

taccatttga aaccgatgta ctggaatacc actgctccac gagacagctt ctattatgtc 600 

aaccaatggg ggccggaata cgaggcaagc aaagctatcg aatacatcaa caaacagaaa 660 

gaccaaaaac aaccgtttgc attggtggta tcgatgaatc ctccacacac gggatatgaa 720 

ttggtgccgg accgatataa agagatatac aaagatctgg atgtagaggc gctttgcaaa 7 80 

ggacgtcccg atatcccggc caaaggtacg gaaatgggag actacttccg aaataacatc 840 

cggaactatt atgcctgcat caccggtgta gacgaaaatg tagggcgaat catcgaggcc 900 

cttaaacaaa ataatttatt tgataatacg atcgtggtct ttacctctga ccatggaatc 960 

tgtatgggtg ctcacgaaaa tgccggaaaa gatatcttct atgaagagtc tatgcgtatc 102 0 

cccatgattc tatcttggcc ggatcaaata aaaccacgta aaagcgaccc gttgatgatt 1080 

gcttttgccg acctataccc cacactcctg tcaatgatgg gattcagtaa agaaatcccg 1140 

gaaacagtac agacattcga cctgtccaat gaagtactga ccggaaaaaa caaaaaagat 1200 

cttgtacaac catactattt cgtaaaattc gataaccatg caacaggtta tcgcggactc 12 60 

cgtaccgacc gatatacata tgccgtacac gcaacagacg gaaagatcga taatgtcatt 1320 

cttttcgacc gtaccaatga tcctcatgaa atgaataaca ttgccagcca acaattgaaa 1380 

cttacccata catttaaccg gcaactgaaa acatggcttg aaaagaccaa tgacccattt 1440 

gcccaatata taaaacttaa ataa 1464 



<210> 812 
<211> 387 
<212> DNA 
<213> B. fragilis 



<400> 812 

gaaataggaa tgaaacaaaa cttaaaatat tatataataa tagtgcttgc cgtgctgctt 60 

cattcggtaa cgatgaaagc ggcaaacacc tcttatataa tagaagatcc ggaccaggaa 12 0 

gaatgtttca tttcgcaagc cactcctgca agccggaata tcctggaacg ctttcatttc 180 

tattgtacca ttatgccctg tgaaatgggg catgcagata tttctcatgt accaacggac 2 40 

aaaagtttta tccgtcctga aatgatcttt cataaataca gaatgagaaa taatcctttt 3 00 

tctgtccatt caaatcactc acatacatac aatccgtctg atccactgac ctactatgtc 3 60 

tacggattaa ggaaaatcat catttaa 3 87 



<210> 813 
<211> 318 
<212> DNA 
<213> B. fragilis 



<400> 813 

gacgatatga tgatacacat ggaatattac ctggtggttt ctaccatcat gatgtttgcg 60 

ggaatatacg ggttcttcac ccgccgcaac acacttgcta tcctcatctc tgtagaactg 120 

atgctgaatg ctacagatat caactttgcc gtatttaacc gtttcctttt tcccggagag 180 

ctcgaagggt atttctttgc cctgttctcc attgccatct cggcagcgga aacggctatc 240 

gctatcgcca tcatgattaa tatttaccgg aatatacgta gtattcaggt aaagaatctg 300 

gatgaattaa agtggtaa 318 



<210> 814 
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<211> 4143 

<212> DNA 

<213> B. fragilis 

<400> 814 

atctcttttg tatgtttagg ctttatttcg ttcttttgca ataaattgat tattaatatg 60 

aagaagctaa atctcttttt attggttttg tttatatgta attgtccggt cgtttctggt 120 

tatgcttttt ttgatagaga tattcgtctg ttaaccatgc aggatgggct ggcggataat 180 

actattacat ctatctacaa agatcgggat ggctttatgt ggtttggtac taataatggc 2 40 

ttgagccgtt atgatggtaa attaataaaa aacttctctt cttcaccagc gtatatgtat 3 00 

gtttccgaaa ttgtagagat gtcagatcga tatttgggag ttatcgctgg aaatacttta 3 60 

tattgttttg ctcggtcgct ggagaaattt ataccgatcg tccatgcaac ggattatagt 420 

tctgtacatg tctctcactt attacctata gataataact ctttttgggg actgtcaggg 480 

aataaattat atctatatac acaggaagaa gttaaaaatg agaaaggaga ggttgttcag 540 

attaaattga aatgtgagaa acagtataaa gatttgattg attctggtga taatttctgt 600 

gcaatgtgtt atactgataa tcatgaaatg ttatgtttgg ttacacagca aggaaatttg 660 

ctattgtttc agcctgaatc ttcggagaaa tctaaaaaga tatctttgtg gaaaaataaa 72 0 

acttgggatg caacttcggt attatatgat aaaggagtgg tatgggtttc tactattgga 780 

cacggtattc tgcgttatta cgtttcttct gggtatatag acagaattac ttataaggaa 840 

aataataaag aaaacagtct atcccataca gatgtttttc aagttattcc aattaataat 900 

aatcgttatc ttgcagtgac ttggagtggg tatactttat tatttcagga taagaatgat 960 

ccgaaaagaa tgatgacaga aatatactat aatacagctt cacaacttca ccgcaactta 102 0 

gaaacaagaa tgatttcagc gtattacgac cccagtggga ttgtttggat aggtactaat 1080 

gggggaggag tgatttattc tgatctacgg tcacaatttt ataaccaatt tcatcaagag 1140 

aggcataatg aaatttgtgg tatagtcatg gataatagaa aatatgtttg gatggctacg 1200 

tttcatcaag ggattatgaa aagtgagcaa ccttttgaac caggaagacg aatgaatttt 1260 

actagggttg gtactccgga tattcaaagt aaaaatacag ttctttgtgc cattaatgat 1320 

aatagaggtt cactttggtt tggaaatagg gatggaacat taacttcata taatgaggca 1380 

acaaaacaat ttcgattaca ttttttacaa gatagaggta aagtgaatac tgtgtcaatt 1440 

tgggcattat attgggatac taatcgaaat ttatgggtag gtactaatga tggagtttgg 1500 

aaattgaata tagattctgg attttgcaaa aaaatcccta ttgagatttt gtttaaggac 1560 

cctactccta tttgtatacg agctattgcc ggcacgaagg acggaactat atggttaggt 1620 

acaagtaatg caggagtttg caaattgaaa attgattcta gaggagagat gtctttagag 1680 

acaggctatg agaagaaagc gaatatcaaa aataattcgg ttcgttcttt gttagtatct 1740 

tctgatggta atgtatatgt aggttatatg gatggtttcg ctattctttc acctaaaaag 1800 

gatgcaatac gtgagtatta tacaactaga aatggattat gtagtaattt tataggatgt 1860 

ctggtcgaag ataaccgagg acatatttgg ttgggaagta attcgggagt ctctcgttac 192 0 

agtaggcatc agcacctttt ttataattat tatataagtg gaagcaatcg ttcggcatta 1980 

cttgctgata atacactatt ttttggcaat aataaatcgc tcacttattt tgatccggat 2040 

gacgtgggtg gtcatttgga tgaagatcag gttcttatta ctggacttga ggtagatggt 2100 

cgtcctgtag ggattgggga taaaataaat gggcagactg tattggcaga aggcatttca 2160 

tatactagtt cgattacttt gaataatgaa aatcgtgact ttgttttatc ttttaataat 222 0 

ctttcttatg cagaggaaca acagaagtac aattaccgct tattaccata tcagacgcat 22 80 

tggttggttt ctaatgatgg agagaaggct acttatatga acttacccga aggggattat 2340 

acatttgaag tgaagaatat ttatcctgac gggaaagatg gaaaggttac atcactccaa 2400 

atacatattc taccgcattg gagtcgtaca ttgcctttcc gattatttat tttactgtta 2460 

ttggccggtg gtgtggctta tttgattcgt cttgtcaaac atcgtcagat gcgtatggaa 252 0 

cgtgaaatgc gcatggaaca tgaacttctg tcagtaaact tagagcgtga gaaagagcga 2580 

caaatccgga tggagcgtga gaactttttt acaagtgcgg cacatgaact acgtacgccg 2 640 

ctaaccttga ttcttgcccc attacaggaa ttattggaac acataaaggc atccgatcca 2700 

ctgtatagca agctatatac catgtataaa aacagctcct cgctacatac actggtcgat 2760 

cagttgctct atgtacaaaa aatagaggcc gggatggtga aactgcgttt gtcagaagcg 2 820 

gatattgtgg agctagtgag agaagtagca gagtcttttc gccaaatggc agggataaaa 2 880 

ggatgtacat ttcaggtaag acttccggaa gatcctgttt tcctatggat agatacggag 2 940 

aaaataactt cgtcggtcgg aaatctacta tctaatgctt ttaaatacac ttctcccaat 3000 

ggagaggtat tgctcactct tacccgtatg gaacaggatg gaaagccttt ttgccagata 3 060 

acagtatcgg atacgggtga gggaataccg gatgagtttc agaagcgcat ttttgactct 3120 

ttcattacgg gtgataattc accggctttc tctactaaag taggcattgg actgcggatt 3180 

gtgaaaaata cgatggatct gcatcatgga caggtcattc ttgatagtga gccgggaaaa 3240 
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ggttctacat 
gaaatagtag 
gaaaaatcgg 
gtagatgtcc 
gctgatggtg 
gatgtaatga 
gagaccgctc 
caaggatctt 
aaagcaaagg 
gcattgatgt 
aaactcattc 
gccgaacaac 
tt atctgtgg 
gagaatcgtt 
tt aaggaagc 
tga 



tcgtattatt 
attatcgcgg 
aagaaggagt 
gtcagtatat 
aggaaggggt 
tgccggttaa 
atattcctat 
atagtggggc 
tagagaacct 
tgaaacgaga 
acgtggttga 
ttcacatgag 
tcgatatgat 
actccattca 
actttacgga 



gataccggaa 
gcatgaaacg 
tccggtcaca 
tcgctctttg 
ccggattgct 
agatgggttt 
cctgatgttg 
ggatgactat 
gattcttcag 
atcggttgaa 
gaagaatctg 
ccaacctact 
ccggagtgtg 
ggagatttcc 
acaatttggt 



ggtaaatctc 
gaaccgcagt 
aagaaaacat 
tttgtgacaa 
accaatgaga 
gcctgttgcc 
acggccaagg 
atgatgaagc 
cgcgaacgtc 
gatgaagagg 
tctaatgaga 
ttataccgga 
cgggtgagta 
gaaaaagtag 
gtgccccctt 



actttactgg 
ttcaacctct 
tgctgattgt 
aatacacggt 
tacccgatct 
gggagatacg 
cagaagatgc 
ctttcaatcc 
tgaaacgcat 
cagatgacga 
acttcaacgt 
aggtaaagca 
aggctgcttc 
gattcagtga 
caaaatatat 



tgatttatat 
atctgtacag 
tgaagataat 
acttgaagcg 
gattatctcg 
tgaacggcaa 
agatgtattg 
ggaagtattg 
ttataccaaa 
atttatacaa 
taagatgttg 
acgcagtgag 
gttgattatg 
tgcccggaca 
ggagaataaa 



3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4143 



<210> 815 
<211> 1266 
<212> DNA 
<213> B.fragilis 



<400> 815 

aatgtaaacg 

gcaggtagta 

gtggaagcgc 

tttcagatgt 

ggtataggtg 

accccttggg 

ggagatgtat 

aaaggatgtt 

gacccggaag 

tgtgctgagg 

acggctctgg 

cgtagtgtgg 

tttgagaaag 

gataagatag 

tgctctatag 

atacatcaac 

ctacccggat 

gacatggaat 

ggggtagata 

atggggcagt 

gttcaggcca 

tcttga 



tcatgaacag 
taccggttga 
taagaaacta 
taggagagat 
gacccaaaga 
ggcaacgggt 
atgtgtatgc 
atttcattaa 
acaatgtaga 
cagacaaggc 
gggatgttgc 
tagaatggta 
agatcgacat 
atgtggtgtt 
ataccttccg 
atactacctg 
tgatcgaagc 
ccagaagatt 
cgcaaaagat 
gtgagatatt 
atgttccggt 



atcaagagat 
tttcgggtct 
ttatggactg 
agatgcggaa 
tatcttcgat 
gttggtgcct 
cggtggggat 
tgctattgag 
agagttcggg 
atatcagacc 
ttttgttccc 
tatgtctact 
tgccattgcc 
gacatgtggg 
tgagctttgg 
gaaaatcttt 
cggatttgat 
gaaagaggaa 
actgcctttc 
gggccgtgac 
agacaatgta 



aaagtgcgtt 
acagcagtca 
gcaccccgtc 
ttggccgaaa 
ttggatacga 
gaagcaatgg 
caaaattatc 
cgtcagcagc 
ctattgacag 
ggcagagctg 
ggtatgggat 
gctatgcggc 
aattatgaaa 
accgatttcg 
ttaccacact 
aagcattcct 
attatcaatc 
ttcggcagtc 
ggtactcccg 
ggaggttttg 
gttgcgatgt 



gtgcactcaa 
cgggtatcca 
cggtgaagat 
agatcggagt 
ctcgtatgca 
atttaactcc 
cccccagtgc 
ccattgaaga 
agaatgatct 
ttgttgccag 
tgaagcagcc 
aggactattt 
aactctgggc 
gttcccagga 
atcgacggat 
gtggagctat 
cggttcagat 
aattgacctt 
atgagatacg 
ttttcaatgc 
tcgatgctct 



tcatcagaat 
ttgccgtatt 
tgtagatgct 
agactgtata 
cgaacagacg 
tgatatgcgg 
cgtgatgccc 
agatcgtttg 
ggcttattac 
tttcggggga 
caaggggatt 
gcatcaggta 
tgcattagga 
atcacagttt 
gaatgattgg 
tatcccgatt 
taatgccaaa 
ttggggcggt 
tcgccatgta 
tgtccataat 
aaaggatatc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1266 



<210> 816 
<211> 1155 
<212> DNA 
<213> B.fragilis 



<400> 816 

tccgggatgg 

agatatattt 

gatgaagtgg 

cagggcattg 

gctgttaatt 

tccggagaaa 

actttaggga 



taatgaatac 
tattcgtgtg 
aatatgctcc 
atcaggctaa 
caattctggt 
ttacttttcc 
gtggaaattc 



attcgtaaca 
ggtgatgctg 
tttggcggtt 
tcttgcacag 
gaatgatgtg 
gattccaaga 
tacagtgaca 



tttaaaattg 
ggctgtgggc 
acaagggtat 
tacatcatag 
caggttgatt 
gtgattccgg 
gctccgatat 



aagatgcaat 
tttttgcatc 
ctaccgtact 
tgcaaggtac 
tgaaagacgc 
gagaaataaa 
cagtgtttat 



gaaaacattt 
atgtgaggac 
tgaccgtgag 
gggactgaat 
atacatcact 
taatctgata 
ccccgaattg 



60 

120 

180 

240 

300 

360 

420 
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gaagtgaatg gaatgttcaa tgaatttaca ccggccggtg atacaatgaa agtagtcggc 480 

gattatttcg atctttatga gataacgacc gaatcgggac aactgttctt tggtggtaaa 540 

gaagtgaaaa ttacaaaatc aacgggcaac agcttgagtt ttgtattgcc ggaagatgct 600 

gtaatgggat caaaaattaa attagtcagt ccggtttgtg gagaggtaac ggttccaggt 660 

aaatatatgg aaaaaggtaa catgctgtgt gactttgatc cgtttaccgg ttggggaggt 72 0 

agtaaatatg tgatagatgg tcctgtgcct gctccgtaca gtggatactt ctcccgtttc 7 80 

aagatcaata aaggggatgc gaacgattgg gactggaacg aggtgactac tattgcacag 840 

tgtgctgtcg aatattctcc ggaggttatt gccgatcaaa ataaatattt gctgaagttt 900 

gaagtaaata caatcaaacc attgactaaa aggcagattc gtttctattt ttcacagatc 960 

aattacgatt gggaaccttt tgcatcggga cttgctctga atacaaatgg agaatggaaa 102 0 

accgtttcta ttgatctggg agagatgtgg aaaggagata ttcctaatga tggagtcctg 1080 

cagattatgg gtaatagttg ggcggaagat acagatatct gttttgataa tttccgtatc 1140 

gttcccaaag attaa 1155 

<210> 817 
<211> 2061 
<212> DNA 
<213> B.fragilis 

<400> 817 

gtccataagg ttgtcagtcc gcctcccttt ccactcttac ttattaaatt tccaatcaaa 60 

aagatgctcc ttccgctttc ttttactaaa tttgcattta aatttgcccg aatcatgaag 12 0 

aatgaaccaa catatagctt gctaaacgcc atcaattatc ccaaagacct gcgccaactg 180 

agcgtagatc aattgccgga ggtatgcgag gaattaaggc aggacatcat taaggaacta 240 

tcgtgcaacc cgggacactt cgctgccagc ctcggtgtgg tagaactgac tgtagcactg 3 00 

cactatgtgt acaacactcc ttatgatcgt attgtctggg atgtgggaca tcaggcctac 3 60 

ggacacaaga tactgaccgg acggcgtgaa gctttctcta ccaaccgtaa actaggcggt 420 

atccgtcctt ttccctcacc ggaagagagt gaatatgaca cattcacttg cggtcatgcc 480 

tccaactcca tctcggcagc gttgggtatg gcagtggcag ccgagagaaa aggagaaaaa 540 

gaccgccatg tagtagccgt tatcggtgac ggatccatga gcggaggact tgctttcgaa 600 

ggattgaaca atgcttcatc gactgcgaac aacctgctga tcatactcaa tgataatgac 660 

atggccatcg accgcagcgt aggcggcatg aaacaatatc tgttcaatct cactacttcg 72 0 

aaccgataca accaactgcg tttcaagaca tcccgcctgt tattcaaaat gggattactc 780 

aatgaagaac gtcggaaggc cttgataaga ttgggaaaca gcctgaaatc tctggcagcc 840 

caacagcaga atatcttcga aggaatgaat atccgatact tcggtcccat cgacggacac 900 

gatgtaaaaa acatagcccg tatcctgcat gatattaaag atatgcaggg accaaagatt 960 

ctacacctcc acaccatcaa aggaaaggga tttggtccgg cagaaaaaca ggctactata 1020 

tggcatgccc cgggtaagtt cgatccggta acaggaaaac gtattgtagc caatacggac 1080 

gggatgcctc ccctgtttca ggatgtattc gggcatacgc tggtagaact ggcggaaaag 1140 

aacaaacgga tcatgggagt cacccctgcc atgccgagcg gctgctccat gaacatgctg 12 00 

atggatcgta tgccggatcg cgcctttgac gtaggcattg ccgaaggaca tgccgtgacc 12 60 

ttctccggag gtatggcaaa agacggatta ctgcccttct gcaacatcta ttcctcgttt 132 0 

atgcagcggg cttacgataa cattatccat gacgtagcga tacaaaaact aaatgtagta 13 80 

ttctgtcttg accgcgccgg actggtaggt gaagacggtc ctacgcacca cggtgtgttc 1440 

gacatggctt atctacgccc gatccccaac ctgactatct cgtcaccgat ggacgaacat 1500 

gagttgcggc gcttgatgta tactgcccaa ttgcccgaca aagggccttt tgccatccgt 1560 

tatccgcgcg ggcggggttc gttggtggac tgggaatgtc cgttggaaga gattccggtg 162 0 

ggaaaaggac ggaaactaaa ggacggaaac gatctggcag taattacaat cggccctatc 168 0 

ggcaagttgg ctgcccgtgc catcgaacgt gctgaagcag ataccggcat ttccgtagcg 1740 

cattatgacc ttcgtttcct caagccgctc gatgaagagc tactgcacga agtcggcaaa 1800 

aagttccgcc atatcgtaac gatagaagat ggaatcatta aaggaggtat gggatgcgcc 1860 

atactcgaat ttatggccga taacggatat tatcccgaaa tcaggcgcat cggtgtaccg 192 0 

gatcagttca ttgaacacgg atcggtgcag caactctacc acttgtgcgg gatggatgaa 1980 

gaaggaattt acaaggtaat tactaaaaac gaattacgaa tggatgctcc tgtggaaagc 2 040 

tgcatggcta cccattctta a 2061 

<210> 818 
<211> 1539 
<212> DNA 
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<213> B.fragilis 
<400> 818 

agcatcaaat taaatagcag ttattattgg ataaaatcaa ataatgtgta tttttggtcc 6 0 

gaaattaaac ctgaaaattt taaatcaaat agtatgagta cactccaaaa tgcaatgggg 120 

aaaatgacaa actacagatg gacgatttgc gccatgttat ttttcgcaac aactataaac 180 

taccttgatc gccaagtact atcgctgacc tgggacgaat ttatcaaacc cgaatttcat 240 

tggaacgagt cacattatgg catcattact gctgtctttt ctattgtata tgccatttgt 3 00 

atgctgtttg ctggccggtt tatcgactgg atgggaacaa agaaaggtta cctttggtcc 360 

atcggtatat ggtcggccgg tgcctgcctt cacgctttct gtggaattat aaccgaagaa 42 0 

tatgtaggaa tgcatagcgc agccgaacta atcgctgcta ccggtgatgt agtagtggta 480 

cttgccacca taagcatgta ttgtttttta gtcgcacgct gtattttagc actcggtgaa 540 

gccggcaatt ttccggctgc cattaaagtt accgccgaat atttcccgaa aaaagaccgg 600 

gcttacgcta cttccatttt taatgccgga gcttctatcg gtgccctgat tgcccctctc 660 

agcattccat tactggctaa agcctgggga tgggaaatgg cattcgtcat catcggtgct 720 

cttggcttcg tgtggatggg attttgggta ttcatgtaca cagctccctc taaaaacaaa 780 

tttgtaaact cagccgaact cgaatatatc gagcaagaca aacatgaaac ctacacagca 840 

actgtaaaag agaacgagga aaagaaaagt atgactttcc ggcaatgttt cacctacaga 900 

caaacctggg catttgcatt cggtaagttt atgacggatg gagtgtggtg gttcttcctt 960 

ttttgggcac cttcttacct gaatacccag ttcgacatca aaacctccga aggattggga 102 0 

agagcattga tctttacact ttacgctata acaatgttat cgatctatgg agggaaactc 1080 

cctacgatca tcattcataa aaccgggcta aacccgtatg ccgcacgtat gagagctatg 1140 

ctgatctttg cattctttcc tctgttggta ttacttgccc agccattagg aaccatctct 1200 

ccctggtttc cggttattat gatcggtatc gggggagctg cccaccaatc atggtcggct 12 60 

aatatttttt ctaccgtagg cgatatgttt cctaaaagcg ccattgccag catcaccggt 1320 

attggcggta tggcaggagg agtaggttct atgattctcc agtattcagc cggcgagctg 13 80 

tttgtacatg ccgacaaaac tcaaatggta tttatgggct ttatcgggaa accggctggt 1440 

tatttcgtta tcttttgtat ctgctcggta gcctacctga ttggatggat cgttatgaag 1500 

gcattagttc ctaaatataa acccattatc ctgaattaa 1539 

<210> 819 
<211> 2463 
<212> DNA 
<213> B.fragilis 

<400> 819 

ccagataatt cagaaacaat gaaaatgaaa ttaatatgct ttttgatgtt gagtgtgttt 60 

tttatttttc cggttcgggc taaaaacaca ttcgggaaga aaaaagacaa agtgacgcgc 12 0 

ttgcattttt atgacctgaa taagaatggg cggatggaca cttatgaaaa cccttctgct 180 

cctgtggagt atcgtgtgga gcatcttttg tcacagatga ctttggagga aaaggtagga 2 40 

cagatgctta cttcattggg gtggcccatg tacgaacggg tgggagagga catccgcctg 3 00 

acccctcagt tggagaaaga aatcggagag taccatatcg gatcgctctg gggggttatg 3 60 

cgggctgatc cgtggacgca acgtacgttg cataccggac tcaatccttc gctggctgcc 42 0 

cgagcgtcca atcgtcttca atcttacgtc atagaacata gccgtttggg tattccgctg 480 

tttctggcgg aagaatgtcc gcatggccac atggcgattg gtgcaacagt atgtccgact 540 

tccatcggtc aggcaagtac ctggaatccg gaactgatcc ggcagatggg acgtgtcatt 600 

gctattgaag caagtgctca gggagcacac atcggctatg gaccggtact cgacttggcc 660 

cgtgatccgc gttggtcgcg tgtagaggaa acttatggag aagatcctta tctgaatggg 72 0 

gtgatgggaa ctgctctggt acgtggtttt cagggagaga cattaaacga cggtaaaagc 780 

gtgatagcga ccctcaaaca ttttgcttcg tatggctgga cggaaggcgg acataacgga 840 

ggtactgccc atataggcga gcgcgaactg gaagaggcta tctttcctcc ttttcgtgag 900 

gcggtaggtg ccggggcatt gtctgtgatg agttcataca atgaaataga cggaaatcca 9 60 

tgtaccggaa gtcgttattt gttaacggat atcctgaaag atcgttggca attcaaaggt 1020 

tttgtcgtgt ccgatttgta tgctgtcgga ggattacggg aacatggtgt tgccggcaat 1080 

gactatgagg cggccataaa ggccgtgaat gccggagtgg atagtgattt gggaacgaat 1140 

gtctatgctg agcagttggt tgctgcggtc aaaagagggg atgttgctgt agcaacgata 12 00 

gataaggcgg tacgtcgcat tttatctctc aaattccaaa tgggattgtt tgatgatcca 12 60 

tttgtagatg aaaagcaggc agtacaactt attgcctctt ccgaacatac cggactggct 132 0 

cgtgaagtag cccgtcagtc aatcgttctg cttaagaata aggacaagct gttgccgttg 13 80 
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aagaaggata ttcgtaccct tgctgttata ggtcccaatg ccgataatgt gtataatatg 1440 

cttggagact atactgctcc tcaagccgat gggactgtag tgacagtctt ggatggaatt 1500 

cgacaaaagg tctctaaaga aactcgtgtg ctgtatgcca aggggtgtgc agtgcgtgat 1560 

tcttcccgta ccggatttaa agatgctata gaaacagccc gtaatgccga tgccgtagta 162 0 

atggtgatgg gaggatcgag tgcccgggat ttttcctcgg aatatgaaga aaccggtgcg 1680 

gcaaaagtca ctataaatca gatcagtgat atggaaagtg gcgaaggcta tgatcgagcc 1740 

acacttcatc ttatgggaag acaactggag ttgttggaag aaatctccag gttgggtaaa 180 0 

ccggtggtat tggtattgat taaagggcgt ccgttattga tggagggagc tattcaagag 186 0 

gcagaggcaa ttgtggatgc ctggtatccg ggcatgcagg gagggaatgc tgtggccgat 192 0 

gtgcttttcg gtgattacaa tcccgcagga cgtctcactc tttctgtgcc acgttcggtc 1980 

ggtcagttgc cggtatacta caatacaaga cggaaaggaa atcgtagccg atatattgaa 2 04 0 

gaaccgggta ctcctcgtta tcctttcggt tatggtctta gttatacaac tttttcctat 2100 

acggatatga aagtgcaggt aactgaagga agtgatgatt gccgggtaga tgtaacagta 2160 

accatacaaa atcagggtac tgcagatggt gatgaaatgg cacaactcta tttccgggat 222 0 

gacgtaagca gttttacgac tcctgccaag cagttacggg cgttcagccg tattcacctg 2280 

aaggctggtg aatcccgaga agtaactttt actcttgata agaagtcatt ggctctgtat 2340 

atgcaagagg gggaatgggt ggtcgaaccg ggacgcttta caataatggt gggaggctct 2400 

tccgaggata ttgcctgccg acaagcattt gagataaacc gaaaatatac ttttaaaatg 2460 

taa 2463 

<210> 820 
<211> 1662 
<212> DNA 
<213> B.fragilis 

<400> 820 

gtcttgttta gggaaatcaa ccggggtccg gatttaccag aagagttcaa agaaaccatt 60 

tacaagacca gcccaataat tccaccggga gggcatttac ccattcagcc aaagaaagga 120 

tattcgaacc ttccccggat tttcagcgtg agaatagaga ccacgtcgcc catatacacc 180 

gacaagggca gcaagacgat cgcatcgacg ttgccgggca accgggcaca ttctttcgat 240 

gggtggatta catccaccgg ccggatatcc gccaatgcgc ccaaagcggg accgccgcgg 3 00 

ccggtggtga ccgacggggt ttaccggcgg acagaaaaac tcaatatcac ctccgtgtca 3 60 

acggagtcgg gcatcgtatg caatatcggg tttgacgaaa gcctgatgta tgaagcctgg 420 

aaaaacgttt cactcaagga acttccggga ctgccggtca tcaaataccc ggaaggcgtc 480 

gcagcccttg cccgtcacct ggaggaagtg atgcgctacc aaaccccggc ggattatcac 540 

gtgttccgca tacaggtggc gtctgaaacc ctggaagaga cggagtatcc ggagttcatc 600 

aaccccatag ggtcggacgg gaagacgtac gccctgctga aggaagcacg gaccgagagg 660 

gttgtcatat cgggccaggc cgtagatgta aaggttcccg caggatacgg gatatcgccg 72 0 

ttcctgaagg tatcccgcat attggagatg atattctcgg catacggctt tacattggtg 780 

gagaatccct ttgccaccga ttaccagctc agcaagatgg tcgtgctcaa caatgtggcg 840 

gacaccattg tcaccggaga gatcgactac aggaatttga tgccggactg taccgtcaac 9 00 

gagttcctgg acgcgctgtt ttgccgtacc ggggccaagg tttacgtaaa tgccggccgg 960 

aaagccgtca tacgcctgct caaggattcg ataggcgcaa cggcatccgc cgactggaca 102 0 

ccgctcaagg cctcggaacc ggaaataaac tacacgcccg caaagcagct caagctctcg 1080 

gcgggcacat cgttcaagga agccgaaccg gcggcggact cctttgagaa attccttaag 1140 

ccttatgggg ggatcattac ggaatttaca ggggaccggg acgtgcccga cgaactgtac 12 0 0 

ataacctacc agccttccac cggaagatat tacaagcggg acatcgtgaa caagaaaaag 1260 

aagtggatat ccagcgactt tttcccatgg gacaagggca cccccggtgt ggaatacctg 132 0 

gagataacgg gaaaggacga atgtgtcccc atggcattta aaacggggct gctgactccc 1380 

ggatatctgg cgggggcggt caacatcaac acaaccctca gaggggccgc caaggagcag 1440 

ggggagaaga agcagacacc cctggctttc tgcttcgcca tggggaaaac caatcagatt 150 0 

ataggggccg gggcccttgt ggaggagtat tatttcggca gctcactctg ccgggagccc 1560 

aaaggcgaat actttcagga ccccgggggg aatgtttaca ggtattcact ggttttcaag 162 0 

ggagaggacg gggcgtttaa ccggttcttt aaggagtacg ac 1662 

<210> 821 
<211> 216 
<212> DNA 
<213> B . f ragilis 
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<400> 821 

accccgtcgg tcaccaccgg ccgcggcggt cccgctttgg gcgcattggc ggatatccgg 60 

ccggtggatg taatccaccc atcgaaagaa tgtgcccggt tgcccggcaa cgtcgatgcg 12 0 

atcgtcttgc tgcccttgtc ggtgtatatg ggcgacgtgg tctctattct cacgctgaaa 180 
atccggggaa ggttcgaata tcctttcttt ggctga 216 

<210> 822 
<211> 534 
<212> DNA 
<213> B.fragilis 

<400> 822 

gtagcatgtt caacagtgat ttctggcttc cggcccgaac tgagcagttc gaagacagaa 60 

gaagactttt tctgggcagg tactaatgcc gctaccgaag accagaactt tgtaatctac 120 

tcttacccgt atacagataa ggataccttt acgaaagagt tctttatcca taagcgagat 180 

tcagtgatga aggctaatat tccgggtgcc aaagagggta tgtacatggc gactgattcg 240 

tctaccgtag aggttcgtcc gattgatatt catggagatt acacaatgga agcacgcgga 300 

ctgtggcgca taaagggcga tttcatgggt ggcccgtttg tttcgcacac ccgtctggat 3 60 

aaagccagcc accgtattat cactacagaa gtatttattt actcacccga taaaatgaaa 42 0 

cgtgacctga tgcgccgatt ggaagcatct ctgtatactt tgcaacttcc taccgagaag 480 

gcgcaggaac agattccgat gggcatagag caggaggaga aaactaacaa ataa 534 

<210> 823 
<211> 246 
<212> DNA 
<213> B.fragilis 

<400> 823 

cttcgtatct ctaaagaatt aatatcttgc aattacgaag ttaatattaa cgtactgtta 60 

atgttaactt cctacagttc tcttattaag atattattgt ttgataatca attgcttata 12 0 

ttctggatgc attctgtcgt aaaattcttc tcttacgttc aagatgtgcg tagttgtctg 180 

tttatcaagt taccgttaag actctccata gcctgcctct ccacttattt ccacattctt 240 
ttataa 246 

<210> 824 
<211> 1155 
<212> DNA 
<213> B.fragilis 

<400> 824 

gagaattact cttatctttg ctatccgaaa ggagaactgc gatgcagaag gattgcagag 60 

atgaatgaac gtaaaattat acatatcgat atggatgcct tttatgcttc tgtggagcaa 120 

agggatcatc ctgaattgcg tggtaaaccg cttgccgtgg ggcatgccga ggagcgggga 180 

gtagtagcgg cagcaagtta tgaagctcgt cgttatggag ttcgttcggc tatgtcgtca 240 

caaaaggcga aacgtctgtg tccgcaattg atttttgttc ccggacggat ggaagtgtat 3 00 

aaatccgttt cccgtcaggt acacgaaata tttcatgagt ataccgatct gattgaacct 3 60 

ctgtcattgg atgaagcgtt tcttgatgtg acggagaata agcaggggat cttgctggct 42 0 

gtggatatag ctaaagctat caagcaacgt atccgtgaag aactgagcct ggtggcatcg 480 

gcaggcgtgt cgtataataa atttctggct aaaatagctt cggactttcg taaacccgac 540 

ggactttgta ctattcatcc tgatcaggca atcgatttca ttgcccgttt gcctattgag 600 

tcattttggg gagtagggcc ggtgactgcc cggaagatgc atttactggg gatacacaat 660 

ggacttcagt tacgggagtg ttcgtctgaa atgctggtac gtcagtttgg taaagtggga 72 0 

ctgctttatt atgattttgc acgtggagtc gatcttcgac cggtagaagc agtgagaata 780 

cgtaaatcaa tcggatgtga gcatacattg gagaaagaca tccatgtaag atcgtctgtg 840 

attatagagc tttatcacgt agctacggag cttgtagagc gattgcagca gaaagagttc 900 

cggggaaata cactaactct gaagatcaag tttcatgatt ttagccagat aacacgaagc 960 

atgacacagg cacaggaact tacgaatctt gagagaatct tgccccttgc caaacaattg 102 0 

ctgaaagagg tggagtatga gcagcatccc attcgcttga tcgggctttc ggtatcgaat 1080 
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cctagagaag aagcggatga acatcgggga gtatgggaac aactgagttt tgaatttagt 1140 

gattggggaa aatag 1155 

<210> 825 
<211> 189 
<212> DNA 
<213> B . fragilis 

<400> 825 

cgatgtttat caccgtcgcg gagttcaagc gcttcgtgtt caccggtcga tgcacccgat 60 

ggaacggatg cacgtcccat aatgcctgat tccaatacta cgtctacttc tactgtaggg 120 

ttacctcttg agtcgagaat ttctcgtcct gtaatttttt ctattttcat tgtttctctt 180 

gttttttag 189 

<210> 826 
<211> 3333 
<212> DNA 
<213> B. fragilis 

<400> 826 

ataatgggat ttaatgaatt tttaagctcg attttcggaa acaaatccac acgagacatg 60 

aaagaaatcc aaccctgggt agacaagatc aaagccgctt acccggaggt tgctaagctt 120 

gacaatgacg gcctccgtgc aaaaacagag gaacttaaag aatacatccg taactcggca 180 

agtaaagaac gcgccaaggc cgatgaactc agagccggca tcgaaaatgt agagctggaa 240 

gaccgcgaag aggtatttgc tcagatcgac aaaatcgaaa aagaaatatt ggaaatatat 3 00 

gaaaaagcac tcgatgaagt attaccggtt gctttctcta ttgtaaaaga atcggccaag 3 60 

cgtttctctg aaaacgaaga aatagtggtg acggccactg actttgaccg gacattggca 420 

gcaaccaagg actttgtccg catcgaaggt gacaaagcca tctggcaaaa ccattggaac 480 

gccggcggca acgacacggt gtggaacatg gttcactatg acgtacagtt gttcggtggc 540 

gtggtact gc acaaaggtaa aattgccgaa atggcaacag gtgaaggtaa aaccttggtg 600 

gctaccctcc ccgtattcct gaatgcactg accggaaacg gcgtacacgt agtaaccgtg 660 

aacgactacc tggcaaaacg tgactccgaa tggatgggac cgctttacat gttccacgga 720 

ctcagcgtag actgcatcga ccgtcatcag cctaattccg atgcacgccg ccaggcctat 780 

ctggcagata tcacattcgg aacgaacaat gaattcggtt tcgactactt gcgtgataac 840 

atggccatca gcccgaagga cctggtacag cgccagcaca attatgctat cgtcgacgag 900 

gtggactcag tattgatcga tgatgcccgt actccgttga ttatctccgg tccggtgcct 960 

aaaggcgaag accaactttt tgatcaactc cgtccattgg tagagcgact cgtggaagca 1020 

caaaaagtat tagcaaccaa atacctctca gaagccaaga aacttatcaa ctcggacgat 1080 

aagaaagagg tggaagaagg attccttgcg ttgttccgca gccacaaggc actgcctaaa 1140 

aacaaggcgt tgattaaatt cctcagtgaa cagggtatca aagccggtat gctgaagacg 12 00 

gaagaggtct acatggaaca aaacaacaag cgcatgcacg aagcaacaga tccattgtac 1260 

ttcgttattg atgaaaagct gaacagcgta gacctgacag acaaaggtgt cgatctgatc 1320 

acaggtaact cggaagatcc gactctattc gttttgccgg acattgccgc tcaactttcc 1380 

gaactggaaa atgaacatgg attgagcgac gaacaaaagc ttgaaaagaa agatgcctta 1440 

ttgaccaatt atgccatcaa gtcagaacgc gtacacacca tcaaccagtt gttgaaggca 1500 

tataccatgt ttgagaaaga cgatgaatat gtagtgatcg acggacaggt gaagattgtt 1560 

gacgagcaaa caggacgtat catggaaggc cgccgttact cggacggact gcaccaggcc 1620 

atcgaagcca aagaaggtgt gaaagtggaa gctgccacac agacatttgc taccatcacg 1680 

ctgcagaact acttccgcat gtaccacaaa ctctcgggta tgaccggtac ggccgaaaca 1740 

gaagccggtg agttgtggga catctacaaa ctggatgtag tagtgattcc gaccaaccgc 1800 

ccgatagccc gtaaggatat gaacgaccgc gtttacaaga cgaaacgtga aaaatataaa 1860 

gccgtaatcg aagagattga acagttggtt caagcaggac gcccggtatt ggtgggtact 1920 

acttcggtag aaatttccga gatgctgagc aaaatgctga caatgcgcaa gatcgaacac 1980 

aacgtactga atgcgaaact ccaccagaag gaagcagaca ttgttgccaa ggccggtttg 2040 

agcggtacag ttactattgc taccaacatg gcgggccgtg gaacggacat caagctgagc 2100 

cccgaagtaa aagcggcagg cggtctggca atcatcggta ccgaacgtca cgagtcacgt 2160 

cgtgtagacc gtcagttgcg tggccgtgca ggacgtcagg gtgacccggg ttcatctgta 2220 

ttcttcgttt cactggaaga tgacctgatg cgtctcttct cttctgaccg catcgccagc 22 80 

gtgatggata aactgggatt ccaggaaggt gaaatgatcg aacataaaat gatttcaaac 2340 
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tccatcgaac gtgcacagaa gaaagtagaa gaaaacaact tcggtatccg taaacgtctg 2400 

ttggaatatg acgatgtgat gaacaaacag cgtacggtgg tttacaccaa acgccgccac 2460 

gcccttatgg gtgagcgtat cggaatggat atcgtcaata tgatctggga ccgttgcgcg 2520 

gccgcaatcg aaaacaatgc agactacgaa gaatgtaaac tggacttgct ccaaacactc 2580 

gcaatggagg ctcctttcac agaagaggag ttccgcaacg agaaaaagga caagctggca 2640 

gacaaaacat tcgatgtggc aatggctaac ttcaagcgca agacagaacg tctggcacaa 27 00 

atagccaacc ctgtcatcaa acaggtgtac gagaatcaag ggcatatgta cgaaaacatc 27 60 

ctgattccga ttacagacgg aaaacgcatg tataacatct cttgcaacct gaaagcggct 2 820 

tacgaaagtg aatcgaaaga agtagtgaaa tcatttgaaa aatcaattct tcttcatgtc 2 880 

attgacgaat cctggaaaga aaatttacgc gaactggatg aactgaaaca ctcggtgcag 2940 

aacgcaagtt atgaacagaa agacccgctg ttgatctaca aactggaatc tgtgactctg 3 000 

tttgacaaca tggtaaacaa gatcaataac cagacagtgt ctatcctgat gcgcggccag 3 060 

attcccgtag ccgagcctac agaggaacag caagaagcag ccagacgcgt agaagtacgt 312 0 

caggcagctc ctgagcaacg ccaggacatg agcaaatatc gcgaacaaaa acaagacctg 3180 

aatgatccga atcagcaggc cgctgcccag caggatactc gcgaagccgt aaaacgcgaa 3240 

ccgatccgcg ctgaaaagac agtgggtcgc aatgatcctt gtccgtgcgg aagtggaaag 33 00 

aagtacaaaa actgccacgg acggaacagt taa 3333 

<210> 827 
<211> 1206 
<212> DNA 
<213> B.fragilis 

<400> 827 

gaagcagagt cactgagttt tattgataac ttgaaactga aagcatcata cgtattcctg 60 

ggtaacaata atatcggtaa ctacccttat cagtccactt acgcacttgg aaaggcgatg 120 

aactatgtat tcggaggtgt gtatacacaa ggagccgcag tgaccactta tgtcgatcct 180 

acactgaaat gggaaaagac ccgtaccacc gatgtcggta ttgaaacagc tttctggaac 240 

aataaattga cattcaacgc tgcttacttc tatcgtaaaa cgacagatat tctctataaa 300 

ccgagtgcaa gttactcttc tatctttggt ctgggacttt cgcaggtcaa tacaggaagc 3 60 

cttgagaaca aaggatggga gtttgagatc ggtcatcaga acaagattgg tgagtttagt 42 0 

tatcatgtga atggaaactt ctcgataatt aaaaacaagg tgatcagcct gggtgtagga 480 

gatgtggaac agaaaagcgg aatgataggt aacggtagcg acctgttcct gggttatccg 540 

atgaatatgt tttatggcta taagacggat ggcgtattcc tgaccgatga cgaagtaaaa 600 

gaatggcacg atcagagcaa gattgctcct aactccaaag ccggtgattt acgctatgtg 660 

gacatctccg gtgacggaaa ggtggacgaa tccgataaaa cttatttagg atcaaagata 720 

cctcagtata cgtttggctt aggactgggt gcggagtata agggatttga tttcaatata 780 

ttgcttcagg gagtggccaa ggtaaaaggc cagttgacca attatgccgg ttatgctttc 840 

ttccaggaag gcaatattca gaaatggcag gcagaagaaa cctggacgaa taatcagtcg 900 

aaccgatatc ctaaatatcc tcgtctcgaa gtgatgtcga atgcaggtag caacaatacg 960 

ctgggctctg atttctggat tttggatgcc tcttatctca aagtgagaaa tatccagtta 1020 

ggatatacat tgcccaaacg tataactcag aagttcggtt cttccaacct tcgtttttat 1080 

atatcacttg ataatccatt ctccatcagc ggatatcgta aaggctggga tccggaaatt 1140 

aatacagacg gtagttatta tcctattctg tcaacttata catttggttt aaccttaaaa 12 00 

ttttga 1206 

<210> 828 
<211> 1050 
<212> DNA 
<213> B.fragilis 

<400> 828 

acagatttgt tgcttactta ctttttactt attatggaaa agaaaacaag aaaaagcttc 60 

atttggctgg ctatcctgct gttgggaaca atttggatac tagcccaacg aaataaacaa 12 0 

ataccttaca acagtatcaa tgggcttgta ttcggcacag tatataatat tacctatcaa 180 

tatgatggca atctgaaagc ggagatcgat gccgaattaa aaaaattcga cggttcactt 240 

tctccattca atgatacatc tgtcattacc cgtgttaatc gtaatgaaga aatcgtcaca 3 00 

gacactttct tccaaacctg ttttaaccga tctatggaga tctcagccga aactcgcgga 360 

gctttcgata tcacagtagc tccattagcc aatgcctggg gattcggttt caaaaaagga 420 
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gccttccccg 
aaactggaaa 
gtagccaaag 
aactatatgg 
gcatggagaa 
caaacaacat 
tactacaaag 
caacataata 
gccacagctt 
aacataggtg 
aagaacatga 



actcgatcat 
acggcaaagt 
gatattccgt 
tagatatagg 
tcggcataaa 
taaaactgac 
atggcaagaa 
ttctttcagc 
ttatggtaat 
cttattttat 
agcaatatct 



gatagatagt 
gatcaaagaa 
agatgtagta 
tggcgaactg 
caagcctgta 
caatgtaggt 
gtatgcccac 
aaccgtagtt 
ggggctggat 
ctacagtgat 
tgacaaatag 



ctactccaaa 
gaccctcggg 
gcccggtatt 
gtggtaaaag 
gacgattcct 
atagcaactt 
accatcgacc 
gccgacgatt 
gaagcggaag 
gaaaaagggg 



tcacaggata 
tgatgctaag 
tggatagcaa 
gggtgaatcc 
tgtcgcttaa 
ccggaaacta 
cacgtaccgg 
gtatgactgc 
cttttacaaa 
aggtgaaaag 



ccaaaaagtt 
ttgtagtgct 
aggtatcaaa 
caaagaggaa 
ccaagagata 
tcgcaacttt 
atatccggtt 
cgacgcatta 
atcacacccc 
ctattttaca 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1050 



<210> 829 
<211> 1629 
<212> DNA 
<213> B.fragilis 



<400> 829 

ataagattgt 

aacaaaattg 

gcaggggtgt 

gagaaacaac 

tgttatggcg 

cggtttacgc 

accgggaaat 

attatcgata 

ggttctgtcg 

gtgcatcccg 

gatcgcgttc 

cctctttatg 

gaattgttgc 

cgcattggtt 

ttattcctgg 

tattacggac 

tcaggcatgg 

ctgaaagagt 

aatggtccgg 

aaaatagccg 

ccgttcatgc 

cagatggacc 

gatagcgaaa 

gttattgagg 

tattacaatc 

ttgtataatt 

aagttgggtg 

agattttga 



tggctatctt 
atatgacaaa 
ctttgccggt 
ccaacatcat 
cacaccggat 
agggattttg 
atccctggag 
ctcaaaaaat 
gtaaatggca 
gagccgctga 
cttgcgtctt 
tggattaccg 
gcatgcatcc 
tccagaaagg 
acaaggccag 
ttcaccaacc 
gaccccgtgg 
tggataagtt 
tgctcgatga 
gtccgttgag 
ttcgttggcc 
tattggcctc 
acacactcga 
gaatgttcaa 
cttatagcaa 
tgaagtcaga 
aactgatcaa 



tgtatccgga 
cgaacgaatg 
aagtgctttg 
cctgattgtc 
tcaaactccg 
tactgcagcc 
caatgtggat 
aacgttgcct 
catcggccta 
gatcggttac 
tctggaaaat 
gaaaaacttc 
cagtgtggga 
tgggaaggct 
gcagtttgta 
gcatgtacct 
tgatgtgatt 
gggattggcg 
tggttatcaa 
agggggtaaa 
tgcaaaggtg 
gtttgctttt 
tgcttttctg 
ttatgcctat 
ggaagacggt 
catcggtcaa 
tcgttttgag 



tctgacagac 
aaattactga 
gccgtacctc 
gccgatgact 
ggtatggatc 
acatctaccc 
gctaaaatct 
aaacttatga 
ggagatgggc 
gactattcgt 
ggaagagttg 
cccggtgaac 
catgcaggct 
gcacaatgga 
gatgacaata 
cgtgttccca 
ctggaggcag 
gagaatacca 
gatgatgcgg 
acaagtatgt 
aaacctcagg 
cttttgggac 
ggcaagagta 
cgtcagggag 
gacttcatcg 
caaaagaatc 
tatctgaaag 



cttattggaa 
actgcacatt 
agcctgccca 
tgggatacgg 
gtatagccaa 
ccagccgcta 
taccgggtaa 
agcaggcggg 
atgtggactg 
ttattcaggc 
tcggattgga 
ctaccggtaa 
ctattgtgaa 
aagatgaaga 
aagacaagcc 
atgaacgttt 
actggtgtgt 
ttgtgattct 
tagagttggt 
tcgacggagg 
tgtcagatgt 
agacttatcc 
aaaaagggcg 
attgggcgct 
gtttgggtta 
tggccgagaa 
ctcactccga 



attactcatg 
gggattagtg 
ggatcaaaca 
agatttaagt 
cgaaggtatt 
ttcggtgatg 
cgccgcattg 
atatactacc 
gaataaggag 
agcaaccaat 
tccgaatgac 
agagaacccc 
tggagttccc 
aatggcagga 
tttcttcctt 
tgtaggaaag 
tgatcagttc 
gacgagtgac 
gggtgatcac 
tacccgtata 
atttgtctgt 
ggacaaggta 
taaagagttg 
gattcctcca 
cggttataag 
gtatcctaaa 
caaagtgacg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1629 



<210> 830 
<211> 1626 
<212> DNA 
<213> B.fragilis 



<400> 830 

agtagcatgt 

acttatccct 

gcttacataa 

gtattgatag 

ttgtgcggaa 

gatggtttct 



ttcattcctt 
tccattacac 
ataagcagac 
tccgtacttc 
gcaattcaca 
tcaagataga 



ccagacctcc 
tcctcatccg 
aagatggaaa 
taacggacaa 
ctctttcttt 
agaagaacaa 



attgccggca 
ttatgtgtaa 
gaagaattgg 
acgggatatt 
gtaccgccgg 
atctcggcta 



tcgaattgcc 
tggcagcagg 
acaaaggaaa 
tggctgcttt 
tatacgatct 
tcaatcacca 



ccgcttgttt 
agaagtacag 
aatgttcggc 
ttcgggtaat 
gttgaaaccg 
aatcggacag 



60 

120 

180 

240 

300 

360 
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ttacaaaact gtgaccgata tctggaactc caacaaaaga tggagagaga aacagcttcc 42 0 

tcacagcagg cattgtcaga ggccagaaaa gttctgaaag cagcaaaaga gaaacgggaa 48 0 

cagcgcagac ttcaccgacc gaacgaaaat gaacaagttg ccatgattcg cgaaagtcaa 540 

taccagaaag cagaattcaa gcgtttggaa agatactgga aagaacaaat ttccgaaata 600 

aagacagaac tggaaagttt ctcgtcacag atagaggctc tcaaagccga acgcagaaat 660 

cgttcggcag cattgcaaca aaagctattc caacagttca acttcctgaa tgccaagggg 72 0 

gaaactaaaa atttgtgtgc tatcttcgaa gaaaccgttc aaaaaacgcc acctgccgga 780 

gcaggtgaat gtgctgcccc gaaactattg caatatgctt atctaagcgg attaagcccc 840 

attgccatgg ccgaattctg gtggggggaa tctcctaaga cagagatcag acaccacggt 900 

tattattatc cgtcttgcag aggaaaatgc gaacccattt tgcgacacat gttgcaaggt 960 

ctcaatgtag agccagcacc ctcagaaaga tactctttat cacaaaatat gccggagatt 102 0 

cttttcgaag accaatggct tttagttctt cataaacccg aaggagtact ctccgtaccc 1080 

ggaaagtcag aagaacaatc gatctacagt ctgcttagag cccgctatcc tgaagcgaca 1140 

ggtcccctcg ttgtacatcg attggatatg gccacttcag gattactgct ggctgccaag 12 0 0 

acccaagaag tacaccggca cctacaggcc cagtttgaaa accgaagcat caaaaaacga 12 60 

tatatagctc tattggatgg tatccttccg gaagaagaag gagttatcga tcttcccatc 132 0 

tgtccggatt atcttgacag acccagacaa atggtgaacg aagagctagg aaaaacagct 13 80 

atcacccgat atcaggtgat ggatcggaaa aacggacaga cccgtattgc tttcttcccg 1440 

ctgacgggac ggacacatca gttgcgtgta catgcagctc atccgttggg attaaactgc 1500 

cctatcgtag gagacgagct ttatggacgg aaggcagaac gcctttatct gcatgccgaa 15 60 

tatctggaat tcatccaccc cgtatccggg caaagaatgg tcatcgaaaa gaaagctgaa 162 0 

ttttaa 1626 

<210> 831 
<211> 501 
<212> DNA 
<213> B.fragilis 

<400> 831 

ttctccgctt tttttgtcat ttatccgttt atatccggaa caatcattat aaaatattcg 60 

ttaattagac aacgaattct attttttctc aaatttattg ctacgtttgc agctactatg 12 0 

ataaagtaca tattctgcat attgataggt atcttttttg tgtatggagc cggttatacc 180 

gcttctatag aagaaactgc agaccttccc gccgaagtta ctgccacctt tgtatcacaa 240 

tatgccggag accattcttt attcaatgat gagacggctg aatccaaagt gtgtgatgct 3 00 

attcttcccc atagttcttt ttcacgggaa ctaagttctt ccaaaatttt gaaactcaaa 360 

ttgcagactg ctatccggct gctcaatgcc tcacttttcc atcaatcgga gaggggagat 42 0 

acttatccgg acttcaatca taacttcatt aaatattcca gcgggtatta tgtatactcg 480 

ttagagcata tcctgattta g 501 

<210> 832 
<211> 924 
<212> DNA 
<213> B.fragilis 

<400> 832 

gaatcattgc cttataccca atttgaatac ctaaatatta tggcagacaa ttatatcgaa 60 

agacaatacg aacaatatga agccagaaaa gcggcttggg aaaaagcacg caaatatggc 12 0 

aaaaagaaaa cggggatcac tcaccctgct agaactgaac aaccgggcca aacgacaata 18 0 

gagccccatc attataaaag agtatttgtt acgggaggag ccaatggaat tggtaaagcc 2 40 

attgtagaaa tattctgtaa aagtgggtat cgggtggcat tttgcgacaa agacggaata 3 00 

gcaggaaaac gtactgcaga agaaacagga gccatttttc atcaagttga cataagcgac 3 60 

aaggatatgc ttgaacactg catgcaatcc atcattgagg aatgggatga cattgatatt 42 0 

ttaatcaata acgcaggtat cagtgacttc tctcctatca ctgaaacaag catagaagat 480 

ttcgacagga ttctatccat taatctacgc ccggtattta ttacttcacg cttcatagct 540 

atccaccgtc aatcgcaaac aacatccaat ccgtacggaa gaatcatcaa tatctgctct 600 

accaggtatt taatgagtga atccggcagc gagggatatg cagcttctaa agggggaatc 660 

tattcactga cacacgcgtt agccttgtca cttgcccaat tccatatcac agtcaattct 72 0 

attgcgccgg gctggataca aacccatgac tacgatcgtc tccgtccgaa ggaccatgag 78 0 

caacaccctt cgagaagagt cggtaaaccg gaagatatag cccgcatgtg tagattcctt 840 



338 



tgtgaagaag gaaatgactt tatcaacggt gaaaacatca cgattgacgg agggatgact 900 

aaaaagatga tttacacgga ataa 924 

<210> 833 
<211> 1623 
<212> DNA 
<213> B.fragilis 

<400> 833 

ctttgcgctt tatcaagaaa aacgactatg ctcaaacgga taccccacac atacaccatc 60 

atttcttcgg tcattctact ctgtgcagtg ctttcctgga tcattcctgc cggagaatat 120 

gtgcgagaga caatcgacgt aaacggtatt tcccgcactg tcattgtaga ccattctttc 180 

caccgggtag aacagacacc ccagacctgg caagtgttca gctcccttct tgaaggcttc 240 

gaacgtcagg caggaattat agctttctta ctgattatgg gaggtgcctt tcaaataatg 300 

aatagcagcc gtgctattga taccggcatt ttttcatttc tgaatttcac gaaaggactt 3 60 

gaaaaacacc gactgatcaa aatactggga gtaaacaatg tagtgatatc cttagtcatc 420 

atccttttca gccttttcgg ttccgtattc ggtatgagtg aagagacact ggccttcgtc 480 

atcatcattg tcccacttgc catatcaatg ggatatgact ccatcaccgg gctgtgcatg 540 

gtatacgtag ctgcccatat cggcttttcc ggtgcagtac tgaatccttt tacgatcggc 600 

attgcgcaag gtttgtctga tctcccgttg ttctccggat ttgaataccg tatgttttgt 660 

tggctggtac tgaccaccgc cctgattgtt tgtgtactca gatatgccgc tgtcgtcaaa 72 0 

aagcatccgg aaaaatcacc tatgtatcat gctgacgctt attggcggaa acgggaaaaa 780 

gaaagctgtg gagagatatc ccatgtaacg actcgccaag catggatcgt atacctattg 840 

ttactcgtgt ccttggggtt gttctccatc atctacccga tcagtacttt ttcagtaggt 900 

gaagcatcag tcacctgcta tgcagttccc accttatcta tcttgtttgc agttttcggt 960 

tggctgggtt tacgcaaatc caaccagttc tttatattga ccttactcgc attcactatt 1020 

cttttcctga ttatcggtgt catgggtcat ggctggtatt taccggagat atccgccatc 1080 

tttctggcaa tgggcattct ttcggggttt gccaatagtg aacatgcaga tgctatcatc 1140 

aagcaattca tggatggagc caaagacatg ttgtcggccg ccatagttgt gggactggcc 12 00 

ggagggatta ttcaaatact gcaagacgga catatcatcg accccatttt acattctttg 12 60 

gcttcactga tgggagaagc cggaaaaata gtatctttgg gggtgatgta tctgatacag 132 0 

acactcatta acctgattat cccttccggg tccgccaaag cagcgttaac catgcctatc 13 80 

atggcaccct tttccgatgt catcggactt tcgcggcaag ctacggtaat ggcttatcag 1440 

tttggtgacg gatttaccaa tatgatcacc cctacttctg ctgtattgat gggtgcctta 1500 

ggcattgccc gcatacctta tgagatttgg gtaaaatggt tgtggaagat acttctttta 1560 

ttcattatcc taggaatggt actactgatt cccacggtac ttttcccatt gaatggattt 162 0 

tag 1623 

<210> 834 
<211> 1338 
<212> DNA 
<213> B. fragilis 

<400> 834 

aatttagata caatgaaaat taaaacgctt gtggctgtgt tgtttctttc ggcgggagca 60 

acaactgtgg tagcacagga cgacgctaat tgtaattcga acagtagtat ttctcacgaa 12 0 

gcagtgaaag ctggtaactt taaagatgct tatactccgt ggaaagctgt tttggagaac 180 

tgcccgactc ttcgtttcta taccttcaca gacggttata aaattctgaa agggttgctg 240 

gggcagatca aagacagaaa ctctgcagaa tacaaaaagt attttgatga gttgatgaat 3 00 

acgcacgatt tgcgtatgaa gtatactcag gaattcttgg gaaaaggtgt aaaagtatcg 3 60 

tcggaagatg aagcactggg cattaaagct gtcgattata ttgcatttgc tccgaaggtg 420 

gatgtaaatc aagcttatga ttggttgaaa aaatcggtgg acgctgcgaa agctgagtct 48 0 

gcagctgcta cattgttcta tttcttgcag atgtctcacg ataaactgaa ggaagatccg 540 

gctcacaaag agcagtttat tcaggactat ctggctgcat ccgaatatgc agacgatgct 600 

atagctgctg ctgataaaga gagtgtgaag aaagctttcg gaggtatcaa agataatctg 66 0 

gtagctctgt tcattaacag cggtactgcc gattgcgaat cactgcaagg tatctatgga 720 

cctaaggtag aaacgaatca gactgatttg aattatttga agaaagtcat cagcattatg 780 

aagatgatga agtgtacgga tagcgacgct tatcagcagg cttcattcta tgtatacaag 84 0 

attgagcctt cggctgaggc tgctaccgga tgtgcatacc aggcctataa gaaaggggat 900 
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atcgatggtt ctgtgaagtt ctttgacgaa gcgatcaacc ttgagacaga caatgcaaag 960 

aaagcagaaa aggcttatgc tgctgccagt gttttgacta ctgccaagaa attgtctcag 102 0 

gcaagatctt atgctcagaa agcaatcagc ttcaatgaaa actatggtgc tccttatatc 1080 

cttatcgcca acttgtatgc tatgagtcct aactggagtg atgaatcggc tttgaacaag 1140 

tgtacttatt ttgctgttat cgacaaattg cagaaagcta aatctgtaga tccgagtgta 1200 

acagaagaag ttaacaaaat gatcagcaga tattccgctt atactccgca ggctaaagac 12 60 

ttgtttatgt tgggctacaa agccggcgac cgcatcacta tcggtggttg gattggagag 132 0 

tctacaacga tcagataa 1338 



<210> 835 
<211> 501 
<212> DNA 
<213> B. f ragilis 



<400> 835 

attatcagca cattggatga aagtatacga ccatttcgta gatttgtgaa aacattaaaa 60 

ttgatgcata tgaaaaagta cctgttttac ctcagtatgg ctcttgtagc agtagtgttg 120 

ttttcttgta aaagcggcaa gaaaagtgtg tttactccaa cttccagcgg acgtgcttat 180 

gaagtcctcg ttgtggttga gaagcctgtg tgggagcgtc ctgccggtag agctttgtac 240 

aatgtgctcg atacagatgt gcccggactt ccgcaatcgg aacgttcgtt caggatcatg 3 00 

tctacttctc ccaaagattt tgatgccatc ctgaagttgg tgcgcaacat tattatcgta 3 60 

gatatacagg acatttatac ccaacctaaa ttcaagtatg ccaaggatgt atatgcatct 42 0 

ccccagatga ttttgactat tcaggctccg gacgaggcat cgtttgagaa gtttgtcgaa 480 

gagaacaaac agccgatata a 501 



<210> 836 
<211> 1191 
<212> DNA 
<213> B.fragilis 



<400> 836 

aagcctcgaa cggttaaaaa ctgttcgagg cttttctata ttcaaccaaa atcgtatttt 60 

tgttctttta gaaaagtaaa aaaacaggct atgacaaaat atccgtatat actgttcgta 120 

ttgctcctcg cgtctttcag ttcctgccag actgttgagc aactttccat cgattatatg 180 

ctccccgcag agatcagttt tcctaacgaa ctgaaacgag tggcagtcgt aaacaatgtg 240 

agcgacactc cggataacac cttaccaccc aaggataata caataaaaaa taagaatgaa 3 00 

ctcagccgtg cagtagccta tcacgaggga caacccgcac tcactaccga agcattggcc 3 60 

aaagctattg ccgaacagaa ctatttcaat gaagtcgtaa tctgcgattc ggccctgcgt 42 0 

gcacgtgatt tcacaccccg tgaatcgact ctcagccaag aggaagttca gaccttggca 480 

cagtttctgg acgtggattg catcatctca ctggaaaacc tgcagatgaa atcgacacgg 540 

gttctcagtt acatccccga atggaacact tattacggca cattggatac gaaggtttac 600 

ccaacgctga aaatctatct gccgggacga aaaagcccga tggtaaccat caatacccat 660 

gacagtattt tttgggaaga atatggaaat accgaagggt ttgtccgctc acgcctgccg 720 

gatgaaagac aaatgatacg cgaagcttct gaatttgccg gttccgtgcc ggtaaacaga 78 0 

atattacctt attggaaaac ggccaatcga tattatttca tcaatggctc tgtagctatg 84 0 

cgcgatgctg ccgtttatgt gaaagaaaac gaatgggaaa aagcatccaa actgtgggaa 900 

caggctttta aagcagccaa gaacgacaaa aagaaaatgc gtgcagcctt caacctggct 960 

ctatattacg agatgaaaga cagtgtggaa gaagcacaca aatgggctgt cactgcacag 102 0 

gaactggccc gtaaaataga caaaatcgac acgttgaaga gaaacgatat agacttgagc 10 8 0 

gaaatcccca actactacct gaccagcctt tatgtgaatg aattaaagga aaggagcaac 1140 

ggattgggca aattaaaagg ccaaatgagt agatttaatg aggattttta a 1191 



<210> 837 
<211> 2022 
<212> DNA 
<213> B.fragilis 



<400> 837 

ctttgcagaa caaaaatgaa atcatatatg gaaaagctaa aaatggaatc tgtgagcatc 



60 
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gcagaggaca gcctgaataa aatagccgaa ctcttcccaa acgtagtcac tgaatcgatg 12 0 

ggtaaggacg gacaactaca taaagctatc gactttgata aactgaaatt cctgcttaca 180 

gccaaccaag cagaaatggg agtggtatat gatgacgacg aacgttacga attaacatgg 2 40 

gtgggcaaga agcaggcgat aagagaggtg gcgcatccta tccgaaaaac attgcgtccc 3 00 

tgcccggaag agagcaggaa ttgggaacag acccaaaacc tatacatcga gggagacaat 3 60 

ctggacgcaa tgaaactcct gaaaaagagt tacgcaggaa aggtagacgt tatctatatc 42 0 

gatccgcctt acaacacggg taaagacttt atcttcaatg atacattcgc tctttcacag 480 

gaagagtccg acgagaaaca gggaagatat aatgaagaag ggcaacgatt gtttcagaat 540 

acggaggcta acggaaagtt tcactccgat tggtgtagca tgatgtatgc ccgactgatg 600 

cttgcccgca ctctgttaaa tgataatggc atcattttca tttccattga tgatcacgaa 660 

ttggcaaatc tgatcaaaat aggaaatgaa gtattcaatg cttctaattt catcgatgta 72 0 

tttaattggg ccaagacgga aactccggaa aatctctcaa aaaaaagtaa gcaaatcatc 7 80 

gaatacatcg tctgctatca gaagaagaaa aacgacatga aattccaggg tctgaagaag 840 

gaatcggtca gttcgaacgg tttgttgaat caaccgaatt ccgtcggtat cctgaccttc 9 00 

cccgccaaca aagtagtcac ttccatcccc gacggagtga tcaaagcagg catgtacgga 9 60 

acagatgctt atgatgtgga attactggaa gacaccaccg tacgtggcgg actgtttaca 102 0 

gcccccgtca aactgaaagc caaattcaaa tggagccagg cgaatctgga caaagagata 1080 

caaaaaggaa caacaatcaa aataccgact ctcaagttaa gtccttcata cgaaaagctg 1140 

gaatatgatc cggaagttcc gcccaacctg atcaattaca aagtaggagt cgaaaccaat 1200 

gaacaggccg gtaaccatca actacagttc tttgataaga aagtgttcaa ctttccgaaa 12 60 

cctgtcagcc tgatccaata tttatgtgag tttatcgaca ccaaaaacaa agattgcatc 132 0 

gtgatggatt ttttctcagg aagcggtacg accgccgaag cagttatgcg gatgaacatg 13 80 

aaaccacgta aaaacaaggt gaaatacatc ctcgtgcaac tgccggaaga tgtgactgaa 1440 

acaataaaaa aggccaaaac tcctagtgaa aaagagatta tgcagaatgc aatcgacttc 15 00 

cttacggaaa accataaagc attgaacatc tgcgaactgt ccaaagaacg tattcgacgt 1560 

gccggagaca caattgaggc ggaatgcaac cagcgtaaat taaaggacct cccggacatc 162 0 

ggtttccgtg tttttcggat tgccgacagc aatatgaagg acgtgtacta cagtgcaaag 1680 

gaatattcac agagtgattt attctatttc actgataata tcaaagaaga ccgtaccgga 1740 

ctcgatctgc tttatggttg cctgaccaac ctgggactat ccctgtctct accacatgat 1800 

gaagaggata taaatggata tacggtttat tctgtcgaca agaccgaatt aatggcatgt 18 60 

ttcgcagaac agattcccga aaaagtcttc cgtgaaatag ccggcaggca accacgccgg 192 0 

gttgtcttcc gggacgcctc attccgtgac agtgccgatc gtatcaatat agacgagata 19 80 

ttcaaaacat tatctcccgg tactacgatc gagattcttt aa 2 022 

<210> 838 
<211> 891 
<212> DNA 
<213> B.fragilis 

<400> 838 

gtgaatatga ctataacagt gtttacgccg acatttaatc gggccactct gttacctaga 6 0 

ttatatgaaa gtttagttaa tcaaacattt cttgattttg agtggcttat tgtagatgat 120 

ggaagtactg atgatacatt taactttata gaatcaatta aggaaaacga taagattgat 180 

atacaatatt actatcagaa taatgctggt aaacatgctg ctattaattg gggagtagag 2 40 

ctagctaaag gcgatctttt ttttattgta gatagtgatg aggttatgat tgaatctggg 3 00 

ttacagacta tagtagatgt ttataaacaa gtatctgata atgataactt tgcaggagtg 3 60 

acagggctga aaagtttttt tagtggtaaa actatagggg gagagcttaa ttatacttat 42 0 

ttagattgtt ctgcaataga ttataacctg aaatataaat atggtgggga gatggctgtt 480 

gcatatagga ctaagatttt gcagaaatat ccatttccga tttttgaagg agagaaatat 54 0 

tgtggggaag gacttatttg gtataaaata gctttgcact ataaattacg atattttgca 600 

catccaataa tattgactga atattatcct gatggtctta ctgcattagg agtgaataaa 6 60 

aggatggaaa gtcctaaaac gactttggct acatatagtg agctctctaa aatgaatgtt 72 0 

ccttttaata gtaggatcag gtatattatt aatttttgga gatttttttg ttgcgataaa 78 0 

caaagaggat ttgcatgcaa attgaaattg gttaataagt ctactatttt actattccct 840 

ttaggatact gccttcattt gattgatatt tttaagacaa aaagaagatg a 891 

<210> 839 
<211> 1293 
<212> DNA 
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<213> B.fragilis 
<400> 839 

cgaaaagaga caaataaatt cattgaaccg gccattttat tcccacaaaa gcagtatctt 60 

tgccacatga actcaaagtt gcgacattta ctcctgattg ttttttcaat cttcccgatt 120 

ctgacatggg gaacggaaag tccgtccacc gctgattcca tccggatcag cctgttgaca 180 

tgcgctccgg gtgaagaaat ctattcgctc ttcgggcaca cggctatccg ttacgaagaa 240 

ccggcacgag gcatcgaccg ggtatacaat tatggcttgt tcagtttcaa cacccccaac 3 00 

ttcatcctgc gtttcgcact cggcaagacc gattatcaat tgggggtgga agattaccgc 3 60 

tgttttgccg ccgaatacga atacttcgga cgcagtgtat ggcagcagac gctcaatctg 42 0 

acagtcgaag aacaacaaca gttaatcacc cttctggaag aaaattaccg cccggaaaac 480 

cggatatacc gctataactt tttctacgac aactgtgcta cccgtccacg ggacaaggtg 540 

gaagagagcc tgcaaaaaag cggtagccaa ttgctcttca gcaatgcaca caccgaaaat 60 0 

ggcgaaacga aatcttatcg ggatattgtc catcaataca cgaaaggaca tccttgggca 660 

caattcggaa tcgatttctg cataggcagc caagccgacc accccatcaa cgatagacaa 72 0 

atgatgttcg ctccgtttta tctgatggat gcttttgccg gagcacgcat agccaacact 780 

tcagacaaca aagcactggt ggcttccacc aaaaaaatta ttgactgtga accggctgta 840 

tccggctccg cagaaaatga tatctggaat atgctaaccc ccatccgatt gtcccttctt 900 

gtgtttatcg caatcggaat ggctaccgtc tatggtctac gcaagaaaaa gagtctttgg 960 

ggactggata tcgcagtgtt tgcggcagcg ggtatcgcag gatgcatcat cgctttcctt 102 0 

gctcttttct ccgaacaccc cacggtaggc tccaattact tgctgtttgt cttccatccg 1080 

gggcatctgc tctgcctccc tttctttata aacgatgaac gaaagcgacg caaaagcagg 1140 

tatcatctgc tgaactgcac agttttaaca ctttttatag tgctttttcc ggtaatacca 12 00 

caaaatttcg acttagcagt attacctttg gcactctgtt tgctgatacg ttctgcaagc 12 60 

aatcttattc tgacatacaa aaaagctaaa tga 1293 

<210> 840 
<211> 402 
<212> DNA 
<213> B.fragilis 

<400> 840 

agtgtttatt ttaaaaaaca aaataaatta cccggacaaa taattgggga taaaatgaag 6 0 

ttaatccaat atatttcaac aacttacgtt agagaagaac tttataaccc ggacaatcgg 12 0 

gatcttatta acccaaaaag tccccctgaa tttataagtt caggaggacc tttttgtcat 180 

aaagtggagc tggagggatt cgaaccctcg tccaaacgag gaaatcataa gctttctaca 2 40 

tgcttatctt tgcctaagtt tttcgtgcag gagcagaacc aaagccatca attcctgcct 300 

tatcctttaa agtttcatca gaagcgcaag gccacttctg actatccccg atgtaactgc 3 60 

accactgaac cggaatgctt cggagcaaca gcttccgagt ga 402 

<210> 841 
<211> 795 
<212> DNA 
<213> B.fragilis 

<400> 841 

atcattcttt tctcttactt ttgcagccga tttttcaaaa gagttatggc acagtttacg 60 

gaagaagaga aaaccattcg gcgtatcgaa aagcgtttta acaaaggtat ggttcaatat 12 0 

gggttgattg aagagggtga caaagtgctt gttggccttt caggaggaaa agattccctg 180 

gcattagtcg aattgctggg caaacgttcg catattttca aacctcgttt ttcggtggta 240 

gctgtacatg tggttatgaa gaatattcca taccagagtg attgggatta cctccgtgaa 3 00 

catgctgaaa agaatggtgt tcctttagtt gtttacgaga cttctttcga cccttctacc 3 60 

gatacgcgta aatcaccttg ttttctctgt tcatggaacc ggaggaaagc tctgtttact 420 

gtggctaaag agcagggttg caataaaata gcccttggac accatatgga cgatattttg 480 

gaaactttat taatgaacat tacctatcag ggtgcattca gtacaatgcc accacgtttg 540 

gtaatgaaca aatttgatat gaccattatt cgcccgatgt gcctggtgca tgaagcggat 600 

ttgttggagt tggcgcaaat aaggggatat cgcaagcaag tgaaaaattg tccttatgaa 660 

tcccaatcga gccgtagcga tatgaagggg atactccgac aattggaaaa gatgaatccg 72 0 

gaggctcggt acagtctgtg ggggagcatg acaaatgtac aggaagaatt gttacccaaa 780 
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gaagtggagt tttaa 795 

<210> 842 
<211> 189 
<212> DNA 
<213> B.fragilis 



<400> 842 

tgtgcatatt tctttgcgca aacggccgac cgttatcaat ggataacgcc cgcacagaaa 60 

agatatattc acccggagga aggttcgaaa aacgaattct attatctaca gacggagcac 120 

tccaatccgt atgatcgagt ttccaactgt aagctatatc ctgcttttca gtatagcgaa 180 
tggcagtag 189 



<210> 843 
<211> 1167 
<212> DNA 
<213> B.fragilis 



<400> 843 

ttggttatgg ctgaatcgaa ttttgttgat tacgtaaaga tatactgccg ctcgggtaaa 60 

ggcggaagag gctctacgca catgaggcga gaaaaatata ctcctaacgg tggacctgat 120 

ggaggagatg gcggaagagg aggccatgtt atcctgcgtg gtaaccggaa ttactggaca 180 

ttgcttcact tgagatatga tcgtcatgca atggctggtc atggggagtc gggcagtaag 240 

aaccgtagtt tcggtaaaga cggagcggat aagattattg aagttccctg tggtacggtg 3 00 

gtttacaatg ccgaaacagg tgaatatgta tgtgatgtaa cagaacacgg acaagaggtc 3 60 

at tct tttaa aaggcggacg tggcggattg ggaaactggc acttcaagac ggctacccgt 42 0 

caggctcccc gttttgccca gccgggcgaa ccgatgcagg agatgactgt aatccttgaa 480 

ttgaagttgc tggctgatgt aggtctggta ggtttcccaa atgcaggtaa gtctaccttg 540 

ttatctgcta tttctgctgc aaaaccaaag attgccgatt atccgtttac aacattggag 600 

cctaacctgg gtattgtatc ttatcgtgac ggacagtcgt ttgtgatggc tgatattccg 660 

ggaattatcg aaggtgccag tgaaggtaag ggattgggat tgcgtttctt gcgtcacatt 72 0 

gagcgcaact ctttgttact tttcatgata ccggcggata gcgatgatat ccgtaaagat 780 

tatgaagtgc tgctaaacga actgaaaaca tttaatcctg aaatgctgga taaacaacgg 840 

gtacttgcca tcactaagag tgatatgctg gatcaggagt tgatggatga aatagaaccg 900 

acattgccgg agggaattcc tcatgtattc atttcatctg tatccggttt gggcatttcg 960 

gtgctgaagg acattttatg gacggagttg aataaggaaa gcaataaaat agaagctatt 102 0 

gtgcatcgtc cgaaggatgt cagccgattg cagcaggaac tcaaagatat gggtgaggat 1080 

gaagaactcg actatgaata tgaggatgat ggtgatgagg acgatttgga ttacgaatac 1140 

gaagaagagg attgggaaga taaatga 1167 



<210> 844 
<211> 360 
<212> DNA 
<213> B.fragilis 



<400> 844 

cagtatatgg cagacgtgaa agagaaaata aatcttctgg atgtaattcc tttccgtagt 60 

gaaaatatta cggccgaaaa gggaagcgat ggtaccgtta ccattgcttt cccccggttt 12 0 

aaatacgagt ggatgcggcg attcttgttg cctaaaggaa tgtctgcgga tattcatgtc 180 

cggctggaag atcatggcac tgccgtatgg gagttgattg acggaaagag aaccgtacgc 240 

cggattattg aagagctggc agaacacttc aattatgaag aaaattacga atcacgtatt 3 00 

acggcttata tcactcagtt gcagaaagac ggatttgtga aattagtgat tgagaactga 3 60 



<210> 845 
<211> 1296 
<212> DNA 
<213> B.fragilis 



<400> 845 
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tatgacatgg caaaaataca aattaaatct gagaaactca caccttttgg aggaattttt 60 

tcaatcatgg agaaatttga ctccatgctt tcacccgtta tcgactcaac actgggtcag 12 0 

agatgcagca gtatcttcgg atatcagttc agcgagatag tccgttcgct gatgagcgtt 180 

tatttctgtg gcggctcatg cgtggaagat gtaacgtcac aactgatgcg ccatctctcg 240 

tatcatccta cccttcgtac atgcagctct gataccatcc tcagagccat caaggaactg 300 

acacaggaaa acatctccta tacttccgac caaggcaaga cctatgattt caatactgca 360 

gacaaactca acacattgct tataaacgct ttggtttcta caggcgagtt gaaggaaatt 42 0 

gaggaatacg atgttgactt tgaccatcag ttccttgaaa cggagaagta tgatgcaaaa 48 0 

ccgacctaca aaaagttcct cggctacagg cctggcgtat atgttatcgg tgacaagata 540 

gtctatatcg agaacagcga tggtaacacg aatgtgcgtt ttcatcaggc agacacccat 600 

aagagattct tcgctcttct ggaatcccag aacatccgtg taaatcgctt cagggcagac 660 

tgcggttcct gctcgaagga aatcgtcagt gagatagaga agcattgcaa acatttctac 720 

atccgtgcca accgatgcag ttcgctctac aatgacatct ttgctctgag aggatggaag 780 

acggaggaga ttaacggcat ccagttcgaa ctcaattcca ttctcgttga gaaatgggaa 840 

ggcaagtgct atcgtcttgt catccagaga caaagacgca acagtggcga ccttgacctg 900 

tgggaaggcg aatacactta ccgttgtatt ctgaccaacg attacaagtc atcgacaagg 960 

gacattgttg aattctacaa tctgcgtggc ggcaaggaac gtatctttga cgacatgaac 102 0 

aacggattcg gttggagcag gctccccaag tcattcatgg cggagaatac tgtctttctt 1080 

ctgcttactg cattgataca caatttctac aagaccatca tgagcaggct tgacaccaag 1140 

gcttttgggc tcaagaaaac gagtcgcata aaggcttttg tcttcagatt catctccgta 12 00 

cctgccaagt ggatcatgac tgcaaggcaa tacgtgctga atatctacac agagaaccga 1260 

gcttatgcaa aacccttcaa aacagaattc ggataa 1296 

<210> 846 
<211> 1446 
<212> DNA 
<213> B.fragilis 

<400> 846 

cagattcatt atcaatatct attaataata aacattcata tattcagtaa caggtctgta 60 

acactaccct cctatttttg caccggaaaa caacaagcag atgttatgaa caatgtacag 12 0 

caagtaaaaa cttattcgca gagaaaaatt tctgatttcc ttttcattct ttgggcaggc 180 

ggagcagcgc tgctctccta ttcattggta tatgcactga gaaagcctta tacagcagcc 240 

ggatttgacg gacttgaagc gtttggaatg gactacaaag tggtagttac catcgcgcaa 300 

atattaggat atgtactttc taaattcatc ggaattaaat taatctccga attaaaacgg 360 

gaaaaccgga tgaagtttat tctgatctcc ataattctgg ctgaagcttc gttaatattg 42 0 

ttcggactgt tgcocgca.cc ttataatata ggagccatgt ttctgaacgg actttcactg 480 

ggatgtatgt ggggaatcat ttttagcttt atcgagggaa gacgaatgac ggacattctt 540 

gccagcttac tcggagtcag tatggtcatc agctcgggta ccgccaagtc ggccggtttg 600 

tatgtcatgg acactttgaa catcagcgaa ttctggatgc ctgccctgat aggcggagtt 660 

gcccttcctc tacttgcctt gttgggatat gcactcaacc ggcttccaca gccaacagcc 720 

gaagacattg ccatgaaatc gaaacgggaa acactgaacg gcaagcaacg atgggagcta 780 

tttaagaatt tcatgccttt cctcactctg ctctttatag ccaatgtggt actgactatc 840 

ttgagagata taaaggagga cttcctggta aaaattatcg atgtctctca atactcttcg 900 

tggatgtttg cacaggtaga cagcgtagta accctcatta ttctgataat tttcggatta 960 

atggtgttcg tcagaagcaa cttgaaagca ctgtcgatat tactgggatt gatcattgcc 1020 

agcatggtag tgatggcagt cgtttcgttt ggttacgaac aattgcagct gaacgccatc 1080 

gtctggctat tcatccagag tctgtgtctc tatctggctt ttctcacttt ccagactatc 1140 

ttcttcgacc gttttatcgc ttgcttcaaa attcgaggta acgtgggttt cttcattgct 1200 

atgaatgatt ttctgggcta tacgggaaca gtcatagtat tggctgtcaa agaattcttt 1260 

tcaccggaca ttaactggac agctttctac aatctgatgg caggatatgt gggaataatc 1320 

tgtttcgttg cttttgtatg ctctttcatc tacctgcacc aacgctaccg cagggagaat 1380 

tacggaaaga caggggtatt cagaaaaaaa gaagaagaaa aagaagttcc cgatttcgta 1440 

tattaa 1446 

<210> 847 
<211> 609 
<212> DNA 
<213> B. fragilis 
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<400> 847 

tttataaatt accattttaa cgacatgaca ggattagaga tttggctact tgcaattggt 60 

ttagcgatgg attgcctcgc tgtctctatt gcaagtggta ttattttaag gcgtattcaa 120 

tggcggccta tgctcatcat ggcatttttt ttcggacttt tccaggctat aatgccttta 180 

ttggggtggt taggagcaag cacattcagc caccttatcg aatcggtcga tcactggatt 240 

gcctttgcta ttctggcctt tctaggcgga cgaatgatca aagaatcttt taaagaagaa 3 00 

gattgctgcc aaagatttaa ccctgcaagc ctgaaagtag tgataacaat ggccgttgca 360 

accagcattg atgcattggc cgtaggagta tcctttgctt ttctgggtat caaaagctgt 42 0 

tcgtctatcc tttacccggc aggaatcatc ggatttgttt ctttttttat gtcccttata 480 

ggattaatct tcggcattcg cttcggatgc ggcattgcca gaaaacttcg tgctgaatta 540 

tggggaggaa tcatactgat ccttattgga acgaaaatat taatcgaaca cttatttttt 600 

aataattag 609 

<210> 848 
<211> 1074 
<212> DNA 
<213> B.fragilis 

<400> 848 

aacattagtt attggaactt tatgaaatta ttggtaaccg gtgctgccgg atttataggt 60 

tcgcatgttt gtaagcgtct tttgcaacgt ggggatgaag ttgtgggttt ggataatatc 12 0 

aattcgtatt atgatattaa tttaaagtat ggacgccttt cgagcttagg tgtttctcaa 180 

tctgaactgt catggtataa gttcacacgg agtaatgttt atcctcgatt tagttttgtg 240 

cggatgaacc tcgaggatag gcaggctatg caaatgctgt ttgctaatgg aaattttgat 3 00 

gtagtaatca atttggccgc acaagcggga gtgcgctact ccattgagaa tccatatgct 3 60 

tatgttgaaa gtaatataga cggttttctg aatgttctcg agggttgtcg tcacagtcag 42 0 

gtgaaacatt tggtttatgc cagttccagt agtgtatatg gtttgaatgg acaggttcct 480 

ttttcagaga aagatggcat agcccatccg gtgagtctgt atgccgcaac caagaagtcg 540 

aatgaactta tggcacatac ttatagccat ttatataata taccttctac gggtcttcgt 600 

ttcttcacgg tatatggtcc ctggggtaga ccggatatgt ctccttttct atttgcggat 660 

gctatcttgc atggtcgccc catcaaggtc tttaacaatg gcaacatgct tcgtgatttt 720 

acatatatag atgatattgt ggaaggtgtc ttgagagtgg ctgattctat tccggaaggg 780 

aaccagtgct gggatgctga ggttgcggat ccaagcatgt cctgtgctcc ctataagatt 840 

tataatattg gtaattcccg tcctgtaaaa ttgatggatt ttatacgtgc tatagaaatg 900 

tcaatcggga gggaagctga caagatctat cttccgatgc agcccgggga tgtgtatcag 960 

acctatgcgg atacttcttc tctttcgcgg gaaattggtt ttcaacccaa tacgtccttg 1020 

gaggcgggcg ttaaggaaac aataagttgg tataaagaat tttataatct ataa 1074 

<210> 849 
<211> 1068 
<212> DNA 
<213> B. f ragilis 

<400> 849 

aaaaaaattc taaattataa agggttcgtt tcgggaggtt cttccgagac aatgagttgg 60 

ttatttatac gtaaattaat ctgtttgcat aaaggtaaaa tcaaatttgc ttataaagaa 120 

gaacaaaaac tggtgcttat tttaattttt cctattgtca tccggcaatc ggaaagagtt 180 

tctgaattct ccgggatatt atccgtaccg gagctctcca cagggaatgg tgagttgttg 240 

tcggttaccg gaatgctgcc gacagtgaaa acttctgagc agaaacgtcc ggataagaaa 300 

aaagagaaac attctctggt attggttgaa aggaataagg atttgtgtaa ttatcttgtg 3 60 

caaatcctga tgaaggagta taagattgtg tctgtttgtg atgcggaggc ggcctttgag 42 0 

actgtgtgtg aacaatgtcc ggatgcagta ctggcttctt ctgttcttac ccgtatttcg 480 

ggtgaggaac ttgccgttcg gattaaatcg gatgacagag tagcacatat tccggtaata 540 

ttgttagtga aaccgggaga ggatgaccgg tacattcaac ggaatgccga tctttatgtg 600 

tggatgcctt ttgccatatc atctttaaag actgagattg cagctttgat tgctaaccgg 660 

gaaatgatcc gtaagaggta tattcgtttg gccttggggg gtgaggcctc ggaccctatc 72 0 

gataaagagg tagagtcatc agaaggtgat caggagttta ttcgtcaggt gagaagtctg 780 

attgaagaga ggatgaccga ttccggattt aagattggtg aactgagcga ctgcatgaat 840 
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atgagtcgtt cgagttttta taataagatt aaggagataa ccggacatgc tcctgcggac 900 

tatgttcgta atgtgcggct caacagagca ttggttttgt taatgagcag aaagtatacg 960 

gtggctgagg ttgcggatat gacgggtttc agtgatccta aatacttcgg gatcgtgttt 102 0 

aagaaatatt atgggggctc accgacgaag tatataaaca atttatag 1068 

<210> 850 
<211> 492 
<212> DNA 
<213> B.fragilis 

<400> 850 

atcgtcgcct accaactgac agcgatcccc gatgcgttga gtcaatttct tccagccttc 60 

ccagtcattt tcgctcatac catcctcaat ggaatcaatc ggatattcgt tgataagttt 12 0 

ttccaaatag tcaatttgtt cgtcagctgt acgttttttg cctttttcac cttcaaattt 180 

ggtgtaatcg taaataccgt catgatagaa ttcggaagag gcgcagtcca tgccaatcat 240 

tacatctttg cccggttcgt agcctgcagc tttgatagcg gcaagaatag agttaagtgc 3 00 

atcttctgtt ccttccaggt tgggagcaaa accgccttca tcaccaacag ctgtactcag 3 60 

accacggtct ttcaatactt ttttcaaagc atggaatact tcggcaccca tgcgcaaccc 420 

ttctttaaaa gaacttgcac ctaccggacg gatcataaac tcctggaagg ctatcggagc 480 

atcactgtgt ga 492 

<210> 851 
<211> 960 
<212> DNA 
<213> B. fragilis 

<400> 851 

cttatgtact atctaataat cttagttctg ctattcctgg cagaactttt ttatttccgt 60 

attgcggata agtgtaacat tatcgataag cctaacgaac gtagttcgca tacccggatt 12 0 

acactgcgtg gtggagggat catcttttat tttggggagt tggcttattt tctcacaaac 180 

cactttgaat atccatggtt tatgctggct ttgagtctga taacctttat cagttttata 240 

gatgacatcc gttctacttc gcaaggactt cgtctggtct ttcattttac ggcaatggct 300 

ttgatgtttt atcaatgggg gctgtttagc ttgccttggt ggacgatcct tgttgccctg 360 

atcatttgta ctgggattat caatgcttac aactttatgg atggcattaa tggcataaca 420 

ggcggatatt cattgatcat tctgatagca ttggcctaca taaataggat atatgtccca 480 

tttgttgaac cggaccttat ttatactatg ctttgcgcag tattggtctt taattttttt 540 

aatttccgca aacaagcgag atgttttgcc ggtgatgtcg gttcggtcag tatagctttt 600 

gtaatcctgt tcctgatcgg aagtttaatt atcaaaacag agaattttgg ctggcttata 660 

ttgcttgctg tatatggagt agatagtgtg ttgacgatcg ttcatcgatt gatgcttcac 72 0 

gaaaatattg gtttgcctca ccggaaacat ttatatcaga taatggctaa tgaactgaga 780 

ataccgcacg tagtagtatc gttggtgtat atgattgcgc aaattataat tatcatcgga 840 

tatttatatt gccaaaatta tggttattgg tatttattgg gctgtatcct cttgctgagt 900 

ggaatatata ttgtttttat gcacaaatat tttcacttgc atcttttatc taaaagataa 960 

<210> 852 
<211> 771 
<212> DNA 
<213> B.fragilis 

<400> 852 

attttatctc ttatctttgc gccctcaaac agagagaaag tgaacttaat tatcgatatt 60 

ggaaatacag tagccaaagt agcgcttttc gaccggactt ctatggtaga agttgtttac 120 

gactctaatc agtccctgga ttccttggag gctgtttgta ataagtatga tgttcggaaa 180 

gcaattgttg ctacggttat agacttaaac gagtgtgtgc tggctcagtt gaacaagctt 240 

cctgtccccg tcttatggtt agacagccat acgccgcttc cggtaataaa cttgtatgaa 300 

acccccgaaa ctctcggtta tgaccggatg gctgccgtgg tggcggccca tgatcagttt 360 

ccgggtaaag acattttggt gattgatgcg ggtacttgta tcacttacga atttgttgat 42 0 

tctttgggac agtatcatgg gggcaatatt tcgcccggac tctggatgcg gctgaaagca 480 

ctccatcaat ttaccggacg tttgccgttg gttcatgccg aaggacgcat gccggatatg 540 
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ggaaaagata ctgaaactgc tattcgtgca ggtgtaaaga aagggataga atacgaaatt 600 

acagggtata ttacggctat gaagcataaa tatcctgaac ttttggtttt tttaacgggc 660 

ggagatgatt tttcttttga tacgaaatta aaaagtgtca tctttgcaga tagattttta 72 0 

gtgttgaaag gattaaatag aatattaaac tataataatg gtaggatata a 771 

<210> 853 
<211> 672 
<212> DNA 
<213> B.fragilis 

<400> 853 

tttattatgt ataccattat tgtatcgaaa gagttaaaag aagcatgtcc tgtttttgca 60 

ggggctgcta tatatgctga agtgaaaaac acttcttatt gtgaagggct ttgggaggag 12 0 

attcaatctt tcacagaaat gctcactgca acaacccggt tggaagatat taaaaaacaa 180 

cctgtgatag ctgccacacg tgaagcctac aaacgctgcg ggaaagatcc cgggagatat 240 

cggccgtcgg ccgaagcgtt acgtcgcaga ttgatgcggg ggattgcttt gtatcagatt 3 00 

gatactttgg tcgatttgat taatctggtt tctctccgga ccggacattc gataggtggt 3 60 

tttgatgctg ataagatagc gggcactggc ctggaactgg ggatcggtaa gatcaatgag 42 0 

cctttcgaag ggatcgggcg tggtgtgctg aatatcgagg ggctgccggt gtaccgggat 480 

gctgtggggg gaatagggac tccgaccagt gataacgaac ggacaaagat ggggctggaa 540 

acaacacata tattggctat cgttaatggc tataatggta aagaaggact gcaggaagct 600 

gccgaaatga tacaaacttt attgaaaaaa tacgctgact cggacggagg aacaattact 660 

tactttgaat aa 672 

<210> 854 
<211> 1044 
<212> DNA 
<213> B.fragilis 

<400> 854 

atatacgtgt attggacagt aatgatgaat atattagtaa cagggattca tggctttgtg 60 

ggttctaact tggttgaagc tttaaaagag aattgtatct tttatgggct tgatattgtt 12 0 

tctcctgcta aagagggagt tgtaactact ttttcttggc tagatatcga acccacatct 180 

tttccttttc aaactcttcc ccaattcgat gccattattc atcttgccgg aaaggcccat 240 

gatacgaaaa accaatcagc cgctcagtcc tattttgaca tcaataccgg tctgactcaa 3 00 

aagatattcg acttcttttt ggagtcttct gccaagaaat tcattttctt tagttcggtg 360 

aaagctgctg ccgatagtgt agtaggagac gtgcttaccg aagacgtgat tccgactccg 42 0 

gttggtcctt atggggagag taagattaag gcggaagaat atataaaaaa tcattttatg 480 

tttccaactg tttctattag tgaggatcgg tctttgcggt tggagaaaga gaaggggagg 540 

atacccaaga ataaacaggt gtatattctt cgtccctgca tgattcacgg accgggaaat 600 

aaagggaatc tgaatttatt atataatgta gtgaaaaaag gaatcccttg gcctcttggc 660 

gattttgata atcgccgttc gtttacttca atcgataacc tttgttatgt gattgagggg 720 

cttttgaatc aggatgtgct tacgggcatc taccatatgg gagatgatga agctctttca 7 80 

acgaatgaac tgattggcat catgtgtgaa gcaatgggaa aaaagccgca tatctggaag 840 

atgaacaaaa gggttatgga aggatgtgcc ggcctgggta ctttgatgca tttacctttg 900 

aatacggaaa gacttcgtaa actgacggaa aattatgtgg taagcaacgc taagatcaag 960 

gccgctttgg gtattgataa attacctgta acggcaaaag aaggattgat gaagaccatt 102 0 

cgttcatttg aagaaactaa ataa 1044 

<210> 855 
<211> 1029 
<212> DNA 
<213> B.fragilis 

<400> 855 

tatagagatt tatatagctc atggatagag gctttagcga tttatgtttt ttacgacaaa 60 

cctatcagat gtggcaaagt tacacattgt ctttctaatg ggattctatt ttatttgcag 12 0 

tatatttctg tatatttgca ttcggaatcc aaatggatct ttatggctat tttagaacga 180 

aaaattatac atgagattga tacctcatgc agtcatatta tctatccgca atttgtgtta 240 
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ccccgacaca agcatgcgga atacgaaatt atgctcttca ctcaagggag tggaaaacag 3 00 

tttgtaggag aaggagtcgc ggactttcaa gaaggggata ttgctttgat agggagcaat 3 60 

gtgcctcatc tgcatctttg taattcaaaa ctgaatcctg ttgcgaatac tgtatgcagt 42 0 

gccggagaag ccttacagtt tcttccggac atatttcctg tacatgtaga aaatttgcct 480 

gattatcaag agatttatcg tctactgaga aaaagtcaat atggcgttcg cttttatgat 540 

aaagggttgt atgatgaggt caaagaattg tttcaggaga tggatctctt aaaacatact 600 

aatcgtttga tcactatatt acgtatcttg gggagactga ctgaatgtcg gaatattaaa 660 

ttactttctg atgtagccta caatggttct aataggcttc tggaagtgaa tgaaccggtc 72 0 

aataaagtgt atacctattt atttaatcat tttaaagaga aagttctcct gcaagaagtg 7 80 

gctgattatg taaagcagaa tccttctgca ctttgccgtt actttaaaca acggacagac 840 

aaaagtattt ttcagtgtct ggcagaaatc cggatagaac atgcttgtaa attactgtca 900 

tattcaaact tgtctgtttc gcaaatagct tttgagtccg gtttcaacag cgtaccttat 960 

tttattaagc agtttcagag tatcactgaa aagactcccg gtgagtatag ggaattgata 1020 

ggcagataa 1029 

<210> 856 
<211> 1089 
<212> DNA 
<213> B.fragilis 

<400> 856 

aatatttatc atttaaagca tattaacatg aaggtattgt atatcggata cgataagaag 60 

ttagttaaaa gtggagctga tcaaattgat attaggaatt tagagctatt gtatgatagt 120 

attccatatg taaaagtatt gccggtttta gaaacacgat ctttttataa acgttattta 180 

tttggaattg attctctttt gatccaaaaa gtatttgctg aattacaaac aggtgattat 240 

cagttggtat ttgtcagtca gtctttgatg ggtaggattt caaagcatat taaatgtgat 300 

ttccccaata ttaaaatcat tacatttttt cataatattg aaaagtatta tgctttagaa 360 

cttttgagag tgtcaggctt tactcattat cttttttatt tggccgcaag ttatttcgaa 420 

tttcaatctg tgaaatattc agattatcta atagtcttaa atcagaggga gagtaatttg 480 

ttacaaaaga tatataataa aagtgctgac ttgattttac ctacttcatt taaagaccaa 540 

tgtagtaaaa tagaaaattg taatataaga aaagaatttg tttacttatt tgtcggtgct 600 

gcgttctttg caaatattca aggtataaaa tggtttattt ctaatgtgct ccctgaagta 660 

catggaaaat taattatcgt tggtaaaggg atggatttgt atagagaaga attcgctagt 720 

gaaagagttg aagtttatgg ctatgtacaa aatttatcag agtattattc tatggcttct 780 

gttgttatat ctcctatatt ttctggtggt ggaatgaaaa ctaaagttgc ggaggcattt 840 

atgtatggaa aggttgttgt tggtaccaaa gaagcattta ctggatatgt caattgttct 900 

ggagttatgt atgaatgtaa tgacaagtat gcatttgtga aaatactaaa tgagttattt 960 

gtagataaaa cacatactgt gtttaatagt aaggctcgtg aaatatattt gcaagaatat 102 0 

agttacgaat cttcatatag taaattttca agatggattt ctcctatttt gaaattattg 1080 

aataaatga 1089 

<210> 857 
<211> 1401 
<212> DNA 
<213> B.fragilis 

<400> 857 

ataatgaatc aagacacaat ttgcgccata gcaaccgctc aaggaggagc catcggaagc 60 

attcgtgttt ccggtcctga agctattacc atcaccggcc gtatttttac cccggccaaa 12 0 

tccggaaagc tgctgagtga acagaaacct tatacgctta ctttcggccg aatttataac 180 

ggagaagaaa tgatagatga agttcttgtc agtctcttcc gggctccaca ctcttataca 240 

ggggaagaca gcactgaaat cacctgtcac ggatcatctt atattttaca acaagtgatg 300 

caactactga ttaagaacgg gtgtcgcatg gcgcaaccgg gagaatatac tcaacgagcg 360 

tttcttaatg gtaaaatgga tttaagtcag gccgaagccg ttgccgacct gattgcctct 420 

tcctctgctg ctacccaccg tcttgccttg agtcaaatgc gaggtggctt tagcaaagaa 480 

ttgacaactc tacgtgagaa actgctgaac ttcacttcaa tgattgaact ggagctggac 540 

ttcagtgaag aagatgtaga gtttgcggac cgttccgccc tacgccgact ggctgacgag 600 

atagaagaag tcattgcacg tctggccaat tcgttcagtg tagggaatgt cataaaaaat 660 

ggtgtaccgg tagctattat cggagaaacc aatgcaggaa aatcaactct actgaatgtc 72 0 
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ctgctgaatg aagacaaggc tattgtaagc gatattcacg gcactacacg ggatgtcatc 780 

gaggatactg tgaatatagg tggtatcact ttccgtttta tcgatacagc cggtatccgg 840 

gagaccagtg atacgataga aagcctgggt atcgaacgga cttttcaaaa actcgatcag 900 

gcagagattg tactgtggat gattgattcg gctgacgcaa tttcacagtt aacactgctc 960 

tccgataaga ttcttcctcg ttgtgaacac aaacaattga ttttagtctt taataaggta 102 0 

gaactgataa atgaaactca gaaaaacgaa cttacctcac aattttctga gcatataggt 108 0 

tcggaaatag aatctatttt tatttctgcg aaacaacgtt tgcacacgga tgaactccaa 1140 

cagagactcg tagcagccgc tcatttacca acagtcaccc agaatgatgt cattgtaaca 1200 

aacgtccgcc attacgaagc actaacacgt gcgctggatg caattcaccg ggtacaagaa 1260 

ggattggacg caaatatctc cggagatttt ctgtcacaag acatacgcga atgtattttc 132 0 

catttatccg atatagcagg ggaagtgaca aatgatatgg tgctgcaaaa tatatttgcg 13 80 

catttttgca tcggaaaata a 1401 

<210> 858 
<211> 648 
<212> DNA 
<213> B.fragilis 

<400> 858 

atcagcagac aaatgaataa gagaggcttt gtaagcagga tcttacagaa tttccggaag 60 

cctgaaggtt ttttcggaag aatgatactt tgggggatga atacaggaca tgcatcattg 120 

gcgcaatggg gaatgtcatg tttgcaatgg caaccggaat ggagtgtact cgatatcggt 180 

tgcggtggtg gtgccaattt gctacagata ttgcaacgtt gcccgcaagg gaaagcatat 240 

ggcatagata tttcatcgga gagtgtcacc tttgcgcgta aaaaaaataa aaagtatctc 3 00 

ggtacacgct gctttatcga gcagggagga gtccaccgac ttccctatcc tgattatgcg 360 

ttcgatgcgg tcactgcttt cgagactgtc tacttctggg gtaacctgca gcatgctttt 42 0 

acggaagtgg cgcgtgtgtt aaagcccggt ggatcgtttc ttatctgttg tgagataagc 480 

gatcctgcca ataaggcttg gacgggactt gttgaaggga tggagattca ttcctgtgat 540 

gaactgaagg cgattctttc caaaagtggt tttaccgata cggccatatt ccggacgaaa 600 

aaagaagaac tgtgcctggt aagccatcgg cagactgtgc ggttgtaa 648 

<210> 859 
<211> 1569 
<212> DNA 
<213> B.fragilis 

<400> 859 

aaaatgagac aatatgtatt attggcttgt ctctctccgg tagcatgcct gatggctgct 60 

accggtcaga agggaggaaa agccaagcaa aaaatcaatg atcggcaact tcctaatgtc 120 

gtgtttatct atgccgacga cctcggttat ggcgacttgg agtgttatgg tgcaaagaat 180 

gtgcagactc cgaatgtaaa ccgtttggca gctgaaggta ttcgctttaa caatgcgcat 240 

gctacggctg ctaccagtac tccttcgcgt tactctatgc ttaccggaga atatgcctgg 3 00 

cgtcgtccgg gcactgatat tgcagcaggc aatgcaggga tgattatccg tcccgaacgc 3 60 

tatacgatgg ctgatatgtt taagaatgcc ggttacgcta cggcggccat cggcaaatgg 420 

catttggggt tgggcgataa ggatggagaa caggattgga atgctcctct gccgactgct 480 

ttaggagata taggttttga ttattcttat ataatggctg caacagccga tcgtgttccg 540 

tgtgtcttta tagaaaatgg taaagtggcc aattatgacc cttctgctcc gattgaagtc 600 

agctatcgta agccgatcga gggggaaccg ttgggaaaag atcacccgga attgctgttc 660 

aatctgaaat cgagccatgg acacgacatg gccatcgtca atggtatctg ccgtatcgga 72 0 

tatatgaaag ggggcggcaa ggctttgcgg aaagatgaaa atattgccga ttcaatcact 780 

tcacatgcca tcggctttat ccgtgagcat aatgacgaac ctttctttat gtatttggct 840 

acaaacgatg tacatgttcc ccgtttcccg cacgaccgtt ttcgtggaaa gaacccgatg 900 

ggattgcgtg gagatgccat cgtgcagttc gactggagtg taggccagat catggaaacc 960 

cttgataaac tgggactgtc agaaaatacg ctaattattc tgtccagtga caatggtccg 102 0 

gttgtcgatg acggctatca ggatcgtgcg gaagaattgc tgaacggtca tagtcccgca 1080 

ggaccgttgc gtggtaataa gtacagtgct tttgaagggg gaactcgtat tcctgccatt 1140 

gtaagatggc cgaagggagc tgcttcatca caggtttcca acgctttggt ctcgcagatc 1200 

gactggtttg cctctttggc ttcattggta ggagccgggc tgccgaaggg agcggcaccc 12 60 

gatagcttta actacctcga tacttggttg ggcaaaaacc agtccgaccg atcctgggtg 1320 
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atagagcagg cttccaatca tacattatca gtccgcacca aggactggaa gtacattgaa 13 80 

cccaatgacg gaccggccat gattacctgg ggaccgaaga tagaaaccgg aaatctgagt 1440 

acaccgcagt tatatcacgt ggtagacgat gtggcagaac agaagaatgt agcttctctc 1500 

catccggatc tggtttttga actccagaat atattaagac atgtccggat gaaaaacctg 1560 

aagccctaa 1569 

<210> 860 
<211> 252 
<212> DNA 
<213> B.fragilis 

<400> 860 

gttcctgagc aacaaaaagt tgcccaggat tttgccatgt cagaattttc acttatctta 60 

gtgttgcaaa aagaaaacaa gcaaaactct aatatgacat ggcaaaaata caaattaaat 12 0 

ctgagaaact cacacctttt ggaggaattt tttcaatcat ggagaaattt gactccatgc 180 

tttcacccgt tatcgactca acactgggtc agagatgcag cagtatcttc ggatatcagt 240 

tcagcgagat ag 252 

<210> 861 
<211> 375 
<212> DNA 
<213> B.fragilis 

<400> 861 

gcctccatta atgatgttca tcattggtac aggcaataca tacgtattcg tacctccgat 60 

gtatctgtaa agaggaatat cgagatagtt ggcagcagct ttagctacgg caagcgaaac 12 0 

acccagaata gcgttggcac ccaatttggc ttttgtcttt gttccatcca atgccaacat 180 

ggcatgatca atgcctattt ggtcgagggc cgacataccg atcagatgcg gagcaatgac 2 40 

tttattgacg ttctctactg ctttctgtac acccttgccg ccataacgat gtttatcacc 300 

gtcgcggagt tcaagcgctt cgtgttcacc ggtcgatgca cccgatggaa cggatgcacg 3 60 

tcccataatg cctga 375 

<210> 862 
<211> 552 
<212> DNA 
<213> B.fragilis 

<4O0> 862 

cgaacgatta atgaaacctg tacgatgaaa aaattaataa aactggtact cttcctgatg 60 

gtagcctatc cactaacggg tgctatcctt tcggcttgct cggaagagag tgattgctcc 12 0 

atgaccggac gcccgatggt ctacgccaaa atgtatatca tcaatccgga aaccaaggct 180 

gtactgaatg acaccctcga ttcattgagt gtgacagcat tcggaactga ttcaataatc 240 

atcaataacc agaaaaaggt acatgatatc gctctcccac tacgctatac aagtgactcg 300 

actattcttg tgtttcatta cacccggttg ttaagagaca caatggtgat cctgcaaacc 3 60 

aatactcctt actttcagtc gatggattgc ggatacagta tgaaacaaaa tatcatcagt 420 

attcatccga ttgattatac ggaaaccaat aaaaagaaat atcatagcat agactctcta 480 

tatatcaaat caaatgcagc taacattaat ggaacagaaa atctcaaaat attctaccgc 540 

tacaatcgtt ag 552 

<210> 863 
<211> 246 
<212> DNA 
<213> B. fragilis 

<400> 863 

gctacagata aaaaaataga caacattgct atcaatccaa aatcagctgg agaaactaat 60 

ctagctataa caatggtcaa tataaagcgt aatgcttgcc ccgacatttt ctcaacagca 120 

ttccacatta aactatttaa tgctgccaat tttaaatttt taatcatcat atacaatcat 180 

catctccaag ttattaacaa tataaaagca ttgatcattc aaaaatacaa tcaatattta 240 



350 



gtatag 2 46 

<210> 864 
<211> 966 
<212> DNA 
<213> B. fragilis 

<400> 864 

tttataagta aaatggatat atctgttgtc gtaccattgt tcaatgaaga agaatccatt 60 

ccggagcttt ttgcctggat tgaaagagtg atgaaggcca acggcttttc atacgaagtt 120 

atctttgtaa atgatggtag taccgaccgt tcttgggaaa ttatcgaaga gcttcagaaa 180 

cagtcgtcca ctgtgaaagg gatcaaattc cgacgaaact acggaaaatc cccggctctg 240 

tactgtggct ttgaacgtgc cgaaggaaat gtggtgatca cgatggatgc cgacctacag 3 00 

gatagtcccg atgaaatacc ggaattatac cgtatgatta ctgaagacgg atatgacctt 3 60 

gtttcaggct ataaacagaa aagatacgac ccgctgtcga aaactctacc taccaaacta 42 0 

tttaatgcca cggcacgtaa agtttcaggg attcataatc tgcacgactt taattgcgga 480 

ttgaaagctt atcgcaaagc tgttgtaaaa aacatcgaag tatacggaga gatgcatcgc 540 

tacatcccgt atctggctaa gaatgccgga ttccagaaaa taggcgaaaa ggtggtgcac 600 

catcaagcac gtaaattcgg aaaaactaaa tttggaggat ggaatcgctt ctttaacgga 660 

tatctcgatt taatctctct ttggttcctc tcaaagtttg gaattaaacc aatgcacttt 720 

ttcggtttat taggctcatt gatgtttata ctgggattca tttcagtggt tattgtcgga 780 

gccagtaaat tatatagtat gaatcacggt atgccttatc ggctggtaac agattctccc 840 

tatttctatc tgtcgttgac tgccatgatt attggaacac aactcttttt ggcaggattt 900 

cttggcgaac tgatttcacg caacgccccg gaacgcaata attatcagat agaaaaaata 960 
atataa 966 

<210> 865 
<211> 798 
<212> DNA 
<213> B. fragilis 

<400> 865 

agtgattgca aaaccatatc ctctggagaa gctgaaagaa acgatcgaaa cttatttata 60 

ggagggagga atagtcggtc tttcaatgca acaatttatc atatagtatg ttctgtagta 120 

aataatgcta taaagataaa ttggattatg aagaaagtag tactaatcgg ggccagcggc 180 

ttcgtcggtt cggctattct gaatgaagct ttgaaccgtg gattccatgt gacggcggta 2 40 

gttcgtcatc ctgaaaagat caagatagag aatgaaaatc tggaagtgaa gagagctgat 300 

gtttcttcat tggatgaagt atgtaaggtt tgtaaaggtg ctgatgccgt gatcagtgct 3 60 

ttcaacccgg ggtggaataa tcccgatata tacaaggaaa ccattgaggt ttatctgacg 42 0 

attatcgatg gtgtaaaaaa ggctggagtt aatcgttttt tgatggtggg tggtgccggt 480 

tcactgttta ttgctcccgg catccgactg gtcgattcgg gagaagttcc cgaaaagata 540 

ttgcctggtg tgagagcctt gagtgatttt tatcttgatt ttctgaagaa agaaaaagag 600 

gttgactggg ttttcttctc gccggcggca gatatggctc ctggagtacg tacaggcaga 660 

tatcgcctgg ggaaagatga gatgattgtg gatatggtag gtaacagtca tatatctgtg 720 

gaagattatg cggctgccat gattgatgag cttgagaagc cggagcatca tcaggagcgt 7 80 
ttcaccatag ggtactga 798 

<210> 866 
<211> 876 
<212> DNA 
<213> B. fragilis 

<400> 866 

agtgacggca aaattcaata tcggactttc aatgacaaca acctcaagaa cggcaaggtc 60 

tatgatgtcg attttgaagc cgcacagcaa acgcaggcac caaccggaac gctcgtagcc 12 0 

cgctaccggc cgatcccttc gttgagtgat ccaaaatact attcacactt caccctctca 180 

aagttccgca atggaacctt ccaactcctc aactacgacg aaggtgacgt agatatgggt 240 
ggaggagcca cctggtcgaa cttgctgaag aatggtgcac gcctggacac aggatactat 3 00 
atgatggtaa ccggtactcg catggcaagc ggagctgtat tggctaatgt gactttcttc 360 
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accattgaag agggaaagac aacaactgtc gatctggtca tgcgcgaaag caaagaccag 42 0 

gtacaagtaa ttggtaattt taattccgaa tcgacttatc tgcctatagg aacctccgaa 480 

ccgcaaagta ttcttcagac ttgtggccgg ggatactacg ttgtagcagt gctgggagcc 540 

ggacaagaac ccactaacca tgcccttcgg gatattgcag ctttaagcgg tgaatttgaa 600 

aaatggggac gcaaaatggt gttgctcttc cctagtgaag aacagtacaa aaagttccgc 660 

ccgtcagaat tccccggatt gccttcaacc attacctatg gtatcgacgt agatggagcg 72 0 

atccagaaac aaattgccga atcgatgaag ttgccaaaca gcaccatcct gcccatgttt 7 80 

attatcggtg atacattcaa ccgggtagtc ttcgtgtcac aaggttatac catcggattg 840 

ggcgaacagt taatgaaagt aatccatgga ttatag 87 6 

<210> 867 
<211> 717 
<212> DNA 
<213> B.fragilis 

<400> 867 

ataaagttaa attatagtgt tgcggaatta aggataacaa acgaatcata tatgaagcca 60 

acaatcaaaa aagtacaacc cgtcaaagtc gtagctccgt tccttaacag tcagtccgaa 120 

agtccggtcc cactggatgc acttaccgac caagagaaag tttccgattt gtacttcctt 180 

aagggaaccg tacatcaaat agctaaacct tacctaagta ttaataattg cactttcaaa 240 

caacaaatat tcagcgaatg tcagtttaaa tcagctcaac tgacagacgt acgttttgaa 300 

aattgcgatt tatccaacgt ttcgtttgcc ggaactactt tctaccgggt agaatttata 3 60 

tcttgcaaat tgctgggaac cggtttcccg gaagccaccc tcaatcatgt tttaatggat 42 0 

cattgctacg gacaatacat caatctctcc atggtaaaaa tgcgaacagc ccgtttcagc 480 

cattgcaatt tccgaaacgg cagcctgaat gacagcaaac tgatgccggc agcttttgat 540 

acttgcgaat tgttagaagc cgacttttcg cacacttcac tcaaaggtat cgacctgaga 600 

aactctagaa tagcaggtat tcaactcaat atagccgatc tgaaaggagc catagtcagt 660 

tcgttacaag caatagatct gttacctcta ctaggggtca aaatagaaga cgattga 717 

<210> 868 
<211> 462 
<212> DNA 
<213> B.fragilis 

<400> 868 

aaagagctaa aagatatgaa acttagtcag caatcacaag ccattatcga atctgcgatt 60 

caaaaagcaa tcaacaaata tacctgtgga tgcgaacaga ccatcgtcac agatatccat 12 0 

attcaaccga atcagaattc cggtgaactc tttatctatg acgatgaaga tgaagaacta 180 

tccagtgtaa ccatcgatga atggacaacc tacgaagggg acgactttta cgaagatgct 240 

gaaagaattt tccgtaccgt gctttgccgc atgaaagaga acgggagctt cgataagtta 3 00 

accatcctca aaccctactc ctttgtgttg gtagatgaag acaaagagac gatctcagag 3 60 

cttctgcttg tagatgacga cacactgttg gtgaacgatg aactattgaa gggactggac 420 

aaagaattgg acgacttcct gaaagacctg ttggagaaat aa 462 

<210> 869 
<211> 1236 
<212> DNA 
<213> B.fragilis 

<400> 869 

aaaacaataa atcttgatag tatggaaagt atagactttg gaaccctgtt tcagggattt 60 

ggaacaatga tagccagcgg atggtttctg gccagtgccc gtatgttttt aatagctttg 120 

gggtttctgc tcatttattt aggctggaaa ggggtactcg agccaatggt gatgattccg 180 

atgggcctgg gaatggtagc tattaattgt ggaacactga ttatgcccga cggaacattg 240 

gggaatcttt ttttagatcc gatgctgtcg gataccgacg cattgatgaa cacgatgcag 300 

attgactttc tacaaccggt atacacattg acctttagta acggattgat agcctgcttt 3 60 

gtatttatgg gaatcggtac attgcttgat gtgggattcc tattgcagaa accgtttgcc 42 0 

agcatttttc ttgctttatg tgctgaattg ggtacattct tgacagtgcc tattgcttcc 480 

ggtctgggac tgtctttaaa agaaagtgct tcagtggcaa tggtaggcgg agctgatggt 540 
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ccgatggttt tgttcacatc gcttgctttg gccaaacact tgtttgtacc tattacggtg 600 

gtggcttatc tttatctggg attgacttac gggggatatc cttatttggt gaaattgctg 660 

attcctaaac gtctgcgtgc tatcaagatg gtagaaaaga aagctcctaa aaattatgat 720 

gcgaaagtga agctggcttt ttctgcaatc ctgtgtgcag tattgtgttt cttgtttccg 780 

gttgcttcac cattgttctt ttcgctattc ctgggagtgg cagtacgtga atccggtatg 840 

aagcatatat atgattttgt gagcggtccg ttgctctatg gttctacttt tatgttagga 900 

ttattattgg gtgtactttg cgacgcacat ttgttactcg atccgaagat tcttaaactg 960 

ttagtattag gtatgcttgc tttgttactg tcgggtatcg gaggcatcat gggagggtac 102 0 

attatgtatt tcattaagaa agggaactat aatccggtga tcggcattgc agccgtaagc 1080 

tgtgtaccca ctacggcaaa agtggctcaa aagttggtaa gtaaagataa tccgaattct 1140 

tttattttgg gtgatgcatt aggagccaac atttcaggag taatcacttc ggccatcatt 1200 

acaggcattt atataacgat tataccttat ttataa 1236 



<210> 870 
<211> 1533 
<212> DNA 
<213> B.fragilis 



<400> 870 

actaagcaaa aatctctaat gaaaaatttc tggaagaaat accataaatg ggtaggttta 60 

ttctttagct tttttatcct gatgttctgc ttttccggta ttgtactcaa tcatcgtaca 120 

ctcttttcaa aagctgaagt cagcagaaac tggatgccgg aaagctatca ctacaaaaat 18 0 

tggaataatg gaatcataaa gggaacacta cgcctacccg atgggaaaat tctggcatat 240 

ggtaatgcag gagtctggaa aacagactcc tgctttgcta catttgccga tttcaaccga 3 00 

ggtctggcca aaggaatcga caatcgtaaa ataagtaata tcgtccgtgt agccaataac 3 60 

gatatctggt gtgccggatt atattctatc tatctcctgg accatgacag ttggaaagaa 42 0 

tatccgatag ccggcaatga cgaacgaatc tcagatatca ctcaacgtgg ggatacctta 480 

gtcatattga cacgctctta tctttatacg ggtgtttctc cttatgacga attccggaaa 540 

acagaattga aaacaccgga aaactatt cc ccaaagacct ctttgttccg gaccatctgg 60 0 

ctgctgcata gcggagagtt attcggtacc cccggcaaac tggcagtcga ttttctggga 6 60 

gtagtattaa tcgttctcag tgctacagga atcatataca cccttcttcc cccattcatt 72 0 

cgccggagac acagaaaaag acttcctgtc aagacacagg caaaggctct gaaaacttca 780 

ctgaactggc acaataaatt gggtacatgg ttgatcggac tgaccctatt gctatctgtc 840 

acaggcatgt gtctgcgacc tccattaatg ataccttttg ttctggtcaa tacccggcct 900 

gtccccggga gtacactcga ttcggataat ccctggcacg acaagcttcg cagtattcgt 9 60 

tgggacgcat cccggaatgt ctggctgtta tcttcgtcaa tgggattcta ccggataaac 102 0 

gatttacaac ttccaccggt taagttaaaa caaactccac cggtaagccc tatgggagta 10 80 

aatgtatttc atccccaaag tccggacgaa tggctgatcg gatctttcag tggcctcttc 1140 

gtctggaatc cttccaccgg caccgtcctc gattattata cgggacaacc tcctgcagcc 12 00 

gttcacggac gaccactcgg cggcagtctc gtcaacggat tcactgacga tttagttacc 12 60 

cgtgaagtaa tcttcgaata cgacaacgga gcacgcaata aagagaacaa tttagtatta 132 0 

ccggcaatgc cggaccttat aaaacagcaa cccatgtcgt tatggaattt ctgtttggaa 13 8 0 

cttcatgtcg gtcgttgtta ttccccattc ttaggtgttt tttcagatct attcgttttt 1440 

atttccggcc ttctgctaac gttaatcctt atttcgggat atatcgtata taaaagacac 1500 

cataaacgaa gcaaaaaaat aaggatgcat taa 153 3 



<210> 871 
<211> 1929 
<212> DNA 
<213> B.fragilis 

<220> 

<221> unsure 
<222> (1889) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 871 

agataccgaa agattaaagt agaaaacagt aattccggaa agatgaaaca aaaagagaaa 60 
gaaatcaaaa ctaaatatat gaaccaaaag cattcactcc cccctctcct taagagagga 12 0 
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acgggggtga ggccttactt aacaacaaat tgtatgaaaa ccctaacccg ctttctaatc 180 

attccgatgt tgtcggttgc gttcttttca tgcagtgagt cccatttcct gaaagatgtg 240 

gcttaccgaa accaagtgac gcaagatttt gaaatgaaga agcagcaact tcccaatgga 300 

gagttgtttg cagtgtttaa tgaaaagctc accattcccg aacaagaggc tttaatgttc 3 60 

ctctatgcct acatgcctac aggtgacgta acagattata caggcgacta ctatctggaa 420 

aatgtcagac tctccgatca ggcacgtcgg gaaatgcctt ggggaaaaga aatccccgat 480 

gacgtattcc gtcattttgt gctccccatt cgggtaaaca atgaaaacct ggatgactca 540 

cgccgggtgt tttacaatga actgaaagac cgtgtgaaga atctgtcatt acacgacgct 600 

gtactcgagg taaaccactg gtgccatgaa aaggtaatct acaccccaag tgatgcccgt 660 

acgagttctc cactcgcttc cgtaaaaacc gcttatggtc gctgtggtga agaatccacc 720 

ttcaccgtag cagctcttcg ctcggtaggt attccggcac gacaggtata tactccgcgt 780 

tgggcacata ccgatgacaa ccatgcatgg gtagaagcct gggtagatgg caaatggtat 840 

ttctttggtg cctgcgaacc ggaaccggtt ttgaacctgg gttggtttaa tgctcccgcc 9 00 

agtcgtggta tgctgatgca tacaaaggta ttcggacgct ataccggaca agaagaaatc 960 

atgtacgaaa ctccgaatta tacagagatc aacgtgattg acaattatgc tccgactgcc 1020 

aaaggttccg tactcgtgac agatgccgaa ggtcagccgg tagccgatgc taccgtagag 1080 

ttcaaggtct ataactacgc tgaattttat acggtggcca ccaaacatac agaccggagc 1140 

gggcatgcat cattgactgc cggcaagggt gatatgttgg tatgggcctc caaagacgga 12 00 

cggttcggtt attctaaact atcattcggc aaagacaatg aactgaagat cacactggac 12 60 

aaaaacgccg gtgaaaccta ttcgcttcca ctggatatcg ttcctcctgc cgaaggcgcc 132 0 

aacctgcccg aagtgactcc ggaacaacgt actgaaaatg atcggcgcat ggcacaggaa 13 80 

gattctatcc gcaacgcata cgtagccact ttcattaccg aagagcaagc tcggactttt 1440 

gccaaagaga ataagctgga tgaaaccgaa acagtacgct tactgatagc ttccagaggt 1500 

aaccaccaga ctttaaccga tttcctttct gatgctgtaa aagccgataa ggccggtcag 1560 

gctatcagcc tgctgaaagt agtctcggcc aaagacctga gagacgtaag cccggaagta 162 0 

ttgaacgacc atctgaataa ctccggcctg cccgcttctg aagatttctg tagcaacgta 1680 

ctgaacccgc gtgtagccaa tgaaatgatc actccttaca aagcattctt ccggaaagag 1740 

attccggcaa gtgaagcaga agccttccgc aaaaatccac aagctttggt agaatggtgt 1800 

aaaaaagaga tcacaatcat taacgaatta aactcacaac gtataccaat gtcaccattg 1860 

ggtgtatgga aaagcccggg tagcaaacna aaaaatcccg taccatttct ttgtatccat 1920 

ggcccgtag 192 9 

<210> 872 
<211> 1296 
<212> DNA 
<213> B.fragilis 

<400> 872 

agatgccgaa aaggcaaaaa cagattctgt aaactaacta tgagcatcat tatttacctg 60 

ttgattacga tggcattttc tgctttcttt tcgggaatgg agattgcttt tgtttcggta 12 0 

gacaaacttc gttttgaaat ggaccggaaa ggaggggtat catcacgtat cctttcgcta 180 

ttcttccgga atcccaatga ttttatttcg accatgctgg tcgggaataa catcgctctg 240 

gttatctatg gtatattgat ggcacagatt atcggcgaca atttgctggc cggatggatc 3 00 

accaatcatt ttgtaatggt attggtacag accgtgatct ccacactgat catcttggtg 3 60 

acaggagagt ttctgcccaa gacgcttttt aagatcaatc ccaatctggc cctgaatgtc 42 0 

tgtgcagttc ttcttttcat ctgttatgtt gttctctatc ctatatctaa attttcgtcg 480 

ggagtctctt acctctttct tcgcctgttt gggatgaaag tgaacaagga agcctctgcg 540 

aaagcctttg gtaaggtaga tctggattac tttgtccagt cgagtataga caatgctgaa 600 

agtgaggaaa ctctggacac ggaagtgaaa atctttcaga atgcactcga cttctcggcg 660 

gtcaagatac gcgactgtat cgttccacgg acagaagtgg tagctgttgc gctggataca 72 0 

tcccttgaag aacttaaagg ccggtttgtt gagtccggta tatcaaaaat aattgtctac 780 

gatggtaata tagataatgt ggtgggatac attcattcgt ccgaaatgtt tcgtagcccg 840 

aaagattggc gcgatcatgt gaaagaagtt cccattgtgc ctgaaacgat ggcggcccat 900 

aaactgatga agctgtttat gcaacagaaa aagaccattg ccgtggtagt ggatgaattt 960 

ggaggaactt cgggtattgt cagccttgaa gaccttgttg aagaaatttt cggtgacatc 1020 

gaagacgaac acgacaacac ttcctatatt tgtaagcaga tcggtgagca tgaatacgtg 1080 

ctttcagccc gtttggaaat agaaaaagta aacgaaactt ttaatctgga gcttcccgaa 1140 

tccgatgact atctaacggt gggaggatta atcttgaatc aataccagag ctttccgaaa 1200 

ttgcacgaat tggtctctgt cggtaaatat cagtttaaga taattaaggt tacagcaacg 1260 
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aaaatcgaac ttgtccgact gaaagtaatg gaataa 129 6 

<210> 873 
<211> 1500 
<212> DNA 
<213> B.fragilis 

<400> 873 

aagaagagaa aaaaggttta tttccaatta atttgtattt ttgaacttct gaataataac 60 

ccaagaccat ttatgaagaa aaaaaatatc ctattctttc tattatgctt tctcctgaca 120 

agcctatcgg cacaaacttt ggaacaagca agaggcatgt atggcagagg gcaatacgct 180 

gaagccaaac ctgtttttca aaaatatgtc aaatcgcaac cggcaaacgg taattacaac 2 40 

ctatggtacg gtgtgtgttg cctcaaaaca ggtaatgctg ccgaggccct aaaatacctg 3 00 

gagacggcag taaagaaacg cattccgagc ggacaactat atctggctca gacttataat 3 60 

gatttatacc gctttcaaga tgcagtagat tgctacgaag aatacattgc agacttgtct 42 0 

aaacgcaaaa aaccgacaga agaagccgag cagcttttag aaaaggctaa aggaaacctt 480 

cgcatgctga aaggggtgga agacgtatgt gttattgaca gttttgtaat agacaaagcc 540 

aatttcctca aagcttataa aatcagtgaa gaatccggaa agctcttcac ttacaatgac 600 

tatttcaaga cgaaaggcta tcatccggga acagtttacg aaacagaaat cggaaaccgt 660 

atttactaca gcgagcaggg agaagagagt ctgaatattc tgtctaaaac caagatgctg 72 0 

gacgagtgga gtcagggaaa accacttcca ggaagtatca acgcctccgg aaatgccaat 7 80 

tatccgtatg tcctgtcgga tggagtgacc atttattatg cctcggatgg tgatggctcc 840 

atgggaggat atgacatttt tgtaacccga tataacacaa acactgatac ctatctggta 9 00 

ccggaaaacg tgggtatgcc tttcaactca ccttataacg actacatgta tgtcattgat 9 60 

gaatataata atttaggatg gtttgcttct gataggtatc aacctgaaga taaagtttgt 102 0 

atctacgtat tcgtacccaa tgattctaaa cgaacttaca actacgaagc tatggaaccg 1080 

gaaaaaatga ttgaactagc ccagctccat tctctagaga gcacttggaa agactctaaa 1140 

atagtggatg atgcccgtca acgacttgaa gcggttatca accataaacc ggctgtagaa 12 00 

caaaactttg attttgaatt tatcattgat gaccactcta cctaccatca cttaacggat 1260 

ttcaagtctc caaaagccaa acaactgtac ctgaaatatg aacagatgga aaaagattac 132 0 

cgccagcaaa ccgataaact gaagagccag cgtgaaggat tcgcacggtc taataaagac 1380 

gaacaaagta aaatggcacc ggctatccgc gatcttgaga agagggtact tcagatgtca 1440 

gaagaactgg ataaacaggc tattgaagtc cggaatgcag aaaaacaaaa cttaaaataa 1500 

<210> 874 
<211> 552 
<212> DNA 
<213> B.fragilis 

<400> 874 

tcattactga aactttttct tacgaacgta gtcttggaga caacagatag cgataaacat 60 

tggtatgtcg tactgacacg aactaactca gaacgtaaag ttcgggatta ttttcaattg 12 0 

caggaggtag atacctttct tcctgtacaa aaccgtgtca tagagcgtga aggaaaacgc 180 

attgagcgtg agcgcctttt gttgccccgt atggttttcg ttcatatctc ccgtcaggaa 240 

atggctgccg tccgaagtac actgaatgta tacgatttcc ttcgagatcg ttctaccggg 3 00 

gctcctacct gtatccctga tgcgcaaatg gctgattttc gctatatgct cgattactcc 3 60 

caggatcagg tgatcctgac aggagagtcc attcccaaag gtactcgtgt agtagttgcc 42 0 

aagggcgatt tacaaggctt gcggggagaa ttggtccgct acaataataa atatcatatt 480 

ttagtacgta tcgatatgtt cggtagcgct atggttacaa ttccggctag ctacgtccgg 540 

aaagagaaat aa 552 

<210> 875 
<211> 1497 
<212> DNA 
<213> B.fragilis 

<400> 875 

gatacaatga aagcaaatga tttactatcc caatttggtg atcatcgcca aagagttcaa 60 

agcgccatta cctccgtttg cgaaggaaga gggattcttc tggtagatga cgaaaaccga 12 0 
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gaaaatgaag gagacctgat cttctctgcc caaagtatga cagaaacaga catggccata 180 

atgatacggc attgtagtgg cattgtttgt ctgtgcatca cggaggagaa agcccgacaa 2 40 

ctgaacttac cgttaatggt ggagcaaaac accagtaaat atggtaccgc attcaccatt 3 00 

tcgatcgaag cagcagaggg agtgaccacc ggagtatcag cagccgaccg catacagact 3 60 

atccggacgg ccattgcccc caatgccact cccgaatctc ttcatcatcc cggacatata 420 

ttcccattga tagcccgttc cggcggaatc aaagaacgga gcggtcacac cgaaggcagc 480 

atagatctga tgaaactggc aggacttgca ccctgtgctg tactttgtga attgaccaat 540 

gatgacggaa ctatggcgcg cttacccgaa atcatagaat tcgggctaga acacaaatat 600 

ccggtagtga caatcaatga tttaaaagaa tatcagacag cccccgactt ccttcccaag 660 

ctggtaggct ccttctcttg tccggcagcc gaaaatccga ctactgccat tatggaagca 720 

gcattccgtc accactatat gcactacaga tatatcaact gtgaagtcgg tcccgaaaat 780 

ctggcggctg ccatacaggg tgcaaaagcc atgggatgga gaggattcaa ctgctctctg 840 

cctaacaagg tggaaatcat ccgatacttg gacgaattag gagaatcggc aaaaatcatc 900 

ggtgcagtga atacagttgt cattggagac gatcagaggg ctaccggaga aaacacagat 960 

ggaaaaggtt ttgtaaaagc catttcggaa attat caeca tccaagacaa gaagattgee 1020 

ttgttaggtg ctggcggtgc tgcccgtgcc ategctgteg aaatggctct ggcgggagtg 1080 

aaagagatca ctatcttgaa cagaaacaga gaeaaaggee aggcattggc agatttactg 1140 

aacagecaaa eggttgeaac tgccagattt gtactttggg accatcctta teggttttec 12 00 

acagatatcg acattgtcat caatgccacc tcagtagggc tttatcccaa cgtagaccaa 1260 

cgcttggaca tagacacaga taccttattg ccacatatgg tagtagcega ctgcatcccc 132 0 

aacceggtat acacccaact gttgagagat gcggtccgcc gcggttgttg tcatgtattg 13 80 

cccggaatga aaatgctggt etatcaageg gtcattgeca taaaatactg gtcaggagtt 1440 

gatgtcgatc eggatataat gctggaaaaa ttaaaagaag tagtaaaacc tgcttag 1497 

<210> 876 
<211> 327 
<212> DNA 
<213> B.fragilis 

<400> 876 

tgtaagatga agagtttaaa atatatagta gcattggcat tggeggcagg actgtttcaa 60 

gcttgtgatt tagagegcta tccactgaca gacttgtccg aagagacttt ttggaatagc 12 0 

gaatcgaatg eggaattgge attgacttct ctgtatagag gaagectgae agaeggegta 180 

gagtataacc etteggattg gtggtcctat caeggaatga ttatgatgga gcatctttcg 240 

gataacgett ttgacegteg gggagagaac aatcctttct ttaagatttc gagtggaaac 3 00 

cctgactgca gaeaatgett ttattaa 327 

<210> 877 
<211> 921 
<212> DNA 
<213> B.fragilis 

<400> 877 

tttgtggtga acatatttat taaactaaaa cctttcaata acatgaaaaa gtattttcct 60 

tcctccgaat taattatcaa egaagaeggt teggtattec atttgeatgt aaagccggaa 12 0 

tggttggcag acaaagtaat attggtaggt gatcccggac gggtggcact egtagcttet 180 

cacttcgaaa ataaagaatg tgaagtggaa agecgegaat ttaaaacggt taceggaact 240 

tacaaaggca aacggataac tgtcgtttct aceggtateg gttgtgacaa tatcgatatc 3 00 

gtggtcaatg aactggatgc tttggcaaat atcgacttcc agacteggga agaaaaagag 3 60 

catctccgct ctttagagtt agttcgcatc ggtacatgcg gaggattgea acccaacaca 420 

ccggtcggca cattegtctg ttctgaaaag teaategget ttgaeggact gttgaacttc 480 

tatgeeggae geaatgetgt ttgtgacctt ccctttgaac gggcatttct gaatcacatg 540 

ggctggtccg gtaacatgtg tgctcctgca ccttatgtta ttgatgccaa tgcagaatta 600 

atagacegta ttgegcaaga agatatggtg cgcggtgtta etattgeage cggtggtttc 660 

ttcggaccgc aaggacgega actccgtgtt cccttggcgg accctaagca gaatgataaa 72 0 

atcgaaaagt ttgaatataa aggttacaaa ataaccaact tcgaaatgga gagttccgcc 7 80 

cttgccggcc tcagcaagct gatgggacac aaagccatga ccgtttgtat ggttatagct 840 

aacegcttga tcaaagaagc gaacacaggc tataagaata ccatcgatac attaattaaa 900 

actgttctcg atcgaatctg a 921 
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<210> 878 
<211> 1161 
<212> DNA 
<213> B.fragilis 



<400> 878 

gacaaaaaga 

gctattgaca 

gtaatattaa 

gatgatagag 

aagttcttat 

aaaaagaaaa 

gttataattg 

tcctttaaat 

ccgaagtttt 

gttaataaat 

catattattt 

tctgagataa 

tcattaaaga 

cctaaaatgg 

gctttagctc 

aaaatatgtg 

ccttattact 

atagccgaac 

aactcttatt 

gaatttttaa 



agatgaatat 
caagattact 
atcgttattc 
ttagtgtaat 
taatttatac 
tagatgttct 
cacgttgtat 
caagaaatgt 
gggacggtgc 
ttattaaaat 
gtgaaaaaga 
tttatatgat 
tggtaataaa 
agattcttca 
tatttatacc 
aatatacagc 
ttgaggataa 
agttagactg 
tattggggcg 
aagagttgta 



agtttttgta 
ctcatttgca 
ggcattgaaa 
agaactttgt 
aataataatt 
ccatgtggct 
aggtgctaag 
ataccatcgg 
tatttgtatt 
aattcaaatc 
agcctctcct 
aatagacgct 
tggtaatcct 
aaatttaaaa 
actgagaaat 
gtgtcatggc 
aataaatgca 
gctatatgac 
tagagttttc 

g 



ggactttctg 
aacttgttta 
cctaattggg 
aatttgaagg 
gaattctgga 
tcaggacatt 
gttgtttatc 
attaatggta 
agtcattttt 
cctcccatct 
tatttgcttt 
tataataaat 
gtagtgattt 
tatacagatt 
accattcaag 
gtagtggtga 
ttgattgcag 
aatatggatc 
aatttaattt 



gagttccata 


tgcgcgtaga 


60 


cttcatcgaa 


tcacgatgtt 


120 


aatatgatgc 


t caatttatt 


180 


gaatacccaa 


atgtatgtca 


240 


aacttatatt 


cttgaataag 


300 


ttatagatat 


tttctattat 


360 


attattgtga 


atatagatct 


420 


2 ~? f" 4. — /~f -~> 4— 3 23 

aaUCyaLCaa 


4— 4— /-fy-if- -a 4- rT /— I 4— 


A P ft 
you 


tggtatctaa 


gacaaaagaa 


540 


gtgattatga 


ttattttgat 


600 


tttgtggcta 


tacaagctat 


660 


ctaggattaa 


agaagtggct 


720 


ctgagattag 


gagatattgt 


780 


tgatttctat 


gtttaagggt 


840 


atattgcccg 


atttcctaat 


900 


caactcggta 


tggggaaata 


960 


atgattttaa 


cgtggcatca 


1020 


agacagcacg 


aattaaaaag 


1080 


cttataaaga 


ttctgtgaca 


1140 
1161 



<210> 879 
<211> 210 
<212> DNA 
<213> B.fragilis 



<400> 879 

aaagccaacg aatcagctgc agaggcaatc aactttatca ccccggcaac gaaaaaacca 60 

atgacgaagg taagccccat catggcaaaa gtggataaca ataagctaaa cataacacag 12 0 

tatgatttta gattgaaaaa taaaatgcaa tgtataccga ccgccggaaa atacaagacg 180 

cacctccccg gcatttatcg gaatggataa 210 



<210> 880 
<211> 903 
<212> DNA 
<213> B.fragilis 



<400> 880 

atcatgatag aacaaccttc cccaaaagta tatgatgaac ttctttccat ctgggaagaa 60 

gctgtccgaa gcacacacca tttcctgacc gaagcagaca tacaatttta taagccgctg 120 

atccgacatg aatatcttgc cgcagtccga ttgtacatca ttcgcgaaga ttcaggaact 180 

attgcagcct tcatgggatt aagtaatgat tgcatagaaa tgttgtttgt ccgtccgaat 240 

gcccatggac atggctacgg tagtcggctg gttgaatttg ccattcggaa aaaacgaatc 3 00 

tataaagtag acgtaaacga acaaaatgca gcagcactgg gattctattt acatatggga 3 60 

tttgagacta ccggtcgcga tgcattggat gcaacaggta agtcattccc cattttacac 420 

ctgcaaattc ctcccatccg actccgaaaa gcaactcttg aggatataga tctgcttaga 480 

accctattta cacaaagcgt gcagaatacc tgttcggctg actataacag gttacaaatc 540 

caagcatgga ccggacgggg aactctacaa cgttggcatg aactgtttca aagcgaccta 600 

tactttctgt tggcagaaga cagcagaaag tctcaagtgg caggattcac atctgtcaac 660 

tctaaaggat atctgcatag catgttcgta catcccgact atcaacgcca gggaatagct 720 

tcgcgcctat tgctaaaagc agaagagtat gtccgtatcc ggcaaggcgt atctgtctat 780 

tcggaagtga gcatcactgc ccgcccattt tttgagaaac acggctatag tatcgaaaaa 840 
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gaacaaacag tatctgttgg tgacatagaa atgactaatt tcttgatgta taaacgaatt 9 00 

taa 903 

<210> 881 
<211> 192 
<212> DNA 
<213> B.fragilis 

<400> 881 

gaatggtcag agtgtgaacg gagttttatg cgtttgttta accgtatcca agcaagtata 60 

catgatatct attgggaaaa tcagtgtaca gaccatctca tgacagatga attgcgaaga 12 0 

gaggaggggg aacctttact gctatctgtg gcagatgtta tggaaatttt agttcggaaa 180 

attttgaaat aa 192 

<210> 882 
<211> 1305 
<212> DNA 
<213> B. fragilis 

<400> 882 

aaaacaagag aaacaatgaa aatagaaaaa attacaggac gagaaattct cgactcaaga 60 

ggtaacccta cagtagaagt agacgtagta ttggaatcag gcattatggg acgtgcatcc 12 0 

gttccatcgg gtgcatcgac cggtgaacac gaagcgcttg aactccgcga cggtgataaa 180 

catcgttatg gcggcaaggg tgtacagaaa gcagtagaga acgtcaataa agtcattgct 2 40 

ccgcatctga tcggtatgtc ggccctcgac caaataggca ttgatcatgc catgttggca 3 00 

ttggatggaa caaagacaaa agccaaattg ggtgccaacg ctattctggg tgtttcgctt 3 60 

gccgtagcta aagctgctgc caactatctc gatattcctc tttacagata catcggaggt 42 0 

acgaatacgt atgtattgcc tgtaccaatg atgaacatca ttaatggagg ctcacacagt 480 

gatgctccga tagccttcca ggagtttatg atccgtccgg taggtgcaag ttcttttaaa 540 

gaagggttgc gcatgggtgc cgaagtattc catgctttga aaaaagtatt gaaagaccgt 600 

ggtctgagta cagctgttgg tgatgaaggc ggttttgctc ccaacctgga aggaacagaa 660 

gatgcactta actctattct tgccgctatc aaagctgcag gctacgaacc gggcaaagat 72 0 

gtaatgattg gcatggactg cgcctcttcc gaattctatc atgacggtat ttacgattac 7 80 

accaaatttg aaggtgaaaa aggcaaaaaa cgtacagctg acgaacaaat tgactatttg 840 

gaaaaactta tcaacgaata tccgattgat tccattgagg atggtatgag cgaaaatgac 9 00 

tgggaaggct ggaagaaatt gactcaacgc atcggggatc gctgtcagtt ggtaggcgac 9 60 

gatttattcg taactaacgt tgacttcctg gcaaaaggta ttgaaaaggg ttgcgctaac 102 0 

tctatcctga tcaaggttaa tcaaatcggt tcactgacag agacactgaa cgctattgaa 1080 

atggcacacc gccatggata tacgacggtc acttcacacc gctcaggcga aacagaagat 1140 

gcaaccattg cagatattgc cgtagcaacc aacagcggac aaatcaagac cggttctcta 12 00 

agtcgttcgg accgtatggc aaaatacaat cagctgcttc gtattgaaga agagttggga 12 60 

gaccgcgctg tatacggata taaacgaatt gtagtaaaag gctaa 13 05 

<210> 883 
<211> 543 
<212> DNA 
<213> B.fragilis 

<400> 883 

ccggttatgg ataccattca gataaaagat aaactattca ctgtttctat cagggaacaa 60 

gagattcaga aagaagtgat tcgcgtggcg aacgaaatta atcgtgattt ggcaggtaag 12 0 

aacccgttgt tcctcagtgt gttgaatggc tcgtttatgt ttactgccga cttgctgaaa 180 

cacattacga tcccttgcga gatctctttt gtgaagctgg cttcttatca gggagtatca 240 

tctaccggtt ccattaagga agtgatcggt attaatgaag acatagcggg acgtacgatc 3 00 

gttattgtag aagatattgt ggatacggga ctgactatgc agcgtctgct ggaaacattg 360 

ggaacacgcg gaccaaaaga aattcatatt gcttcgttgc tggtgaaacc ggataaactg 42 0 

aaggtggact tgaatattga atatgtggca atgaatattc ccaatgattt cattgtagga 4 80 

tatggtctcg attatgatgg tttcggccgt aactatccgg atatttatac agttgtagac 540 

taa 543 



358 



<210> 884 
<211> 477 
<212> DNA 
<213> B.fragilis 



<400> 884 

caaactatgg atgtactgat catcattgca ctgatagccg ccgcagtaat actcttttta 60 

gttgaactgt tcgtaattcc gggtatcagc ctcgccggta tttcagcttt ggtctgcatt 12 0 

atctatgcaa actattatgc ttttgctaac ctgggaacag gtgcagggtt tataacactt 180 

attatatcgg gaattgcctg tatcggttcg cttgtctggt tcatgcggtc gaaaaccttg 2 40 

gataaattgg cattgaagaa agacataaca tccaaaatag accgaagcgc tgccgaaaaa 3 00 

gtaaaagttg gcgatacagg tatcacgatt acccgactgg ctcaaattgg caatgctgaa 3 60 

atcaatggca atatcataga ggtcaagtca atggacggat tactgaatga aaaaactccg 42 0 

attgttgtca atcggatcac tgatggaata atctttgtcg aaaaattaaa atcctaa 477 



<210> 885 
<211> 528 
<212> DNA 
<213> B.fragilis 



<400> 885 

aagaatatgg attggaataa aaagataatg cgaatttcac tgctggtttt cacactggta 60 

gtaggaattt cgtgtactgt ttcttataag tttaatggtg gtaatatcaa ttacgataag 12 0 

gtaaagacta tctctattgc cgactttcct attaagtcgg actatgttta tgcaccgtta 180 

ggcactaagt tcaacgagga cctgaaagac attttccttc gtcagacccg tctgaaactg 2 40 

gtgaataaca atgccgacct cgagattgat ggagagatta ccggatataa ccagtataac 3 00 

caggctgttt cggccgacgg atactcttct gaaaccaagc tgaccatcac agtgaatgtt 360 

cgttttgtga acaatacgaa tcatgaacag gacttcgagc aacagttctc ggctttccgt 42 0 

gtttatgatt cgagggagtt gctaacagcc gttcaggacg gactgattgc ggagatgact 480 

aaagagatta cagatcaaat atttaacgca acggtagcaa actggtaa 52 8 



<210> 886 
<211> 1068 
<212> DNA 
<213> B.fragilis 



<400> 886 

aataaaccgg aaatgaaaaa gctaattata ctgacagggc tgttactctc tacctcggct 60 

tatgcccaga ccgaagttac agcgggagtt acccggggaa aagattacgg tgtaacctat 12 0 

gcacttccta aaacagcaat caatattgaa gtcaaagtca ataaagtgac atatactccg 180 

ggagaattca gcaagtatgc cgaccgttat ctccggttga ccgatgtgtc gggtgagcct 240 

caggaatatt gggaactggt cagcgtcaaa gcaaaatctg tcggtatccc cgatagcgaa 3 00 

catacctatt ttgtcaagct gaaagataaa acagtagctc cgctaataga attgaccgaa 3 60 

gatggtatcg taaaatcaat caacgtaccg ctatctccta aaaaatcggc tccgatgcaa 420 

cccgccacga cacagaaaaa gaagataaat ccacgtgatt ttctgaccga agagattctg 480 

atggcaggtt ctacggctaa aatggcggag ttggttgcca aagagattta taacattcgt 540 

gaaagtaaaa atgccctggt acgcggacag gcagacaaca tgcccaaaga tggggagcaa 600 

ctgaagatta tgctcgccaa cctggaagag caagaggctg ccatgaccga aatgttctcg 660 

ggtaccttga ataaagacga aaagatattc aacatccgcc tcactccgga taaggaaatg 720 

gacaacgaag tagctttccg cttttcgaag aagctgggca tagttgccaa taacgatctt 780 

gcaggagagc cggtttatat cacgctgaag aatctgaaaa ccgtcaacgt accggaagac 840 

gatggcaaaa agaaggtgga cggcattgcc tataatgtgc ccggcaaagc acaagtaaca 900 

ctaacggagg ggaaaaagca atggtttaac ggagaacttc ctgtcacaca attcggtacc 960 

atcgaatatc tggctccggc gcttttcaat aagaaatcga ctgttcaggt tactttcaac 1020 

ccggatacag gaggcttgat caaggtagat agagaagaag gagaataa 1068 



<210> 887 
<211> 3054 
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<212> DNA 

<213> B.fragilis 

<400> 887 

acagagaagg aagatacaga aatgaaatta aaattcaaac atcagaagtt tcaggaagac 60 

gcagcaaaag cggtatgtga tgtctttggc gggcagccat acaagacgtt cgactatcaa 12 0 

gtagagaccc ggaagaaaga cggacagacc agctttgaaa agtttacagg attccgcaac 180 

caccctatcg tacctcaact cacagatgag atcgttctga aacacatccg ggatatccag 240 

cgtgcccaac aaatcaaacc gtcggaagcg ctggaaggga aatacaatct caccatcgaa 3 00 

atggagacgg gtgtaggtaa aacgtatacc tacatcaaaa ccatctttga actgaacaaa 3 60 

cgctacggtt ggtgcaagtt catcattgtc gtacccagtg ttgccatccg cgaaggagtc 42 0 

cacaaaagcc tggagattat gaaggaacac tttgcctcgg attacagcac ccctctgtct 480 

tatttcatct acgactccaa acagttgggt gaattgaacg catttgtcac agacagcaaa 540 

atccatgtaa tgatcatcaa ttcacagaag ttcaatgcaa cgaataaaga tgcccgccgc 600 

atctacatga agctggatga ttttggcgga aactgtccca tcgatgtgat tgcgcagatg 660 

aatccgatac tgattatcga cgaacctcag tcagtagaag gagccaaaac aaaagaggga 720 

ttgaaacgat tcaatcccct gttcacactg cgttattcgg ctacacaccg cgaactctat 780 

aatctggtct atcgcctgga cgcaatggaa gcttacaacc tgcaactggt taagaagatc 840 

gctgtcaagg gtatctctat cagtgggaca acagctactg aaggattcgt ttatctggaa 900 

ggtttgaacc tgtatccgga caaaaacccg actgccaata tcggattcga aataaaaaga 960 

accaaagcag tgaatcaggt agtacgagct ctgaagataa atgatgactt gtatgctaaa 1020 

tcaaaccatc tggaagaata ccggaacgac tatgtaatta cagatatcaa cggcgttgaa 1080 

gactccgtca ccttccggaa cggcatcaaa ctttatgcag gtgacgtagc gggtagcgtc 1140 

aacgaaactc aactacgacg tatccagata cgggaaacca tcttgtcaca catagaaaaa 12 00 

gaacaggaac tgtttgagaa agacatcaaa gttctctccc ttttcttcat cgatgaagta 1260 

gccaaatacc gccggtataa cccggacgga aagggagaat atgccgagat tttcgaacag 132 0 

gaatataccg atatcataaa gcacctggat ccttcgttat tcaatcagcc ggaatatatc 13 80 

gattacctga aatcgactgt ggcatcgaaa gctcacgaag gatacttctc caaagataaa 1440 

aaaggaaaac tgattgacag taaaaccgag cggggaacca aagaatcggc agatgaagat 1500 

gcttacgatt tgattatgaa gaataaagaa cgtctgcttg accggaaaga gccgatccgc 1560 

tttattttct cacattccgc tctgcgggaa ggatgggaca acccgaatgt ctttcagatc 162 0 

tgtaccctga aacaaagttc ggcagaggta cgcaagcgtc aggaagtggg acgagggctg 1680 

cgcctctgtg taaacggaca gggagatcgc atggacgcca acgttttagg cgaagaagtg 1740 

catcgtgtca acctactgac cgtgatagcc agcgaatcgt acgaatcgtt tgccaaaggc 1800 

ttacagacag aaatggcgga agccatagcc gaccgtccac agaaagtaac catccaatta 1860 

ttcaaggacc agtcgctccg attagctaac ggtgaaacca tcatagccac cgaagatata 192 0 

gcacaaagta tctacgactc tttacttgaa aacaagtaca tcaagaaagg agaactgaca 19 80 

gacaaattct atgaagaccg taaacaggga gaagtgattt tcgacgacga gctcaccgat 2040 

tataaggcgt ctatcatgac catcctggcc tctatctata atccaaggga gatgcagccg 2100 

aacgatgcaa ggaaaagtaa gataaatctt cggttgtcaa aagataaact tgaaaacagc 2160 

aaacttcagg aactgttaaa actgctatgc agtaagtcaa cctacaccgt aaagtttgac 222 0 

gaaaaagaat tggtagagag agcgatcgaa agtttaaatg aaaagttaag agtatcccag 22 80 

ctctatcttt ctgtcattac aggccaaatg gaaaaaatca agtccaaagc agctttaatt 2340 

tcgggagagg catttaaggt agatgccaat caggcgcact atgaaaagat agatgccatg 2 400 

gcaaacgatc aagtaaaata tgacttgctc ggtaaactca cagacgccac caatctgacc 2460 

cgacaggcag ttgctcagat tctctcccgg ataaaaccga atgtattcgg ccaattcaaa 2520 

aacaatcccg aggattttat tatcaaggct tcggaactga ttaatgaaga aaaagcatgt 2580 

ctgatagtaa aacatatcga atatacccca atcgaccagt actatgatgt atcggtcttt 2640 

acccgggcaa ctattcaggg gcgtttggga gtaaacacaa taaaagcaga taaacatctg 2 700 

tacgatcatg tgagattcga ctcccaaaat gaaaaaacat tcatggaaag actggaagaa 2760 

aatgacgaaa tagaagctta tgtaaaacta cccggcaatt tctatatccc tactccgatg 2820 

ggaaaatacc atccggactg ggccatcgtc ttcaaacaaa agttatcgaa gtatccttat 2880 

tttattgccg aaaccaaagc cagcgattcc tccctacaag atcggagaat agaagaggca 2940 

aagatcgaat gtgccaaaaa acattttgcg aagacaaacg gtgggaagct taaatataat 3 000 

aaagtaagct ccttcgaaga actcttgaaa atcgtcacac aagaatccgt ttaa 3 054 



<210> 888 
<211> 1251 
<212> DNA 
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<213> B.fragilis 
<400> 888 

tcaggaaggg agtttagtgc gaatatgaca aaagcggaaa tacaacaggt aaaactaagg 60 

ttcggtatta ttggtaacac tgaagctttg acgcgtgcga tagatgttgc catacaggtg 120 

gcacctaccg atttgtccgt gctaataacc ggagagagtg gtgttggtaa ggaaagtttc 180 

cctcagatca ttcaccaata cagtcgccga aagcatggac agtatattgc tgtcaactgt 240 

ggtgctattc ctgaaggaac catcgattcg gaactgttcg gtcatgaaaa aggggctttt 30 0 

acgggagcca ttggtgagcg aaagggctat tttggtgaag ccgacggcgg aactattttt 3 60 

ctggatgaag tcggagaatt gcctttgccc acgcaggcac gtttgcttcg tgtactcgag 42 0 

agtggggagt ttataaaagt aggctcctcc aaagtacaga aaacggatgt ccgcattgtg 48 0 

gctgctacca atgtcaattt gacccaggcc attgcagagg gacgtttccg tgaggattta 540 

tactatcgtc tcaatacggt gcccatccag atccctcctt tgcgggagcg tggagaagat 60 0 

gtgctgttac tgttccgtaa gtttgcaagt gactttgcag agaagtatcg tatgcccgcc 660 

atacagctga ccgaagatgc caaacgggtt ttgctgtctt attcctggcc gggtaacgtg 72 0 

cgtcagttga agaatatcac ggagcaaatc tctataattg agaccaaccg tgagattaat 78 0 

gcccctatct tgcaatctta tctgcctgcc cagagtacgc agcgattgcc tgccctgttt 840 

ggtgtaaaga cagggaagag cttcgaaagt gaacgtgaaa tcttatatca ggtccttttt 900 

gacatgcgac aagatgtgac cgaactgaaa aagcttgtac acgaaattat gtccgagcgc 960 

ggagcggtaa cctccaatgt cggtacgttt tatacgccgg ctccggtagt agcccctacg 1020 

ccctcagtgc ctgccatcat tcatccggtc aagcccaatt gtcccgatga cgatgacata 1080 

caagataccg aagagtatgt ggaagagtcg ctttcgttgg acgaagtcga gaaagaaatg 1140 

atacgtaaag cccttgaaaa gcatcatggc aagcgaaaaa gcgcggcaaa ggatcttaat 12 00 

atatccgagc gtacccttta ccgaaaaata aaagaatatg gattggaata a 1251 

<210> 889 
<211> 1410 
<212> DNA 
<213> B.fragilis 

<400> 889 

tttctgagtt taaatattac aatgacagaa caattgaaaa acaaattgag tgactccaaa 60 

acacttcgtt ggagtgtgct cgctctggtc gcgtttacta tgctttgcgg ctatttcctc 12 0 

accgatgtaa tgtccccttt aaagcctatg ctcgagaaag agcttctctg ggatagtttg 180 

gactacggat tctttaccag tgcttacgga tggttcaatg tattcctgct catgttgatt 240 

ttcggtggta ttattctcga taagatggga gttcgtttca ccggtatggg agcttgtata 3 00 

ctgatggtgt tgggttgtgg actaaaatat tatgctatct ctactacttt ccctgaaggt 3 60 

gctttgatta tgggtttcaa gactcaggtc tttctggcgg ctttaggata cgctatcttt 42 0 

ggtgtcggcg tagagattgc cggtatcact gtctctaaga ttatcgtgaa atggtttaaa 480 

ggcaaagaga tggctttggc tatgggactc gagatggcta ccgcacgtat cggtaccact 540 

ttggctatgg tgcttaccgt tcccattgcc gattatttcg gctatacgga tgaaagcggc 600 

agtttccata ccaatattcc gatgcctatt ttgttgtgcc tgatcatgct gtgcatcggt 660 

actatcgcct ttttcattta taccttttat gataagaaac ttgacgcttc tttagatgct 72 0 

cagggagaag aaccggaaga accgttccgt atgaaggacg ttatgctgat tgtcaccaat 7 80 

aaaggcttct ggctgattgc tttattgtgt gtactattct attctgctgt tttccccttt 840 

attaaatatg caaccgacct gatggtgcag aagtataacg tagaccctaa actggccgga 900 

aatattccgg gattactacc gataggtacc atcttcctga ctccgttgtt tggtactctt 960 

tatgaccgta tcggtaaggg agcgacgttg atgattatcg gtgccgtcat gctgattggt 102 0 

gtgcatactt tgtttgcgct tcccattctg aacgtatggt ggtttgccac tgtgattatg 1080 

attgttctgg gtattgcttt ttcactggtg ccttcggcca tgtggccttc tgttccgaaa 1140 

attattccgg agaaacaact gggtactgcc tatgctttga ttttctgggt gcagaactgg 12 00 

ggattgatgg gggtacctct gttgatcgga tgggtgttga atacctattg caaaggtcct 12 60 

gttgtggatg gagcgcagac ttatgactat actttgccta tggctatctt tgcttgtttc 13 2 0 

ggtgttttgg ctctgattgt agctttaatg ctgaaagcgg aagacaagaa gaagggatac 13 80 

ggactgcagg aagcaaatat caaaaaataa 1410 

<210> 890 
<211> 813 
<212> DNA 



361 



<213> B . f ragilis 



<400> 890 

atgatttcac 

aacatttctc 

tttaattgtt 

ttgtttcaat 

acgaaaatac 

atgctaaacg 

acggcagact 

catgccgggt 

gctgtttttg 

caatcattcg 

tcgcgtatct 

aaccggcaac 

tgtacttaca 

cgtattttgt 



tcactgacga 
attttgtaac 
cacctttttc 
cactatcgca 
ttccggtcga 
ggattgatgc 
gtattccggt 
ggagggggac 
gaacggaagg 
aggtggggga 
ctttcaggca 
agcttttgga 
tccggcatga 
cgggcattat 



tagaaaaatg 
gacccgtcac 
gggagatgaa 
agctcctagg 
tgaaaaattt 
gttgattacc 
attgttgtat 
agtggagtat 
acaagatgta 
tgaagtttat 
ttcggttaca 
ttttggagta 
ggatttcttt 
gataaatagc 



ttggggtatg 
ggcggttata 
cttgaaaggg 
catttgatta 
cttggagctt 
actgagccgg 
gatagagtac 
attgttggac 
atcgcatgta 
gaagcttttc 
cataagtacc 
ccgggagtac 
tcagcgcgaa 
tga 



ggctgttggg 
gtgagggggc 
tagagaagaa 
ttccttttca 
ctgggcagca 
gatgctgcat 
atcatgctgt 
atacgcttga 
tcggtccggg 
gtttgaatgg 
atattgactt 
aaattgaaat 
gattgggcat 



cgcatacccc 
gtatgcttct 
tcagacgttg 
gacacacgga 
gcaacaggaa 
ttgtatttcc 
agcggctgtg 
gaagatgcgg 
tatctctcta 
ttttgatatg 
atgggaagcc 
agcggatatt 
aaagtccgga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

813 



<210> 891 
<211> 1263 
<212> DNA 
<213> B. fragilis 



<400> 891 

tgcgaaattt 

aataccatat 

tttctagttc 

ttagtgatct 

gatactgata 

ttattctcac 

gttcttttag 

atatactatt 

tctcctataa 

caagtaacag 

tgttttatat 

atacatagtt 

atattaatat 

ttgttttcta 

acttattcta 

atgttgttgt 

atgaacataa 

tatatccgat 

cgtgataaaa 

aatttccaaa 

aattctattt 

taa 



gggtttactt 
ttattatatt 
ttttattatt 
ctatgtcatt 
ttataagata 
taatgtttgt 
tgtgtacttt 
tttattttct 
atatattatt 
atactataaa 
gtaatgaaaa 
caattctgat 
tactttttat 
ttattcttcc 
ttgttggtga 
ctgcaatata 
tatttatata 
ttgctaattt 
agagatatag 
tgacatattc 
ttcagatttt 



ctgctttatg 
aatattgtta 
ctgtaaacga 
tgctttatta 
ctataacgct 
attagagaat 
tgctaatgtt 
atctttactt 
agttactttt 
aaatgctgct 
taaacttaag 
gttgcttcct 
actagctgta 
agatgttggg 
tgctcaatct 
tctttcaata 
tttaattatt 
tgcccatttc 
tctagtgatt 
taggacattg 
attctctaat 



tatactaaaa 
ttttggttat 
ctttccatat 
gcatatactc 
tattatccat 
aatcttacat 
cagataattt 
aagttatttg 
atttctattt 
tcatttgcta 
attattttat 
ttgtttttat 
ttaatatctt 
tttggaagtt 
tctatcagat 
aataaattgt 
atgtacttaa 
ctttttttat 
tttttattca 
tctggtgggt 
gtggtagaat 



aatttcttgt 
ttgctccatt 
gtcaatacaa 
aaaagtctct 
ttatcgatca 
tttcgtttaa 
ctattttttg 
agcatgaagg 
ttggatttat 
ttttctttta 
tatacattat 
ataaaaaaat 
ctcgcataaa 
tattgttgaa 
atattggtat 
ttaatgtaag 
attataataa 
ttgaatttat 
tagttgtttt 
attgttcgag 
atttatcatt 



ttttgataaa 
tttgacgata 
gtgcatgttt 
gttttatctt 
gtcttttgat 
tttaattaat 
ggttttttgt 
catatcaatt 
cctttttact 
tgcatttatc 
aggggttggc 
aaatacccaa 
tataatgagt 
gaaagcggaa 
ttcttgcgta 
taataaatat 
tcctgatgga 
ccaattgtta 
tattgttaca 
ctatatgaat 
taaagcatat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1263 



<210> 892 
<211> 1191 
<212> DNA 
<213> B. fragilis 



<400> 892 

tgctttctct 

agaatattaa 

ggaaaatatc 

atagtgaatt 

ccaaatttgt 

cttaccttgg 



gtctgaggag 
taacaggaga 
agtctgtttt 
tagagtgtcc 
cttcgtcttt 
ctaataatca 



tactttctat 
ttattgtcct 
tgaagatatt 
tgttgttgag 
gagagctgta 
tttttatgat 



gatatattgg 


acaatatttt 


tattatgatt 


60 


agaaatagga 


ttgatgatct 


gattaatcta 


120 


attcctatag 


tgaagggaca 


tgattattct 


180 


catgacgatt 


gtgctattaa 


gaagcaaggt 


240 


gaaattttga 


aattattgga 


ttttaatctt 


300 


tatggagacg 


gaggtgttaa 


gcatacactt 


360 



362 



gaatgttgca aaaatttgga tttagatttt gttggtggtg gtgagtcttt atctgcagct 42 0 

cgagctatta aatttaaaaa tttgttcgga aaacgttttg catttatcaa tgtttgtgaa 480 

catgaattct ctatagcaac acaaacgact ggtggctcaa acccattgaa tcctatatct 540 

aattattatg atatacaaaa agctagagca acagctgatt atgttattat catagtgcat 60 0 

ggaggacatg aacattatca attgcctagt ttgcgtatgc aagagacata tcgctttttt 6 60 

atagatgctg gagcggatgt tgtggtaaac catcatcaac attgttttag tggttatgag 72 0 

atttataaca ataaatatat tttttatggg ttgggtaatt tttgctttga taatcctgtt 780 

aaaagaaata gtatttggaa tgaaggatat atgttaagtc ttaatttttc tgactatgga 840 

aagattgatt tctctcttat accatatata caatgtgatc agttgcctaa ggttcgttta 900 

ttgaaggaaa gtgaaaaagc tgtttttttt gataaaattt cttctttaaa taaaattatc 9 60 

cagagtccgg atatgttgaa agactccttt tatgctttct gtatgaccaa gaggcgttta 102 0 

tatctgtctt tatttgaacc ttatccgggg cgttatctca agtatattta tcgtatgggg 1080 

tatttaccat cttttttatt ttctaaaaca aggttattta tccaaaactt tatggactgt 1140 

gaatctcatc atgatattgt gaaagaagtg ataaagataa atcgaaaatg a 1191 

<210> 893 
<211> 183 
<212> DNA 
<213> B.fragilis 

<400> 893 

gaaatattat gggggctcac cgacgaagta tataaacaat ttatagaccg acagaagacc 6 0 

ggtaagggag ccaccttgcc ggtcttcatt ttattaacta atccggtcga ctggattgag 12 0 

agaattactc ttatctttgc tatccgaaag gagaactgcg atgcagaagg attgcagaga 180 

tga 183 

<210> 894 
<211> 1575 
<212> DNA 
<213> B.fragilis 

<400> 894 

atgaaaggac tattaacctc catactgacc gtacttacct ttaccggact gcaagcccag 60 

ccacttccat ctaccccgaa attagtggta ggtctcacca tagaccagtt acgtacggac 12 0 

tatctcgaag ctttttcaac actgtatggc gacaggggat tcagaaggct ctggaaagaa 180 

ggacgtgtgt tccggaatgc cgaatatact ttcagtggca cggaccgcgc atcagccata 2 40 

gccgctattt atacaggcac cactccttcg gtcaacggca ttatcggcaa acgatggatg 3 00 

gatgtatcga cactgcgtac tgtgagttgc gtcgacgacc ccgctttcat gggcaattat 3 60 

acaaacgaaa gctcttcgcc ttcccatctc ctgacctcta cgatagccga tgaactgaag 420 

atagccaccc gtaacgaggg attggtatat gccatcgctc cattccgcga cgctgccatt 480 

cttgcagcag gacatgccgg aaatggcgca ttctggctca acaacacaac cggaaaatgg 540 

tgtggaacga cctattatag cgagtttcca tggtgggtaa gccagtataa cgaccggaat 600 

gccatcgact tccgcattgc tgatatgaca tggactcctg tccatccggt acaaagctac 660 

agtttccttc ccgaatggag agatgctgct tttaaataca aatttgacga cgatcgtgtc 72 0 

aataaataca aacgactgat tacaagccct tttatcaacg acgaaatcaa tacgctgaca 780 

gaagaactgc tggataagag cacgatgggc aaagatcatg tccccgacat gctggcactg 840 

acctactatg caggcaacta cgcccataag agcgtacagg aatgtgccat ggagatgcag 900 

gatacatacg tacgactcga tcggagcatc gcctctttac tggacatcat tgacaagaaa 9 60 

gtgggtctgc agaatgttgt tttctttatt acctccaccg gatataccga taccgaatca 1020 

cccgacctgg gactctaccg ggttccgacc agcgaatttc acctgaaccg ctgcgcagct 1080 

ttgctgaaca tgtatctgat ggctacctac gggcagggac agtatgtgga agcgtactac 1140 

gatcagcaga tttatctgaa tcacaaactg atcgaagaaa aacaactgaa tctggcggat 1200 

atacaggaaa aagccgccga atttttgatc caattcagcg gagtgaatga agtatattcc 12 60 

ggcaaacgcc tgttattggg gtcctggaca ccggacatct cgatgatacg caacagtttc 1320 

caccgtaaac gctcgggcga cctgctgatt gacgtattgc cgggctggag catcgtcaac 13 80 

gaaaatacat ccgaccataa ggtggtgcgg aaagcgcata ttccgtctcc ccttattttt 1440 

atgggcagcg gcgtaaaacc agccgtaatc aacacgcccg taaccattga ccacatagct 1500 

cccaccgtag ctcacatatt gagaatacga tctcccaatg cctgttcggc aactccgatt 1560 

accgacatcc ggtaa 1575 
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<210> 895 
<211> 549 
<212> DNA 
<213> B.fragilis 



<400> 895 

aactgttctc gatcgaatct gatgaattta tcatttgccg ccattgactt tgaaaccgcc 60 

acaggataca tggaaagtgc ttgtgcggta ggtatcgtta ccgttacaga cggagagatt 120 

acagacgaat attacagcct gattcaacca ccggagaatg aatattggcg tgcaaatatg 180 

cttgtacatg gaataacgcc gggaatgaca gagtcactcc cgggatttca tgccatctat 240 

cccgaagtca aaaagcgttt acaaggcaac gtagtagttg cgcacaatga acaattcgac 3 00 

cgcaatgtac tgaagaatac catgcggatg tacggactgg attatgatga gttatcgctt 3 60 

ccggaacgtt gggaatgtac ctgccgcatc tatcgttctt taggatacaa gccggtcaac 42 0 

ctaagcgcct gttgcgaacg ggaaggcatc gaacttaaac accacgaagc actttccgat 480 

gcccggggat gtgcaaagct atatctcaat ttccttgaaa aataccgtcc gctcagtacc 540 

ctatggtga 549 



<210> 896 
<211> 408 
<212> DNA 
<213> B.fragilis 



<400> 896 

ttattaacgc taaatcaaaa taacaaaatg tacttattat tagttatctt aatggttatt 60 

gcagccatac tgatgtgctt cattgtgttg attcagaact caaaaggcgg tggtcttgct 12 0 

tcagggttct catcatctaa ccagattatg ggtgtacgca aaactacaga ctttctggaa 180 

aaagcaactt ggggcttagc tgcatttatg gttgtgatga gcattgctac tgcgtatgtc 240 

gttccgactt cttcttctaa aacacaagat gtcattatgg aacaggcaca gcaggaagag 3 00 

cagaccaacc cttataacct gcccgtaggt actactgcac cgaagacaga cgctgctgct 360 

ccggttgaag cacctgccac agaaactccg gctactccgg caaactaa 408 



<210> 897 
<211> 1266 
<212> DNA 
<213> B.fragilis 



<400> 897 

ttttacccta gaataaccga aaggaagacg tttttttatt atttttgcat cactttatta 60 

accctaagag atttcacaaa tatgaaaaga cacgtcttcc ttttggtaac cttgtttacc 12 0 

atgagcactg ttgcagctca acaacaacca attatttccc ccaaagactc tatcccctct 180 

gtgatcgaac gcgtcaccgg aaaagagaac aaaggatttt ccgctcacat gaatctccaa 2 40 

ttatatactt catgtgctgc ctcttttact gaaaatgagt tagatgaagt tgctttcaag 30 0 

ttaaaccggt ttaagctgga aatcatagga aatatcaacc ggaagttctc ttaccatttc 3 60 

cggcaatctt ttaataaata cagcaacccc tttgctctgg ataatctgtc ctcttccgta 42 0 

gagtatgctt atctgaccta tcacctttcc gatcgctttt ccatcacggc cggaaagcaa 480 

tttcttatgc tgggaggcta tgagtactat gtcaatccga ttaaagtacg tgaattcagc 540 

gagtttaata attatgtaaa ctgctttctg gcgggagtat ctgccacttg gaatgtgact 600 

ccgactcaag aactcaattt tcagatagtc aacaaccgta acggtggaga cgcagatact 660 

taccttcacg gcctgccgac agatgtcgaa gctaccaaag tacctctgat atcgaccatt 720 

aactggaaca gttattatct ggacaaagcc attcagttga gatacgccgc ttcatgggga 780 

cagcaggcca aaggaagaaa tataatgtat cttaccgcag gcaatgttta cgaaaaaggt 840 

ccatggatcg cttatatgga tttcatgtac tcccgacaag gaatagataa taaaggcatt 900 

atcagcgcct tacctcgcat agacttggaa aacccgcaga cagcccaaca taccgagtat 960 

tttaccacga ttgccaatgt agactaccgc ttccacccta attggaatgc ttacctgaaa 102 0 

ggtatttacg aatccggaaa aatttataaa gctaacggta tctttgaaaa aggtacctat 1080 

cgccggacat ggtgcggaca agtttgtgtg gaatactatc caatgaggaa cagcgaacta 1140 

ttgatcttct tgcactatca atacaaacgg aataaactat tgaaacccgc ccgcaattta 12 00 

gatgctatag acccgaatac gcagcggatc tcgctagggc tggtatattc cataccggtt 12 60 
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ttttaa 1266 

<210> 898 
<211> 2697 
<212> DNA 
<213> B.fragilis 

<400> 898 

caaacaaata gaatgaaata tattatatac ttcatgatga tgttgtatgg ttcattatgt 60 

catgccatcg tttgcaaaca cattgtagaa agaagtgaaa cgaatactcg taaagtgtat 12 0 

cagatccaaa gggatgctct gggttatatg tggtttatga accatgccgg aatcagtcgg 180 

tttgatggga ccaagctaaa acactataaa ctgccggccg aagggcgaac catggattat 2 40 

tatatgggca attgccggtt gcttacagat aatcggaatg ggttgtgggt agtcacccgt 3 00 

aatggatatt tatggatgta caatccatca ttggataaat tcgaatgcag gaatcatctg 3 60 

gttattccga atgatgtttc ccttcatttt ctctgcgtag ataacagtag tcatatctgg 42 0 

ttttctgtcg gaaaccggtt gatagcctat caaatactat ctaatacttt tcatcgggtg 480 

gatcatagcc tggcagcgat ttcctgtatg gtagaggtgg ctccgggaga gtattttgta 540 

ggttcggatg aagggctgtt cggaattaca ataaagaatt atgccgtcga caggcaaacc 600 

ggagaattgt ccggtaaaag atatagccgg atacatgaaa tactttttca tccttatacc 660 

caaagattgg ttatgtttga ttattcggaa ggattagggg tatgggatat gaagtcggag 72 0 

caattggttg gtacttggaa ccgattgttg aatagtcggg tcagtggtct gaggatatgg 7 80 

gatgaccgga ctgttttggt agctacagat ggtgagggaa tatttcgtat ggatatcgtt 840 

aatccggata ttacatcttt tatacaaact gattttgaga atgataattc aattcgtacc 900 

aaccggattg ctgacgtgtt tgtagatgat cagaaactta tctgggtggc ggattatccg 9 60 

gaaggcgtat caatgatcga tgtggaatct cctgatgatt ataaatggta cagggcacgg 102 0 

tccggggaca gtcattcact gaccaacaat cgggtaaatg cggttctgca tgattcggat 1080 

ggggatgtct ggtttgcaac ggatcacggt atcagttgtt ttcatccgtc gacaggctta 1140 

tggaaccgga ttgttacgcc tcttccttgt cagatgtata ctgcattgtg tgaagtaaag 12 00 

ccgggagaga tatgtgccgg gaactatgta cacggtttgt tcttcatccg aaagaaaagt 1260 

aattattcgg ttacaccgta tgtacgtatt tcgggagtaa acgcattgtg tcgtaaggat 1320 

aaagacggtt tttggattgg aacggatgaa ggggtgtttt tttattgtcc ggaaaacgat 13 80 

agtatcgtgg aggtaaaacg cttgtccggt ttacacattc atgctttgca tcagtccgat 1440 

gattgtcttt atattgggac tgaaggaaac ggattaatgg tctatcatcc ggagcatgag 1500 

cagatggata cggttgctgc tttaggaacc ggtaatgtat atgctgtttg gtcggatgat 1560 

agcaggcggt taatgggaag cagtgatggg tttgctttct cgctcgatct tgtacaacat 162 0 

tcatattaca ggtttctgag taaaggaatt cggattacat cgggtacttt cttaggtaat 1680 

ggaagataca ttttgggcac ctatcaaggc gcaattgagt atgataaaca aaaggctcga 1740 

ccgctgcgta aagcttgttt gggattttac ttggacgaac ttcgggtttt ggacaaagag 180 0 

gtgaccgtcg aaacggagaa ttctccattg aagaaagctc tgaactgtac agctacgtta 1860 

cagttggagc acaatgaaaa tactttttcg tttacggcta ctgccattcg ctatactgaa 192 0 

aagcaggata tagcttacag ttggaaactc gatcatacgg attggagtgc tccgtctgta 19 8 0 

gataatagaa ttcgtttttc gaaccttcct ccgggtgaat atatcttttc tgtgcgggcg 2 040 

ttatccattg ataacggtcg gccgtttgcg caaagaaata tgcacatcat cattcgtcaa 2100 

cccctttgga agacaggtgg agctttcctt tgttacggtc ttttggcact tatgttgggc 2160 

tctctggccg tacgttcatg gttcgtatgg caagacagaa acctttcaag agaacaagta 222 0 

cggttgtttg cgaatacgac acgtaacctt tgtctaccac ttacactgat aaaagttcct 2280 

ttggaatatc tttatgaaaa gtcatcttcc gaacttgtta gtaacgtatt gcaacagata 2340 

aagggagtga acaatttatt ggctgagctg gaaaatatca gtcgtgtttc tgctgctccg 2 400 

gggcgtctgt cgcttgccga ctatgagtta tccatattct tgaaagagac agtagcccga 2 460 

attagagatt atatcagcga gaaggacatt atgctccgtt ggacggagga gcctgccttt 2 520 

gctaccgtat gcctcgataa ggataagatg tctgccattc ttagaaacct gttaatggct 2 580 

tttacagaca gtatggatcg aggtgacgaa attcttctga gtacttcgtg taacaatcaa 2 640 

aagtgggagt tgaggctgga atctgaggat aacggctttc ttaaaaaaaa attctaa 2 697 

<210> 899 
<211> 783 
<212> DNA 
<213> B.fragilis 
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<400> 899 

cgcaacggta gcaaactggt aattagaatg atttctgcta acttacaaca atggattcag 60 

catccggaaa cgctgaataa agatactttg tacgagttgc gaacgcttgt cacacgctat 12 0 

ccttattttc agtcactgcg attactctat cttaaaaatc tatatttgtt gcacgatatc 180 

tctttcggtg ccgagcttcg taaagccata ttgcatgtgg ctgatcgccg gaagctgttt 240 

tatctgattg agggtgaacg atatattttg aaacctcgga aaaagaacgc acttcccgaa 3 00 

acagaagttt tagaggaaga gcccagcctc gatcgtacgc tttccctgat cgatgctttt 3 60 

ctggccaccg tgcccgaaga ggtttcagcc cagacaagcc tggactatgc aacggactat 42 0 

accacctatt tgctgcaaga agacgataca ccggaactgg aagaaactcc caaacttcgc 480 

ggtcatgaat tgattgacgg ctttatcgaa agaagtgaag aagaaacatc catccgtttg 540 

caaccggcag atgaaaataa agctatctcc gaagaggaag agagcgagac gcatcatgaa 600 

gaagatgaag atgatagctg tttcaccgaa acattggcca aaatatacgt caaacagcat 660 

cgatattcca aggcacttga aattattaaa aaattaagtt tgaaatatcc aaaaaaaaat 72 0 

gcttactttg cagaccaaat cagattttta gagaaattga ttattaacgc taaatcaaaa 7 80 

taa 783 



<210> 900 
<211> 252 
<212> DNA 
<213> B.fragilis 



<400> 900 

aattcaggca gtattttcag taatagcagt tgttgtaagc gggctaatcg gtatttatat 60 

ggcgtatcac caatttggtg tatgggcttt agtcgtacag tccttagtat ctgcttttat 12 0 

ctcaacagtt tccttttgga tatattcaag atggatgcca ttatggactt tctctataca 180 

atcatttcag gagttattct cttttggatc aaaattatta ttagctggag ttttacatac 240 

aatctattct aa 252 



<210> 901 
<211> 936 
<212> DNA 
<213> B.fragilis 



<400> 901 

tattgtgaaa gaagtgataa agataaatcg aaaatgaatt ataaaagaat tttgaagaac 60 

cagacaacgc gtcttgcgat gttaagagct ttgtctttta ttccagatgc tattatgtta 12 0 

agattgcaat attggataaa aacagggcat aaattgaatc taaataaacc tcaacgttat 180 

actgaaaaaa tacaatctta taaatgcttc tatagaaacc ctttactgaa ggtctgttct 2 40 

gataaatata tggtcagaga ctatgtagct tcaaaaggaa tggctaaata cctcaatgaa 3 00 

ttgtatggca tatatgactc tgctgaagat atctgttttg atagtttacc taatgagttt 3 60 

gtaataaaat ccacggatgg aggaggaagc aataatatta ttatatgtaa gaataaagat 42 0 

gaattaaata tatttgaaac aattaagaca gtgaactcat ggctaaaatt aaatagaaaa 48 0 

gttaatccgg gaagagagtg gggatatttg ggaggaaggc caagagttat tattgaaaaa 540 

cttattaaaa atattaattc ggaaacttca cttacagatt ataaaatgta ttgtttttgt 60 0 

ggacatgtcc atagtttatt tgttctaaca gatagggata aaggtgctaa gataaatttc 660 

tttgatcgga attggaatcc tttgaatgta aaatcagata gttatcctac ttctaatcaa 72 0 

ttaatattga agcctaaaaa ttttgatcgt atgatagaaa tagcagaggt tttatcagag 780 

gattttccac atgttcgtat tgatctatat aatattgatg gtaatattat ttttggtgag 840 

atgacttttt attcaggaag tgggtattgg ggattcgtcc cagattcttt tgattttgaa 900 

cttggtcaac agtttgatat ttcatctttt atttaa 93 6 



<210> 902 
<211> 435 
<212> DNA 
<213> B.fragilis 



<400> 902 

ggaactatga gcttgcataa atattcgatt gttttattgg cattattggc gttactctgc 
agttgccatg atgaagataa aggagatatc ccacagtccg atgagcgaac cgcagatttt 



60 
120 
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attgtgaaat ataaggatga tttcggaata catacggatt ataaagctaa ggtatatatc 180 

tattatggaa tatattcaat ggatattgta ggctttcatt atcttccgga cggggtgctg 24 0 

gatcatgaag ggaaagaaat aactcctgac atccgtctat ctgctgatgg aaaagaagat 3 00 

ataaccttgt tattggataa tgctgaaaag gtaacggtta ttgttgaaag ctcctattat 36 0 

gagggaagag tgggaataac aagttactct tcgggcgaca cacctataaa agggaatatt 42 0 

acgtttgggg aatag 435 



<210> 903 
<211> 912 
<212> DNA 
<213> B.fragilis 



<400> 903 

gatttaatga gagtatctgt ggtaattccc tcatataata gggctaagtt gttattggag 60 

acgattccta catatttgca agaggacgta attgaagtta ttatagtaga tgacgcatca 12 0 

gttgataata cagctgaagt tgtaaagaag attcaggaaa aatatccaca agtaaaatat 180 

atacgcaatg cggtaaataa gaaacaaacc tattctaaga atataggaat taaaatatca 240 

aagggggact atatttattt gggtgatgat gatagtattt taatgcctaa ttctatccgt 300 

tatttaaaag aaacaatgta taaatataat gcggatatct gtggtgcaaa agctctttat 3 60 

cttccaatgg aatatgttaa taaaatagat gaatatgttc aacttaatga tattcaatta 420 

gttgataaga atgagattgt tgatataaaa aagataaaag cttcatttaa ttactctact 480 

gcattaccta tagttgttcc tttttgtcaa gcttgcgctt tagtcaaaaa agagttagcg 54 0 

attcagatct tattcgatga aaactttaca ggtaatgctt atcgagaaga aacagatttc 600 

ttcataagat gtactttaca aggagcaaag gtgatgtatg attcacgtgc tgtacaggtt 660 

aacttacctc gtcaagtagc aacaggggga gcgcatagta gaggacgcat taaatggtac 72 0 

ttatcgacaa ttgctaataa ttggtacttt cttaaaaaga attggaagaa tattcaaagt 7 80 

tactataaat tctcggataa tatttataaa agacaattaa tgtttgtatt gaaaaatatt 840 

tgttttgcct caaaagcagt agttaaaata ctaatgcgaa atttgggttt acttctgctt 90 0 

tatgtatact aa 912 



<210> 904 
<211> 192 
<212> DNA 
<213> B.fragilis 



<400> 904 

tgcattagga gccaacattt caggagtaat cacttcggcc atcattacag gcatttatat 60 

aacgattata ccttatttat aaatcataga gttggtgaca agataatttt ggggatgact 12 0 

atagagaatc attcggtggt tattggtaac ccggaaagat tgaaatctat agtcatccat 180 
gtgtttctat ag 192 



<210> 905 
<211> 240 
<212> DNA 
<213> B.fragilis 



<400> 905 

cggatgagga gtgtaatgga agggataagt aaacaagcgg ggcaattcga tgccggcaat 60 

ggaggtctgg aaggaatgaa acatgctact ttattggatt taaacggatt cttgtgtgac 12 0 

gattttcaag agttcttcga aggagcttac tttattatat ttaagcttcc caccgtttgt 180 

cttcgcaaaa tgttttttgg cacattcgat ctttgcctct tctattctcc gatcttgtag 240 



<210> 906 
<211> 1128 
<212> DNA 
<213> B.fragilis 



<400> 906 

agcaggagga gaaaactaac aaataacaaa atggaagata acaaaataaa aattggcatc 60 
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actcagggag acataaatgg ggtaggatac gaagtcattt taaaaacgtt tgccgacccc 120 

gtcatgttgg aactctgtac accggtcatt tacggctctc cgaaagtggc tgcatatcac 180 

cgcaagtcgc ttgatttgcc tactaacttc agtattgtca ataccgctgc agaagctgcc 240 

cacaatcgcc tgagcgtggt caactgtacg gatgacgagg tgaaagtaga gttctcaaaa 3 00 

cccgatccgg aagccggtaa agcagctttg ggagcacttg agaaggcgat agaggagttc 3 60 

agggaaggct tgatcgatgt catagtgacg gctcctatca ataagcatac gattcagt cc 420 

gaaggatttg cttttcccgg acatacggaa tacatcgaac aacgtctggg gaatggttca 480 

aaatcactga tgatcctgat gaaagaggat ttccgggtag ctttggtaac aggacatatt 540 

ccggttcgcg agatagcctc ttcaataacc aaggaactga ttcaagagaa acttgccata 600 

ttcaaccggt cgttgaaaca ggatttcggg attggtgcac cgcgcatcgc agtgttggca 660 

ctgaatccgc atgccggaga cgacggattg ctcggtacag aagaacagga aatcatttct 72 0 

cctgctattc aggaaatggc tgccaaggga atcttgtgct atggccctta tccggctgac 780 

ggatttatgg gatcgggcaa tttcacccat tttgacggag tactggccat gtatcacgat 840 

cagggattgg ctcctttcaa ggcattggcc atggatgaag gtgtgaacta cacggcgggt 900 

ttgccggtga tacgcacttc tcccgcgcat ggcacagcct atgatattgc aggaaaaggc 960 

gttgcttgcg aagattcatt ccgtcaggct atttatgtag cgatcgacgt attccgtaac 1020 

cgtcaacgtg agaaggaagc acatgccaat ccgttacgta aacagtatta cgagaaacga 1080 

gacgacagtg ataaactgaa gctcgataca gtagatgatg atatttaa 112 8 

<210> 907 
<211> 519 
<212> DNA 
<213> B.fragilis 

<400> 907 

atgccgggga ggtgcgtctt gtattttccg gcggtcggta tacattgcat tttatttttc 60 

aatctaaaat catactgtgt tatgtttagc ttattgttat ccacttttgc catgatgggg 120 

cttaccttcg tcattggttt tttcgttgcc ggggtgataa agttgattgc ctctgcagct 180 

gattcgttgg ctttttatag ttcgcaccag gaagaattgg cccggctgaa gcgtattcgg 240 

aaactgcatc agaaagtagc tacgttaata actgaaagtg ctctgagtga tgaggagtat 3 00 

ggcagtgatg ggcgtgaaga cttcagcagg ggggtcacaa aacatcccgg agataatcgt 3 60 

gggttttatc atggtgtcag tcccggtgaa tcggagagag gtttaatgga ttatttttat 420 

ccggaagaca caagaacgat gtttcttcgg aaagaagaac agatgttaca gcatgataaa 480 

aaaaataata agacatcctc aaccaataaa aaacaataa 519 

<210> 908 
<211> 372 
<212> DNA 
<213> B.fragilis 

<400> 908 

aatatgaaag gtatttatgc tatttcgttg ttggtcgttt ccaacatttt tatgacattt 60 

gcctggtacg ggcatttgaa gctacaggaa acaaaaataa tcagtaattg gcctttgtat 12 0 

ggcgtggttt tgttttcatg ggtgattgcg ttggctgagt attcttgtca ggttcctgcc 180 

aaccggctgg ggttcagcgg aaacggaggg ccgttttcat tgatgcaact taaaattatc 240 

caagaggtga tcacactgat tatatttacc gttttttcta ccttattatt taaaggggag 300 

tcactgcatt ggaatcatgt ggcagctttt gtctgcttga tagcagcggt atatttcgtg 360 

tttatgaggt ag 372 

<210> 909 
<211> 1323 
<212> DNA 
<213> B.fragilis 

<400> 909 

tctataatta tgaagattgc tatagttggt acaggttacg ttggtttggt tacaggtacc 60 

tgtttctctg agatgggagt agacgtcaca tgcgttgatg tgattgaatc taaaattgat 120 

aatcttaaaa aaggcataat tccgatctat gagccgggac ttgaagacat ggtgcaccgc 180 

aattacaatg cggggcgttt gaagttcact acttccttag cctcatgttt ggatgatgtt 240 
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gaggttgtgt ttagtgcagt tggtactcct cctgatgaag atggcagtgc ggatttaaag 300 

tatgtgcttg aagttgcccg tacgattggt aaaacgatga accattatgt actggtggta 3 60 

acgaagagta ctgttcctgt cggcacagcg caacaggtga aagctacgat ccggggtgaa 42 0 

ttggataaac gcggtttgaa tcttgaattt gacgtggcct ccaatcctga atttctgaaa 480 

gagggagatg ccgttgatga cttcatgaag ccggatcggg tagttgtggg agttgagtct 540 

gagagagcca aatctatcat ggagcgtttg tataaaccgt tcatgatgaa taattatcgt 60 0 

ttgatcttta ctgatatccc ttctgccgag atgataaaat atgcggcaaa ttctatgttg 660 

gccactcgta ttagttttat gaacgatatt gcgaatcttt gtgagttggt tggtgcaaat 72 0 

gtaaatatgg tccgtaaggg gatcggtgct gattcccgaa taggtagtaa attcctatat 780 

ccgggttgtg gatatggtgg ttcttgtttt cctaaggatg taaaagccct gataaaaacg 840 

gccgataaga atgggtattc catgcgtgtg cttaaggctg ttgaagaagt taataattct 900 

cagaaaagta ttctttttaa taagttgatc agatattttg atggaaatct ttcgggaaag 960 

cgtattgcgt tgtgggggct gtcttttaaa ccggaaacgg atgacatgcg cgaggctcct 1020 

gctctggttc taatagataa gatattgtct tgtggcggtc ttgttaaagc ttatgatcct 1080 

attgctgttg aagaatgtaa acggcggata ggggacagca tagagtatgc caatgatatg 114 0 

tatgatgcgg ttcttgatgc ggatgccttg ttactggtta cagagtggaa agagtttcgt 12 00 

atgcctagct ggggtgtatt gaaaaagacg atgaatcgag ctttgattat cgatggcagg 1260 

aatatttatg ataagaaaga actgcatgat atgggttttg aatatacgtg tattggacag 1320 

taa 1323 

<210> 910 
<211> 2100 
<212> DNA 
<213> B.fragilis 

<400> 910 

ggaccctcta tctttgctgt ctcaaaaaca aagattatga tgaaagtaaa attagcatta 60 

ctacttactc ttataggaac acttccttta gcagcacaga atgtacggca agaacaggac 12 0 

acagtctctt atatgaacga tgatcctttc aatcttgaac aaattgtggt tacggcaacc 180 

cgaacagaaa agaagattaa gaacacaccg gtcatcactc agataatcac ctctaagcaa 240 

atagaagaaa gaggaaccgg taacattcag gaccttctga ctcaagaggt tcccggactt 300 

aactttcagg aggttggcta tggaaccagc atcgatatac agggattagg ttccaaacac 3 60 

atccttttcc tgatagacgg cgaacgtata gcgggcgaaa acggtggcaa catcgactat 42 0 

tcgcgaatca atctttataa tatcgaccat atcgaaatag tcaaaggagc ttcttcggcc 480 

ctctatggtt ctcaagcgat gggcggagtt atcaacatca ttacgcgtaa agccaaaaag 54 0 

aaattcgagg cttccgcagg catacgctat gcaggaagaa accagcaaaa ctataaagat 6 00 

actcccaaag atcattcgca atacaaatat cggattcatc tggataaacc caatctgaac 660 

accaatctgt ctcttggatt gaacctgggc aagttcacca tgaacaccga cgtactttac 72 0 

aaaagtttcg atggatacca attattcgat aaaaaacctc tcgtgaaata ttttccggcc 7 80 

tataacacca caattaccga agaactcagt aaaaccccga ccagtatatc gggatacgaa 840 

gacgtacaag tagcccataa aatggactat cgtttcagca aacggctcaa agtccagtta 9 00 

aaaggaagct attatatgct gaacaaatat gattttcaag cagataatat attcgagaaa 960 

tcagaggact atacctatgg cggaagcata gattacacga tttccgacaa atcctctttg 1020 

gtagcctctg ttcataccga tcactacaac cgatatgata aatacgaact gaagagcggt 1080 

cgtcgtctcg aatataaaaa caatattatc cagccccgta tcgtatatag cactacggcg 1140 

ctcgataaac agaccattac gggaggattg gaatattaca gagaatcatt attcagtgat 12 00 

aaatttgaaa ccggtgtgaa agaaaacaaa agccaatggt atgccaccgc tttcctccag 12 60 

gatgactgga gcatcaacaa gcaattctcc gtaatagcgg gactccgctg cgactatcac 13 2 0 

gagaaatacg gtaccaacct cactcccaaa gcttccgtga tgtataagat ctttccattc 13 80 

actgtccgct ttaactatgc acgcggctac cgttcaccca gcattaaaga gttgtacatg 1440 

aactgggacc atctgggcat gttctggata tatggcaaca gtaaactgaa acccgaaact 1500 

aacaattata tctctctttc gggagaatat gtgaacagtt ggatcaatat caatgccaac 1560 

gtttatagca actggttccg aaacaaaata gaaggaatgt ggagcaatga ccaaacggaa 162 0 

ctccattata tcaatatagg aaaaagccgc ctggcaggag tagagaccat gtgcaaaata 16 80 

caaataaaca gacatatcaa tgtgcatgga gcatacaatt atctgtacac aagcaaagat 17 40 

gcggatggag tccgattgag ctcttccagt ccacattccg gtaatattcg tgctgaatat 180 0 

aacacacgca tcccacgcta tgccaccgtt gtcaacctgt ccgggaatat tatggggaaa 18 60 

aagaaattcg atgtgttgga tgaactggaa atagacggaa agaaggtaga agcctactat 192 0 

caggctaaag taaaccctta ttgtctttgg gatctgacag tatctcaata tatcatgcag 1980 
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aatctgagaa tcacagcagg aataaccaat ttattcgatt atacttcaga tcgagtgact 2 040 
ttcaatactt ccacttcacc gggaagaaac tattttatcg catgtaatta tacactttaa 2100 



<210> 911 
<211> 1179 
<212> DNA 
<213> B. fragilis 



<400> 911 

ttcatatcta ttgcaaaaga aatgactatt gcatttgttt atcgaaattt tccttcattg 60 

ggaggggttg aaagggttat agtcattttg gcaaatgaaa tggtaaagca agggagtaaa 12 0 

gtagttattt attctttaga gcaaggactt aatgcttatt ccctggattc ttctattgaa 180 

ataatttgtt tacctcaaaa aaaaatattg gagtcaaaag aaaatgtgaa ttttctaata 2 40 

acacatttat gtgaatataa aatcgaattg ctgtttaacc atgattctgt gaaagatagt 3 00 

attgaactct gcagaagagt gaagaaaaag ataaatattc ctgtagtaac acttcatcat 3 60 

ggacaaatat atttgccatg gaagtcacaa tgggctattt tgaaagataa atatagtttg 42 0 

cgtgtatgtc ttaaaaaaat attttttcct ttttttgtgc ttgctactaa agtaaggaat 480 

aatttgcatc atcgatataa tatcaaggta tgtgatgtat atgttttttt ggcagagtgt 540 

tataaagatc aattaggaat agataaaaaa gtaatggcta ttccaaaccc attatcttct 6 00 

tctttctttt ttgaggatga ttgctatcag agcaaatgta atactgttgt gatggtaggg 660 

cgtattagtg attttcataa acgcattata ttagctttga gaatctggaa ggagatagaa 72 0 

aactgtgaac aatttgattc ttggaatttt gatatagctg gagatggtcc cgacttttat 7 80 

ttaattcagg atactatttg ttccttaggg cttaaaagag ttcgtttact tgggcaagtc 840 

aattcatttg atgtttataa aaaagctaaa atattacttc ttacaagtgc ttttgaagga 9 00 

tttcctctgg ttttaaatga agctaaacag tgtgcatgtg taccaattgc aatggatagt 960 

tttgaatctg ttcatgaact aattaataat ggtgaggacg gattgattat ttcaaataat 102 0 

gatttaaata cttttttgga gggattaaaa tatttgatgt cacataatga tattcttcgt 1080 

gaaatgtcaa aaaaatcagt gctaaacact cgtaaatatg aggtctccag attatgcaat 1140 

atctggatgg atttatttaa gtcaatagtt aataactaa 1179 



<210> 912 
<211> 789 
<212> DNA 
<213> B. fragilis 



<400> 912 

actctctata tatcaaatca aatgcagcta acattaatgg aacagaaaat ctcaaaatat 60 

tctaccgcta caatcgttag cctgctgtgc ctcttattca gccttccgct ccaagcacaa 12 0 

cagcaaagac ccggtgcacg tcctgctgtc aagcagaaag caaaagagga gataaaagcg 180 

gatacgattc ccttttacaa tggaacgtat gtcggtgtgg acttattcgg attgggcagt 2 40 

aaactactcg gaggagattt tctaagttct gaggtaaatg tgagagtaaa cttaaaaaag 3 00 

aaatttattc ctacagtaga aatcggtttc ggacaaacag atacctggag tgataccggt 3 60 

atccattata aaagtgccgc tccttatttt cgcgttggag ctgactataa tgttgttaaa 42 0 

gaatatttgt atgtaggact acgttatgga tttagcagtt tcaagtacga catctcaagt 480 

acaccttttt ctgaccctat ttatggaggc agtatggcta atcccggatt gatagacggc 540 

atttggggag gaagcgtacc ttatcattac aacggactga aatctaacat gcaatggctt 600 

gagctggtgg ccggagtcaa tgttcaaatc tataaaagct tctatatggg atggacctta 660 

cgctttaaat ttaaaacagc gggctcgatc agcgaacatg gaaatccatg gtatgtaccg 72 0 

ggttttggtg aatatgattc ctcaaacata ggtatcacat atacactgat ttataaatta 7 80 

ccattttaa - 789 



<210> 913 
<211> 1035 
<212> DNA 
<213> B. fragilis 



<400> 913 

aatcctaata ttcacttaaa atcaagcaca attatgaatg tcgaacctat gtatctgact 
atcttcttga tagcgggagg tattatcttc ctggttcttt tctttcatta tgtacctttt 



60 
120 
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tttctatggc tatcagccaa agtatcagga gttaatatct ctttggtaca actttttctg 180 

atgcgtatcc gtaatgttcc gccatacatc atcgtaccgg gtatgattga agcacataaa 240 

gcaggtctga gcaacatcac ccgtgatgaa cttgaagcac actatctggc aggcggacac 3 00 

gtagaacggg tagtccatgc attggtatct gcatcgaagg ccaatatcga acttccattc 3 60 

caaatagcta ctgcaattga tcttgcaggt cgcgatgtct tcgaagccgt gcagatgtcg 420 

gttaatccta aagttatcga cacaccaccc gtaacagctg ttgcgaaaga cggtatccag 480 

ctgatagcca aagcacgtgt gacggtacgt gccaatattc gccaattggt gggtggtgcc 540 

ggcgaagata caatcctggc acgtgtaggt gaaggtatcg tttcgtcaat cggttcctct 600 

gaaaaccata agtcagtact tgagaatcct gattccatat caaaactagt gctgcgcaaa 660 

ggactcgatg ccggtactgc atttgaaatt ctctctattg atatcgctga tattgatata 72 0 

ggtaagaata ttggtgctgc cctgcaaata gaccaggcaa atgccgacaa gaatatcgcg 7 80 

caggcaaaag cggaagaacg ccgcgcaatg gctgtggcta ccgaacaaga aatgaaagcc 840 

aaagcggaag aggcccgtgc taatgtaatt caggcagaag cggaagttcc aaaggccatg 9 00 

gctgaagctt tccgtagtgg aaatctcggt attatggatt attataaaat gaaaaatatt 960 

caagctgata catcaatgcg tgaaaacata gctaaaccta tcggtggagc taccagtaaa 102 0 

ccgttgagcg attag 1035 

<210> 914 
<211> 738 
<212> DNA 
<213> B. fragilis 

<400> 914 

tcagcagata ttccgcttat actccgcagg ctaaagactt gtttatgttg ggctacaaag 60 

ccggcgaccg catcactatc ggtggttgga ttggagagtc tacaacgatc agataaatct 12 0 

atgttgcgac aacaaagtaa cagtttatta aataaaagtt tgagcataac aattgtcttc 180 

ggggcaattg ttatgcttct tttattttct tcctgtggtg ggagaaataa ggcgatggcc 2 40 

gatgccatta ccgagcggga ttcactgcct gttatggata cacggggggt aacgaccctt 3 00 

atatccgatt ccggtgtcac acgttaccgg gtcaacactg aagaatggtt gatctttgat 3 60 

aagaagaaac cctcgtattg ggcttttgag aagggcattt atctggaaca gttcgattca 42 0 

ctctttcata tagatgcgag tataaaggcg gatacggctt attattatga tcgtgaccgg 480 

ctttggaaac ttattggaaa tgtagatatt aagagtctga agggcgatca tgtgaccacc 540 

gagttgttat attggaatga agccaccaag aaagtgtata ccgataagtt tgtccggatg 600 

gaaaaaccgg atcagattat gaccggatat ggctttgagt cagacgatca gtttatgaag 660 

ccggttgttc ataacatatc cggtatagta tatatcgatg aagatgccga aaaggcaaaa 72 0 

acagattctg taaactaa 738 

<210> 915 
<211> 747 
<212> DNA 
<213> B. fragilis 

<400> 915 

ataacggata tccccatgaa aaacatattc actttactga ttttatctgt atgctttttg 60 

tgtgccaaca tatcgggtag ggcacagaac aaattttcgg atatggaggt caatcatgtc 12 0 

cgggtggcta caccgggact tttttccaag gagaattgtg tcatgctgga tctgaagtcc 18 0 

ctgtcacgga attactcttt ccctttgccg ggaggcaaag tcatttcggg ctatggaaca 2 40 

cgtggaggcc atagcggtga cgacataaaa acttgtgccc gcgatacgat tcgtgcagct 3 00 

tttgacgggg tggtacgtat ggctaaacct tatggtgcgt atggcaatgt gattgtgata 3 60 

cgacatccca atgggttgga gacggtatac agtcataatg tgaagaatct ggtaaagagt 42 0 

ggggatgtgg tgaaagccgg aatggctatt ggcctgaccg gacgtaccgg acgggctact 48 0 

accgagcatc tgcattttga gacgcggatt aacggacaac actttaatcc cggtcttatt 54 0 

tttgatatga agaagggaac cttgcgtact gattatttgc aatgtacgaa gaaaggtaag 60 0 

ggaattgttg ttaaagcttt gaaaagcgaa aaagtccttc ctaaatataa aactctttcg 660 

cctttcctat atgaactgcc cgggattaaa aaaccggtat ggaatatacc agccctagcg 72 0 

agatccgctg cgtattcggg tctatag 747 



<210> 916 
<211> 204 
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60 
120 
180 
204 



<212> DNA 
<213> B.fragilis 

<400> 916 

tccgtatgta ttccgaaatc atccttatat ttcacaataa aatctgcggt tcgctcatcg 

gactgtggga tatctccttt atcttcatca tggcaactgc agagtaacgc caataatgcc 

aataaaacaa tcgaatattt atgcaagctc atagttcctt ataatttatt agttctcaat 

atgatattac ttccaaatat ataa 

<210> 917 
<211> 1158 
<212> DNA 
<213> B.fragilis 

<400> 917 

tatttcatct tttatttaat aatattaatg tataaccttt tcatcgtagt taacgttgat 
tggttttttc tctctcatcg caaagatatt gcactaactg ctcaaaagtc tggttacaat 
gtcactatcg taaccaagga taccggaaaa aagaaagata ttgagtcact tggcctgaag 
gtgatcgatt tacccatgaa tcgttcggga caaaacctgt tagaggagct gcatacttgt 
tggttccttt accatcttta tcgtcgtgag aatccggata ttgtgcatca tgtcggtttg 
aaaacgattc tttggggtac tttggctgcg aaattggcta atatccatgg gatcgttaat 3 60 
gccgttagcg gtttaggtat atttttttca gagggaaacc ggtctattat ttcgaaatta 42 0 
cttcctaaag tacttcgttt ttctcattat cgtaataatg tcgctgttat ttttcaaaat 
gatgaagaca agtcattgtt tttaaaacat cagatcataa aggaatctca agcttataaa 
attaaggggt ccggggttga cttgaaacag tataattata ctcctgaacc ggaggagggg 
aagattaaag ttctgttaac agctcgtatg attgtagaga aaggtatctt tatcctgaca 
gattccgcta taaaacttag gaagcaatac cagggtaagg ttcagttctt attgtgtggc 720 
ggacttgatg ataatcccat ggcaataaaa gaaagtgaat tacaagcggt atgtgacggg 
aagtatatca agtggttagg ttatcggacg gatgttttgg atttgttaaa ggactgccat 
attgtcgctt tcccttctta ctataaggag ggactgccta aatccttgat tgaggcaacc 
gctataggac ggcctattat aactactaat tcgatcggat gtaaagagac tgtaattgat 
ggttataatg gatatctgat tcccataaaa gacagtgata tgttggcttc ccgattaagt 
tttttatttg aaaataaaga tgtaagacag agtatgggac gtaattcccg gaagctggct 
gagaaggact tttctattga tgacgtaata aagaaacatt tggatattta tagaacatta 1140 
gttattggaa ctttatga 1158 

<210> 918 
<211> 1422 
<212> DNA 
<213> B.fragilis 



60 

120 

180 

240 

300 



480 
540 
600 
660 



780 

840 

900 

960 

1020 

1080 



<400> 918 

tggagcatct ttcggataac gcttttgacc gtcggggaga gaacaatcct ttctttaaga 60 
tttcgagtgg aaaccctgac tgcagacaat gcttttatta aaaggtattg ggagacgtct 12 0 
tataagcgga tcgggtattg taaccgtttc ttggtcggta tccagaataa ctcggaatcg 18 0 
gaaaagaaaa cacggatgat tgcggaagcc cgtttcctgc gtgcgacaca gtatttttac 240 
cttgccagct atttcaaaaa tgttcctttg gtagagaatg tgctgacggg tgaagaagcc 300 
aacaatgtga caaagacctc acaggccgat atcctgaaat ggtgtgtaac cgaatttaca 3 60 
gcagctgcgg ccgatttacc ccgtttctcc gccattccgg cgggagaagc cggacgtgct 42 0 
tgtaagcagg ccgctcttgc ttttctcgga cgtacctgca tgttgcagaa agactggaaa 
agtggagcaa aggctttcca cgatattatg gaattgggag ataatgcgat aaacgccaac 
tatcaggagc tgttttatcc ttctaccgga acttcgaaca aggagaatat tttctacatc 
cagtatttgg aaaactatct gggtaccggt ctgccgcagc atgcactttc tgctaaagac 
gggggatgga gcctggtcaa tccggctgct gatttatacg aatcgtatga atttaaggat 72 0 
ggaactcctt tcagctatga tgatccgaga tatgacccgt ctaatttagg aaaggatcgc 780 
gatccgcgtc tggattatac aatttactat aacggtgcca tctttatggg tacagagtat 
aagatgagtc ctgactacag tgcagccaag aaggagaagc tcgattatac gagcgaggct 
tccagaactg gctttatgat gaggaaatat tttgaagaat cgacacctat aaacgatgta 
cagagcgcaa acggactgac tccggttatt cgttatgccg aagtgttgtt gggctatctg 102 0 



480 
540 
600 
660 



840 
900 
960 
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gaatgcctgg ttgaagataa tcaaacgatc actcaaggaa tattggacga gactatcaat 108 0 

gcagtgagag gacgtgcaag tgtgaacatg cctccggtaa ccgaggtaac tcctgcgaag 1140 

cttcgtgaaa tcgtgcgtca cgaacgccgc atcgagttgg ctatggaagg tatccgttac 12 00 

tgggatatca tgagatgggg aattgcacac gaagtattgt cccagaaaat ttggggtgcg 1260 

ccttacccgg gttcgactca gtatgcgact acgaccaaag aggttgaccc gacaggaaac 132 0 

taccgctggt atgtgggcaa acgtgctttc cgtaatccga cggattatac atggccgatc 13 80 

cctcagtccg agcaaaatat taacccgaat ttacgtgact aa 142 2 

<210> 919 
<211> 2868 
<212> DNA 
<213> B.fragilis 

<400> 919 

aatcagctca tgttaaaatc aaaatatctg tttctttcac ttatttgcct gctgacatcg 60 

ttccgactcc atgctcaatt tatggattac ggctcggatc ctgctaaatt caaatggaat 120 

atcgcgagat taccccacta caatctggtt tatccgcaag gaaacgattc catggcttac 180 

cgttatgccc tctttctcga gaatgtttat ccacacatgt caaagaccat tggaaaaccg 240 

atcaaagcta agtttccggt cattcttcat ccgggcaaca tgcaatccaa cggaatggta 3 00 

tcctgggctc cccgacgaat ggaacttatt acaacgcctt cttcggatct gaataaccaa 3 60 

agttgggata agcatctggt actgcacgag tcacgccatg ttttccagac aggaaaggta 42 0 

atgcacggca ttttcaaacc gctctattat ataataggtg aacaggcagc cggagtagcc 480 

tcttttttct tgccggtatg gtttcttgaa ggagatgccg taagtacgga gactgccatg 540 

tctaacggtg gtcgtggacg actaccggaa tttaacatgg tttaccgtgc ccaaatgtta 600 

ggaggaaaaa agaactattc cttcgacaag tggctaatgg gatcttacaa aaactatact 660 

ggtacctact atgcactggg gtttgatatg acctcttatg ctcgtcaacg ctacggagcc 720 

gatatttggg ataaaagcac cagtagatac attcggaacc tactgttcga aggttcattt 7 80 

aagcattata cgggcagtag ttttaagcgt ctccaacatg atacgttcga cttcctgcgt 840 

gcagagtggg agaaacagga tacttgtaca cagtccccgc aatatctatc acctacaaaa 9 00 

gagacttata cctcctatcg atacccacaa cccatcaacg attctatagt gattaccgta 960 

aagtcgggat tgaaagatat caactcttta gtgatcatca ataatggcag agaaaaacat 1020 

ctggactata taggtagtat taatagccgt ttgagctatc gaaacggccg ggtttactgg 1080 

agcgaactag tacccggact acgttggaca caccagaatt actcaattat aaagtactat 1140 

gatctggata agaaaaacat aaaagccctc actccccgac aacgttattt atccccggcc 1200 

attgacgagc aaggacagca cattgccgtt tcacgtccta cagtcgaagg taaaaaccaa 1260 

ctcgtgctga tacaagcaga aaaaggtaat gaactcgctg ctttcgacgt tcccgataat 132 0 

gcatttatca aagaactgac atttgcagga ggcgacacaa ttatctcgat agctgtcgca 1380 

gattccggta tccgcctgtt acaattcaac ttcggaaacg gaatatggaa agaactgcta 1440 

aaaacagctt ccgccaatat cacttctcct gtttggaaag atggaaaaat ctttttcgaa 1500 

tcgggagcca acggcatcaa caacatctac agcctcaatc cggcagacgg acaagtccgc 1560 

cgaatgacag ctgcccgctt cggagctttc gatccttctt ttggatcgtc agacggacgt 162 0 

ttgttcttct ctgattacca agccgatgga tatcgcattg cctcactccc gactgacagt 1680 

atgctctttg aaaaggcaga tctcaaccgg ccggcttcca tgccatttgt tgaaacactt 1740 

gccgctcaag agcaattcaa cctggactcg gcacgtctga catcagtcga tttcaatccg 1800 

aaacgttata gaaaagcgga acatacgttc aaaattcaca gctgggcccc tttctattat 1860 

gatgtggctg aggcaatgaa ctcaggtgcc agcgatctga gtacaatagt aaaacccgga 192 0 

gcaaccctga tgtcccaaaa taccctgaac acagccatca tgcaggccgg atggtatata 1980 

gacaaaggct atcatcatgg taaactgtca tttatctatc aaggctggtt ccccgttatc 2 040 

aatctgtcgg tagactatgg tgataaagct ttcaatgtag actggacaca gaatgacaaa 2100 

gggcaagaca ttacacaggg ccattatacc caacgaaatc tggtggaagc agaagcacgt 2160 

gtctatctcc cttttaactt aacacacaac caacgaatac gaggcataca accggctctg 222 0 

acttattatt ttaccaataa taaatatcag gaatatcaca gtcggaaatt ccataacttc 2280 

caatatatcc taccggaaat tctattctat gattacagac gaaaagctca gcgagacatt 2340 

ctcccccgca caggctatca attacgtttg caatacctga agactccatt caattctgaa 2400 

aattacggaa gcctgtatgc cgcccgcctg actacttact ggccgggaat catcaggaat 2460 

catgggctga tgatccgtgt cggctatcag tatcaggatc ttgacaacaa agcattatac 2 52 0 

cttcccaaac atcttttaga aaaaccccga ggataccatt tccagtatca aacccgccaa 2580 

caatgggcct tcaaaacaga ttatgcttta cccctgctgt cacccgattg gagcatcggc 2 640 

tcacttattt acatccgtcg gttgcgtgca aacctctttt atgatctatc gcgcaatcaa 2700 
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gccagttcta aaagtaggtg gagtaaccaa agttcatacg gaggcgatct gattttcgac 2760 
tggaatgtac tacgaatgag ttatccgctt acaacaggca tacgcttgat acagccgatc 2 820 
gattatggca aatttcaagt agaggcactg ttttcaatca gtttctga 2 868 

<210> 920 
<211> 249 
<212> DNA 
<213> B. fragilis 

<400> 920 

ttattcatca tgaacggttt atacaaacgc tccatgatag atttggctct ctcagactca 60 
actcccacaa ctacccgatc cggcttcatg aagtcatcaa cggcatctcc ctctttcaga 120 
aattcaggat tggaggccac gtcaaattca agattcaaac cgcgtttatc caattcaccc 
cggatcgtag ctttcacctg ttgcgctgtg ccgacaggaa cagtactctt cgttaccacc 
agtacataa 249 

<210> 921 
<211> 1521 
<212> DNA 
<213> B. fragilis 



180 
240 



60 



600 
660 



<400> 921 

aagttcacca tgaagatttt tccaagtagc agcatcaaga aactggatgc ttacaccata 

gaacatgaac cgattgcatc gatcgacctg atggagcggg ccgcacaggc actgaccaaa 120 

gccatcaccg aacgctggga catcacaact cccgtcacgg tatttgccgg accgggcaac 180 

aatggcggag atgcccttgc cgtggcccga atgttggcgg aaaaggaata caaggtcgaa 240 

gcctatctgt ttaacccgaa aggggaactg tctgccgact gccagaccaa caaggagctg 3 00 

gtagagacga tggataatgt gaagttcagc gaagtaagca cacagtttgt acctcctgcc 3 60 

ctgacaatgg atcatctggt agtggacgga ctttttggtt cgggacttaa taagccgcta 42 0 

agtggcggtt ttgcggcagt agtgaaatat atcaatgcat cgcctgccac cgtagtcgcc 480 

atcgatatcc cctcgggact gatgggggaa gagaacacat ttaatgtaaa agccaatatc 540 
atccgtgccc aattgacatt gagcctgcaa ttgccgaaac tggctttcct ctttgccgag 
aattccgaat tcgtaggcga atggaaactg ttggatatca acctcagtcg tgaagcgatt 

gaagaaacgg aaagcaatta tgccttattg gaagcggaag aaatacacgc tctgatcaaa 720 

ccccgtaaca ctttctcaca caaaggaaac tttgggcatg ctctgctgat tgccggttcg 780 

tacggcatgg caggcgcgtc gatactggca gcccgtgcct gtatgcgttc gggtgtaggc 840 

ttactgacag ttcatgcacc tatacgcaac aatgatatcc tgcagatttc ggttccggag 9 00 

gcaattatcg aatcggatgc cagcgatacc tactttgcct gccctacaga tacggatgac 9 60 

tatcaggctg taggaatcgg tccgggcatc ggacgctcgg aagagaccga ggctgcactg 102 0 

cttgaacaac tcagtggttg ccagacacct ctggtactgg atgccgatgc actaaacata 1080 

ttggccaacc accgccacgc actgaccaca ttgcccaaag gctctattct gactccccat 1140 

cccaaagaac tggaacgcat ggtgggcaaa tgccagaact catacgaacg actgatgaag 12 00 

gcctgtgaac tggcccgaac cgccaaagta catatcatat taaaaggagc ctattcggca 12 60 

attatcaccc cctcgggcaa gtgctatttc aactctacgg gtaatccggg tatggcaaca 1320 

gccggaagcg gagatgtatt gacaggtgtc gtgctggctt tgctcgcaca gggatatccg 13 80 

gctgaagaag ctgccaaaat cggtacttat gtacatggtc tggcaggtga tttcgcacgc 1440 

aaaaagcaag gcgttatcag catgacggca ggagacatta tcagtaatct gccattggct 1500 

tggcgtctgg taagcgaata a 1521 

<210> 922 
<211> 2154 
<212> DNA 
<213> B. fragilis 

<400> 922 

atcgtattaa tcaaaatggc aacattacaa aacattagat ccaaaggacc cctgttggtg 60 

atcgttattg gtttggcttt gtttgctttc attgcgggcg atgcctggaa agttctccag 12 0 

ccacaccaat cgcatgatgt aggcgaagtc aatggagaaa ctctttctgc tcaggactac 18 0 

cagaacatgg tagaagaata taccgaggtt atcaagttct caagcggaat gagttcattg 2 40 
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aatgatgaac agaccaatca ggtgaaagac gaagtatggc gtagctatgt gaacaataaa 3 00 

ctgattgaaa aagaagcgaa gaagctcggt attactgttt cgaaggctga aattcaatca 3 60 

atcattaacg aaggtgtgaa tccgttgctg cagcagactc cgttccgcaa tcctcaaacg 420 

ggcgctttcg ataaagatat gttgaagaaa ttcttggctg actactctaa aatggacaag 480 

accaagatgc cgtctcaata tgtggaatac tatgaaggaa tgcacaaact ttggtcattt 540 

gtagaaaaga cactgatcca gagccgtttg gcggaaaaat accaggcact ggtgactaaa 600 

gctcttttct ctaatccggt tgaggcacag gatgcattcg acgcaagagt aaaccagtcg 660 

gatgttctgt tggctgctgt tccttattct tctattgtag actctactat cacagtgaaa 72 0 

gagtctgaac tgaaagatct ctataacaag aagaaagaac agttcaaaca atatgttgaa 780 

acacgcaaca tcaaatacat cgatgtacag gtgacagcca gtgcagaaga cagagctgct 840 

atccagcagg aagtgactga ttatacaaac caactggcta ctgccaatgg tgattatact 900 

actttcatcc gttctaccgg atcggaatat ccgtatgttg atttgtacta taccaagaaa 960 

gctttcccgt cagatgtagt tgcacgcatg gattcagctt cgattggaca agtatatggc 102 0 

ccttactaca atgcaggcga caatactatc aattcgttca aggtgttgtc taaagtggct 1080 

gctgccgatt ctgtgcagtt ccgtcagatt caggtttaca cagaagacgc tgctaaaaca 1140 

aaagctttgg ctgacagcat ctatactgct attaagggtg gggccgactt tacagctttg 12 00 

gctaagaagt acggacaaac aggtgaatcc aactggattt cgtctgctaa ctacgaaaat 12 60 

gcacaggttg atggcgataa cttgaaattt atcagcacta tcaacaatct gggagtaaac 13 2 0 

gaactctcta acgtagcatt gggacaaggc aatatcattt tgcaggtgac tgataagaaa 13 8 0 

gctgtgaaag ataaatataa agttgccgtt atcaagcgtg cggttgagtt cagcaaagaa 1440 

acttataata aagcttataa tgaattcagc cagtttattg cagctaaccc gacagtagac 1500 

aaggttgcgg ccaatgctga agaatcaggc tataaattgc tcgaaagaaa tgatctgtat 1560 

agctcagaac acggaatcgg tggtatcaga gggactaaag aagcactgaa atgggccttt 1620 

gctgcaaaac cgggtgaagt ttccggctta tatgaatgtg gcgaaagcga ccgcatgttg 1680 

gttgttggtc tggttagcgt gatcgaagaa ggttatcgtc ctttggccca ggttcaggat 1740 

cagttgagag ctgaaatcat tcgtgataag aaagctgaga agatcatggc cgacatgaag 1800 

gctgccaatg caactacaat tgcccagtac acatcgatgg ccaatgcagt aagtgattct 1860 

gtaaaacacg taacatttgc agcgcctgct tatgtagccg ctttgcgtag tagtgagccg 192 0 

ctggtaggcg catacgcttc ggtttcggat atcaataagc tgagcgctcc tatcaagggt 1980 

aatggcggtg tgtttgtgtt gcaggtatat gccaaagata agctgaacga aacattcgat 2040 

gcccaatcag aagaggctac attggaaaac atgcatgccc gtctggcaag tcgttttatg 2100 

aacgatcttt atctgaaagg cgatgtaaaa gataaacgat acctgttctt ctaa 2154 

<210> 923 
<211> 1284 
<212> DNA 
<213> B. fragilis 

<400> 923 

actataataa tggtaggata taaacagaca ctttgtgcgc ttctgctcac gatattatta 60 

cccggagtgg caatcgctca aaataataca aactctcctt atacacgata tggctatggt 12 0 

cagttggctg atcagtcatt tgcaaacagt aaagcaatgg gagggatcgc ttacggattg 180 

cgcgatggat cacatatcaa tccgttgaat cctgcttctt atacggctat tgattcgttg 240 

acctttcttt ttgacggagg gttttcgatg caaaatacaa actttagtag tgaaggcacc 300 

aagttgaatg cgaaaaattc aagttttgac tacatagcga tgcagtttcg tctacaccag 360 

cgcgtggcca tgagtatcgg tctgctgccc tactcgagtg taggctataa tatggccaag 42 0 

gcgaacaacg atgttgcatc ggaagaagcg cggagtgtca cttcatttgc cggagacgga 48 0 

ggcttgcatc agctttacgt aggtttggga gtgaaggtgc tgaaaaacct ttcagtcggc 540 

gccaacgtat cgtacttttg gggggagatc acgcgtcagg cgcgtattac ttttccttat 600 

aatgacaacg cttttgcttt tcagcatgta gactatttgt ctgtgcgcga ttataagctg 660 

gacttcggcg cgcaatacac acagcagctg ggtaggaagc atgcggttac attaggtgta 72 0 

gtgttctcgc ctaaaaaaga tttgcataac gaagcttatg tacaaagatc gacgcttacg 780 

aactccaaca gcacgcaggc cgtcactacg aatacggtcg atacggtggc tacctttgga 840 

atgcccaata gctttggggt gggacttacg tacgagtatg acaaacgtct gatcgtggga 900 

gctgatttta atttgcagaa gtggggcgac gtgacctata tgaatcagcc gaatgctttt 960 

tgtgatgcga tgaaaatttc agtgggtgcc gagtatatgc cgagtcgttt ctcgcgtagt 1020 

tatctggcgc atatcaaata ccgcgtcgga ggatattatt cggaacctta ttataaaata 1080 

ggaggggaga gagcctctcg tgagtatgga gtaacggccg gtttgggatt acctcttccg 114 0 

ggttcacgct cgctaatcaa cgtttcggct caatatatta aagtacatgg tctgaaagcc 1200 
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ggtatggtag atgaaaatac attgcgtttg agcatcggaa tcacgttcaa tgaaggctgg 12 60 
ttcttcaaac gtaaagttaa ataa 1284 



<210> 924 
<211> 657 
<212> DNA 
<213> B. f ragilis 



<400> 924 

actaaaacaa taatgaacaa caaaaacaaa ttcagattcg ccattctatt atttggcgta 60 

ctttcagcgt ttatcaccac cgcttgctca gacaacaata gtcccgacga ccctgcacaa 12 0 

ggagaaaaca cgttgccggt aaaacaagta agcctgagtc ggaaaacagc atacggaaac 180 

gactggatct attattcact tgaaaaagga aaagaagtaa gcgtcagtga agaatcccat 2 40 

gccgaaaata cagactggga catcgcattc aatcgttaca atgtgcgtac caacagcggt 3 00 

gcatccggca aaggaaaagg tggagcatta ctcactaaca ttaaagattt ggcagcctgt 3 60 

acgacagttc cgcagggaac atttactgtc gacgcagcct ataccatcac tgctcccggc 42 0 

acaggtttcc ctcctcctac catggagtcc accgctaatg aggttctctg taaagcaatc 480 

acttttgccg gccctcctcc cacttacacc ccaagcgatt acgtatttat cgttcgcaca 540 

gccagtggga aatatgccaa gttgaaagcc aagagttttt atgatgacga aggcaaaagc 600 

ggtatttatt catttgaata tgccattcag ccggatggca gtacaaattt aaactaa 657 



<210> 925 
<211> 1458 
<212> DNA 
<213> B.fragilis 



60 



300 
360 



<400> 925 

ttgtatatga tgattaaaaa tttaaaattg gcagcattaa atagtttaat gtggaatgct 

gttgagaaaa tgtcggggca agcattacgc tttatattga ccattgttat agctagatta 12 0 

gtttctccag ctgattttgg attgatagca atgttgtcta tttttttatc tgtagctcaa 180 

tctttcattg attgtggttt ttacaacgct ctggttcaga aacaggatcg gacagaagtt 240 
gactattcta ctatgtttta ttcaaatgtg ttgattagtg ttgttgtata tttttttctt 
tattggagcg ctccatatat tgccagtttt tattctcagc ctgaacttaa acagatcact 

agggtaatgg gggttagttt aattatatct gcgttcagaa ttgttcagca ggctaaatta 42 0 

gtaatagcac tcaattttag aattcaggca gtattttcag taatagcagt tgttgtaagc 48 0 

gggctaatcg gtatttatat ggcgtatcac caatttggtg tatgggcttt agtcgtacag 540 

tccttagtat ctgcttttat ctcaacagtt tccttttgga tatattcaag atggatgcca 600 

ttatggactt tctctataca atcatttcag gagttattct cttttggatc aaaattatta 660 

ttagctggag ttttacatac aatctattct aatctgtata caatagtaat tggtagaaaa 72 0 

ttttcatctg ttgatcttgg cttttttagt cgtggacaaa ctatggccta ttttgtacct 780 

tctaatatga caaatattgt aacaatggct atgtatccaa tattatgttc tattcaggat 840 

gattatgtta aattgaaaaa gacatttaag gtgtatattc gattggtttg ttttattttt 900 

tttcctataa tgataattct tgctgtatta tctgaaccaa taattaaaat tgtattaacc 960 

gataaatggt taccatcggt tttttatgtt caaatattgt gtattgctta tatgtgggat 102 0 

1080 



ccattaatga gaataaatgc taatatttta agtgttgttg gccgaacaga ttattcgttg 

aaaagtgaaa taattaaaaa ggttatctcg gttattgtac tatttattac tatacctttt 1140 

ggaatagatg ttatgtgtat cgggttagct ttatattgta ttatagattt attggtttca 12 00 

acatattatg tgaaaaggat tattggactt ggtttctggg atgaaatgag aaatatttat 12 60 

gcattcttta ttctgtcttt agttattgga ggagtagtgt ttgttgttaa tatatttgtg 132 0 

gagtctgatc ttctcaaaat ttttatagga actttggttg gaatagggtt gtatatatcc 13 80 

atgtgtatca tattccgaat taaagaggta tttgactttt ggagtattat taattcatat 1440 

ctattgcaaa agaaatga 1458 



<210> 926 
<211> 579 
<212> DNA 
<213> B.fragilis 



<400> 926 
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attaacaaga tgctgaacat tgtaattttc ggtgctcccg gttcaggaaa gggaacacaa 60 

agcgaacgta ttgttgagaa atacggaatt aatcacattt caacaggaga tgtattgcgt 12 0 

gcagaaatta aaaacggcac agaactgggt aaaacagcta aaggctacat tgatcaggga 180 

cagttgattc cggatgaatt gatggtagac attctggcaa gtgtgtttga tagtttcaaa 240 

gatagcaaag gggttatttt tgacggtttc ccaagaacta ttccacaggc tgaggcgttg 3 00 

aaagtgatgt tgaaagaacg tggtcaggac atctctgtga tgttggatct ggatgttccg 360 

gaagaagaac tgatgactcg tctgattaaa cgtggtaagg aatcgggccg tgcagatgat 420 

aatgaagaga ccatcaaaaa acgtttggtt gtatataata cacagacttc accgttgaaa 480 

gaatattata aaggcgaagg caaataccag catatcaatg gtcttggaac catggaaggt 540 

atcttcgaag atatttgtaa agcggtagat acattataa 579 

<210> 927 
<211> 474 
<212> DNA 
<213> B.fragilis 

<400> 927 

ttctattgga tattattact cttctataca aataaaaacc gaaaagaatc gacgaatcat 60 

aattcattcc tttttaaaat gaaaaattct ctgtgtaatt ctgtgtcctc cgtagtgaag 120 

aaactcttgg agtatggcga ggacggtact cctgtctatg tgaatgaact tactgcgtta 180 

aatcaagaac tccgtaactt gtgtgctgat cttcttcttc agaaaggaga atctcccgaa 240 

gaagaggctg aaatacttgt tactttgttc aaaggttacg ataccatgct gtttaatttt 3 00 

tcctctgaga atgaacaggt tattcaagaa ttattggatc gttcaatgac tgttttagaa 360 

aaactaccag cctctgtatt gaaatgtcag ttgctgctgg agtgctttga gcagacggga 420 

gatgaggaac tgataagaga agtgaaattt acttttgaga gttgtggtat ttga 474 

<210> 928 
<211> 1965 
<212> DNA 
<213> B.fragilis 

<400> 928 

aagttatact ttattatttt atctttgttc gcgtacagca agacaaactt tatggctatg 60 

gcacacacac tggattcatt cccggatacc ggagaccttg agaatttgaa agataattat 12 0 

cagaaaatca cttctgtcct ggcaggacat cagattgcat tttgggaata tgacattcct 180 

acaggagagt gtaatttcac agatgaatat ttccatattt tagggttgaa ggaggccgga 240 

atcatattca gagatattaa tgacttttat cggtttgccc atccggagga tgttatctct 3 00 

taccaaacga cttttgcgcg gatgcttgaa tcggaaacca aaatctccca aattgtggta 3 60 

cgttgtgtag ggaggcaagg agaaacaatt tggcttgaag ataattttat tgcttataag 420 

aagaataagg agaatggctc tgataaaatt atagcatata ctgccaatat cacttcacgt 480 

tgtgaaaaag aagtccagat caggcagctt gaggaacgaa accggaaaat tattgaagca 540 

ctaccggagt tcatatttat ttttgatgat aattttttta ttacggatgt attgatggca 600 

cccgatacag agttgttgca tccggtggaa gtgttaacag gagcagatgg gcgatctatt 660 

tattcttctg aggtcagtga cttgtttatt agcagtattc atgaatgcct aaaaagtggg 720 

aaattaaaag aaatagagta tcctgtggat gtcgaagccg gcagacattt ttttcaggca 780 

cgcattgctc cgtttgaggg aaataaggtg ctggccttga ttcatgatat tggtgatcgg 840 

atgcgacgtt cgcaagagct acttgaagcc aagcaacggg cagaagaggc tgatcggatg 900 

aaatcagtat ttctggccaa tatgagtcat gagatacgta ctcctttaaa tgctattgtg 960 

ggcttttcgg aaattatagc tttgactgag gatgaaaagg agaaagaaga gtatttaggg 102 0 

atcattcagc agaatagcaa tctactgtta caactgatta atgatattct cgatttgtca 1080 

cgaatcgagt cgggtaagtc ggaaatgcat tgtcagttga cggaaatgag cggattggta 1140 

gatgaagtgg ataaagtaca tcgtcttaaa atgaaaaaag gagtcaagct gaatgtgatt 1200 

cgtccatcag aggaaatttg gatttcgaca gataggaatc gggtgacgca ggtattgttc 1260 

aatttcttgt cgaatgcaat taaaaatacc attgagggta gcattacttt cggacttgta 1320 

aaagaggaag aatgggttaa actttatgta acagataccg gctgcggtat ttccaaagag 13 80 

aaattacctt tgatatttac ccgttttgag aagttgaatg attttgtaca aggaacaggg 1440 

ctgggattac ctatctgtaa gagtattgta gagcggttgg gtggtcggat tgaagtggaa 1500 

tccgagcttg ggcaggggag tactttcatt ctttatttgc ccaataggca agtacaggaa 1560 

gttgtggttg gcgaaagaga aaacgcagca ggtaatatgg gagtggagaa ccggcagaag 1620 
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aagatactga 
aaagaatata 
gagaagcccg 
acagcaaaaa 
ttttgtccgg 
tatcctctgg 



tagcggaaga 
cgattctttg 
acttgatttt 
ttcgtgctat 
aaggagagcg 
agaagctgaa 



tgtggagtcc 
ggtgcctaat 
gatggatatc 
ctcgcaagag 
agctcttgaa 
agaaacgatc 



agttatctgc 
ggagaagaag 
cgaatgccgg 
ataccgatta 
gcagggtgta 
gaaacttatt 



agattaatgc 
ctgtgaagag 
tgatgaatgg 
tagcaattac 
atgaagtgat 
tatag 



ctttctgaaa 
tttcatacgc 
tattcaggca 
agcatatgcc 
tgcaaaacca 



1680 
1740 
1800 
1860 
1920 
1965 



<210> 929 
<211> 633 
<212> DNA 
<213> B.fragilis 



<400> 929 

ccggatagct 

ctttgtgcgc 

atgcattttg 

ggattgtcca 

atgggatatt 

ctgcaagcat 

gcccactaca 

atgtttgacg 

aaagaagtga 

tcacagaatg 

aagcccaaat 



ttattaactt 
tatttattaa 
gcacctatat 
ttccgtttct 
attatgcacg 

gggtattcat 

tttatttccg 
aactgaccaa 
tggaaatgat 
tgttctatgg 
caccggaggt 



taacaggcat 
taaaatgaca 
gggagtatac 
tttattcctg 
tacctatcgt 
cgtttttatg 
gttcatcgac 
taaagaagta 
cagtagattg 
cagcatattg 
gcaacctcta 



aaattccata 
gaaagcagaa 
tggatactta 
tttttcgggc 
gacaaagtat 
tatatgtttg 
catggtttca 
ccgggaatag 
acaccgatag 
gctgtcccca 
tag 



ccaaacaata 
gtaacttaca 
agttcattct 
ttactttagg 
gtggcggctc 
cggcactcct 
ttgtaaacac 
aagggtacat 
acattactat 
ctgccttgtt 



ttatttgtac 
aaaatatgcc 
attcccattg 
agttccgttc 
gatccgcttc 
cacggcagtg 
ttacatggga 
cagccaactg 
gcaactgatg 
tgtgatgaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

633 



<210> 930 
<211> 3885 
<212> DNA 
<213> B.fragilis 



<400> 930 

aattttatag 

ctgttcgccc 

tccgaaatcg 

aatccgacaa 

ggggaaaaca 

gcaaaacctg 

ttgaatgatg 

aagaatactt 

gtagaacccg 

gaatattttt 

aatattgtaa 

tggtcgacca 

ggtgcagacg 

tatatggtta 

ccggtgttgc 

tattggaaag 

cagaaagata 

tcggcccgtg 

ccgaatgatg 

aatctgaaaa 

caatcggatt 

gaagttagag 

tattcgttcg 

gatgcacgtc 

atctggtcgg 

tatatcggtt 

ggcggcaaga 

cagttgaata 



taaaaatcat 
tcgctttgcc 
ttagagaagt 
cgattgaagt 
tcttcagagt 
aagcacaaat 
gcagcaatct 
ctttgatgaa 
tactgttcga 
atggtggcgg 
acgagaatag 
atggctatgg 
agaagggaaa 
gcgacggagc 
tgcccaagtt 
aagacgaaaa 
atggcggaat 
cagtgattga 
gatatggtgc 
gtctgggtga 
tacatccgaa 
atgccggtgt 
gattgaatgg 
cgttcattat 
gtgaccagac 
caggcctgtc 
atatgattgt 
tggacggatg 



gaaaaacgca 
tctgatgttt 
gacacaacaa 
attattctct 
atttcaggac 
tctggtaaac 
catctctatc 
agtgattgac 
taagggaaaa 
tgtgcagaac 
ttggactgac 
tatgatgtgg 
agtgaaactt 
tgtgggcgtg 
cggtttctat 
aggaatcttg 
caaagaatca 
ccgttacaag 
cggatacgga 
ctatgcccgt 
agaaggcgtc 
gcgtgtgttg 
agtggcagat 
ttcacttgac 
aggcggtgta 
aggccaacct 
caacaccaga 
gggatctaat 



attggtaatt 
tccggttcgc 
aacttgaaga 
aataatcaga 
aatgccggag 
aatcccagaa 
actacgggga 
ttggaaaaga 
gtgactgtaa 
ggacgttttt 
ggtggagtag 
tataccttca 
acacatgatt 
ttgaatgatt 
cagggacatt 
ttcgaagatg 
ttgaacggtg 
aatcacgata 
cagacggaga 
aagaatggcg 
agtgcattgc 
aaaacagacg 
gtgggtcaca 
ggttgggccg 
tgggagtaca 
aatatttctt 
gacttccagt 
gaaaagtatc 



gtaggcagcg 
ctgcgcaggc 
tcgttagtgc 
gaatgacatt 
gaatcatccg 
acacagtttc 
aaatcaaagt 
atactgttgc 
ccctgaaaga 
cacacaaagg 
cttctcctgc 
aaccgggtaa 
ctccatatct 
tctatcagtt 
tgaatgctta 
gaaagcgtta 
aaaagaataa 
tgccgttggg 
cactcgacgg 
ttgaaattgg 
tgcaaagaga 
tagcatgggt 
ttatgcctta 
gtacgcaacg 
tccgtttcca 
cggatatgga 
ggaaaacttt 
ctcacgctct 



aaaagcagtg 
aatgcacagg 
aaagaaaatc 
cgatttctac 
tgatccggaa 
tacactcaac 
ggaaatcgac 
ctttgaagag 
aaatcccaat 
aaaagccatt 
gccattctat 
atatgatttc 
cgatctcttt 
gaccggtaat 
caatcgcgac 
taaagaaagt 
ttatcagttc 
ctggctcctg 
aaacattcag 
attgtggaca 
tatcgtgaaa 
cggttgggga 
ttacggtaac 
atatgccgga 
tatcccgact 
tggcatcttt 
cactccgatg 
gggcgaacct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 



378 



gccacttcta 
agttttgcaa 
ccgaatgctt 
ttagtagccc 
ggcatttatt 
aactgtgttc 
ggagctatca 
cgtatctacg 
atatctgaag 
tcaaagaata 
aaagagaaag 
cagataggta 
ggtgaaaacg 
agtgaatttg 
accgatatta 
gataattaca 
gatattgagg 
gaaatagaat 
ggactggctg 
tcggattggg 
ggaatcaagg 
gattttgccg 
gatatgatta 
accgatggtg 
aactggaccg 
tttactgaac 
tatggttcgg 
ggtgatatca 
accggtctga 
cagaatggtt 
gtaagtgaag 
aattacagta 
aatgcgttga 
cctctgaacc 
aaagcactct 
gatctgttta 
gatggcgttt 



tcaatcgctg 
aagaggctgt 
acaccttggg 
ccatttacaa 
tgccagaagg 
tcaacaattt 
tcccgatgac 
aaatctatcc 
catataaaga 
atgtgaagat 
ccacagaatt 
aaggcaaagt 
tatacttcta 
agaagaaggt 
ctaaaaatca 
gagtaacctc 
cttatacctt 
tcaatggcat 
ctgaaacaga 
ctgaattcgg 
gtgaaacaac 
agctgggtga 
tcgatctgcg 
gtaacggaac 
aggccggagc 
gtccgactgc 
gtagggaatt 
ataacgatgg 
gaaaaggtga 
tgattgacgc 
agccgattga 
agggtgatgt 
gttttgcgtt 
tcaaagctat 
atcctacatt 
ttctgaagct 
tggtcgacaa 



gtatctgaaa 
aaccggtatg 
aacagctacg 
agctaccaaa 
agagtggatc 
tgccgctcct 
caatcctaat 
gtataagcac 
gggtaaagga 
ttctattcgt 
cagagtaaat 
gaagttgacc 
tgacgctgct 
tatcactaag 
agttgtaatg 
cggttcgttg 
gaagccgaca 
gctgtataca 
ctatacattt 
tgctaagaca 
tgcgaagaat 
tatgtggcat 
aacagtgaac 
gatcctgaaa 
aatcgactgg 
acgctatatc 
gtatgtattt 
taagatcgac 
ttccgactat 
ttatgatatt 
gaaactggat 
tgttgaagtg 
gccatacaat 
ggaaaatctg 
tgttaacctc 
gaaagcgaaa 
gaacctgaac 



ctgaaatcgg 
ccgcttatcc 
cagtatcagt 
gcagatgcta 
gattatttta 
ctctggaagc 
aataacgttg 
atgatgaccg 
accactactt 
cctacacagg 
gttactgcta 
gaagtgtctt 
cctaatttga 
aatcctcaag 
gatatcgaag 
accgctcctg 
tggaacaaag 
acaataaagg 
aagattcgtg 
aaagctaatc 
caggaaggat 
acgaagtatg 
cagctggata 
ggtactgtat 
aaacgtaatg 
aaactggctg 
aaagttcccg 
aataacgatt 
gaaggctata 
tctgtagtgg 
ggtaccattg 
cttgtaaaag 
cagcaggatt 
acttatgaca 
ggagctaaag 
cgcgctgtga 
acacgtaagt 



aattattgcc 
gtgccatgtt 
ttatgtatgg 
aaggcaatga 
ccggagagaa 
ttccggtatt 
ctgaaattaa 
ttgaatatga 
tcattgaatc 
gtgatttcga 
agccgaagaa 
ctatggatga 
ataagtttgc 
ttcttgtgaa 
gcttccaata 
ctgcccgaat 
tgccgaatgc 
atactgagtt 
ccgtaaacaa 
cgctggaatt 
ttgatatcga 
gagcaaaagc 
aattcgaata 
attatagtat 
gtgatgtgaa 
taacagaagg 
gaaccgagag 
taacctcgta 
tcagtgtagg 
caacacaatt 
aaatcagtac 
gtgtcaacct 
atgaattcgt 
gactccatac 
aagccctcga 
agtttgatct 
tttaa 



ttacacttat 
cctggaatat 
taccgatttt 
tatccgtgat 
atatcagggt 
cgtaaagaac 
taagggactt 
cgatgacggt 
gaatgttgat 
cggctttgta 
ggtttctgct 
tttccggaaa 
tacaaagggc 
actggccgct 
tgcacctgcc 
tgctgccgaa 
tgatttttat 
gctgtttgac 
ggatggctat 
tgctcttcac 
tcgtttgttc 
actgccttat 
cctgccacgt 
ggataaggaa 
agtatttaca 
tgtaaacaac 
ccgtttgcag 
taccaactat 
cgacattgat 
ggaagatggt 
tgctaaacgg 
ccgctctgta 
gggtgtagaa 
caatggtaca 
aggaacaaac 
gaaagctatt 



1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3885 



<210> 931 
<211> 1050 
<212> DNA 
<213> B.fragilis 



<400> 931 

atatttgtag 

tctgtcacca 

tacgataaaa 

ttgctcaagg 

gatggtacgg 

attccggacg 

tgtaagttct 

ttgaatcaaa 

atgggcgagc 

tcgtatggat 

ggacttcaac 

tttccttcac 

gtcgatctgt 

gtttttaagg 

gggcttgact 



taaatatgcc 
aagatttggg 
aagtgacttc 
gagagtatga 
tgaagtatct 
aggatcgggc 
gcatgacggg 
tcgcagcatt 
ctatcgataa 
acggatggag 
gctttattga 
aacgttctga 
taaaaaacta 
gcgtcaatga 
gccgggtgaa 



caaatatccg 
gatgcctgct 
tattgatgaa 
tttggggata 
ttatcaggtg 
gacattgtgc 
taaacaagga 
gccagagcgg 
tttggatgaa 
ccccaagcgt 
agaaagcgaa 
gttgatgccg 
tgattttagt 
ttcgctgatt 
cttaatccgg 



cttttaggaa 
tttgcagcca 
atgaccaatc 
tccgcgcctg 
agtgacaatc 
gtgtcttctc 
tttaccgcaa 
gataagttga 
gtattaaaag 
attactttgt 
tgtcatctgg 
gccgaaaggg 
aaacagcgca 
tatgctaagg 
tttcatgcca 



tgacccttac 
aacagatcgc 
tgtcgttgaa 
ttgatgagat 
attttgttga 
aggtaggctg 
gtctgacagc 
ctaatgtcgt 
cactgcatat 
cgtctgtagg 
ctatcagtct 
ctttctcgat 
gactttcgtt 
aactgttgaa 
ttcctggggt 



cgaattgcaa 
ttcctggtta 
gcatagagag 
gcgttccgta 
agcggtgtat 
taaaatgaac 
caatcagatc 
gatgatgggg 
cctgaccgct 
attgcggaaa 
gcattctcct 
taaagaaatg 
tgaatacatt 
attgctgcgt 
agacctcgag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 



379 



ggtgccggta tggagactat gacgtcattt cgtgactacc tgacctcaca tggactgttc 960 
actaccattc gggcttcccg gggagaagat atttttgccg cttgcgggat gttatcgacg 102 0 
gctaaacagg aggagagtaa caagaattaa 1050 

<210> 932 
<211> 228 
<212> DNA 
<213> B. fragilis 

<400> 932 

aactgtgcag ttcagcagat gatacctgct tttgcgtcgc tttcgttcat cgtttataaa 60 

gaaagggagg cagagcagat gccccggatg gaagacaaac agcaagtaat tggagcctac 12 0 

cgtggggtgt tcggagaaaa gagcaaggaa agcgatgatg catcctgcga tacccgctgc 180 

cgcaaacact gcgatatcca gtccccaaag actctttttc ttgcgtag 22 8 

<210> 933 
<211> 207 
<212> DNA 
<213> B. fragilis 

<400> 933 

aatgaggaag tggtaaccac ttatgtaggt gcaaagatag ataatattcg tcatatagaa 60 

agaaaaagag ggtggatttt agatatttta ttcattccgg aaagaaacac aggagattat 12 0 

actaactttg tattttcaga atgtaacaaa gacaaccaac aaaatctcaa aacaacatct 180 

attcattatg gtcaagaaaa tcattag 2 07 



<210> 934 
<211> 198 
<212> DNA 
<213> B. fragilis 

<400> 934 

cgcacaaagg tacaaataat attgtttggt atggaattta tgcctgttaa agttaataaa 
gctatccggt cagaaaaaaa acattttttt ttcgatttta aatatctgaa aagtaggagc 
ttaccggtaa agcatgcaaa aaaataccga tcccctattg caggtttaat aaaaatacct 
acctttgcac ccgggtaa 



60 
120 
180 
198 



<210> 935 
<211> 183 
<212> DNA 
<213> B. fragilis 

<400> 935 

ttttttctat tttcattgtt tctcttgttt tttagtctct gcaaatttac tcacttatgg 
gaagcggaag aatcactatt aataggtatt ccgttcgggt tccttcccta taaatcagta 
tcattacttt tatattataa aacaaagaaa agagaagaaa ggttcaatcg tcttctattt 
tga 

<210> 936 
<211> 192 
<212> DNA 
<213> B. fragilis 



60 
120 
180 
183 



<400> 936 

agagaacaat ttagtattac cggcaatgcc ggaccttata aaacagcaac ccatgtcgtt 60 

atggaatttc tgtttggaac ttcatgtcgg tcgttgttat tccccattct taggtgtttt 120 

ttcagatcta ttcgttttta tttccggcct tctgctaacg ttaatcctta tttcgggata 180 

tatcgtatat aa 192 



380 



<210> 937 
<211> 198 
<212> DNA 
<213> B.fragilis 



<400> 937 

gaagtgtttt tcacttcagc atatatagca gcccctgcaa aaacaggaca tgcttctttt 60 

aactctttcg atacaataat ggtatacata ataaatcaaa attttatcta tcagctattt 120 

atcataatgc ccgacaaaat acgtccggac tttatgccca atcttcgcgc tgaaaagaaa 180 

tcctcatgcc ggatgtaa 198 



<210> 938 
<211> 1260 
<212> DNA 
<213> B.fragilis 



<400> 938 

atattttatt cattccggaa agaaacacag gagattatac taactttgta ttttcagaat 60 

gtaacaaaga caaccaacaa aatctcaaaa caacatctat tcattatggt caagaaaatc 12 0 

attagtatct gtgctgccgg tatgattgta gccagttgct cccccaaaaa gacaacagct 180 

cagccgacag atccgtcaac cactgacagt gaattaacaa tgctggtcgg aacttacact 240 

tccggcaaca gcaaaggcat ctatactttc cgattcaacg aagaaaccgg agaatcgctc 3 00 

ccactgagtg atgcggaagt agcaaaccct tcatacctca ttccatcagc ggacggaaag 3 60 

tttgtctact ccgtcaatga atttagcaaa gaccaggccg cagtcagcgc ctttgccttc 42 0 

gacaaagaaa aaggaactct acacttattg aatacacaaa aaacaatggg agccgatccg 480 

tgctatctga ccaccaacgg aaagaacatc gtcacagcca attatagcgg tggaagtatt 540 

accgtctttc ctatcggaca agacggagca ttgctacccg cctcagacgt aatcgaattt 600 

aaaggttccg gtccggacaa agaacggcag acgatgcctc acctacactg tgtacgtatt 660 

acccccgacg gtaaatattt actggcagac gacttaggta ccgatcagat acataaattc 72 0 

aatatcaacc ctaatgccaa tgccgataac aaagagaaat tcctcacaaa aggtaccccg 7 80 

gaagctttta aagttgctcc cggttccggc ccccggcatc tgatattcaa ttcagacggt 840 

aagtttgcct accttattaa tgaaatcgga gggacggtaa tcgcttttcg atatgctgac 900 

ggaatgttgg acgaaattca aactgttgcg gctgacactg taaacgcaca gggaagcggt 960 

gacatccacc ttagcccgga cggaaaatat ctctatgcca gcaaccgctt gaaagcagac 102 0 

ggagtagcta tctttaaagt tgatgagacc aacggtaccc taaccaaggt aggttatcag 1080 

ttaacgggaa tccatccacg caactttatc atcactccca acggcaaata cttattggta 1140 

gcttgccgcg acaccaatgt cattcaaata tttgaaagag atcaggctac cggattatta 1200 

actgatatca agaaagatat aaaagtagat aaacctgttt gcctgaaatt tgtagactga 1260 



<210> 939 
<211> 1797 
<212> DNA 
<213> B.fragilis 



<400> 939 

actatgtgtt caaaaataaa acatatatta ctgactgcgt gctgtttcac aggcgcagga 60 

ctgatgacaa gttgtaatga cgggtttatg gatcgctttc cggaaacgag tattacagag 120 

aaagtctttt tttcttctcc tgctgatttg gagacttata ccaatggcat gtacggctat 180 

atcggtgcaa gctattcgga tactccttcc gacaatatgc tttacccaga agataccgat 240 

atttataaaa tgatgcgcgg cgaatatcgg gcggataata taggtaaatg gagctggagc 3 00 

aacattcgta cagtcaattt tatgttggct cggacaggtc gtgtagaagg agatcgcggt 3 60 

gagattgatc attatattgg gttggcacgt atgtttcgtg cactggtcta ttattcaaag 420 

gtgaaagatt attcggatgt accttggtat agccatgacc tgcaaacgac ggacattgat 480 

ttattgtata agccgcagga ccctcgagca ttggtggtag actctattat ggcagatctt 540 

gactttgccg taactcatat gaaaacgact aaaagcacga ctcgtattta tcgtgatgcg 600 

gctttggctg tacaggcacg gattgctttg catgaaggaa cgttccgtaa atatcatccg 660 

gaactgaagc tgaatgacgg cgaccgattc ttgaaaatag cggtagaggc atgccagaag 72 0 

attatggaca caaaaagtta cagtttgtct acaaccaaag agagtggttt accggcctat 7 80 

cagtcacttt tttgcagtac ggatcttaca cagaatccgg aaatgattct ggtagctgac 840 



381 



tacgacaagg 
ctttcccgta 
catcaagtgg 
cgcctggaac 
cgtacgaaat 
cagattgact 
ttgatgtatg 
acaattaacc 
gctaatatcg 
gctgttttgg 
ggcgatttga 
attccgggga 
gaaaagaaag 
gtttatgctt 
ttggttgccc 
gctaccaaag 



cgttaggacg 
gcctgatgga 
aaggctataa 
aaacatttat 
tgaacttggg 
ggggaaaatc 
cagaggcaaa 
tgatcaggca 
atccggtaca 
aagtgcgccg 
tgcgttgggg 
tgggatacta 
cagatataga 
tggaaggtaa 
aacataataa 
atataactgt 



tctgcacaat 
agattattta 
aacgaagaca 
gaaaccgggt 
aggatatcct 
gtatacagat 
agccgagctg 
gcgtgcaggc 
ggatgaacgt 
tgaacggaga 
atgtggaaag 
tgatgtgaca 
taaaattccc 
taccatcgga 
gtatactttt 
taacgagaac 



gctcaggctc 
gttgttaagg 
gtacttgaag 
gttttgaatg 
cagattaagt 
ttgcctatta 
ggtatactca 
atgccggatg 
tactccaatg 
atcgagttgg 
ttgtttgaag 
ggtgacggtc 
gaagaagaca 
cttaccgaag 
gtatctccaa 
ctctatcaga 



agtttgacta 
atggacatac 
tctttgaaaa 
tgggaaccac 
tccgtccgct 
tccgttatgc 
cacaagatga 
cttcgttgga 
tacagtcagc 
catgcgaagg 
cagctcccga 
aaccggatgt 
agcaaaagta 
gaacaaaagg 
aatattatta 
atccattctg 



caacaccggt 
cgagtatttt 
cagagatccc 
tgaacctcat 
gacattcgat 
cgaggttttg 
tgtcaaccag 
tgattggctg 
acagaaaggt 
gttcagatat 
aggagcttat 
cgctatagta 
taaactgaca 
ctatatctat 
ctatccggta 
ggaatag 



900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1797 



<210> 940 
<211> 396 
<212> DNA 
<213> B.fragilis 



<400> 940 

tattccgaat 

tgctccccct 

ccatccttac 

tcaccatctt 

tctacgccat 

ccggtatctc 

gtataccccg 



cagcgggatc 
tattcccgtt 
cgtccgtacc 
ttccgtccgt 
ccttaccgtc 
ctttcttccc 
caccatccgc 



ctgcataatg 
ggtaccgtcg 
attcgtacca 
accgtcggta 
cgtaccgtct 
gttacggata 
cagttcctcg 



ccaaccaccg 
acaccatctt 
tctgcaccat 
ccgtccacac 
atgccgtctt 
gtaatcggag 
atatga 



gattctgtcc 
tgccatcggt 
ctttaccatc 
cgtctttacc 
tcccggcatc 
cacttttgga 



ggggtcacct 
accatccacg 
ggtaccgttc 
atccgtacca 
tccatccgga 
gaaggagata 



60 

120 

180 

240 

300 

360 

396 



<210> 941 
<211> 204 
<212> DNA 
<213> B.fragilis 

<400> 941 

ggtataggac cttcttattt agtaaactca ttcaaatttg caagtccgaa ctatccgttt 60 

ttatacatta gatcaaatat tttttttaaa cctcttagag atttcacctc acaatgccct 12 0 

ccctaccgta tagatacctt ctgttattta agcatattgg aaagttacac ctgctggaat 180 

atgttcatac cgaaaaaaat gtaa 204 

<210> 942 
<211> 891 
<212> DNA 
<213> B.fragilis 



<400> 942 

gatattggta 

aatttgactt 

atcttaaaag 

ggtactatga 

tttttctggg 

ttatttgcat 

atttcggata 

tttttacagc 

gcgaagcagc 

attttgttca 

tatctcagtg 



catttttgtt 
ctgatttgct 
ggaactcctt 
aagtttcctc 
ataaagaaga 
ttggtgattt 
gttctgtttc 
ttcttgcaca 
gggaactgtt 
gttccttgac 
caaaaaatgt 



tttttataat 
acccaatcag 
attgaagttg 
tgctcagcaa 
tgactatacc 
aatagtacat 
aaaagatgtg 
gtacatggag 
ctatatcctg 
ggaacagtcg 
gggtgagttg 



atacatcaac 
atttttcaaa 
gatagcaatg 
gagttggcta 
tgcgagatgc 
gatttattga 
ggacttaaat 
atgaatttat 
aattctgttt 
tccaggttta 
gccagcctgt 



ctatgaaaac 
ccggatttag 
ttttgatttt 
ccgtcagaga 
tttcggactc 
cattccgtcc 
ttgccgagcc 
atgatctctc 
ataatgagca 
aagaacagat 
taggctatgg 



cttaattgaa 
ctttttaatc 
tattatgagt 
gcggcatatc 
acaggtcatc 
tttcggagcc 
tttgaattct 
cttgtatata 
ggaacttgcc 
actggagaac 
cgtcactaat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 



382 



tttcgggcaa agtttaaaga acagttcgga gtgtcggtct accgttggct tctcaatcgg 72 0 

aaatcgcaac atatcattta ccgtattacc gtatatggtg atgagtttag ccagattatc 780 

gatgacttcg gattttcatc tccttcacac tttaataaat tttgcaggtc acaatatgga 840 

ctgacacctt gcgaacttcg caagaaattg aaaacaaaca ataattctta a 891 

<210> 943 
<211> 993 
<212> DNA 
<213> B.fragilis 



<400> 943 

caggaggtaa 

aagtttatat 

cgtgcaaaag 

aaaaattatc 

gatgttattt 

tatgattatg 

acatatcttt 

gatggcacac 

gtacctaccg 

aatgaaatac 

cccatgatta 

aaagacatgt 

aatgcaggtt 

actacatgga 

tattaccaag 

aagcctttga 

actactttta 



cttctgtcat 
atatttttgt 
agaaagaatt 
ctgaaaaagg 
tgatgattga 
tattgttatc 
tccctgactc 
cggtccgttt 
aacaatcgga 
gccgcatact 
tgggtggaga 
atcatcatgg 
tcaaagacag 
tctatgataa 
gtaagaccat 
aatttatggg 
agatatcacc 



atctccatgt 
tgcaattgtt 
aaaagttttg 
atgtgaaggt 
gacttatggt 
agacaatctg 
catttctaca 
attcgatacc 
aacagatatt 
ctctgtcttg 
tttcaatgtg 
tggtgcagtt 
cttccgcgaa 
tgaagataaa 
tcgtgcaatt 
agaggagttc 
attagaaaaa 



attattttca 
gcaataagct 
tcatggaatg 
actatcggga 
gcggctccaa 
tgtatttata 
ttcaattttg 
tggttgcatt 
ctcgcatggg 
cagccgatga 
cattcccatc 
gtggagtgga 
atacatcccg 
ccactccgat 
acctccgaat 
ttttatcctt 
tag 



ataatagaat 
ttgcttcctg 
tatggcatgc 
ttctgagaaa 
tggttgcaga 
gccgctatcc 
gaggagtcga 
atttacctga 
atgatgccgg 
ttcgtcagac 
ttgattggac 
ctgtatctaa 
aaccggaaaa 
ccgaccgtat 
cttataatca 
ccgatcacgg 



aggtatgaat 
tacgcaagaa 
gggacatgct 
gagtcaggcc 
ctctttggga 
gattaaaaaa 
aatagacatg 
tatgcgcttg 
tacccgggat 
tgacagtatt 
tgatgcaacg 
agagatgcaa 
aaatatagga 
tgattttatc 
ggagttgacc 
ctttgtgatg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

993 



<210> 944 
<211> 1296 
<212> DNA 
<213> B.fragilis 



<400> 944 

tatgacatgg 

tcaatcatgg 

agatgcagca 

tatttctgtg 

tatcatccta 

acacaggaaa 

gacaaactca 

gaggaatacg 

ccgacctaca 

gtctatatcg 

aagagattct 

tgcggttcct 

atccgtgcca 

acggaggaga 

ggcaagtgct 

tgggaaggcg 

gacattgttg 

aacggattcg 

ctgcttactg 

gcttttgggc 

cctgccaagt 

gcttatgcaa 



caaaaataca 
agaaatttga 
gtatcttcgg 
gcggctcatg 
cccttcgtac 
acatctccta 
acacattgct 
atgttgactt 
aaaagttcct 
agaacagcga 
tcgctcttct 
gctcgaagga 
accgatgcag 
ttaacggcat 
atcgtcttgt 
aatacactta 
aattctacaa 
gttggagcag 
cattgataca 
tcaagaaaac 
ggatcatgac 
aacccttcaa 



aattaaatct 
ctccatgctt 
atatcagttc 
cgtggaagat 
atgcagctct 
tacttccgac 
tataaacgct 
tgaccatcag 
cggctacagg 
tggtaacacg 
ggaatcccag 
aatcgtcagt 
ttcgctctac 
ccagttcgaa 
catccagaga 
ccgttgtatt 
tctgcgtggc 
gctccccaag 
caatttctac 
gagtcgcata 
tgcaaggcaa 
aacagaattc 



gagaaactca 
tcacccgtta 
agcgagatag 
gtaacgtcac 
gataccatcc 
caaggcaaga 
ttggtttcta 
ttccttgaaa 
cctggcgtat 
aatgtgcgtt 
aacatccgtg 
gagatagaga 
aatgacatct 
ctcaattcca 
caaagacgca 
ctgaccaacg 
ggcaaggaac 
tcattcatgg 
aagaccatca 
aaggcttttg 
tacgtgctga 
ggataa 



caccttttgg 
tcgactcaac 
tccgttcgct 
aactgatgcg 
tcagagccat 
cctatgattt 
caggcgagtt 
cggagaagta 
atgttatcgg 
ttcatcaggc 
taaatcgctt 
agcattgcaa 
ttgctctgag 
ttctcgttga 
acagtggcga 
attacaagtc 
gtatctttga 
cggagaatac 
tgagcaggct 
tcttcagatt 
atatctacac 



aggaattttt 
actgggtcag 
gatgagcgtt 
ccatctctcg 
caaggaactg 
caatactgca 
gaaggaaatt 
tgatgcaaaa 
tgacaagata 
agacacccat 
cagggcagac 
acatttctac 
aggatggaag 
gaaatgggaa 
ccttgacctg 
atcgacaagg 
cgacatgaac 
tgtctttctt 
tgacaccaag 
catctccgta 
agagaaccga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1296 



<210> 945 



383 



<211> 252 
<212> DNA 
<213> B.fragilis 



<400> 945 

gttcctgagc aacaaaaagt tgcccaggat tttgccatgt cagaattttc acttatctta 60 

gtgttgcaaa aagaaaacaa gcaaaactct aatatgacat ggcaaaaata caaattaaat 12 0 

ctgagaaact cacacctttt ggaggaattt tttcaatcat ggagaaattt gactccatgc 180 

tttcacccgt tatcgactca acactgggtc agagatgcag cagtatcttc ggatatcagt 240 

tcagcgagat ag 2 52 



<210> 946 
<211> 540 
<212> DNA 
<213> B.fragilis 



<400> 946 

cgaactatta 

cttctgctga 

tgggatttca 

ctggacccac 

aaatcctttc 

ttgagaaaag 

ccggaacatc 

gtggtaaaat 

ctttacttga 



atgctgtata 
ccggatgtta 
taaactataa 
aggttgcgtc 
cattggaaaa 
aagtgctggg 
aatataaggg 
tcgacttata 
atgacaaaga 



tatgaaaaaa 
tggagaagaa 
tatttatttt 
gaatatattg 
ttctgtggat 
ggaagcgaaa 
agagaccttt 
tatcacctgg 
atacagtaag 



cattttgtat 
gataaagaga 
tctgtgaagg 
ggtaatgaga 
acccgcttta 
gagcgtgtgc 
acgattcatt 
aagaaacaga 
gattctttcc 



ggttgctttt 
aggaccaggg 
atgccgccgg 
ttactgtgga 
atatgccccg 
tttcttttgg 
ggggagatgg 
accctacaat 
tgataaagat 



attatttccg 
agattatatt 
caataacttg 
atatggggat 
cccgttggga 
tgagttttct 
aacgaaagat 
acataaaagg 
cgtgaaatag 



60 

120 

180 

240 

300 

360 

420 

480 

540 



<210> 947 
<211> 279 
<212> DNA 
<213> B.fragilis 



<40O> 947 

aatctttcta atatttctgc ctttcatttt attaccttac aaaaccaaaa actcttcttt 60 

ataatcaaga tatcacccaa tcacaagacg cgtgaattac actatcgcaa aatccacaca 12 0 

tttttattca gcccatctcc ttcggaccgt aaaaaagagt ttctacacat tttcaaaata 180 

caaaatgaca ggattgtaag tgatcatgaa aaaacaagtt gtgataccta ttatgtgcgt 240 

tttctcttac catccgggat tctgtaccag tttttttga 279 



<210> 948 
<211> 2136 
<212> DNA 
<213> B.fragilis 



<400> 948 

atcttttaca 

gaaagaaaca 

ggagtttatg 

gtgcttaaat 

gacatgaacc 

aaaattcttc 

gcggagaaac 

gttacagaca 

accggtgtta 

cttaagtttt 

atagacattg 

gctgttcaga 

gaggaccgtc 

gtatcggtcg 



aagatatgaa 
ttcgtttagt 
ctcagcaaac 
cgatcgaatc 
gtaaagttac 
cggactgcaa 
aaaatacccc 
cacgaggcga 
tcactaatat 
catacgtcgg 
tattggagga 
agaaagttaa 
cggtattgaa 
gtgatggtga 



taaatttaaa 
tagactaatg 
ccgcatcaat 
gaagagtgaa 
ggtacaagcc 
atgtgtagtg 
aaatgataat 
aacgttgata 
cgatgggaaa 
ctatatcgcc 
aaacagtaaa 
tctttcgggt 
tatgggacaa 
agcagatgat 



tggagaagct 
tcttttattt 
cttcatgtga 
tacactttct 
aataacgaac 
gagaatagaa 
actgcgaaaa 
ggtgtaaatg 
tattcgttga 
cagactgtaa 
gcgctggatg 
tcggtagcaa 
gcgctgcaag 
tctccttctt 



ttttaagctt 
tgtttctatt 
aacaagttcc 
tctacaatga 
gtattgatgt 
agattatttt 
cgaaagagat 
taacggtatt 
aggttccggc 
aggtaggtga 
aagtcgtggt 
ctgtttctac 
gtgctgttgc 
ataatattcg 



tttcttaaca 
tgtttttcag 
cctaaagcaa 
tgccgaaatt 
gattttatcc 
ggttcccggt 
aaccggtacg 
gggaactact 
aggtaagtca 
taaatcagtg 
agttggctat 
caaagcgatc 
caacctcaat 
tggtaccacc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 



384 



tcattgaatg gaggttctcc gttggttgtc atcgacggtg tggtctctac cagtgatcaa 9 00 

ctgaatcgta tgaatcctgt tgatatagca aatatttctg tattgaagga tgctgcgtca 960 

tctgctatat atggttcacg tgctgcgttt ggtgtcatcc tggtgacaac taaggatggt 1020 

agcaatgaaa aacttaccgt caattataac aacaattttg tattacgtac caatacccgc 1080 

atgccggaaa ttataacaga tccttatctg gtggcaacca ctcgaaatac gatggcatat 1140 

ccatggtata atctttataa cgaggagcaa ctggcctatg cgaagaaatg ttctgaggat 12 00 

ccttctactt ctccttattt cttgaatccg gatggatttt atacttactt tggtcgaaca 12 60 

aactgggttg acgaggctta caacgatgta ggtttttcaa ctatccacaa cattgatatt 1320 

tccggaaaaa cagatcgtat ttcctattac ttttcgggag gatacaatcg gcagaacggt 13 80 

atgtttaagt atggtaatga catttataac cgatataacc tgcgtaccaa attacagttt 1440 

aaactgacag actggtggag cttgaacagt aatgtcagcc tgacgacttc cgattatgat 1500 

tatgcgaatg ccatgaccaa cacttataaa cagatgtatc gtaagaatcc gatggatatg 1560 

gttaagaatc ctgatggaac ttggacagat gccagtgtcg gtacattggg agcattggcc 162 0 

gaaggtggtc gtgctaccga ctggaaaaca aatacaaata tcaacttgtc gactaagata 1680 

gatgtgatca aagatgtctt ttttgtacaa ggaacatttg ccttttcaaa tacaaaaacc 1740 

agaagtaatt ggtataattt gcctgtgact taccgtaacg gaccggaatt acctgttttg 1800 

acatttaatc cgatttcgac cgtatccgat gcttcaagca gtaactccga tacgaaacat 1860 

attctatttg atgtatatgg taccttccaa aaaacatttg cgaagaagca tgctgtcact 1920 

gctgttgtgg gtttcaatca ggaagagtat aaatatgatt acgtaaaagc aaatcgtaaa 1980 

gaactgattt caagttcact gcctactatt aatctggcta caggtgatat gaatatgtcc 2 040 

cagagtataa cgacctgggc tctgagaggt gcttttgcgc gtttgggata tatttataac 2100 

gacaaatata tttttgaatt caacggacgc ctatga 213 6 

<210> 949 
<211> 1536 
<212> DNA 
<213> B.fragilis 

<400> 949 

ttatttatga agaaaaattt attgtattta ttcgcactga tctgttcggt gagtttattg 60 

gttgcatgta acgatgatga tccagaatat attcaggatg gtgaatttga tggtgtctat 12 0 

ttaggtacct tggatgtaga tgctgcagga gttataaaag ttgatgatat tcctcaaaaa 180 

gtttacataa caaaaacagg cgagaatcag tttaagatgg aactgaagaa ctttagtttt 240 

caaacaatgg agttaggaaa tatctcagtt gataacatcg cagttattaa aaagggtaat 3 00 

agttgtactt ttagtggtaa agcgaattta actttagcag ttggagcatg cgatgttact 3 60 

gtatcgggta ctattgagga taataaattg gatatggaca ttgcagtggt tgctgctggt 420 

acattaaatg ttgcagttga ttttgaggga actaaattag ctgcagataa aagttcagaa 480 

gctaagatct taacttttac ttttgcaaat gaatttgtta cttctcaacc tgttattgat 540 

tctgaaaata aaacaataac ttttgttgta tcagatcaga tgcctgaaga gcaattgaaa 600 

gcgttaattc cagaatttac tatctcggaa ggagcttctg ttgacaagaa gagtggtgta 660 

gctcaagatt tctcgcagcc tgtaacatat actgtaacat ctgaggatgg tattgttaaa 72 0 

atggtttata ctgtttctgt ttcaggaaaa gaaaaatatt taagctttaa tgaatgggaa 780 

acaattaaat cttccacgag tggttctttg gaacaatatc agaacccgaa aggtacttat 840 

ggtacaagta atccgggggt gatgactatt aatgaaatgt ttgggcaagt tggtattcct 90 0 

tcttttgagt attgtgttgc tcctgttgat ggcagggtgg gaaaagctgc tcaattaaaa 960 

acattgcata ctgcgattgt cgctaatggg atagattata atgcagcttt tggaggccta 102 0 

atcccttata ttactgctgg ttctttattt actggtacgt ttaaaacaga tatgtttaat 1080 

ccgttgaata gtacaaaatt tggggtacca ttcgttggag aacctgtaac atttacagga 1140 

tggtataaat atgctccggg tgagatttat tatgataata ctaataaaat tgtagaggga 12 00 

cagactgata aatgttctat ttatgcggtt ttatatgaag aatctttgga tagcaaaggt 12 60 

aataatattc cattgactgg agattataaa aataaagaag tatatatcgg gtcttcaagc 13 2 0 

cgagttgtga tgaaagctga attgtctgat gggtcggcaa aagctgaatg gacgcaattc 13 8 0 

tctgttcctt ttaaacctgt tggagataat aaatatgatg caaataaaaa gtattatgtt 1440 

gctgtgatat gctcatctag cttcgaagga gattactata aaggtgctcc gggaagtact 150 0 

ttaattgtag atgatttttc tatcctttca aaataa 153 6 

<210> 950 
<211> 804 
<212> DNA 



385 



<213> B.fragilis 



<400> 950 

ttattgaaga aacaaatgaa aaaatataaa ttctttatag ctatattgac tttctcaatt 60 

ctacacagca tctcagtaaa ggcacaagaa gagagaaata aaggtatcat atggtcttct 120 

cttagaggat tagaatatga agtaaaagca ggatttagta ttggcggcac ttccccatta 180 

ccattgccca aagaaatacg ttctatagat agctacaatc ctaatatggc catagccatt 240 

gaagggaatg caaccaagtg gtttggttct gataaaaaat ggggaatgct attagggctc 3 00 

cgtttggaaa ataaaagcat gacaactaaa gctacagtga aaaactataa tatggaaatc 3 60 

atcggggatg gaggggaaaa agttagtggt gtatggactg gaggtgtgaa aacgaaagtc 42 0 

aaaaattcct accttacaat acccattctt gcaaagtata aattaactaa gcgatggaat 480 

ctaacagtag gtccttattt ttcatatatg cttgaaggag atttttctgg taatgtatat 540 

gaaggttatc tacgtaaaac agatccaacg ggacctaaag tggaattcac agatggtaaa 600 

gtcgcaactt acgatttctc caatgacctt cgtaaatttc aatggggtat gcaactagga 660 

ggagaatgga aagcttttaa acacttaaat gtatatgcag atctctcatg gggattaaat 72 0 

gacatcttta aaaaagactt caaaacaatt acatttgcta tgtatccaat ctatcttaat 780 

ttaggatttg gatatgcatt ttaa 804 



<210> 951 
<211> 1248 
<212> DNA 
<213> B. fragilis 



<220> 

<221> unsure 

<222> (8) , (16) , (29) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 951 

aaagtctntg 

agtttaggaa 

atcagtcaga 

ggtaacttaa 

gataaccgtt 

cccggagtta 

ttaaagacgg 

aaaccgttct 

tacgaaaatc 

tggggtgtag 

cagtcatggt 

aaagatgaaa 

gattataaaa 

cagtggaatg 

ccgggtacgg 

aaaggtaata 

aaagcttatg 

cagaatgcag 

ttgttgaata 

gaattttcgg 

tatcctcttc 



agcctntgac 
accaggatgt 
tactggataa 
cctgggaaaa 
taagcataac 
cgcttccaag 
aaggatggga 
attatgatgt 
ccaagggatt 
aaactttagg 
gtacttctta 
acaaggatgg 
ttataggtaa 
gattcgatct 
gtgacctcta 
tgtatgacca 
tggcggaaaa 
cttatatgcg 
agataggaat 
gtttatacaa 
agcgttctta 



cgagtggtnc 
ggatgcttat 
gcagcaaccg 
ggtaactact 
aggtgaggta 
tgttttggga 
attgactgtg 
aaacttcaat 
gctgggcgat 
gttctttact 
tccgggtacc 
taaaattact 
tagccgtgct 
cagcttgttt 
tttctggggt 
ttggacggag 
tacggataga 
tctgaaaaac 
tgagcgcttg 
gcattacaaa 
ttcattcggt 



agttttctga 
gcatatcttg 
gtatatgtgg 
accaatcttg 
tatgtacgtc 
accgacgtgc 
ggttggaaag 
ctggcggata 
tattatgtag 
tccgaagaag 
cgtcctttgg 
gacggagcat 
cgttatacat 
gcccagggag 
atctatgcac 
gaaaatccgg 
gaatgtggag 
ctcacagtag 
cgcattttct 
gtagatccgg 
ttaaatgtta 



agttgagagg 
ccacgatggg 
gagctcccgg 
ccttggatgc 
gtacaaaaga 
ccaaacagaa 
atcagttcaa 
gccgtgctta 
gaaaagagat 
acattaagaa 
caccgggtga 
ggactttgga 
ttggtttatc 
taggtaagaa 
agccttggac 
atgcgtactt 
tggtacagac 
gttatacact 
tctccggtga 
aaagcttagg 
cattctaa 



ttcctatggt 
atcaggaaaa 
attggttgcc 
caatttcttt 
tatgttaacc 
tgcagccgat 
gttggccgga 
tattacaaag 
tggtgagata 
tcatgccgat 
tctgaagttt 
agaccatggc 
ggccaatgca 
ggattattat 
caatatcacc 
cccccgtatg 
cagatattta 
tcctaaggta 
taatttgtgt 
tgacattgtc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1248 



<210> 952 
<211> 606 
<212> DNA 
<213> B.fragilis 



<400> 952 

cattataaag aacagatgga aacagcagaa aacaagtaca tcactgtagc ttacaaactg 60 
tatacaacag aagatggtaa aagagactta gtagaagaaa cagcagccga acatcctttc 12 0 



386 



caattcattt caggtttggg cactacgctc gaagcttttg aatcacagat agtaaacctt 180 

cataaaggag acaaatttga atttactatt ccttttgccg aagcttatgg tgaatatgac 240 

gaagaacatg taatcgatct tcccaaaaac atctttgaga ttgacggaaa attcgataac 3 00 

gaacatatct atccgggaaa catcattcct ttgatgaact cagaaggcca gcgcctgaat 3 60 

ggtagtgtag ttgaagtaaa agccgatacg gtagtgatgg atatgaacca tccgttggcc 42 0 

ggtgaagatt tgactttcgt gggcgaggtt accgagagcc gtccggctac aaacgaagaa 480 

attcaggaaa tgattaaaat gatgaccggc gaaggcggat gcagttgcgg aagctgtggc 540 

gacggttgcg gtgacgactg cggagacagt tgtggagaca gctgcggttg tggacattgc 60 0 

cattaa 606 

<210> 953 
<211> 1383 
<212> DNA 
<213> B.fragilis 

<400> 953 

aaccattcac tcatccggat aggatattct ttggttaaat ggatggttta taagagtata 60 

tttcctactt ttgtgcaaaa tataaacctt ggacagatga ctgaatcaga aagaaaacag 12 0 

ataatcgctt taatacagcg ggaggtgatt ccggctatcg gatgtacaga gccgattgca 180 

gttgcattgt gtgtagcaaa agctacagag actttggggg ccaaaccgga gaaaataaag 240 

gtattgttga gtgctaatat cttgaaaaat gcaatgggag taggaatccc cggtacggga 3 00 

atgatcgggc tgcccatcgc tatggctttg ggtgcgttga tcgggaagtc cgattatcag 3 60 

ctcgaggtgc tgaaagacag tactccggaa gctgtagaag agggaaagaa actgattgat 42 0 

gaaaagcgta tctgtatttc gttgaaggaa gacattacgg agaaacttta tatagaagtg 480 

acgtgcgaag ccggtggtga acaggcgacg gctatcattt ccggtgggca taccaccttt 54 0 

gtttacgtgg caaagggaga tgaagtactg ttgaataaac agcagacttc cggggaggaa 600 

gaggaagaag agactctgga acttacttta cggaaggtgt atgattttgc gttgactgct 66 0 

ccgttggatg aaatccgctt tattcttgag acggcacggc ttaataaaaa agcggcagaa 72 0 

cagtcttttc agggtgatta cggacatgcg ttgggtaaga tgcttcgggg cacttatgaa 780 

cataaaatta tgggggatag cgttttctca catattcttt cgtatacgtc ggcagcatgt 84 0 

gatgcccgta tggcgggagc catgattcct gttatgagta attcgggcag tggtaaccag 900 

gggatttctg cgactttgcc cgtagtggta tatgccgaag aaaacggaaa gtcggaagaa 960 

gaattaattc gtgctttgat gatgagtcat ttgactgtga tttatattaa gcagagtctg 102 0 

ggacgcctat ccgccctctg tggctgtgta gtggcggcaa ccgggtcgag ttgtggcatt 1080 

acttggttga tgggaggctc ttataagcag gtggcttttg ccgttcaaaa catgatagcc 1140 

aatctgaccg gaatgatttg tgacggagct aaacccagtt gtgctctgaa ggtgacgaca 12 0 0 

ggagtgtcga ctgctgtgtt gtcggcggta atggcaatgg agaatcgttg tgttacttcg 12 60 

gtcgagggaa ttattgacga ggatgtcgat caaagcatcc gaaatctgac gcggattgga 132 0 

tcacagggta tgaatgaaac agacagggtg gtgctcgaca tcatgacaca taaagggtgc 13 80 

taa 1383 

<210> 954 
<211> 1065 
<212> DNA 
<213> B.fragilis 

<400> 954 

atacgggggc ttatcgcggt aagcgggctt tgggagcacc tgtattctca agcgtggcag 60 

ttagtggtaa ctgtctgttt gttgccgatt ttggcggtaa tatttataat tttaagttac 12 0 

acaattaaat gtatatacaa aatgaaaaag atatctatac tatttatatt ttccttgatt 18 0 

cttggtttat ttgtcagtga agtaagcgcg gccggcccac gtttgaagca acgtcccaag 240 

catgtggtat tggttgcttt tgacggattg agtgctgttg ctatccgtaa tcatcccatg 3 00 

cccaatttca atcggttgat gaaagaagga gcttctacat tgaataaccg ttctatcctc 3 60 

ccttcatcga gtgctcctaa ctgggcctcc atgtttaccg gagtaggacc ggaacttcac 42 0 

ggttacacta cttggggaag caaaacaccg gaaattcctc cttttatcac caaccaatat 480 

ggccgttttc cgggactgta cggattgttg cgcgatacac atcctaaagc ggaactcggt 540 

tatatttacg aatgggatgg catgaagtat ttagtcgatt cgcttgccat caatcatttc 600 

gtacatgctc cacagacaaa agatcatccc aagggagcga cacaattcgc cgtcaattat 660 

ctgaaagaga aaaagccgat gtattgtgct gttatatttg aatatcctga tcataccgga 72 0 
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catacctata 
ttaggtgaaa 
ttgaccgcag 
gagactccac 
acgatggtta 
gcttggttgg 



aatgggaatc 
tagtagcagc 
accatggggg 
ttgtctttta 
ttgacgtacc 
gcaatccggt 



taaagagtat 
tattgaagaa 
tatcggtacc 
tggaaagggg 
tgccacagaa 
aacaactgca 



tatgaaaagt 
gccgatatga 
aatcatggag 
gtgaaaaaga 
gcctggttgt 
ttttttacta 



tagatgaatt 
tggacgaaac 
gaaaaacgct 
attataagat 
tgggtgtaga 
aataa 



ggatgggtat 
agtgattatt 
caatgaaatg 
tactgaaagc 
gcctcacgaa 



780 

840 

900 

960 

1020 

1065 



<210> 955 
<211> 192 
<212> DNA 
<213> B.fragilis 



<400> 955 

ttagtaattt ttaattttca ttctaaaacg gagcaaagta acagctttat ttccatccaa 60 

acaaggtcta tcttacgtat tttatgttct aaatttaaga ttaagttggt tatcgagcga 12 0 

aatcgtatta aaaaaaaatt aatctatgat tcgaatcgag caaatataat atatctccat 180 

catcatcttt aa 192 



<210> 956 
<211> 1680 
<212> DNA 
<213> B.fragilis 



<400> 956 

agcggctcct tgcaatcgag cggcgaatac tcaaacatca ttataacgag tgcttatgcc 60 

gatctgaccg gtttggacaa tccctggccg gctgtttctg atgaattgaa atctgcacag 12 0 

aacaatgtga agacgattcc gacaatcggt tatcatgccg gatcggcaac tttgtcacgc 18 0 

tggagtcttt ataagcagat acgacaggcg aatgagttca ttgcctatgc ccacgttatt 240 

ccgcagaatg gcgatgtggc tgactttatt gatgaaaaag aactggctct tttgaaaaat 3 00 

gaagcacgtt tcttacgtgc ctattatcat tacctgttgt ttgagttata tggcccaatt 3 60 

cctattatga ccgaaattgc tgatccctcg gccgctgatt tggactatta cagaaatttg 42 0 

gtagatgaag tagtcgcttt tatcgataaa gaacttaatg aatgctatga cctacttccg 480 

gagaaagaac tgaatccgga tggcaccatc aataatgagc gtgcggcagc gccaaccaaa 540 

ggggcggcat tggctatctt ggctaaattg catgtatatg cggcaagtcc gttgttcaat 600 

ggtggttatc ccgaagctat tgctttgaag gataatcaag gcaagcaact tttcccggca 660 

aaagatgaca cgaaatggaa gactgcattg gatgctttac aacgttttat cgattattca 72 0 

aagggacgct actctttata ccaagtaatg aaaaatggtg aaatcgatcc ggctgagtca 7 80 

ctttatcagt tgtttcaggt aagcgttaac aattccgaag ctgtttggca aagtagtaag 840 

aactcctggg gaggcgtaaa tggtgagggt cgtgagcgta gatgtacacc gcgtgcaatt 900 

tttagcggat tcagttgtgt cggagtcctc caggaagcca tcgatgactt tttgatgagt 960 

gatggcaaga gcattgaaga atcgggtttg tataaagaag agggcattgg tgaagacggt 102 0 

ataccgaata tgtataagaa ccgtgaacct cgtttttacc aggatataac ttattccggc 108 0 

aaagtatggc aaaaaacaga taagaaaatt tatttttata aaggaatgcc tgacgataat 1140 

tctaaagcag atatgagtta ttcgggatac ttactttata aaggtatgaa ccgtgacttg 12 0 0 

ttgaatcagg gaaacaatcc gaaatccaaa tatcgcgcag gtatgttgtt ccgtttggcc 12 60 

gatttctatt tgttatatgc agaagctttg aatcatgtaa atccgggtga tgcacgcatc 1320 

attcagtatg tggacagtgt tcgttataga gccggtattc ctttgctgaa agatattaag 13 8 0 

ccgggaatta tcggtaaccg ggagttgcag gaaaaagcga tccgtcacga gcgtcgtatc 1440 

gaattgtttg ccgaaggaca acgctatttt gatgtgcgcc gttggatgtg tgctgaagag 15 0 0 

gagggttata aacaaggtgg tccggttcat ggtatggata tgaatgctac cgatcttgaa 15 60 

ggtttcatga aacgtactgc ttttgaaact cgtatttttg aaaaacgtat gtatctgtat 162 0 

cccattccgt tggcagagat acaaaagtca aaaaaactgg tacagaatcc cggatggtaa 1680 



<210> 957 
<211> 1137 
<212> DNA 
<213> B.fragilis 



<400> 957 
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aaaagagtaa tgaataaagg atttatacta aaagccctgt ttctgttatg ctttcttttc 60 

tcacaaacag cccaaggaca atctttcact cccggagaga tatggcccga taatcaccag 12 0 

gtacatatca atgcgcatgg cggaggcatt ttatatgaaa acggaaccta ttattggttt 180 

ggcgaacaca aaacagaagg tgaagccgga aatttagcca atgtaggggt gcattgctac 240 

tcgtccgatg acttatatca ttggaaagac tgtggtattg cactatcagt gatagaaaat 3 00 

gatcccggac atcccatttc taaagggtgt attctcgaac gtcctaaagt tatatacaac 3 60 

cctctcacta agaaatatgt catgtggttt catcttgaac ccaaaggtgc aggatattcg 420 

ggagcactaa gcggaattgc ccttagcgac cgggttacag gcccttacac ctttctaaaa 480 

gctgtccgtc ccaatgccgg ttcatggccc atcaacgtac tgcccattca taaaaccacc 540 

cgcagacctt ctgcagaaga agaacgtcaa tgcaccggag gtagtcttcc tgcccatccg 600 

gacagcctca acatattggg ccgcgacatg gagcaggggc aaatggcgcg tgatatgaat 660 

ctgtttgtcg atgatgacgg taaagcttac catatctact cctccgaaga gaatagtacg 72 0 

ttgcacattg ccgaactgga tccgacctat acaggctata caggcaaata tatccgggcc 780 

tttattaacc ggttcatgga agctccggcc atgttcaaga aagatggtaa ctattacctt 840 

attatgtccg gctgtagcgg atggaatccc aatgccgcac gctctgccat agcttcttcc 900 

atttggggag aatggaaaga gttaggaaat ccttgtatag gtcaggatgc agaccttact 960 

tttcattctc aaagcactta tatcctgccg gtacaaggta agaaaaatca gtttatatac 102 0 

atgggtgacc gctggactcc acaaaacgct attgacgggc gctacatctg gcttcccatt 10 8 0 

cattttgaag gtcccaaacc gattatcgaa tggaaagatt catggacttt agactaa 1137 

<210> 958 
<211> 1359 
<212> DNA 
<213> B.fragilis 

<400> 958 

aaacaaacaa taattcttaa attgaagctt atgttaaaaa atcattgttg tatggggaca 60 

ggatgcagaa gagccatcct tttccttctt ttttttccgc tgtttggtac cgggtttgcc 12 0 

caaacggaaa cgccggttaa ggtcgataaa cctcttaaat cgtattcaca ttggtatttt 180 

ggagcagaat atggtgtccc tttcctgttt ggcgatttta cttcattctc ggcggataaa 240 

aactatgtcg gatctcagtt tggagggttt gccggttatc aggttaattc gtggataggc 3 00 

atagaggcat ctgcgcgaac cggatatacc agaatggggg cgaaatctta cgcgggtgat 3 60 

taccttatga atgccgacgg gatgacttat tatacgaatc aggattttaa tacctggaag 42 0 

tacaaggatg tgttttcgaa agtacacttt acgaatattg gtttgcagat gaatctgaat 480 

gtgaataatt tctttggccc taaccgggga aatcgtcgct ggacggtatt actgagtcct 54 0 

gccgtctatg cacaacattt ctctacagag ttaataaata aggcagataa aagcccatta 600 

tcaggaaaaa agacggataa gtggaacatc gggatagggg gagatgtgtc tttgcgttat 660 

aagatttccc gggcttttga tgtacagttg cgtacgggaa ttatatgggt caataacaat 72 0 

aagatggatg gtatatctac gctgattaag tcgaaagacc actttatgac cagtgccgga 780 

ctctctttga tatggaaggt gggtaaaaag aaagaagaca atgttctcta tgcctcaaga 84 0 

cgagcggcgg atgtggagat cagatatata gaggagcgtg cggtcagctt gcctacccct 900 

gcttgttgcg ttgaagattc gatagagaag gagcggatga aacgggagat tgcttcttta 960 

aatatgcagt tgcaacaggc gcacacggta gtgaaggaga agaccggttc tgatccgata 102 0 

ctgggattca acgaattgcc tccggtctat tttaagagag gatcggctta tctgaatgta 1080 

gccttgtaca aaaatgaatt atgccgcatc gtgcaaaccc tgaaaaagta tcctgagctg 1140 

aaagttattc tttcagggca tgccgaccat accgggaatc cggatattaa tcaaaaaatc 12 00 

tctttacagc gtgccgaagc actggcagcc tatcttgaaa agaaggggat agatggcaaa 12 60 

cgtatcgccg tgaaaggaga gtgcatagat atgcttactt ccgatccgaa taattacagc 132 0 

gtacttgcca ggcgagttat tgttgaaatc caaaaatga 13 5 9 

<210> 959 
<211> 582 
<212> DNA 
<213> B.fragilis 

<400> 959 

tactttaaaa tgaaggaaaa cacaatgtat tcgaatcttg aatcggtaga acaattattt 60 

cggcaatatt ataaggtgtt gcgggtatat gcgttccgtt ttgtgaatga ttgggatatt 12 0 

gcagaagacg tagtgcagga tgtttttgtt gctttatgga ataaacgtac agatattgaa 180 
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tttgatggcg 
ctttccagta 
gcactgcaaa 
gaaattgaaa 
agaagctatg 
gtagaaaagt 
ttaatgagtc 



cggtaaaagc 
agaaatatac 
ttctggagaa 
cttttatcga 
gtctgaaaat 
atttgacccg 
ttttattctt 



ctatcttttt 
cgaagaagaa 
taatcaggaa 
aacacttcct 
aaaggaaatc 
tgctttgttg 
gctttatctc 



aaggctgtct 
tccgtagaac 
aactcgttgt 
acgcaggtaa 
tccgttcagc 
gaactacgta 
tgcttgaaat 



ataataaatc 
aattttccga 
tcatgaaaga 
aaaaggtatt 
tcgatctttc 
ctcatcttaa 
aa 



gcttaatatc 
tcaaatcgaa 
actccagagt 
tatattaagt 
tccaaaaacc 
aaacaaggat 



240 
300 
360 
420 
480 
540 
582 



<210x 960 
<211> 1131 
<212> DNA 
<213> B.fragilis 



<400> 960 

aatccaaaaa 

ttgctgcatg 

aggaaagaat 

cttcaggtga 

gcggatggtg 

gggaagaaag 

gacggtaagg 

acggacggaa 

ggtacggacg 

aacgggaata 

gatcccgctg 

ctggccgaca 

cctcaactgg 

acgcaaaaga 

gccaatgcaa 

gtggaattta 

attgaggttc 

gctttcagtt 

agagcgacag 



tgaaaactat 
gctgtaaaga 
tgcaagagtt 
ttgtcaatac 
cggggtatac 
gagataccgg 
atggcgtaga 
aagatggtga 
gtaaggatgg 
agggggagca 
attcggaata 
atgatggaaa 
gagtgaagca 
ttggaaccgg 
aaaatgctgt 
ctttaagcgg 
cggagggacg 
gtaagggtat 
tcgattctga 



gattagaaag 
gtccgatttc 
gacagatcta 
cgacaggata 
tatctccttc 
tccggatgga 
tggtacggat 
gaacggtacc 
cgtggatggt 
aggtgacccc 
ttactggact 
cagaataaag 
atgggaaacg 
tcccgagaca 
gtccgtgttt 
tggagcaacc 
taaactgttt 
ctcaaaagag 
agcggtagct 



aaaatatatc 
aacgatttac 
tgtaaaaaac 
ggagataaca 
tccaaaagtg 
gatgccggga 
ggtaaagacg 
gatggtaaag 
accgatggca 
ggacagaatc 
ataaaaatag 
gcaacctcta 
tccgccgggg 
tggattgaag 
gagaaggtgg 
cggttcagat 
tttttcaacc 
caattgtcgg 
agtcttcacc 



tgttgttgtt 
ttgacagaca 
tgaatgagga 
ttactcatat 
ctccgattac 
aagacggcat 
gtgtggacgg 
atggtgcaga 
aagatggtgt 
cggtggttgg 
gttccggtga 
cggtacacga 
gcgacgataa 
cggatggaaa 
acttgaagga 
tacctatagg 
ggggcgggtc 
tagatgttcc 
gacggggctg 



attgatacct 
gggtgatcag 
tatttataat 
cgaggaactg 
tatccgtaac 
agacggtacg 
taccgacggt 
tggtacgaat 
cgacggtacc 
cattatgcag 
accttattat 
tggacaaacc 
ctattactgg 
gaaaatagtt 
gccggattat 
aagaccggtt 
tcaggcgatt 
gaaggggtgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1131 



<210> 961 
<211> 1137 
<212> DNA 
<213> B.fragilis 



<400> 961 

actaaaagga 

ttctttctgg 

gcaattgacg 

aagagaataa 

gtattaccac 

ccacaaaagt 

ttcgtaaaat 

aacaaatatg 

gtacaatggg 

gattatccaa 

aaaaacactg 

attgggaaat 

ttgccttttt 

atattcacag 

attttctacg 

cccaaattga 

tttaatttgc 

ggcaaatata 

gcagttggaa 



ttattaatca 
gctttataat 
catgcaaagg 
gtatatatgt 
aaggagcaac 
atactgttac 
cagaactccc 
atatatttta 
ctagtggcaa 
cagtacaagt 
gaagttttgg 
ttgatgtatc 
ataaagttcc 
aaaatggtaa 
aaacagataa 
tatctgttgc 
ctttcataat 
aactaagtat 
gcacccttta 



aaaacagaat 
ggtttcttgc 
atccgatgat 
atacaaaact 
aataaatccc 
atcagaagat 
tacaaattat 
cgaatttcag 
cattggatac 
aaatgatgga 
ggcaggagta 
taacgcactt 
tcaatcctta 
acctgttgaa 
taaccatgaa 
acgtattgaa 
gaggccagga 
cgtattctcc 
tatcgatgaa 



cgaatgaaat 
atacaagacg 
gtcgtcttga 
gctgacatca 
cctagtggaa 
gggaaatggt 
catttcgaaa 
gagggtactt 
gatttgacag 
aaaattggga 
aatatgccaa 
gctgatgcac 
aaaggatatt 
ggcaaaaaag 
atgttagatg 
gatcctaaag 
aaaactatag 
tctagcattg 
gtagaactta 



taaagaatct 
aagcacctaa 
cagatattaa 
caaaacaaaa 
tagaaaatga 
ctgctgtcta 
ctctcacaga 
ccacagaacc 
gaatggctaa 
aatgtctgaa 
tagctgctgg 
taaaagctac 
ataaattcaa 
accaatgtga 
gctataatgc 
aaacagatca 
atcaggctaa 
aaggagatca 
ttttcaaaga 



gatagcctgt 
tgcagaagct 
tgcagaagca 
attaaatttt 
ttttaccacc 
tactgtagaa 
atcatctgca 
atctaaaata 
agaatccaat 
attagagaca 
taaccttttt 
tcaatttggt 
agcaggagat 
catttatgct 
actgacctct 
atggaccgaa 
gcttaaagct 
ttttcgtggt 
aaactaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1137 
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<210> 962 
<211> 1059 
<212> DNA 
<213> B.fragilis 

<400> 962 

atttcacttg tattactaat agaggagcta aaaatgaaga atgaaattaa agacataaat 6 0 

gaagttatca tacgttttct ggatggtacg gctaccggtg aagagaaagt ttttctgttc 12 0 

aactggctga aacaatcaga aaagaaccgg aatgaatttt ccgaagtccg tgatttatgg 18 0 

cttttaggca acacgatagc taccgacgat ctggaaacag agatagcgct agagcgattt 240 

aaaaatcgga tacagtcaac agaatccggt ttacgtaaaa acagattcgt tttccggaaa 3 00 

cacttcgttc cgttcttgcg tgtggcagct gtctttttga tgttatttac tgtatggtct 360 

gtcttttatt attggggtag cagttcggtc ccgaaacagc cggatgtcat gaatcgtttg 42 0 

ttgactgcca atggaagtaa gggacgattt gttttgccgg atagtacggt tgtatggctt 480 

aattccaata gtttattgga gtatcctgaa acgtttagtt cgtcagcccg tgaagtcagt 540 

ttatccggtg aagcctattt tgaggtacgg aggaatgaga agcttccttt ccgtgtgcaa 60 0 

gccggagaga tgaaagtaga ggtattgggg actcgtttta ttgttgacaa ctatcgacgg 66 0 

aaatccgggg ttgaagcagt attggtagaa ggtagtgtga agattgccgg ttgtaagatg 72 0 

aatcattcgg tagtgttgac tcccgggcag ttgattaatt atgataagaa gagtgaacgt 780 

acgaaagtac aaatggtgaa tacggatgat tatatcagtt ggattcaaaa tgaactgact 840 

tttgataatg ataagttggc tgatattatt attaatttaa ataagtggta tggggtggat 900 

attgaatgtc cgtcagagtt tgctgaaaaa gtatttatgt cgttctctgt caggaatgga 9 60 

gagaatctgg atgaaattct gaaagcgatg actttggttg ctccaataag atattactgg 102 0 

gagaatggta tcttacatat tcttcccaga aagcgatag 1059 

<210> 963 
<211> 2475 
<212> DNA 
<213> B.fragilis 

<400> 963 

tatgataaat taataaatat gttgaaatta tttataagca tggctgcttt attcctggga 60 

gtcggtagct ctgtgttcgc acaaatagaa ggaaaagttt atatcgatgc gaacggcaat 12 0 

ggcatttgtg atgcgggaga gagggggcta aaaggagtct gcgtacaaga tggtctcaat 18 0 

gtggtgaaaa ccacagatga tggtcatttc atacttccgg gacataaaga tacacgtttt 240 

gtaactttga ctgttcctga tgggtatcag gcatcaacct cccattacct atcttttgac 3 00 

ggaaccggaa aaaagtatga attgggtatc tgcaagacct cggtaaatac cgggaatgga 3 60 

tattcgtttg tacaaattac agatacggaa acttctctat acggtgattg gatcgataac 42 0 

ttgaaagagt acgtgaaaac caatccgact gcttttatta tccataccgg tgatatttgt 480 

tatgaagctc atcaggattt tcatggacat tatcttcgtt ccgtagattt gggaattccg 540 

acctactatt gtgtggggaa tcatgatctg cgtgccggaa aatacggtga agagttgtgg 600 

caaagtcatt ttggtccttc atggtattcg tttgatgtcg gtaatgtaca ttatgtagta 660 

actccgatgc tgggtggtga tcatgcacct tcgtacaggc gttccgacat catccgttgg 72 0 

ctgaagaatg atcttgcaca aacggataaa gggaaaagaa ttgttttatt taatcacgac 7 80 

ttatggtttt ggggagacga tttgctcttc aaagataaaa atggcgaaca gatagacttt 840 

gctgattaca atctggatgc catgatttac ggacactggc acaatcatta ttataagcag 900 

ttgaagtcag gacttcatac ttactgctca tccactccgg acaaaggagg aatagaccat 9 60 

ggaacttctt gtttcagaat ttacaatgct gataccaaag gtaaattaag ttcagcaact 102 0 

cgttatactt atatagacgg aatattgact tctgcctatc cggcggaagg tgaaactgtt 10 80 

tcagttcctg acggaaaaat gacggtccgg atcaatgctt accgtactat atcggatgcg 1140 

aagaaggtga cggcttctgt tgaacgaaac ggaaggcttg tatcgactgt gacgcttatg 12 00 

cctgaaacag actggggatg gagcggggca gtccgggtat ccggcggtaa gcaacgcctg 12 60 

ttggtgactg ccgagtttga agatggaact cgtttgacga agagagtaga ctatactgta 13 2 0 

actaagcagc cgttatcggt cattgcgaca tctgatgtct gggcagggct tcgtggaaat 13 80 

gccgcacaca accaactggt gaatgacagt gtatctttac ccttgcaaac caactggatt 1440 

cagaatgtcg gcagcaacat ttacatgtgc tcgccgattg ttgcgcagaa caaagtcttt 1500 

atcggaacca ttgatgatga caaagcgaag aaatgctatg taaaagccta tgatgcgacc 1560 

acaggacatc tttgctggac ctttgtcact tccaattcga taaagaatac cattgcctat 162 0 
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gaagatggcc 
aaaggaacag 
ggattggccg 
cgggctgtcg 
acttctacct 
ttcggacatg 
ttt cgagacg 
ctgtatgtaa 
tttaattcgg 
aaaggtgtgg 
acaagtcttt 
cctgtccttg 
gacctgaata 
gtggcagtta 
aagttacaca 



gtatattcgc 
cttgctggca 
tagccgatgg 
acggaaaaat 
ttaccgtggg 
acatcagtaa 
gatcagctac 
tcaatccccg 
cttgcgctcc 
tcgcttttga 
tctataccgt 
taggttcgac 
cgggggctta 

gtggtaactg 
attaa 



ttcggatgct 
aacgcaattg 
tgtggtatat 
tttatggcag 
tgcgggagtg 
tggtgctttg 
tttctatgat 
ttcgggagat 
gcttgtgacg 
ccgcctcact 
tccgtattca 
tgttctcttt 
tcgcggtaag 
tctgtttgtt 



tcgggaatgc 
ccggtttctt 
gccggacatg 
aataaagcct 
ttagttgcat 
ttgtggaaga 
ggtaatttct 
attctgaaaa 
gataaatatc 
tttaaggaag 
cataatcagg 
ggcgccagtg 
cgggctttgg 
gccgattttg 



tttatgccat 
tgctgccgct 
caaaaggtac 
gggacggagg 
cggcccattg 
agcgtgatag 
atttagcttc 
tggcagaaac 
tgattgtttc 
tctggaatta 
aatgtacggt 
atggatattt 
gagcacctgt 
gcggtaatat 



agatgctgaa 
tctcgacgaa 
ttgtgctgtg 
cgagggaacc 
gaacgggctt 
taaaattcgt 
atgtgagaat 
ttcttatgaa 
tacttccaat 
tcgtaccggg 
cgaggtttcg 
gcatgccgtt 
attctcaagc 
ttataatttt 



1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2475 



<210> 964 
<211> 894 
<212> DNA 
<213> B. fragilis 



<400> 964 

gaaaatggat 

atttgcgcaa 

gatagacgac 

acaatcaccg 

gtagtcgtta 

ctgaccggaa 

tatacgatat 

cgtatgtcga 

ttcaatgccg 

ttgccgatgt 

atgggacctg 

cccgatgaag 

cattctttgc 

aagtggctgg 

gaagcaaaag 



ttaaagaacg 
tcaccgtaat 
agcaacctat 
atgtagatgg 
catccatcgg 
aacgtaagaa 
atggttccga 
agcgttgggc 
aaagatatgg 
tgttcattct 
ttttttcgta 
agtatgatat 
tgcatccttt 
ccggtatttc 
agcacaatct 



aaaacagatg 
ggctcaggtt 
tgtgggagca 
caaattcctc 
gatggagacc 
ggtctcgttt 
ctttaaagtg 
ttttcagcct 
cgtgaaatat 
tcgttgtccg 
tggttttgca 
ttatagcagt 
cagctttggt 
aggaaagagc 
ggtacttacc 



aaaagaataa 
cagacacacg 
ttggtcaccg 
ttgcaggagg 
cgggaagttg 
gtggcacatg 
ggatatgaat 
accttgcaaa 
caggagactt 
atagcccgca 
gggaaagtga 
gagtatgaat 
gtagcctatg 
atgtgtctgg 
ttgggagtca 



taataggaat 
acataaaagg 
ccaaaggaac 
ttcctctttc 
acttgaatgt 
ccggtcttag 
tcggtctggg 
tctgtaatca 
ggaatccggt 
aaatgaatct 
aagctagcga 
acgactattc 
gcataggggt 
ggcaggatga 
ctaaccgtaa 



gattgcctgc 
agtagtattc 
caatatcagt 
ggtaaagaaa 
accggtacag 
tatgagtaaa 
tatcgaggtg 
cggagctgag 
atcactggat 
ggcattttct 
gacaggcaaa 
cggtgggaaa 
ggaatacaaa 
tgaaggcttt 
ctga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

894 



<210> 965 
<211> 258 
<212> DNA 
<213> B. fragilis 

<400> 965 

acggaaaaat cgttctgtag ggagttatgg gtttgtctta tggacggtga ttccgctcgc 60 

tttctgatag aatgtcacaa gataggtgcg attgcttcgc tgaagtattt ccttgaagga 12 0 

aagctgtgtc tttctgttgt tcctttaagt gcttcggaca cttcacaacc ggaacggaaa 18 0 

acgcaagagt caggggagaa aaacaggaca atagacctga tggacaaaaa caataggaga 2 40 

tatttgaaaa agaagtaa 258 

<210> 966 
<211> 1980 
<212> DNA 
<213> B. fragilis 



<400> 966 

aacggaactc cgtatccgga acaacctgaa tgcctggctg cgggatacca cggtgcttat 60 

catcacccaa cgcatataca ccatgcaatc ggccaaccgg gtcattctgc tggacgacgg 12 0 

ggagatagaa tccatcggca ccccggaaga gttgttggaa cggtcggaaa tgtatcggga 18 0 



392 



aatatattac tcacagcaaa tcgttatctg accatggcac acggagatca tctgaaatac 240 
agcggaaagc ctaaggcggg aaagaaaaca tttctacgcc tgatatcgta cgtggcctgt 3 00 
gaccggcggt tactgattgt gatcggtgtg ctgatcgtga tcagcattgc ggccaacctc 3 60 
accggatcgt acatgcttcg cccgattatc aacgactaca tcctgccggg cgactttcag 42 0 
ggactggtgc gcatcctgct gttcctggca gctatctacc tgacaggagt ggcggctact 480 
tacatcgaat atatcctgct gaacaagata ggacaacgca ccgtgacccg gatgcgtgag 54 0 
gaactgttcg gcaagatgga acgtctgcct gtcagatact tcgacaccca tcagcacggt 
gatgtcatga gccggtacac caacgatatt gaccgcatca gtgacgcatt gaccgacagc 
ctgtccgata tgctgtccag tgcactgacg gttatcggta ttttctgcct gatgatcttt 72 0 

780 
840 
900 



600 
660 



atcagcccga cactgacagc ggtaacgctg attactgtcc ccctgatgtt cctcagtgcc 
aaaggcattg tgaaacggag ccggaaatac ttcaaagcgc agcaggaagc actgggaatg 
atgaacggct atgcggaaga aatgatcagc ggacagaaag tggtgaaagt attcggacac 

gaacagaagg tggaaacaga cttcgggata ctgaaccaaa gcctgaagga caaatcgttg 960 

aaagcacaat tctactcggg gctaatgatg cctgtcatgc aaaacctcaa tacgctgaac 102 0 

tatgtgatca tcaccattgt gggggcttta ctggccatct tccgcggatt cgacgtaggc 1080 

ggactggcag ctttcctgca atattcacgg cagttcggcc gcccgatcaa tgaactggca 1140 

agcctttaca acagcataca ggctgccata gccggagccg aacgtatctt cgaaatcata 1200 

gatgaagcgc ccgagaaagc ggatgttccg gaagccgtca cactgaaaaa tataaaagga 12 60 

gacgtagccc tgaagaatgt gtatttcggc taccgtccgg agaaaaccat cttgaaagga 132 0 

gtgtccctgc atgcaccggc aggaaagaaa atagccctgg taggcgccac cggagccgga 13 8 0 

aagacaacga tattaaacct gcttccccgc tttttcgata ttcagtcggg agagatcacc 1440 

atcgacaatc acccgatcga ccggatcgag cgcaacagcc ttcgccgttc aatggctatc 1500 

gtgttacagg acacccatct cttcacaggt acggtacggg agaacatccg cttcggacgc 1560 

ctcagtgcga cggatgacga ggtagtggca gccgcccgcc tgaccgctgc ccattcgttc 1620 

atcaaacgct tgccgcaagg gtacgatact ttgctcgaaa acgacggagc caacctgagc 1680 

caagggcaac ggcagctatt gaacattgcc cgtgccgcag tggccgatcc ggccatcctg 1740 

ttgctggacg aagcaacgag caacatagat acacgcagtg aaatcttgat ccagcgggga 180 0 

ttggacctgt tgatgcaagg acgcaccagc ctgatcatcg cccaccgcct gtctacgatc 1860 

cgcaatgcag ataccatcct ggtactggag cacggagaaa tcatcgagca aggcagtcat 192 0 

caggaattac ttgcattgaa gggaaaatat tattcgctga atgaagagca attcaaataa 1980 

<210> 967 
<211> 195 
<212> DNA 
<213> B.fragilis 

<400> 967 

agaatcagcg ccgcattgag tgtggcaagt accagtccgg ttgccacaaa gggcttccgg 60 

gaccagtgca accatcggtt aaacgggcgt ccgatgatgg aacagatcgt gaaaatcagt 12 0 

ttcggtgaag caaataggaa aaagacaacg gcaaaacgcc ctatccataa cgtatgttgc 180 

tcggcgtatt catag 195 

<210> 968 
<211> 1725 
<212> DNA 
<213> B.fragilis 



<400> 968 

atgaaaaaat actggcaaat actgaagaag tacaaaataa gcctgctggc atgcccgttg 

ctggtactcg tgtcggtgat gtgcgaaacc gttcagccga tgtacatggc ggatattata 12 0 

gacaacggag tgatgcaaag agacctctcc gtcatcactg ccgtgggcgg aaagatgata 18 0 

ctgatctcca ttgtcggact gattttcagc attgccaatg tctacgtatc ttcccatgca 240 

tccattggtt tcggaaccga tctgcgcacc ggccttttcg gcaagataca gcaactctct 300 

ttcttcgaca tcgaccggtt cagtacggct tcgcttatta cccgcctgac cagtgacatc 360 

agccgcatcc agcaagtcat catgatgtcg atgcgcctga tgctgcgctc tccgctgatg 42 0 

cttgtcatgg cggtgttttt cgttgtacgc atcaatctcg aactggcggg tgtcctgctg 480 

gctgccatcc ctatattggg tttcagcgta ttctttattc tccggaaagg tttccccttc 540 

ttcctgaagg ttcagcagaa ggtggatcaa ctgaatgagg tagtacgcga aaacctgatt 600 

aacatccggg tggtaaagtc atttgtacga gaggacttcg aagcacataa gttcaaagac 66 0 



60 



393 



aagagcgaaa gcctgcgtga tacggtgatt catgcttcca acatcattgt ctccatcttt 72 0 

ccggtaatgc aactggtgat gaacctgtct atcatcgcta tcctctggat gggaggccac 780 

aaggtgatga ccggagagct gaaggtaggc gaactgatat cgtttgtcaa ctacctggga 840 

caggtgttga tgtcattaat gatgctttcc atgatcatca tgtcttatgc ccgtgcttct 900 

gcctcgtcga aacgtatttt agaggtactg gacacacaac cttcgctgac cgacacaccc 960 

gaaggcatgc agagcacccg agagattgaa aaaggagaga tcgccttcga gaaggtcagc 102 0 

ttccgttatg gcggcggaga gacggacgta ttacgaaaca tcagtttcca catccgcccg 1080 

ggcgagacag tggccatagc aggtgctacc ggatcggcaa aaagttcact cgtgcaactg 1140 

atcccccgcc tctatgatgt cagcgccgga gaaatacgca ttgacggcat ccctgtacaa 1200 

gactataacc tgcgcgaact ccatgcccgc atcggaatgg tgctgcaaaa gaacgaactc 1260 

tttacgggaa ccatcgccga aaaccttcgc tggggaaaac cggacgccac gcaagaagaa 132 0 

ctcgaagtgg cagcccgtgc cgccgaagcc catgagttca tctgctcgtt gcctgccgga 13 80 

tacgacacac tgctgggacg gggtggaatc aacctttccg gcggacagaa gcaacgcatc 1440 

tgcatcgcca gggccttgct gcgtaaaccc aagatattga tactggacga cagtaccagt 1500 

gccgtagact ctgaaacgga actccgtatc cggaacaacc tgaatgcctg gctgcgggat 1560 

accacggtgc ttatcatcac ccaacgcata tacaccatgc aatcggccaa ccgggtcatt 162 0 

ctgctggacg acggggagat agaatccatc ggcaccccgg aagagttgtt ggaacggtcg 1680 

gaaatgtatc gggaaatata ttactcacag caaatcgtta tctga 1725 

<210> 969 
<211> 1266 
<212> DNA 
<213> B.fragilis 

<400> 969 

ataattgcta actttgtttc aaatttcaat agcatgatac gtatactaca tacagccgat 60 

tggcatttgg gacaaacctt tttcgggtat gaccgcacgc aggaacacga acattttctg 12 0 

gactggctgg ccggtgtcct cactaagaac aagattgatg tactgattgt tgccggagat 180 

gtctttgatg tttccaatcc gtctgctgct tcccagcgga tgttctatcg tttcattcac 240 

agggtgacga ctgagaatcc gcgattgcag ttggttgttg tggccggcaa tcacgattcg 3 00 

gctgcccggc tggaatctcc tctgcctttg ttgcaggaga tgcgtacgga gattaaagga 3 60 

attgtccgta aacagaatgg caaaatagat tatgagcatt tactggtaga attgaagaat 42 0 

gcggcggggg aggtagaagc cctatgcctg gcggtacctt tcttgcgaca gggagactat 480 

ccggtggtag agactgaagg caatccgtat gcggaagggg tgaaggaact gtatgcccgt 540 

ttgttgaaat atgcgttgaa gaagcggact gacggacagg cattggtggc tgtcggacac 600 

ctgctggcaa ccggttcgga gattgccgag aaagatcata gtgagcgcat catcatcggt 660 

ggtctggaga gtgtatcgcc cgagtctttt cccgaacaga ttgtttatac ggctttaggg 72 0 

catatccaca aggctcagcg cgtatcgggc agggagaata tccgttatgc cggcagtccc 7 80 

ttacctatgt cgtttgccga gaagcattat caccacggag tggtaaaagt gaccctggat 840 

gaaggttggg cggttgagat agagaaactt gaatatactc cgttagtgcg tttgctaagt 900 

atccctgcca cagaagctgc ggctccggac gaggtgctgg atgaattgcg cgggctggaa 960 

ctaccggaag atgaaccgat gccctatctg gaagtcaagg tgaaactaag cgaaccggag 102 0 

ccgatgttgc ggcagcaagt ggaagaaata ctggaaggca agccggtccg gctggcccgt 1080 

atcgtttctt tctatcggca ggcggcagag gggagcgtgg aagaagaaac cctgaccgcc 1140 

ggattgcagg agatgaatcc cttacagatt gtgaaagcaa cctttgagaa tagttaccag 12 00 

gcggagatgc cggaagaact ggtaaatttg ttccaggagg cttgccggac catcaattta 12 60 

gaatga 12 6 6 

<210> 970 
<211> 1143 
<212> DNA 
<213> B.fragilis 

<400> 970 

acaaagatga acaagagaaa attactaggc ttgctctgtc tgatgacatt gctggctacc 60 

tcctgtgata ataaaggaga ttattggggg gctatggaat cttctaaagc aacattaacg 12 0 

ttggagcgga tttgtgatat ggctacgctt tcacaagatt ccgtggaatt gctgtccaat 180 

attctgggga tgaatacaga agaactgtat cggacagacg tggtcatgat agggaaagtg 240 

acaagtgaag aaaccggatt ctaccagtat cccagatttc tgatagctaa agatcgagag 3 00 
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atgaaagaag tgcttacgga agcttatgta catcgtgata cagaagggac tttttatgct 360 



420 



600 
660 



ttcctggaat cgaatatgct ccccgtgggc gaaacttatt attgcgccat ggttgattat 

aattatggct ataacggacg tccgggattg ctggaccatg tattgggcgg aaatacgcgt 480 

ggcgaacgtt acagtgaagt gaagccattc cgcctttcag ggttgccaag attggttgtg 540 

cacgatgcac attttacagg atacagtttc tatttatctg cggaggtccg tttcaaaagc 

aatggcggaa ttatagaaca gggcgcttgt tacagttcta ctaaaagaat ccctacggtt 

gatgatcaga agacattggc acgggaaacc cggaattatg actattcgtt tttagaagta 720 

gaggtgaccg atctgcttcc caatacacat tattatatac gtccctatgt gactactgag 780 

gaagggacag gctacggacc ggtagttgag tttacaaccg aaccgggtac ggaaccgatt 840 

attgattatt tcacgatgta tatagatacc gataggtcgg taaacctgta tgccactttc 900 

tatatagata attatcagat tacgcactac ggatacagct atggcattta ttctccagaa 960 

acgggaacgg tgacggatga acagaagata gaggttcctc ttgaggataa tcacggacaa 102 0 

cagcttagta aagtcattac cggattgcgt ccgggaatac tttatgcttt ccgtgtttat 1080 

gcggaaaatg gagtaggggt tacttacagt ggttaccaga ccgttaagat tcccgtagaa 1140 



taa H43 

<210> 971 
<211> 2991 
<212> DNA 
<213> B . fragilis 

<400> 971 

tctgtggtga aagcaaaccg aataaccatg aagatactaa ccatacgatt aaaaaatctg 60 
gcttccattg aaggaacttt tgaaattgat ttccaagccg agcctttacg ctctgccggg 12 0 
atttttgcaa tatccggacc tacaggagcc ggaaaatcta cgatactgga tgcgctctgt 180 
ctggcgctgt atgataagac accccgcttc tccgcttcgg tcgagagtct ttatatgtca 240 
gatatcggtg agagtcgggt gaatcaggcc gatgtgaaga atatccttcg cagagggacg 3 00 
ggagagggat ttgccgaagt tgatttttta ggagcatccg gacactgtta tcgttctcgt 360 
tggtcggtac gccggacagg aagccgggct aacggtgctt tgcggtcgca gacgatacaa 42 0 

480 
540 
600 



780 
840 
900 
960 



gtgaccgatc tgactgccaa tcaggaacta cagggaacga ggaaagagtt gttggcacaa 
ttggtgactt tggtcggttt gacttacgaa cagtttaccc gtacggtgct gcttgcccag 
aatgacttcg ctactttctt gaaatcccgt gagtctgcca aagccgagtt gctcgagaag 

ctgaccggaa cggagatata ctcccgtatt tccagtgaaa tctatctgcg tagcaaaaca 660 

gccgatgcag aactgaatca gctgaagagc aatgccactc tgatcgaact gctttccgaa 72 0 
gaggaaatca ctcttctacg gaccgaaaaa gagagtctga ccaaccttcg tgaacaggga 
agcaaagcat tgatagacct gaatgcacag ctatcggtgc tgcatacttt gaaattgcag 
caggagcagc gtgataagaa agtgcaggat atgcggctgg atgaagaaaa aagtaagaag 
ctccgggaag aatatacacg ccagtccgac tctctcattc gcttcagggg acagtgtgaa 

gctgtgcaac ccgatcttag ccgggcacgt gaactggatg ttcagatcca atcactggtc 1020 

agccaatcta agcaggtaga ggagatactg caaggtgctg aaaaggcagc gaatgcacaa 1080 

gccaataaat tgcaatctgt gcagggagcc ctgcatactt cctgccattc gttgaagaat 1140 

ctgacgggag agatcgagct accggttacg gaagagaccg ggctatttct tgaatctgtc 12 00 

cggaacaggc tgaaagagca ggaagaccaa cttgccattc ttcaggaaaa gaacgaagcg 12 60 

cgtgtgaacc gtctgaatgc ttttgggatt gaagcggtga ctgacgagca agcccgatgg 132 0 

atgcaggaac aaacccgttt gcagaatgcc cgccagcaga tgttggaatg gagaaaagcg 1380 

gggacagagg ccgaacgtct gaaagcacaa caggaagaga tggggcacaa acaagaacag 1440 

atgcggaaag agataaccct tctgaccacc cggttgtcag agaaggaggc tgaactgaaa 1500 

gtgctgcaac gcctttttga gaatgcccgt atcgcgatgg ggaaagacgt tcggaccttg 1560 

cggctgaatt tgcgtgagaa tgaaccgtgt ccggtatgtg ggggcactga ccatccttat 162 0 

aggaatgagg aacaggtagt ccatagtctg tatcagaaca tcgaacagga atatcaaacc 1680 

gcatctgctg agtatcagca actaaataac cggaatattg ccttgaaaca ggatttgctt 1740 

catctgtcgg aattgtccgg agagataacc gtacagttgc aggcgttcct ccaagaggct 1800 

gagcagaaac gtccttcgtc tgaagaggag caaaatccgg actattttga gaaacaattg 1860 

cataccgtgc aagggaagct aaatctgctc gcggagaaaa tgcaccaata ccatcagctc 192 0 

tacaaggaat ggcaacagca tgaggggcag atcaggacgg ttcgctcggc ttgtgaggct 1980 

ttgcgcgaag gggtggcccg ttgccatctg ttgatgcagc aagttctggc tgctaaagag 2040 

caatttgaac tactgaaaac ggcggaaacg actgctcggg agcagttccg agtggtcagt 2100 

gaacagttga taactctccg tcaggaacgg gctcctttgt tgaaaggcaa atctgttgag 2160 

gatgcggaag ccgctattcg gaaaaaagag aaacaattaa acgattccgt ggaacaggtg 2220 
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cgcaaggagg gagaagaagt ccagtcgcgt atttccggta tgcagggaga gattaggcaa 22 80 

ctgaacagct ccatcgacga attgatgctc cggaaagaac agatagccga tccggaacat 2 340 

ttgccggaga cgatcgcccg ccagcaagcc accaatcagg agaccgaacg gcgtctgtcg 2400 

accgtcgaag cacgtctttt gcaacaggag caaaaccgga agaagctgaa gcaactggag 2460 

caggaactga ccgaaaaaca ggagacagcc aaccgatggg gaaaactcaa taaactgatt 2520 

ggcagtgcgg acgggacaaa gttcaaggtg attgcccaaa gctatacgtt gaatctgttg 2580 

ctgatgcatg ccaacaagca cctgtcttat ctatcgaaac gttaccggtt gcagcaggtg 2 640 

ccgggaacgc tcgccctgca agtgatcgac tgcgatatgt gtgacgaggt gcgtaccgtc 2 7 00 

tattctcttt ccggcggaga atcttttctg atctccctgg cgttggctct cggcctgtcg 2760 

tctctttcga gcaataacct gaaagtggag tcacttttca tcgacgaagg tttcggttcg 2 82 0 

ctcgatgccg acagtttgcg cacggtgatg gaagctctcg agcaattgca gatgcaggga 2880 

cggaagatcg gagtcatttc ccatgtacag gagatgagtg agcgcatcgc cgtgcaagtg 2940 

caactccatc gtgcggcgaa tgggaagagt gctatcactt tgacaaattg a 2991 



<210> 972 
<211> 297 
<212> DNA 
<213> B.fragilis 



<400> 972 

cgtgtgaacg atttcaatat tcaacagaat atgatcacca gtgccgaaga agcactagac 



60 



ctctctatcc tggcttataa cgagactcgt cagcggttta tcatcgggaa agcagatatc 12 0 

aacagtctga cgctgtctct gaaccgtcag caagaggcac aacagaatta catttcagcc 

ttgcaaaact attggctgaa ctattataag atacgtaaac tgacgttaca tgactttgct 

accggaatct cgctgactga caagtttgac tatgcgggag gacaattggt gcgatag 2 97 



180 
240 



<210> 973 
<211> 1092 
<212> DNA 
<213> B . fragilis 



<400> 973 

agaaaaagtt taataaccat tatgaaaacc cagtatccct cttatacgct ttgtctggca 60 
ttgacaatgc tgacagcttg ttcggtgaga aagaaagaga gtgcctctga aaaaggagtg 12 0 
gaaccgtgtt tgcccgacac gaaaaacgag gtgtctgtca tgacgctcaa aaagcagata 180 
ttcaatcatg aattggtgag taatggaaaa atttctgccc ggggaatggc tgacctgaga 240 
tttgaaagtg gtgaagtgat agcccatatt tgggtaaaga acggagaccg ggtacgaaag 3 00 
gggcagaagt tggcggagtt ggacaaattc aaacttgaca accagttgtc gcaatcggaa 3 60 
gacgctttaa aaaagtcgga attggaattg agggatgtac ttatcagtca gggctatccg 42 0 
gcagacgaca ttagtcaggt acccgaagag acaatgaagt tggcaaaggt gaagagtggt 
tatgatcaga gcaaatcaca atatgaaatg tcgaaataca atgcagagca cgctactttg 
accgcacctt ttgacggagt agttgctaac ctgttttcga acccctacaa tctggccagt 
acttcggatg tattctgtac ggtgatcgat atgcagggta tggaagtaga ttttactgta 
cttgaaagtg agttgccatt aataaagaac ggagataagg tagtgatcaa gccctattcg 72 0 
gatgccgcaa cagtacacga aggaagtatt tcggaaatca accctttggt agatgataaa 
ggaatggtga aggtgaaagc ccgggtgaac ggggccggta agctgtttag cgggatgaat 
gtacgtgtca gtgtacatcg ttcgttggga gagcagttgg tgattccgaa aagcgcagtc 
gtacttcgtt cgggcaagca ggttgtattt accctgaaag atgggaagat ggcccaatgg 
aactatattc ataccgcttt ggaaaatgca gacagttata gtgtggccga cggactgaca 
gaaggagata cggtcatcgt aagcggaaac attaacctgg cacacgaagc tccggtcaca 
atcattgaat aa 1092 



480 
540 
600 
660 



780 

840 

900 

960 

1020 

1080 



<210> 974 
<211> 588 
<212> DNA 
<213> B.fragilis 



<400> 974 

gatatggaac tggatgactt gaagaaatcg tggaatgctc tggatgaaca cctgaaaaac 



60 



396 



aaggagttca ttgaagaaaa agagatagca caactgctgg gacgtgcccg taacaagatg 120 

aacagcatcg accggttcaa caggaaactg cgttttgcct cgatcggcat actgacatta 180 

gcggtgctct tctggatatg cgccgacaca cttacagacc ttttttattg gatagccctc 240 

tcactgtgca tcccggctct ttgctgggat ttgtactccg cccattacct gagccggacc 3 00 

cggatcgatg agatgcctct ggtcacagtc atctcccgca tcaatcggta ccatagatgg 3 60 

atggttcgcg aatggatcat aggtatcctc tatctgcttg cgatggctac ttttttcttt 420 

ttccacaggc aagtctggca atatggtgct gcgggaatta tcgtcagcct gatcgtctgg 480 

gccatcgggc tcggaatctg cctatgggta tatcgccgga acataagaca tataaaagaa 540 

ataaagaaga acctcaacga gttaaaagaa ttaaatcata cagcttaa 588 

<210> 975 
<211> 1038 
<212> DNA 
<213> B. fragilis 

<400> 975 

aacatccgct gggaagcagc agacggattg gaaacatcaa agacatctcc ggcaacaatc 6 0 

agtacatcaa tcttgttctt agtgaggaca ccggccagcc agtccagaaa atgttcgtgt 120 

tcctgcgtgc ggtcataccc gaaaaaggtt tgtcccaaat gccaatcggc tgtatgtagt 180 

atacgtatca tgctattgaa atttgaaaca aagttagcaa ttattcagga cagcaatatt 240 

attttgttta tttttgcaca agctaaaacc aaaatcatta tgaagatact atactatatt 300 

tatcaaatct gcattgcatt gcccattttg ttagtattga ctatcctcac ggcggttgtc 360 

acaatcgttg gttcattgct gggaggagcc cacatctggg gatattatcc ggggaaaata 42 0 

tggtcacaac tgatctgcct ttttctgttg atcccggtca aagtgcatgg gcgcgaaaag 480 

ctacatgaaa gaacttctta catctttgtc cccaatcatc agggctcatt cgatatcttt 540 

ctgatttatg gttttctggg acgtaacttt aaatggatga tgaaaaaaag ccttcgcaaa 600 

attcctttcg tcggaaaagc atgcgaaagc gcaggacata tctttgtaga tcgctcggga 660 

ccgaaaaagg tacttgaaac cattcgtcaa gccaaagact ccctgaagga cggagtatca 72 0 
ttagtggtct tcccggaagg agcccgttct ttcaccggac acatgggata ttttaaaaaa 
ggagcttttc aattggcaga tgacttacag cttgccgtag ttcccgtaac catagacggc 

tctttcgaaa tcctgccacg caccggcaaa tggattcacc gtcatcgcat gattctgacc 900 

attcatgatc ccatcccccc caaaggacaa ggagcagata atatgaaagc tactatggcc 960 

gaggcttaca cagctgtaga aagtgcactt cccgataaat tcaaaggaat ggtgaagaac 102 0 

gaagatcagg atcgatag 103 8 

<210> 976 
<211> 1173 
<212> DNA 
<213> B. fragilis 

<400> 976 

gaaaatatgc tgcaacgagt tttaggcttt ctgatagtaa tccttgtact gccggacatt 60 

tatatttacc ggacatttat caaacaactg actctaagtc ttttctggcg gattctgtac 120 

ttcttcccca ctcttttcct gatggcagga gtcgtgtcac tggctttctt tgccaactat 180 
gaatacgccg agcaacatac gttatggata gggcgttttg ccgttgtctt tttcctattt 
gcttcaccga aactgatttt cacgatctgt tccatcatcg gacgcccgtt taaccgatgg 

ttgcactggt cccggaagcc ctttgtggca accggactgg tacttgccac actcaatgcg 3 60 

gcgctgattc tttacggatc gatggtcggc aaagaccgtt tcgaagtaaa ggaggtcact 42 0 

ttccggtctt cccgtctacc cgaagccttc aacggatacc gcattgtcca gttgtccgat 480 

atccacatcg ggagttggca gggaaacgcc aagagcctgc aacggatggt ggacctggtg 540 

aatgcacaaa aaccggactt aatcgtattc acgggtgacc tggtgaacaa ccgggctgcg 600 

gaattggacg gatttgaaga gatactgtct caactgcatg ccacagacgg cgtctactcc 660 

atattaggga accatgacta cggaccttac tatcgctgga aaagcaagcg tgaccaggta 72 0 

aacaacctga acgacctgaa gaaaagacag gccgacatgg gctggatact gctgaacaac 780 

gagcacaccc tgctacaccg gggaaatgac agcattgccc taatcggggt agaaaacgaa 840 

ggagaacctc ctttctccca gcacggcgac ctgcccaagg cacaggcagg aacaaacggg 900 

ctattcaagc tgttactaag tcataaccct acccactgga ggcgtgaagt gttacctcaa 960 

tcggacatcg atctgatgtt ggcgggacat actcatgcca tgcaactggc catcggacat 1020 

cactcgcctg cctcctggat ttatccggaa tggggaggta tgtacatgga ggacaaccgg 1080 



780 
840 



240 
300 



397 



gggctgtacg taaacgtcgg catgggattc gtaggtctgc ctttccgctt cggagcatgg 
ccggagatta ccgtgataac actggataaa tga 



1140 
1173 



<210> 977 
<211> 543 
<212> DNA 
<213> B.fragilis 



<400> 977 

ccaaataaaa 

cctgtacaaa 

tgctatctgt 

aatctgtgga 

cggatagccc 

gtcgcactga 

ctgcggcaac 

ctttacctgg 

aatgtagcca 

taa 



gtaatcctaa 
aagagttcct 
ataccacccg 
aagcttatcc 
tcaacacttg 
cccgcgaagc 
tgtaccggat 
aagaaaaaag 
ctaagctgag 



aagaagatca 
ctcggtaatc 
gaacgctacg 
caaattccgg 
catcagcttc 
cgactggatg 
gatcaatcaa 
ctacgaggaa 
ccggatcaag 



atgaacctga 
aaggaatacg 
ctcggcgatc 
aaagaatgca 
atccgcaagg 
acagaagaga 
ttgggacaac 
atagccgaaa 
gacaaactta 



atccggcgca 
agcgggttat 
tttaccagga 
aaatatcgac 
aaaagaatgt 
aagacgaact 
tggacaaatc 
tcaccggact 
aaaagatgaa 



tatcaacgag 
ctacaaagta 
agtgattctc 
ctggatttac 
gccggaaatc 
gacggaaatg 
gatcgtactg 
gactgtgacc 
aaaggaggaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

543 



<210> 978 
<211> 3252 
<212> DNA 
<213> B. fragilis 



<400> 978 

ctatgcggga 

tctaaaataa 

gtggcgttga 

accctgcccg 

atagaggcta 

tattctacct 

atagatgcgg 

gatggagtga 

tttatgtcgt 

gagcatatca 

gctactccca 

actttgagtg 

tataatgtcg 

acaaaagatg 

atcagtctgg 

cgcatcaacg 

ttgcaattga 

ggatatgaaa 

atttatctgc 

ctgaatccgc 

gtcattttct 

gtttcgctca 

cgtaacctga 

gtgatcatct 

gtgattatca 

gaaaagatcg 

cgggcctctc 

cgtaaattgt 

ccggtgttta 

aaaaccttgg 

ggaggtagtt 

gaagaagtcg 

aacgaactga 



ggacaattgg 
aaacgcagac 
tcggtctggc 
gttttacggt 
ccagtaaact 
ccgataatgg 
tacgttttga 
gctatcctta 
ttacactcaa 
aaacaagact 
tggagtgggt 
atatccagca 
aatcatctac 
agggttttga 
acgagcttgt 
gacttaactc 
gcaaacaagt 
ttcacaccag 
gtaccggcct 
gttacctatt 
attatctttt 
atctggtgat 
aagcctttat 
tcttccttga 
acctcgccgt 
ggttgaaaaa 
ttccgcgaag 
gtcgttggcg 
tgcttccgga 
gatcgtccac 
tgcgcctgtt 
tgttgtatgt 
tcaagaagat 



tgcgatagtg 
gaaagcctct 
cttaatcccc 
gcagttcagt 
ggaggctatg 
ttcgggaagc 
ggcttctact 
tatcaggatg 
cgctccttct 
ggcacagatt 
cttggagtat 
ggctgtcagc 
agggggtaaa 
tgcatcccgt 
aactgtgtcc 
catttatctg 
gaaagaagag 
ttatgatgcg 
tacggttctg 
cttgattgta 
tgggttggag 
agacaatacg 
gtctatcctg 
cgagaagata 
atccttgttt 
gaggaaacgc 
aataacggtt 
tgtggctgtc 
taaagtggag 
ttacaaagag 
tatccagaag 
atatgccaat 
ggaaatctat 



ctaactaaca 
tcttttacgc 
ctgcttcccg 
atgcccggta 
ttggcacgta 
atcaccattg 
attatccggc 
aagcgcccgg 
actcctatct 
cagggcatct 
gatagcgaac 
cgctattatc 
gagtggatcc 
atccgggtga 
cacatggaag 
tcgataacag 
atggaggcca 
acggaattta 
atcttgctgt 
gtcagcctga 
atgcagctct 
attgtgatga 
gccgctactc 
cggttgaact 
gtcgctctgt 
cgtcggaccc 
tactttaccc 
tgcatcttgt 
ggtgaaggac 
aagatcaagc 
gtatacaacg 
cttccgaacg 
ctgagccagt 



ttgaaacgat 
tgattgtcgc 
taaagctgaa 
cttcgtcgag 
tcaagggcat 
aattggataa 
agacctggcc 
acgagaatgc 
tgattcagca 
ataagataga 
aactgagacg 
tgaaagagtt 
ggttggcgct 
agagtgccga 
aggcccccca 
cagaagaaac 
tacaaaaggt 
ttcacgaaga 
tctttgtgct 
gcatcaatat 
attcgctggc 
ccgaccatat 
tgactacaat 
tgcaggactt 
tcttcgtacc 
aatcccgctt 
gcttctatgg 
tgatactatt 
gtgccacgga 
cgatagtcga 
gcagctattt 
gcagtacgtt 
ttaaagaaat 



ggataatcct 
ctttatatgc 
cccttcgaga 
ggtggtggaa 
aaagaatatc 
gtatgcggat 
gcagcttccc 
ttcccgccct 
atatgccgat 
cctgagcgga 
attgggaatt 
cctcggtacc 
gatgcctgaa 
aggtaaattg 
aagctattat 
tgccaatcaa 
gttgcctgcc 
attgaacaag 
gattatcact 
tgcggtagca 
cggtattacg 
cctgcaccgg 
gggagctttg 
tgcagccgtg 
tgccctgatc 
cttccttctc 
ctggatgata 
gttcggactt 
gtggtacaac 
caaagcattg 
tacccggaac 
ggagcagatg 
taagcagttc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 



398 



cagacctcgg 
cagaatagcg 
ggtggaggta 
gagggagccg 
tgggctgaaa 
aattcgtatt 
gaacgtatgg 
tatggtaaga 
ctttcttcca 
acagacgata 
caacaggttg 
tcgggcgagc 
ccgatggggt 
aagcaatact 
ttcaactctt 
gtattcctga 
gttttgctat 
atccggaggc 
aaaatccttc 
gtcggtacgg 
gtgatgtcta 
ggtaagcgtt 



tatacaatgc 
gtttccctta 
gctggggtgt 
gttcgtttca 
agttgaaagc 
tctcttattg 
cgcaggagaa 
atatggagat 
agcagtctca 
aacagtataa 
ccaaggagaa 
agggaaacaa 
atacggctca 
tgcttttgct 
tgaagcagcc 
cgttctattg 
gtggtattac 
gtcatccacg 
ctatcttcct 
ataaggaggc 
tcatcggaat 
ga 



ccgtcggggt 
tacactgaaa 
atacggcttg 
ggtgaagatg 
caagttactg 
gaaagacgat 
tatcaatgcc 
cggctcggta 
ggaatatgat 
gctgtctgaa 
ccagcagtac 
aatcctgaag 
gtcggagaga 
ggtagtgatt 
tttggccatc 
gtttaagctg 
ggtgaacgcc 
aatgtcggct 
gacggtggtt 
attctggttc 
ctttttcttc 



aatatcaata 
gccaacatta 
caggatcagg 
tacggataca 
acgcaccgac 
tatcaggagt 
aatattctgt 
gtggcggaaa 
atctgggcca 
ctggccacta 
aggctatgcc 
cgggatctgg 
gagagctggg 
gccattatct 
atatttatca 
aactttgacc 
agtatctata 
ttgagagctt 
tccaccatcc 
ccattagcgg 
cttccggtat 



tctattttac 
tcagtaaggc 
ggttcagtaa 
attatgatga 
ggatcaagga 
tctattttaa 
tctccaccat 
atggttcgga 
tgcaatattt 
tggaaaaagg 
tgcaatacga 
aagaattcaa 
gttggggaaa 
tctttactac 
ttcccgtgtc 
aaggtggctt 
tcctcaacga 
ataccaaagc 
tgggttttat 
caggaactat 
tcgtgttgaa 



caaagagcat 
cctgcaactg 
tgatgttcgt 
actgtacgag 
agtcatcatc 
tctgaaccgt 
ccggccgata 
aaagataaag 
tccgtatgga 
gcagatgccc 
atatatcggt 
taaagagttg 
aaaggataat 
cagtattctg 
gtatatcggg 
tgcttcgttc 
atacaatgcc 
ctggaatgca 
tccctttatg 
cggaggattg 
gaagagggtt 



2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3252 



<210> 979 
<211> 1653 
<212> DNA 
<213> B.fragilis 



<400> 979 

tggtctatta 

accgggatca 

catactctgg 

agtgtattgt 

aaagaaagaa 

aacattcatg 

gcatctgtgg 

gagttgtcgt 

ccgttgcagg 

ttctctcctc 

gccgaattat 

gagcaaacat 

ttgaacctca 

tat cgtgtgc 

caatatctgc 

accttgattc 

gagctgataa 

gagtatattc 

aaagaaacat 

aatcagaaga 

ttcattttgg 

ccgattgacg 

atgtcggcca 

ctggatgcca 

gccggacgaa 

gtaccgttgc 

atgataggta 

tggtttattt 



ttttcaccgg 
aaggtggtcg 
accgtttcta 
tctgtgccat 
tgccggatat 
tggacgagaa 
agcagacagc 
catcggaggc 
aacaaattta 
ctgaaacagt 
atgcccgcaa 
tcgggcagaa 
gtattaatca 
tgagaactgc 
ctatcagcat 
agacacaacc 
aagtaactcc 
cttataaatt 
ctagcgagac 
tgctggacga 
cggctcagtt 
tggcgtttgc 
tcggattgat 
tcaacgaatt 
gaagattacg 
tgttttcgtt 
ctatgacaat 
atagaaataa 



taacatgttg 
aaagtggttg 
cgataagggt 
ttcttttccg 
cgacgaaaat 
tcaacgccgg 
gtccatcgga 
ggaactttat 
tcagaagctg 
ttttgagaaa 
taaggacagg 
gacaggaata 
ggaaaagcta 
tttcaaggaa 
tgcaggagac 
ggacagcaag 
cgcagaagat 
ctatggagtg 
aggagattgg 
actggtggtc 
cgagagtttc 
attggtattg 
tgtgacttgc 
gcgtaaggag 
accgattatc 
cgaccttggt 
cggtaccctg 
agaaaaacga 



gttactgttc 
aggttgaaaa 
atcgattggg 
ttatgtatct 
gaattgatta 
gtcgacgaat 
ttgcaggatt 
tttaagaccg 
aaagaacgtt 
ctgtttgtga 
gcacccggcc 
cctcctacag 
ttgttgtatc 
aacagtgtag 
gaaaagacgg 
acgggagagg 
ttgaagagta 
gagaatgcgg 
gacatcgcct 
attctgttca 
atgcagcctt 
ctttgggtgt 
ggtatcgtga 
ggagtaccat 
atgacatcgt 
tccgaacttc 
gtgagtttgt 
tag 



tgtatatgtt 
ttaataatcc 
tattcagtca 
tctttttcta 
cccgtattga 
tgttccgtga 
acattttgaa 
aaacctcgaa 
atcctttggc 
ccggtgaggc 
ccggtacttt 
gtattgcttt 
aaatctctta 
ctatgttaca 
ttaaccaggt 
tgaactttat 
ttacagccgg 
agaagttgat 
tttcgggaag 
tctccctttt 
tgctggtgtt 
gcggacatac 
tcaatgactc 
tactggaggc 
tgaccactat 
agaaaccgct 
tcatcatccc 



ggtatatcgt 
tttgaaagaa 
taagacgttg 
ctttatcgat 
atggaacgaa 
gttgcaggga 
ccgggaacaa 
agagattgct 
tgtgatttca 
tgatattgtg 
gcgtggattg 
cgaaaaccaa 
taacgagttg 
ctcttatcag 
tttacaggaa 
tcccttacgg 
aagaaatggg 
gacccaggtt 
cttcttttcc 
gctgatgtat 
gatggaaata 
attgaacctg 
catcctgaag 
cattcatgaa 
ttttgcaatg 
gtccatagcc 
tttgttgtat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1653 



<210> 980 



399 



<211> 459 
<212> DNA 
<213> B.fragilis 



<400> 980 

agtaccccca ttgcaaacac ccataacgat atgatgcaga accccgatta ttttgaacgt 60 

acccgcagcc ggttacatcc caagaaatac cgattttatt tctttgatta cctttactat 12 0 

tgcggtgacc ggtggtcgaa aagaaattct cgtgtatggg gaagcggagt gatatttaat 180 

tattggacat tttgtatatg ggggcctgtt gccttttgga caagattaaa tgggattcat 240 

ctttttagcg agagcataga tgtgacaatt gtttttgccg gcatgttact ccctttcgtt 3 00 

tgtacccgtc tacgataccg gaaagaccgg gtatcggcta tcaggcacca ttaccgccgg 3 60 

agtgcctgga gaagcatcat tcctccccgg ctggtggtgt tcggatggtt tatcatccta 42 0 

ttgcttgaag tgataggagc aaagttatgc gaggcataa 459 



<210> 981 
<211> 1461 
<212> DNA 
<213> B.fragilis 



60 



<400> 981 

cacagaacaa ttatgattaa attcctgata caacgtccca ttgccgtatt gatggctttt 
acggcttgct tcatagtggg gttggtgacg tacttcacat taccggtatc gctgttgccg 12 0 
gatatctcca tcccggagat taccgtacag gtatcagcta aaaatacctc ggcacgtgaa 180 
ttggaaaaca cagtcgtgaa gcctgtccgt cagcaattga ttcaggtggc tgccctaaaa 240 
gacatgacca gtgaaacgcg tgacggtgcg ggtattatcc ggcttagttt tgattttggt 3 00 
accaatacgg acctggcatt catagaggtt aacgaaaaga ttgacgcagc tatgaactat 3 60 
ttgcctaaag ataccgatcg tccgaaggtg atcaaggcaa gtgctaccga tattccggta 
ttctatctga atctgacttt aaagacagac agtgcttatg aagagacgga tcagcaggct 
ttcctgaatt tatgtgagtt ttcagaatcg gtaatcaaac gccgtatcga acagttgccg 
gaagtggcaa tggtagacgt taccggtttg ctggaaagac aattgcagat tgtacctgat 
atggataaac tggctatgct tgaattatcc attgaagata ttgagacggc tctggcgcaa 
aacaatgtag agccgggaag catgaccgta cgggatggat actatgaata taacatcaag 72 0 
ttctcaactc tgctccgtac tgcggaagat gtggagaata tatatatccg taagggagat 
cgcatcatac agttgaaaga attttgccgg atagcgatag taccggtcaa agaaaaagga 
gtatctgtgt cgaacggtaa aagggccgtg acgcttgcca tcattaagca ggccgacgaa 
aacatggaca atatgaaaga tgctctgtcg gaaacaatgg attatttcaa aaagatctat 
ccggatatcg agtttagcgt gagtcgtaat caaaccgaac tgctggacta tacaatatcc 
aatcttcagc agaatctctc actcggtttt gttttcattt gtatcgttgc cgtactcttc 
ttgggagatg tcaaatctcc attcattatt gggctgagta tggtggagtc tattgtcatc 1140 
agtttcttgt ttttatacct gtgtaaaatg tctctcaata tcatctacct gtccggactg 12 00 
atcctggcac tgtgtatgat gatcgacagt tcgattatcg taacggataa tatatcgcaa 1260 
tacagggaaa agggttattc gttgcggaga gcctgcgtgg cgggaacaag tgaggtggtg 132 0 
actcctatgc tgagttcttc gtttacgaca atcgacgtat ttgtaccttt ggtatttatg 1380 
agtggtatcg cgggtgctat cttttacgat caggcttttg ccgttagggt aggattgatg 1440 
gtctattatt ttcaccggta a 1461 



420 
480 
540 
600 
660 



780 

840 

900 

960 

1020 

1080 



<210> 982 
<211> 1293 
<212> DNA 
<213> B.fragilis 



<400> 982 

agaaaaacga tagatatgaa gaaaagatat tatatagtaa tagcagcctt gctgtttggg 60 

gcttctgtag cgaaggctca ggatcatata aaactcgatt tgcagaagac gatacaattg 12 0 

gccaatgaca gttcactgga ggcattccgt acgcagaata tgtatctttc cggttactgg 180 

gagtatcgga cttacaaggc caatcgcctg ccgagcctta ctttgaatat gactcctgcc 240 

gagtataacc gggatatcac caagcgatac gattcggaaa aggacttgga tgtttatcgt 3 00 

agccaacagt cgttctatgc atcgggtaat ctggctatcc agcagaactt cgatttgacc 3 60 

ggcggtactt tctacctgca atcgcaattg ggatatatgc gtagttttgg tgggaacaag 42 0 



400 



acaacgcagt 
aattcgttca 
tttgtgtata 
atggcgcagg 
agcattggag 
aagttggatg 
gccatgtttt 
ttgcctgtcc 
gaaaacaatc 
gacaagacga 
caggtggctg 
gtcagtgtca 
cgcaataacc 
gaagtgatca 
agaagcacta 



ttaccagtgt 
agtgggagag 
atgtggaagc 
ccgagtataa 
tgcaacgcca 
tggtgaatgc 
cactggtttc 
ggcctcagga 
ctcagttact 
aaaaagagtc 
ataattttgg 
gtattccgtt 
tgaatgtggt 
tgacgtgtga 
gacctctcta 



acctatccgg 
aaagattgaa 
cgtatccgtg 
cctggccaag 
gaagatagca 
acgcaatacg 
attcctgaac 
attggtgata 
ggggttaaag 
gcgtttcaat 
agatgtgtac 
ggttgactgg 
gaaaacttct 
acgatttcaa 
tcctggctta 



ttgggatatt 
cccttgaaat 
caggccacta 
gagaatatgg 
gccatctcga 
ttgcagaaca 
ctggataaga 
ccggtggaca 
cagaacgtac 
gcgagcgtga 
cacaaaccca 
ggggtaagga 
gcccgccagg 
tattcaacag 
taa 



cacagagcct 
atgaaaaagt 
cgtatttctt 
tttcttcgga 
aagccgactt 
aggctagtgc 
atacggttat 
aggcattgca 
tggaagccga 
atgccagtat 
tgcagcagga 
aaggtaaata 
atgaaatcag 
aatatgatca 



ggtcggatat 
aaagaaagag 
taacctggct 
tacgctttat 
attgacactg 
cctgaaacgc 
tgatatcgac 
gatggcacat 
acgcaatgtg 
cggtttcaac 
cttggtatcg 
taacatggcg 
cctggacgaa 
ccagtgccga 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1293 



<210> 983 
<211> 486 
<212> DNA 
<213> B. fragilis 



<400> 983 

caaatggaag 

ggatggactt 

gtgaaagtaa 

gggaatggag 

gcgggtgatt 

gaactgcaat 

gaaaacgagc 

gtagccagga 

ggctga 



tacccttagt 
atgcacccat 
aaggaagtat 
aattaggact 
atgtacatgt 
tgtgtttgca 
ggcacaatta 
tggccaaagc 



ggataaagat 
tcccgaagtg 
tgatggtgtc 
ttctgtaaaa 
tgttttgtac 
agacgaacct 
tgtgaaatgg 
gattgacagg 



tacttgcttg 
ccgcaagata 
gaaatcaaaa 
gctgaaatcc 
cttgatgaag 
cgagcattgg 
atctattctg 
cttgcaagca 



agagaactcc 
aaaaggcacc 
agcaccattt 
gtaagaagat 
agccgtcgga 
aatttttcaa 
caaagaccga 
acctgaagta 



cggcaatgga 
tttcggctgg 
gatgccaatg 
caaaaagcag 
gattcccgaa 
ttcactggct 
tcgggcaaaa 
ttacgataaa 



60 

120 

180 

240 

300 

360 

420 

480 

486 



<210> 984 
<211> 1170 
<212> DNA 
<213> B. fragilis 



<400> 984 

attaattcct 

aat cttgtat 

ggaaacgagt 

tattgccgac 

tctgaattta 

catggaaata 

tatggagcct 

tcgaaagtgt 

aatagtgcaa 

gatccgaatg 

gctttgatgg 

gttactttgg 

ggagttcccc 

ctgccgaaag 

acggagatta 

attacaaaag 

caaggtgtca 

gtttcaggtg 

aacctgggtg 

aggccggata 



ctttatttat 
cccaattgcg 
tgatcatttt 
aggctgattt 
tgaaagggaa 
cttgccctat 
ctaaacgggc 
tggtataccg 
tagctacttt 
tggagatgaa 
gcaatgaaca 
gagcaattgt 
atgtgggaga 
atggatttgg 
tccgcagtac 
gtaaccattg 
tccgttttcg 
ataagcttga 
atacggatat 
cttattttga 



gaaagtttta 
caatattcaa 
tgaatatgat 
tatttttaat 
ctttggtttt 
catgatatcc 
gggtgaacaa 
ttttcccaat 
ttgctataac 
tcttgtatat 
tcgggaagga 
ggaattgcta 
tgcttttact 
ctatcctttg 
ggaccgggga 
gcatcatacc 
taacatgtat 
aataattgat 
ggtaactttt 
agaagtttaa 



gtaaccggtg 
agtggcaaag 
gtagatagtg 
ctggccggtg 
gcttctacgc 
tcttctacac 
ttgttgtttg 
gtctttggta 
atagcacatg 
atagatgatg 
gcttattgta 
tattctttcc 
aagaaacttt 
aagatgaatg 
cagttctcgg 
aaaaatgaaa 
gattcatcct 
attcctaccg 
atgtggtgta 



cgaaaggttt 
caaaaaatta 
acccttctga 
taaaccgtcc 
ttcttgcttc 
aagccgcttt 
agtattcccg 
aatggtgtcg 
atcttctcat 
ttgtggatga 
aagtatctgc 
gtgagaaccg 
actccaccta 
tggatgcccg 
ttaatatttc 
agtttgtggt 
ctgagattct 
gttataccca 
acgagtgttt 



tgtcgggcgg 
cgctttgtcc 
attggatgac 
actggatcag 
attgaagaga 
ggataatcct 
ggaaacggga 
ccctaattat 
tcaggtcaat 
attaatctct 
tgtatacact 
taacaatttg 
tctctcttat 
cggcagtttc 
caagccacat 
tgtaagtggc 
agaatatttt 
taatattgag 
tgatcccggt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1170 



401 



<210> 985 
<211> 201 
<212> DNA 
<213> B . f ragilis 



<400> 985 



60 



agacaaatat caaaaatata taaacaatca caattacaca gctatcattt gaagaaaaca 

atatcacccg gtgctagaca ccaagcacac actctgaaac ataaaattat tcgcaaagcc 12 0 

agttctttaa agtaccacct atatttatcc aataataatt attacatcat ttttcttata 180 

cagtctagtt ggaagaactg a 2 01 



<210> 986 
<211> 1899 
<212> DNA 
<213> B. f ragilis 



60 



<400> 986 

tctaatggat ttaaagacaa atcctgtttt cctaaaaacg aatatacaat gatggactct 

catgacacta accaaccttt gaaacaaggg gaattagaag aagaaaaaaa agcagttgag 12 0 

gtttctgaag aaattacaga aactccggct gaagaaacta ttgtggaaaa accgacagaa 180 

aatgcttcga aactaagcac taaagaagag gtgctgctcc ggttaaaaga agttgcccaa 240 

gatgctgaaa atgcaaacaa gcaagaactg gatggtttaa agcaaacttt ctataaaatt 3 00 

cataatgccg aaatcgaggc tgcgaaaaaa acgttcgtag agaatggtgg tgccgaagaa 3 60 

gaatttattg ctcagcccag tggcgtggaa gaagaattta aaagtttgat ggcagctatt 42 0 

aaagaaaaaa gaagtgcctt ggcagctgag attgaaaagc aaaaggaaga aaatctacaa 480 

gttaaactat cgattatcga agagttgaaa gagttagtgg aatcacccga tgacgccaac 540 

aaatcctaca acgaatttaa aaagctacag cagcagtgga acgaagtgaa attggtgcca 600 

caagctaaag tgaacgagtt atggaagaac taccagttgc atgttgaaaa gttctatgat 660 

atattaaaac tgaataatga attcagagaa tacgacttca gaaaaaacct ggagattaaa 72 0 

acacatctct gtgaagctgc cgaaaagttg gccgatgaac aagatgtagt ctccgctttc 780 

catcaattac agaaactaca tcaggagttc cgtgacaccg gtcctgtcgc caaagaatta 84 0 

cgtgacgaaa tatggaatcg ctttaaagcc gcttctacag ccgtcaaccg tcgccatcag 900 

cagcatttcg aagctctaaa agagaccgaa caacataatt tggatcagaa aacagttatc 960 

tgtgaaatag tagaagctat tgagtttgac caattgaaaa catttgcggc atgggaaacc 102 0 

aagacacaag aggtgatcgc cctgcaaaac aaatggaaaa caattggttt tgctccgcag 10 8 0 

aaaatgaacg tgaaaatctt tgagcgtttc cgtaaagctt gtgacgaatt ctttaaaaag 114 0 

aaaggagaat tcttcaagtt gctgaaagaa ggtatgaatg ctaatctgga aaagaaaaag 12 0 0 

gcattgtgcg aaaaagcaga atctctgaaa gatagtacag aatggaaaga aacggctgaa 12 60 

atcttaacca agctccaaaa ggaatggaaa acaattggcc ctgtttctaa aaaatactcg 132 0 

gacgctgttt ggaaacgttt cattactgca tgtgattatt tctttgagca aaaaggcaag 13 80 

gccacttctt ctcaacgttc tgtagaacaa gagaatctag aaaagaagaa ggcaatcatt 1440 

gcccgcttaa ctgctattga cgaaacgacg gatgccgatg aagcaagcaa agaggttcgt 1500 

gaattgatga aagaatggaa tggtatcgga catgtaccgt ttaaagagaa agacaggctt 1560 

tataaacaat atcacggttt gattgaccaa cttttcgatc gatttaatat cagtgcatcg 162 0 

aacaaaaaac tgagtaattt caagtcttct atcggcaata ttcaaagtgg aggctcccag 1680 

tcactctacc gtgaacgtga gaaattagtc cgtacatacg aaaacatgaa aaatgaactc 1740 

caaacttatg aaaataattt gggcttcctg actacctctt ctaagaaagg aaatagtctt 1800 

ttgacagaaa tcaaccgcaa ggtggaaaaa ttaaaatccg acttagaatt agtattgcag 1860 

aaaataaaag taatcgatga atcaatcaaa gaagaataa 18 99 



<210> 987 
<211> 342 
<212> DNA 
<213> B.fragilis 



<400> 987 

ggagggacat ctatgagaca aggagtcgta tacttgaata aagaacgggt aggcattatt 60 

acggaattat cttctaacga atataaattt cgctatgatg acgaatattt caatgatcca 12 0 

tcaaagccct ccataagcct gacattgaca aaacaacaac aggaatatac ttcccattat 18 0 



402 



420 
480 
540 
600 
660 
720 
780 
840 



ctatttcctt tttttgccaa catgctgtca gaagggcaca accgcatcgt tcaggcaaga 240 
ttattgcaga ttgatgaaaa agatgatttt ggtattttat tagctacagc acataccgac 300 
acggctgggg ctgtaaccat aaaacctctc gactatgatt ga 3 42 

<210> 988 
<211> 1032 
<212> DNA 
<213> B. fragilis 

<400> 988 

agtattatgt cactttttaa agataaatct ctcttgatta ccggtggaac aggctctttc 60 
ggcaatgcgg ttttacgtcg ttttcttgat tctgatatca gggagattcg tatattctct 12 0 
cgtgatgaaa agaaacaaga tgatatgcgt cactatcttc agaacccaaa agtaaaattc 180 
tacattggcg atgtccgtga caagcgctct gtggatggag ttatgaatgg agtggattat 240 
atcttccatg ccgctgcgct gaagcaagtc ccttcctgtg agttttttcc cacacaagcg 3 00 
gttaggacaa atgttctcgg tacagaaaat gtgttggatt ccgccatagc tcacggtgtt 3 60 
aaaaatgtgg tggtactttc taccgataaa gctgcctatc ctataaatgc aatgggtatc 
agcaaagcca tgatggagaa agttgctatc gccaaaggtc gtcagttggg taattgtgga 
ggaacaacga tttgctgtac ccgttatggg aatgttatgg ccagtcgtgg ttctgtgata 
cctttgtggg tagagcaaat taagaaatgt aatccaataa caataacaga tcccaacatg 
acccgcttca tgatgacttt ggatgatgct gtcgacttgg tgatttatgc ctttcagcat 
ggaaaaaatg gtgatttgtt tgttcagaag gcgcccgctg ctactctgga tgtattagcc 
gatgcattaa agtctcttta ccatagtaac gcggatgtca aagtgattgg tacccgtcac 
ggtgagaaac tctatgaaac tcttgttacc cgtgaagaga tgtctaaagc agaggatatg 
ggtgattatt atcgtatccc atgtgatacg cgtgatttaa attatgataa gttttttgtg 900 
gaaggaagtg aggaggtctc caaaatagaa gattaccatt ctcataatac ccgtcgtctt 960 
gatgttgagg ggatgaaaga acttctcttg aaacttgatt ttattcgcga agatcttggc 1020 
cttgaaaaat ag 1032 

<210> 989 

<211> 1245 

<212> DNA 

<213> B. fragilis 

<400> 989 

acaaaatcgt ctggagaaaa tcctgataat atcgaaatga atattttgtt tctgaccctt 
aaccgtgttt cagatctttc tgaacggggg atatacacgg atttgatgcg ggaatttatt 
tgtcatgggc atagggtcta tatggttgtt cccgccgaac gtcgctttca tgaatctact 
tcaataaaag agagttgtgg cgctcaaatg ttgagggtga agacattgaa tatccaaaag 
agcaatgtgg tggagaaagg catcggtaca ttgttattgg aaatgcagta tcaatgtgcc 300 
ataaagagat attggaagga tatccggttt gatttgatac tttattcaac tcctcccatt 360 
actttcaata gggtcatcag ttcacaaaag agacgttgta aggcgaaaag ttatctttta 
ttgaaagata tttttcctca aaatgccgtt gatttgggaa tgttttcaaa gagaagctta 
atttatagac ttttccgtaa aaaagagaag gatttatatc agatatcgga ctttataggc 540 
tgtatgtctc ctgccaatgt ggattatgtg ttgacacata atccggaaat aaaggctgat 600 
agagtagaga tatgccccaa tagtattaaa ttgttagaga agtcattaat ggcttcaact 660 
gtaagaaaaa acatattgca gaaattgcat attccaatta ataagactct ttttatatat 72 0 
ggtggcaatt tggggcgtcc acaaggtttg attttcttgt tggacgtgat agccgcaaat 780 
gaggaacgta atgacagtta tttcatcatt gtaggcagtg gcactgaata tggcaagata 
aagtcttggt ttgaggcgaa tcatccggat aattcaatgc tgctttcttc acttccaaag 
aaagagtatg atgatttggt aaaggcttgt gatgtcggtt tgattttcct tgatagacgt 
tttaccatcc ctaattaccc ttcccgttta ctctcttatt tagaaaaccg gatgcccgtt 102 0 
ttattggcta cagacctgaa tacggatatc ggacggattg ctgaacggaa tggttatggc 1080 
ttttggacag aaaatgggaa tttggataca tttatggaaa tggtggattc cttatctgca 1140 
gacagagaaa aaataaaagt gatgggcgag aaagggtatg aatacttgaa gtctaattat 12 00 
acagtagaaa gagggtaccg gatgataatg aaacattttg agtag 1245 



60 
120 
180 
240 



420 
480 



840 
900 
960 



<210> 990 
<211> 183 



403 



<212> DNA 

<213> B.fragilis 



<400> 990 

ctgcatctgt tagaactcca acagttacag aacctagttt ggaagcctct cttattatat 60 

ttaaatgacc tggatgaatc atatccgcac tcattccaac ataaactttt ttacattttt 120 

ccatcttaca tagaatttat tagaaaaaac aaacaggaag tgttagttat aacaatttat 180 
taa 183 



<210> 991 
<211> 489 
<212> DNA 
<213> B.fragilis 



<400> 991 

agaatttccc ccctcagctc cccccaaagg ggaggaagaa gagcggaagg gggattctgc 60 

ttatctcccg ataccgggat acgccttcaa tacaatgacg cacaattact cgggactgat 12 0 

ggacacgcta aagagattga gcattaccga caccggggaa gtaaactcca tactcaggct 180 

gtcggactat gggaggaagg gaacgacggt atggaaactg attgccaaca cttgctggag 2 40 

cgacatcgga gccaaaggaa gatacctgat agcggcgcta aacaagacga aaagaaggta 3 00 

gcagagagtg tcagt cccct atttgtagtt gacaaaaaag caaatataca ggcttttgac 3 60 

cagaaaggga ttcagcgaaa caaagaagta aaaagtgtgc ttaacgaact aaaacacagt 42 0 

gtttttaaag cacaagattt ctctcgccca aagctttgtt ttaacgctac gttaaagctt 480 
gttctttaa 



489 



<210> 992 
<211> 186 
<212> DNA 
<213> B. f ragilis 



<400> 992 

gaatgcctac cgaaagatag gaatcttttc 
ttaattgatt cggtaaaaat gatttttcga 
aaaagcttat tagctcttta tgaactcttt 
ctatga 



tcatttggta atggagtaaa tattgtttct 60 
ctaaatcttg tacttgttct gccttttggc 120 
tggggggaat gccgatgcgg ataccaaatt 180 

186 



<210> 993 
<211> 297 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (56) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 993 

aaagcagcta 

tcctggtata 

caagttattt 

ggcattttcg 

tgcaacaacc 



cggcaaccgc 
catttatatt 
cacgccagct 
aaggaaaggt 
tgcgcaagat 



atcattgcca 
aaaaggaata 
taatgtgaat 
cgggttgtat 
tccgaatata 



ctgaatggga 
gataatgtag 
atccggaaac 
gtgcacgatg 
aagtcggtga 



tacacacaag 
ggctgctgaa 
tggatatgga 
tggaagatgt 
cacgtgtaga 



aagctntcgt 
tgaaatcaca 
aacggacgat 
aaaagctatt 
aaactaa 



60 

120 

180 

240 

297 



<210> 994 
<211> 1164 
<212> DNA 
<213> B.fragilis 



<400> 994 



404 



aattctaaca ccatgctgaa atcaaaatat aaaatctatc tgttactact ctgcctgaca 60 
ggttgtgttt cagaatacaa cgcacaacta ccttcttccg atgaagaatt gctggtagta 12 0 
accggcgaca ttatcgctaa tacagaagcc atattctcat taagcaaaag tattccacta 180 
tccgaagaca tgccggaaga ttatcgaaac atttatgcca gaattgctgt agtaggcagc 2 40 
gacggctatc gaagtgattt cggaacggct cttggtgatg gtaaatacca ggtcagtatc 3 00 
ggtgaattgc aggatgatgt atcctacgga atagagatag aatacgacgg agagatttat 3 60 
acctcgtctc cttccacacc gatggtatct tctgaaatag acagtgtttc gtggatacaa 420 
ccagaacctg aacaagcact ttctatacgg gtatcgaccc atggtgatcc cggaaaaact 
caatactaca tgtggaacta tcgggaagac tgggagataa gagccagcta cattacaact 
tgctactttg atccggatat gaaccgcatc tatgaagaca gcaattatcc aactttctat 
tgttggaaaa aggaaatatc aagaaatata ttgattggct ctacggaaaa gttgaaagaa 
catctgatca taaataataa gctactcgat gtgccggtca atgaagacag attcactgta 72 0 

780 
840 
900 
960 



480 
540 
600 
660 



ctatacagca tacaggtaca gcaacgggca ttgagtaaag agggatatga atattacttg 

aatgtacagc aacagaatga agaaatggga ggaatcttta ctccacaacc ctctgaaatc 

caaggaaaca ttagttgtat cagtcagcct ggacgaagga cgatcggtta tgtaggcgtc 

tataaaaaca tctctgaaaa gagaatatac attcatccca acgaaattaa acgtcctcct 

ctatacagtg gctgtgaaga agtgtcggat agcgaaatgg atgaacaggg ctatagcaca 102 0 

tatctgataa gataccttgt cggttatcgt ccagtcggta caggcactca cattgaccac 1080 

tgggccctac ggagatgtac agaatgtgaa gccaacggag gaagtaaaaa caagccttca 1140 
ttctggccca acgatcatca ataa 1164 



<210> 995 
<211> 366 
<212> DNA 
<213> B.fragilis 



<400> 995 

ttgagaaaca ctctacaaac aaaaagaaat ctctgtcatt ctaaaatcaa aagtacaaat 
agccgtaatc aaattcacaa tatattggga atttccttgc ttatcaaaca aatacttagt 
atttttgttt caaattcaca atatattggg aatatgaaac aaattggaat acagattcgc 
caacgaagaa aaatgttggg tataaatcag caaacacttg ccgatttagc acaaatcagt 
atcaatacta taacaaaaat tgaaaatgga gaaataaata ttaattttca aaagctctat 
gccatattgg aggtattagg attagaactt tctctgaaaa ttaaaaataa ggagggacat 
ctatga 



60 

120 

180 

240 

300 

360 

366 



<210> 996 
<211> 2046 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 

<222> (1088) , (1885) , (2007) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



60 
120 



<400> 996 

attatggaca ctgagaacca gaaagaaata gctgaagagc agatgattga acaagcgttt 

caggaattgc tgaacgatta tcttgctacc aagcaccgca aacgtattga gattataacc 

aaggccttca atttcgccaa tcaggcacat aaaggcatca aacgacgctc gggggaaccg 180 

tatatcatgc accccattgc cgtcgcgaag atcgtatgca atgaaatagg ccttggctcg 240 

acttccattt gtgccgcttt gctgcacgat gttgtcgagg acaccgatta tacagtagaa 3 00 

gatatcgaaa atatcttcgg ggccaagatt gcacagattg tcgacggact gaccaaaatc 3 60 

tccggaggta tttttggtga ccgggcttcg gcacaagcag aaaacttcaa gaaactcctg 42 0 

ctcaccatgt ctgatgatat ccgggtgatc ctgatcaaga ttgccgaccg cctgcacaac 480 

atgcgtacac tcggttccat gttgcccaac aagcaatata agattgcagg cgaaaccctt 540 

tatatttacg cccctcttgc caatcgcctg ggactgtata agatcaagac ggaactggaa 600 

aacctcagtt tcaaatatga acatcctgaa gaatatcagg agattgaaga aaagctgaac 660 

gcaacagccg ccgaacgcga taaggtattc aacgaattca ccgctcccat acgcgagcag 72 0 
ttggataaaa tgggattaaa atatcgaatc ctggcacgtg tgaagtccat ctactctatc 



780 



405 



tggaacaaga tgcagaccaa gcatgttcct ttcgaagaga tttatgatct tctggctgta 840 

cggatcattt tcgaaccacg caacatagat gaggaactga acgactgttt cgatatttat 900 

gtttccatct ccaaaatcta taaaccgcat cccgaccgtc tgcgcgactg ggtgagccac 960 

cccaaagcta acggatacca ggcactgcat gtcactttga tgggcaataa tggccagtgg 1020 

atcgaagtcc agatacgcag tgagcggatg aacgatgtag ccgaacaggg atttgccgcc 1080 

cactgganat ataaagaaag aggaggcagc gaagacgaaa gcgaactgga gaaatggttg 1140 

cgtaccatta aagagatact cgacgatccg cagccggatg ccatcgactt tctcgataca 12 00 

atcaaattaa acttattcgc ctcggagatc tttgtcttca ccccgaaagg agagctcaaa 12 60 

accatgccgc agaactccac tgccctggat ttcgccttct cactgcacac ggatatagga 1320 

agccactgta taggtgccaa agtgaatcat aaactggtgc ctctaagcca taagctgcaa 1380 

agtggtgacc aagtggaaat cctgacatcc aagtcacagc gtgtacagcc gcaatgggaa 1440 

gtgtttgcca ctactgcgcg tgcaagggct aagattgcgg ctattctgcg taaggaacga 1500 

aaaaccttcc agaaagaagg agaagaattg ttgaatgaat tctttaagaa agaagagatc 1560 

cgcccggagg cagccgtcat cgagaagttg tgcaaactgc ataacatgaa gaacgaagaa 162 0 

gagtttcttg tagccatcgg taacaaaacc atcgttctgg gagatgccga caaaaatgaa 168 0 

ctgaaagaga aacaaagcag caactggatg aagtatctga ctttctcttt tggcaataat 1740 
aaggataaac agcaggagga aaaagaaccg caggaaaagg aaaaaatcaa caccaaacaa 
attctcaaac tgacggaaga tgccctgcaa aagaaatata tcatggccga atgttgtcat 

cccatccccg gtgacgacgt actgngatac atggacgaga atgaccgcat catcatccac 192 0 
aagcgtcaat gtccggtagc ggccaaactg aaaagcagct acggcaaccg catcattgcc " 
actgaatggg atacacacaa gaagctntcg ttcctggtat acatttatat taaaaggaat 



1800 
1860 



1980 
2040 
2046 



agataa 

<210> 997 
<211> 888 
<212> DNA 
<213> B. fragilis 

<400> 997 

tctaaaaaaa caaatatggc aatagcatac gacgggatca attattttcc ggtgggcgta 60 
aacttcatgg aagagaacgc aatggaagtg atagaagcaa aatatggaat aaaagggtcg 120 
gcaattgtgc tgaaactgat gtgtaagatt tacaaggagg gatactacat acgatgggat 180 
gaagaacaat gcctgatttt cgcaaacaaa gcaggaagag aggtgcaggc agaagaggtg 240 
caggggatca tcgagattct gttcaccaaa ggaatactgg acagaaacag ttatcaggaa 3 00 
aacggaatac tgacttcgga aagtatacag aaagtatgga tggaagcgac aaagcgaagg 3 60 
aaaagagagt tgtcggagct cccttacctg atggtgaaac cggaaaaaga aaatggaaaa 42 0 
gccgacactc ccccggcact acaagaaatt cagcaaccag agctgttcaa aaaggaaaaa 
acacctgtta acccgaaaaa tgtagtacat catgtagccg ttgacgcaaa aaatgcatgc 
aattccggac aaagtaaagt aaaagaaaag aaagcagagg aaaataaaga atttcccccc 
tcagctcccc ccaaagggga ggaagaagag cggaaggggg attctgctta tctcccgata 
ccgggatacg ccttcaatac aatgacgcac aattactcgg gactgatgga cacgctaaag 72 0 
agattgagca ttaccgacac cggggaagta aactccatac tcaggctgtc ggactatggg 
aggaagggaa cgacggtatg gaaactgatt gccaacactt gctggagcga catcggagcc 
aaaggaagat acctgatagc ggcgctaaac aagacgaaaa gaaggtag 



480 
540 
600 
660 



780 
840 
888 



<210> 998 
<211> 366 
<212> DNA 
<213> B. fragilis 

<400> 998 

actacgaaaa caaacaacaa gaagaaaaaa aaagaattca aaaaaaacaa aacgggcaac 60 

aagcaaatct ctgacaatag agaccttaag tcaaacagag gacgaaaaaa ggagaaaccc 12 0 

atacaaggga ttgttttgaa acactacgaa tgcttaaagc tactaatcac actctatcaa 180 

gatggggcaa tgggtataaa aaaggagaca tcacaagttg cattagcacg atatatagac 2 40 

gacaaaaaac tattagggaa tattcgaaat ggaatattca ttccattgaa gttcagcact 3 00 

attctaaagg aaacaaacac catctggaac gaaatgctac gagataaatc cattggcata 
aaatag 



360 
366 



406 



<210> 999 
<211> 360 
<212> DNA 
<213> B. fragilis 



<400> 999 

atgaacgaaa cgaaagtatt aatagaaaag ataaccgaag gtatacaaga aaaaaaaggt 60 

aaaaacattg tcatagcaga cctgacaaac atagacgaca cgatatgcaa atactttgta 12 0 

atctgtcagg ggaactctcc cagccaggtc attgccattg tagattccat aaaagaattt 180 

acccgcaaag gtgccggcac caaaccctct gccatcgacg gacagcgaaa tgcagaatgg 240 

gtagctatgg acttttcaga tgtattagta catgtattcc taccggaagc cagaaacttt 300 

tataatttgg agcacctgtg ggcagatgcc aagttaacta caattcccga cattgattaa 360 



<210> 1000 
<211> 225 
<212> DNA 
<213> B. fragilis 



<400> 1000 

aacctctcga ctatgattga attaacttgc tgtccttcta ctttacaaaa gggattctca 

acctattcgc ctgttgcatt gaaagagctg ttcaatagcc aaaaggtaaa ccatatactg 

ccatacaatg gcatggacaa taatgaaacg gaacaaaaag aatttcagga taacaacaaa 

cacatgtcta tatccggagc tcaacaaaat aagtccagcc aatga 



<210> 1001 
<211> 1104 
<212> DNA 
<213> B. fragilis 



60 



<400> 1001 

ttaatatctg ccatggaaca tcctgagaat aacgaagcgt ataaaggttt ggttgtgaat 
gcaggcattg aacaaccgtc atctgtaaat ccttatctga aacggaaggt aaagaagcgt 12 0 
caattgtcgg ttagtgagtt tgtggaggga attgtcaagg gagatgtgac gatcttgagt 180 
caggctgtga ctttggtaga aagtgtgcgt cctgaacatc aagctactgc ccaggaagtt 240 
attgaaaaat gtctgcctta ttccggaaat tcaatccgtg taggtatcag tggtgtaccg 3 00 
ggagccggta aaagcacctc gattgatgtc tttggattgc acgttctcga aaagggaggt 3 60 
aagttagctg ttttagccat cgacccgagc agtgaacgca gcaaaggaag tattttgggt 42 0 
gataaaaccc gtatggagca gctttcagtg catcctaaat catttatacg tcctagccct 480 
tccgccggtt ctttgggggg agtagcccgt aaaacccgtg aaacaatcat tctgtgtgaa 540 
gcggccggct tcgataagat atttgtagag acggtgggag tgggacagag tgaaacggct 600 
gttcactcga tggtcgattt ctttctgttg attcagttgg ccggtacggg agacgaactt 660 
caaggtatta aacgcggtat catggaaatg gcagatggta ttgtgattaa taaggctgat 72 0 

• 780 
840 
900 
960 



ggtagcaata tcgataaagc caaattggcc gctgctcagt tccgtaatgc tttgcatctt 

tttcccgctc ccgattccgg atggacaccg cgtgtactca catattccgg attctacaat 

cttggggtaa aggaaatatg ggatatggtg tatgagtata tcgattttgt gaaaggtaat 

ggctattttg aatatcgccg taacgaacaa agtaaatact ggatgtatga aagcatcaat 

gaacagttac gtgacagttt ctatcataat gccaagatcg aatcgatgtt acaagaaaag 102 0 

gagcaacaag tgctcagggg aaatctgacc tcttttgttg ctgccaagag cctactcgat 1080 
acctattttg aagatctgaa ataa 1104 



<210> 1002 
<211> 1206 
<212> DNA 
<213> B. fragilis 



<400> 1002 

ttattgtatg ccaatcaaag aaattttaga 
atgatttcac caaaattttt tattgatagc 
ggtgttccgg attctttatt aaaaaacata 



attaatacct ggtacaaaag cattaaaaag 60 
ctttcagata ggcaaattga ttttttttca 12 0 
tgtgcttata ttgcggataa caaggatgca 180 



407 



aagcataata 
ttggctacaa 
aatcctcttg 
ggatggcgtg 
actattcctt 
gatttcacta 
gctttaataa 
gtattacaac 
ggaaaaaagg 
agaacaaaaa 
gcttcacaaa 
gatggtgacg 
cctgaaaatt 
ccaactgtag 
gtatatagtg 
gcaggaccta 
ccatctatta 
aaatag 



ttataacagc 
gggaaattcc 
cgtcattgac 
gtgagccagg 
tactggatgt 
agcaattaga 
tagaaaaaga 
aattatcaat 
atgtaatagt 
tgaatgaaag 
tagcattagg 
gtgctgtgat 
tgatccatgt 
gacttaagat 
ttgattgtga 
ttcttctaga 
ctccaataca 



aaatgagggg 
agttgtgtat 
tgataaagaa 
agttcatgac 
aatggggatt 
tgatgcgttg 
tacattcgaa 
gagtcgagaa 
ttcaacaaca 
ccatcagagt 
gattgcactg 
tatgcatatg 
tgtgtttaac 
taatatccct 
agaagcttta 
ggttaaggtt 
aaataaatta 



gcggcagtcg 
atgcaaaact 
gtctataata 
gaaccgcagc 
aagaatacag 
gtctatatgc 
tcatattcgc 
aacgctattc 
ggtatgattt 
gattttctta 
gaaatacccg 
ggatcaatgg 
aatggatctc 
gcaatagcaa 
aagactgctt 
aaaaaaggga 
tcttttatga 



gtttggctgt 
caggagaggg 
tacctattct 
atgttaagca 
tgatgagtaa 
gtgaaactaa 
ttaaattaaa 
aaatggttgt 
cgcgagagct 
cagttggttc 
atcgtaaaat 
caattattgg 
acgattctgt 
gagctgtcgg 
tagaaaaggt 
atcgaaaaga 
cttttttgaa 



aggtcattat 
aaatattatc 
tttactgatt 
gggaaaagtg 
gagtgaagtt 
cgaagcattt 
ggaggattct 
agattcaatt 
ttttgaatac 
tatgggacat 
ttattgtttt 
agataagggc 
aggtggtcag 
ttataaagtt 
cataaaagaa 
tttgggaagg 
taatgaaaaa 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1206 



<210> 1003 
<211> 1260 
<212> DNA 
<213> B.fragilis 



<400> 1003 

aagagttttt 

tttgactcca 

ttagccgatc 

accaaagcaa 

cttgtccccc 

ctcgatttta 

agtgcatttc 

ccattgaaag 

gttatcagtg 

gctgccatcc 

ctttctgcaa 

gacggtacgg 

atcactacca 

caagagtccg 

ttctgtaccc 

gagcagaccg 

ttggtcaatg 

ttgaaaagtc 

accgaacaag 

tacgaaggaa 

ttgcgcgtat 



atatggaaca 
gtgaagtagg 
ttatttgtcg 
cctgggatga 
ccatcattat 
tccgtcgtat 
tgtcggctgt 
gtttgttgca 
tattgataga 
tgatgttggt 
ataatatgct 
tgattgaagt 
tccctcccta 
gtggacggcg 
ccgaaatgtt 
agcaggtggt 
ggcgacgcca 
ttcccgatgt 
gtattccggt 
ttcaggccga 
tccagaatcc 



gataaccgaa 
tcccattatg 
taacattctt 
tattgtgttc 
ttatgtgttg 
ctgcatgatt 
ttatcatgta 
aacggcacaa 
taaatctccg 
gtttaaagac 
gaaagtaggt 
gacgctcaat 
tctgctggtc 
tgtgaagcgt 
ggctaaatat 
gaaagagtat 
gaccaatctc 
caataagaac 
cgaactttac 
tgtattcgat 
tacgggagaa 



aaaataaatg 
acactggtac 
ttaagagtag 
gaccgtaaag 
attcctttgg 
tatatcattg 
tacagtgagc 
gtgatactat 
atggtattgc 
agtatcatgg 
gactggattg 
acggtgaagg 
agcgattctt 
tctatcaata 
aagaaaatcc 
aataaagaac 
ggagtattcc 
ctcacttgta 
tttttttctg 
catctgcttg 
gattttcggg 



acttattcgt 
tgattattgg 
ttgccaaact 
ttttgattta 
caattccgaa 
cggtttttct 
gagaacagtt 
ttttcattgg 
tcaccgggct 
gatttgtgtc 
ctatgcccaa 
tgcgcaattg 
ttcagaattg 
tcgatatgaa 
aattgttgac 
atcacataga 
gtgcctatct 
tggtacggta 
ctgtgaaaga 
ccattgttcc 
agtggaatag 



ttcctgggga 
cattgccttt 
agtgaaaaag 
cctcagtcac 
tgtaagtgcc 
gcgttttatc 
tcgtgatcga 
aggaattgtt 
tggtgcttcg 
cggcattcaa 
atacggagcc 
ggacaatacc 
gcggggtatg 
cagtgtaaga 
cgattatgtg 
caactctatt 
gaccaactac 
tcttcagccc 
gtgggtacct 
ggaatttggt 
aagaaattaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 



<210> 1004 
<211> 840 
<212> DNA 
<213> B.fragilis 



<400> 1004 
gcaaagtttc 
aatttagata 
acaacgacta 
gttgatgcag 
gaatgtacta 
accgaaattg 



tattttttag 
atatgggaaa 
cgattaacct 
acccacaggc 
tctacgaatg 
attcgttgaa 



taggcgctct 
aataattgct 
cgcagcttcg 
aaatgcctct 
cattatcgac 
agtcatttca 



gaggatttat 
ttggccaatc 
ctggctacgc 
tccggattgg 
agagccaacg 
tcgcacatta 



cacttatttc 
aaaaaggtgg 
tcgaaaagaa 
gagtcgacat 
tacaggacgc 
atctcgtagg 



gctgccaata 
tgtaggaaaa 
agtactggtt 
caagcaatct 
tattcatgac 
cgccgaaata 



60 

12 0 

180 

240 

300 

360 



408 



gaaatgctaa atctcaaaaa ccgtgaaaag atactgaaag aagtgctgac tccgttaaag 42 0 

gaagagtatg attatatttt gatagactgc tctccttcgc tgggactgat cacaatcaat 48 0 

gccctcacgg cagccgattc ggtgattatc cccgtacaag cggaatattt tgcccttgag 540 

ggaatcagca aactgctgaa taccatcaag atcatcaaat cgaaactgaa cccggcactc 600 

gaaatagaag gttttctgct gaccatgtac gactcacgtc tgcgtcaagc caaccaaatc 660 

tatgatgaag tgaaacgcca cttccaggaa ctggtgttca aaaccgtcat ccagcgtaac 72 0 

gtaaaactga gtgaagcccc cagctacggt ctccccacca tcttatatga tgcagagtcc 780 

accggagcga aaaatcattt ggcgctggct aaagaactaa taagcagaaa cagtaaataa 840 



<210> 1005 
<211> 615 
<212> DNA 
<213> B.fragilis 



<400> 1005 
gtaggtatgt 
ctgattatta 
ggtgccggta 
atcaaattta 
aaacgattga 
ttaatcaata 
tatcttcctt 
ggttgggctc 
gtctggtatg 
aaaaaagttt 
acagggaata 



atcagtatgt 
tctggcctgt 
ctttttttct 
aaaccatgac 
cgaaggttgg 
ttctgaaagg 
tatacaataa 
aggtgaatgg 
tagaccattg 
ttgtgcgaga 
attaa 



tattaaacga 
attgcttctc 
tcaggaaaga 
agatgaacgt 
taagtttgtt 
agatatgtcc 
agaacaggct 
acgaaatgcc 
ttcttttttt 
gggtatcagt 



ttaatcgatt 
gtaactcttt 
cccggtagac 
gatgcagaag 
cgttctactt 
tttattggtc 
cgtcggcatg 
atttcgtggg 
ctggatttga 
tctgatactt 



ttgtagtcgt 
ggcttcattt 
atggtaaaat 
ggaacttact 
cgattgacga 
cccgtccgtt 
aagtccgtcc 
taaggaagtt 
agatcttttt 
cagtaacaat 



gttttttgtc 
tgccaataaa 
ctttaaggtc 
tccggatgat 
actcccacaa 
attacctcaa 
cgggataacc 
tgagttggat 
tttgactata 
ggaacctttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

615 



<210> 1006 
<211> 1068 
<212> DNA 
<213> B.fragilis 



<400> 1006 

agtagtatgc agattttttg tacacttttt aatgttgctt atttagataa ggctatcaca 6 0 

atgtataatt ctcttgagag agtctctagt gaatttactc tatatgcttt ggccatggat 12 0 

gataggtgct atgaaatttt agttgatcta aattttagga acttgaaacc gattaagcta 180 

tcagactttg aagatgatga tttgcttaaa gtaaagtcag atagaacctt tggtgaatat 240 

tgttggactt gttcatcttc tttgatatct tacgttctgc atgaatattg tgagccacat 3 00 

tgtacgtaca ttgatgcgga tatctacttt ttttcggatc ctatagtttt gatgaacgaa 3 60 

atgcttcata agaatgcttc tgtattaata gtaggtcatc gttttaatga ctataataga 420 

gatttaatgt gtcggactgt tgggaaatac tgtgttcaat ataatacttt tttgaatgat 480 

gaaaatggta atatattgct tgaaatatgg cgtcgacaat gtataacgca ttgttcttgt 540 

gatggtgatg gtgtctattg gggggatcaa aaatatatgg ataattggac tactgactat 600 

gattttgtac atgaaactct taatgtaggt gctggaatag ctccttggaa catctctcaa 660 

tataaattgc gtttaataaa tgactcaggc tgtgttattg taagtaggaa taaagttgat 720 
tgctctacag tattttatca ttttgaaaat attaattaca taaatgataa gattgtgaaa 
attaatgtgt tcaatacatg gcatatagat aaggaactag tgaaggcttt ttatattcca 
tatttgactg aggtttatga cattaaacta atgctaaagg aaagatatgc tgtgaatatt 
ttgcttaaaa aacatcctgg tgttaaatgt gataaaagaa cttttgtcca aaagattatt 
gataggttaa gttatcttat tgataaggaa aaacaaaaac tctatataat gtcagttctt 

ccaactagac tgtataagaa aaatgatgta ataattatta ttggataa 1068 



780 
840 
900 
960 
1020 



<210> 1007 
<211> 1527 
<212> DNA 
<213> B.fragilis 



<400> 1007 

cttatagtta atgagatgag cgacaataag cgtattgcag ttaatacatt gattatttat 



60 



409 

gctcgaatgg ctgttacgac aataatcagc ctaatagcta caagatatgt cttacttgaa 12 0 
ttaggacaag ctgattatgg attatataat gttgttgggg gcatagtgac gatgctcaat 180 
gtggtaagta taggaatgta tatgaccacg cagcgtttta ttaatgtaga aatgggtaaa 240 
ggacctaatg gaaatttaaa taaagtattc aatgtttgta tagttctgca tataggattt 
gctttatcta tttttatcat aggtctgact gttggtttat ggtatattta taacattttg 
aacgtattgc cagaaaaact ttccgatgca gttttgatat attttatatc tactacagtt 
tctgctatcg gtattattaa tattccattt caaggattga tgttagcatt tgagaaattt 
aaaaagatgg caataattga tttgctatct aatttcatga aagtgccttt agttatttta 
cttatgtgtt ggtctggtaa taaacttctt ttttatgcga ttggtgtttg ttttatttct 
cttttctctt ttcttttcta ttatagttat tgctatcgaa agtttgggga tattgtgaaa 
tggcatctgt cacgtgaaaa atatatttat aaggaaattt tagttttcaa taattacact 72 0 
tcgattggaa ctattgcata cctttctcgt actcaaggtg cttctgtggt tataaattac 780 
ttttttggaa ttattgtgaa tggagctttt gctatagtat tccaaatcga aaatttcatt 
atgatgtttg ttaataatct tgggactgct tcagatccac aaataactca atcctatgcc 
tctggtaatt atagagatgc attttctctt gttgagaaaa tttctaaata tagtatgttc 
ataatgcttc ttgtaacatt ttcaattggt gttgagctgg aatttctttt aagattatgg 
ctcggtacat tgccggaggg tattttagta ctttctcgct ggatgttagt aagtctttta 
gtgcggagta taaatagctc atgtggctct attattcaag cctctggtca tgtgaaatgg 
tttcaaataa taagttctgt attattattg cttggattac caatatcttg gcttttatat 
aaatggggga tgccccccgt aactattata attactttta cagtaaccga ttttattagc 1260 
agaatgatat atttatggtt aatgcatcga attatcaaat ttgatgtttt gcatttttca 132 0 
aaaaaagttt ttttacccgt aattaaggtt ctgtgtctat caggcttata tctgtatctc 1380 
tacaattcca ttatgctaca aactgatttt atgcgtgtta tggggattgg cgtgtcatgt 1440 
atgttttatg tgtgtctatg tctattcgtt gggatgaatc gtttggaacg gaatagtatt 1500 
tttttttata ttaagaataa aatatga 1527 



aaccgaaaaa ataaatga 

<210> 1009 
<211> 765 
<212> DNA 
<213> B.fragilis 



300 
360 
420 
480 
540 
600 
660 



840 

900 

960 

1020 

1080 

1140 

1200 



60 



<210> 1008 
<211> 1038 
<212> DNA 
<213> B.fragilis 

<400> 1008 

catttaacta aaacacgacc tatggcaatc agtctcaaag acaatctgac ttcttcctat 
ttcaatgctg ctcataagtt atactctaaa aaggcgcgcc gccggattgt agcttatgtt 12 0 
gagagttatg acgatgtagc tttctggcgt acactgcttg aggagtttga ggacgaagaa 180 
cattattttc aggtgatgct tccttcggct acatctttgg ctaaaggcaa gaaaatggta 240 
ctgatgaata cccttaatac ggccgagtta ggcaaaagtc tgattgcctg tgtggatagc 300 
gattatgact ttttgttgca aggagctact gctacttcac gtaaaattaa tcgtaataga 
tatatttttc agacctatgc ttatgctatt gaaaactatc attgttatgc cgatagcttg 
catgaggtct gtgtgcaagc cactttgaac gacagacacc tgattgactt caatgagttt 
atgaaacgat actctcagat tgcttatccg cttttcctgt ggtctgtctg gttttatcgt 
cgtcatgata cttatacgtt tactatgagt gaatttaatg cctgtgttcg tttgcacgat 
gtcagcttga ggcatccgga acgttctttg gaggcagtga ggcgttcggt aacgtctaaa 
ctttctgagt tatccacgcg ttttccacaa ggtatcgaag aggtcgacaa gttatcggtc 
gaattaaaag gacttggagt gcttcctgat acaacatatc tgtttattca ggggcatcac 
atcatggaca atgtcgtgat gaaagtattg actcccgttt gtacagccct gcgacgcgaa 
cgggaacaag aaatcaaaaa attggcggaa catgacgaac aatttcataa tgaactgact 
tgttatcaaa acagtcaggt caatgtggag gtaatgcttc gcaaaaatag tgcttacaaa 
gatttatacc tttatcaatg gttgaaagaa gacataaaag agtttttata tggaacagat 



360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1038 



<400> 1009 

tcacaaacta aggcgcaaca ttttatgaaa aaccgacatt acttaaggca catactggcc 
attactgcgt tattattcaa tggagaagcc atttactcgc aaacttatcc aatagaaaac 



60 
120 



410 



tacttaaaag cagcaggaga ctatgttact atttacaatg gtgaaatcga attaacatac 180 

agtcttgccc aatacgacaa tcttccctat tttcaaggtg atgaatttac cacaggagag 240 

attatcttca aaggaaaccg atacccggga ctggatcttc atttggattt acacaaagac 300 

caactttgtg cactgactcc tgacagccat tacagcatga ttatcaataa tgaaggaatc 3 60 

gaacaagtca acctgcacaa tactacattt atctatttcc gtccgacaaa gaagacagat 42 0 

ctcaataagg gattttacga attactacaa gacggaaagc gactgagact gctggcacga 480 

aaaacatact ccgttgctca gatcaacgta gaaaaaatag ccaaaacccg caaacatcaa 540 

actgaatact tcatatacgg agtaaaatat tatctggaat acaatgggat atattatccc 600 

gtcagtaaca acaagtcgtt tgccaagatc tttccggagc aacataaact gataaaacgt 660 

tacgcacgaa aacataaact taattttcgc catgacgctg atgcctcatt aatcgctctt 72 0 

actaactttt gtgaagaatt gatagaccaa aaacaaacac gatga 7 65 



60 



240 
300 



<210> 1010 
<211> 360 
<212> DNA 
<213> B. fragilis 

<400> 1010 

aatcattata atatgaagaa gttgaaaagc ccggcgtcac aaagtgaagc catgaaactg 

agatggaaaa aacggatcgt cttcgagaaa ggatacaccg agtcgtgcgc cgaatggatg 12 0 

gcagaacgac ttgaagcact cctggaccat atgcaatatg gacatgcgac ggtagcttac 180 

cgaaaacaaa acgggagttt ccaactggta aaagcaacat tgatttacta cgaagcggag 

ttccgtaaga agtatgatcc cacaaaaata gaaggcgcag tagtctactg gaatgtggac 

gaacagagat ggacgacgtt tcaagtggag aattttatgg agtggagacc ggtggtatag 3 60 

<210> 1011 

<211> 1002 

<212> DNA 

<213> B. fragilis 

<400> 1011 

atgaagaata aagaattagg gatgaagaaa ctaataaaaa aggcgctgaa acttatctta 
ccattggttt tgggaggctt tatcttatat tgggtctatc gtgacttcga ttttgtgaag 120 
gctatggaag ttttgcaaca tggcacaaac tggtggtgga tggctttctc gcttcttttc 
ggcatatttg cacaggtatt tcgtggttgg cgttggcgcc agacgctgga gcctttggga 
gcatttcccc gaagaaggga ttgtgttgat gccattttca tttcgtatgc agctagtttg 
gttgtaccga gggtaggtga ggtgagtcgt tgcggggtac ttgctaagta tgacaacgtt 3 60 
tcttttgcta aatctttagg gactgtggtt accgaacgtc tggtagatac tgtgactatt 420 
cttttgatta ccggtgttac ggttctattg caaatgcctg tgtttgttac cttccttgag 
caaaccggaa cgaaaatccc ctcattcatg catttactta cttctgtctg gttttacatt 
gttttatttt gtacaatcgg agttattgta cttctctact atctgattcg tacgctttct 
ttctttgaga aagtgaaagg agttgtgctt aatgtttgtg aaggaattat gtcactgcgt 
aatgtgaaga atcttccgct ttttctactc tatagttttt tgatatggct tagctatttc 720 
ctgcattttt atttcacttt ttattgtttt gcttttacgg cacatttggg cttacttgct 
gcattggtta tgtttgttgg aggtaccttt gctgtaattg tgcctactcc gaatggagcc 
ggtccatggc attttgccgt tattaccatg atgatgcttt acggggtaaa tgcgacggat 
gcagggattt ttgcactaat tgttcatggc atccagactc tgctggttat tttattgggt 
gtttatggat tggtgactat ttctttttta caccggaagt ga 



60 



180 
240 
300 



480 
540 
600 
660 



780 
840 
900 
960 
1002 



<210> 1012 
<211> 1335 
<212> DNA 
<213> B. fragilis 

<400> 1012 

attctatgta agatggaaaa atgtaaaaaa gtttatgttg gaatgagtgc ggatatgatt 

catccaggtc atttaaatat aataagagag gcttccaaac taggttctgt aactgttgga 12 0 

gttctaacag atgcagctat tgcaagttat aagcgccttc cttatttaga ttatgaacag 180 

cgtgcggaga tagttaaaag tatcaaaggt gtagattctg ttatacctca agagacttta 24 0 



60 



411 



gactatgttc caaatcttga aaaactacgt ccggattatg ttgttcatgg tgacgattgg 300 
attgatggtg ttcaatctaa tactcgtaaa cgtgtaatta agtgtttgtt agagtggggg 3 60 
gggaaagtgg ttgatattgc atatactaaa ggtttttctt ctactgcaat gaatgagagg 42 0 
ataaaagaaa taggtacaac tccggaaatc aggcagaaaa gacttcgcag gctaataaac 480 
gcaaaaccta ttgttcgtat tcttgaatca cataacgggt tgactggact cattgcagaa 540 

600 
660 
720 
780 
840 
900 



aatgcttcgg taataattaa cggagtgaag catgaatttg atggtatgtg gtcctcttct 
ctaacagact caactagcaa aggaaaaccg gatatagaag ctgttgattt aacaactcgt 
ttacatgatt taaatgatac tttagaatgt acaacaaaac cagtaatttt tgatggtgac 
acaggtggaa aggttgagca ttttgtattt acggttagaa cgcttgagag gcttggtatt 
tccgccatta tcattgagga taagatcggt ttaaaacaaa actctctatt tggtactgat 
gctgttcaaa cacaagattc gatagaaagt ttctgccata aaattcgttc gggaaaaaat 
gcacaagtaa cagactcttt tatgattatt gctcgtatcg aaagtcttat tgctggtaaa 960 
tcaatggagg atgctttgga aagggctgct gcctatgtta aggcaggggc tgatggagtt 102 0 
atgattcata gtaaagacaa gtctgggatg gacataaaga atttttgtac atgtttcaga 1080 
aaaatcgact cgacgacacc aatagttgct gtaccaacca cttataatca gtttactgaa 1140 
tcagaattgg cttcatgggg tataaatgtt gttatttatg ccaatcacat gcttagaagt 12 00 
gcttatcctg caatgctgga ttgtgcaaaa tcaattttga ctcatgaacg ttcgttagaa 12 60 
gcatccaatg attattgtat gccaatcaaa gaaattttag aattaatacc tggtacaaaa 1320 
gcattaaaaa gatga 133 5 

<210> 1013 
<211> 1152 
<212> DNA 
<213> B.fragilis 

<400> 1013 

actgaaatcc tgggaattat gaaatattat ctgattgttg gcgaggcttc gggcgatttg 60 
catgcttccc acttgatggc tgcactgaaa gaggaagacc cggaagctga atttcgcttc 120 
tttggcggtg atttgatggc tgccgtggga ggaacaatgg tgaagcatta taaagagttg 
gcctacatgg ggtttatccc tgtgctgcta catttgacga ccatttttgc caacatgaag 
agatgcaagg aggacatcgt ggcgtggtcg cccgatgtgg tcattctggt ggattatccg 
ggctttaatc tcgatattgc taagtttgtg catgcgaaaa caaagatacc ggtttattat 3 60 
tatatctctc ccaagatttg ggcatggaaa gagtatcgga tcaagaatat aaaaagagat 
gtggacgagc ttttttccat acttcctttt gaggtaggat ttttcaaggg acatcgatat 
cccattcatt atgtgggaaa tccgacggta gatgaggtga ccgccttcaa ggcgtcgcat 
caggagtcct ttgccgattt tattgccgat agtgaattgg cagataaacc tatcatagct 
ttgcttgcag gtagcagaaa acaggagatt aaggataatc tgcccgatat gatccgggct 
gcttcagctt ttcccggtta tcagcttgtg ctggcagctg ctccgggcat ttctccggaa 720 
tactatgcca aatttgtaaa aggaacggaa ctggcggtga tttttgaccg gacttatcgt 
ttgctccaac aggcggatgt tgccttggtt acttccggta cggctactct cgagacagct 
cttttccgtg ttcctcaggt ggtttgttat catactccgg tgggcaaatt ggtgtctttt 
ctccgaaggc atattttgaa ggtgaagttt atctcgttgg tcaatctgat tgccggacgt 960 
gaagttgtca gggagttggt ggccgatacg atgacggtag agaatatgcg ggccgaattg 102 0 
gagtgtttgc tgtttcggga ggattatcgt cgcaaaatgt tggacggtta cgaagagatg 10 80 
gcacggttac tcggaccggc cggagccccc cggcatgcag ctcgtgaaat ggtgaaattg 1140 
cttaaaaaat ag 1152 

<210> 1014 
<211> 855 
<212> DNA 
<213> B.fragilis 

<400> 1014 

aaacgtaaca acaccttgaa gaacaacttt ttacaacgcg ccataacagg aatattattc 60 

gtagccatca tagtgggttg tatactttat gatccactgg ctttcggcac tctttttgtc 120 

attgtcagcg ctctgactat acgcgaattc ggacatctcg tcaaccaatc gggagaggta 180 

agcatcaacc ggactatcac catgttggga ggagcttatc tgtttctggc cattatgggt 

ttctgtatcg acgctgccgg ttctaaaata tttattcctt acctgatatt aatcatttat 



180 
240 
300 



420 
480 
540 
600 
660 



780 
840 
900 



240 
300 



ctgatggtaa gcgagttata tctcaaaaag aagaatccgg ttttaaactg ggcttactcc 3 60 



412 



780 
840 



120 
180 



atgctgagcc agatgtacat cgcgcttccc tttgccatgc tgaatgtgct tgctttccag 420 

aatgatccgg aagcaagcag cgtatcatac aatccgatat tgcctctatc catctttgtc 480 

tttttatggc tgaatgatac gggggcatat tgtttcggat cactttttgg caaacaccgc 540 

ctgtttgaac gcatatcacc taaaaaatca tgggaaggtt ccattggcgg aggtattgta 600 

gccattgcct cttcatttgt ttttgcctgc tacttcccca tcatgacatg ggcagaatgg 660 

gcgggactgg cattggtagt tgtcattttc gggacttggg gtgacctgac agagtctctg 72 0 

ctgaaacgcc aattgcagat taaagattca ggaagtattc tacccggaca tggaggtatg 

ctcgatcgct tcgacagttc actaatggct ataccggcag gcgttattta cctatatgca 

ctgacattgg tctaa 855 

<210> 1015 
<2H> 945 
<212> DNA 
<213> B.fragilis 

<400> 1015 

aagaaattaa aaaagtgtgt tatatttgtg gctaataata ctatattagc tatgaatgca 60 
ctacaaagca acattattcg ggagatcact ccgctgtccg ataaggattg tttctacatt 
gccgaacggt ataaaacgga gtttacttat cccattcaca atcatgccga atttgagctg 

aactttacgg agaaagcagc cggtgtgcga cggatcgtcg gtgattcggc agaagtgatc 240 

agtgattatg atttggttct gattaccgga aaggatttgg aacatgtatg ggagcagcac 3 00 

gattgccatt cgaaagagat ccgtgaaata acgattcagt tctcttccga tcttttcttc 3 60 

aaaagtttta tcaataagaa tcagttcgat tctattcgtg atatgcttga gaaagctcag 42 0 

aaaggtcttt gttttccgat gtccgccatc ctgaaaattt atccccttct cgatacgctg 480 

gcatccgaga aacaagggtt ttatgctgtc atcaagttct tgaccatact ttatgaactg 540 

tcacttttca atgaagaggc ccgtacgttg tcaagttctt ccttcgcgaa aatcggcatt 600 

cattcggata gccgccgtgt gcagaaagtg caggaatata ttaatgccca ttatcaagaa 660 

gagatccgcc tgaatcagct ggccgatatg gtaggaatga ctccggtatc tttcagtcgc 720 

ttctttaaat tgcgtaccgg taagaatctt tcggactata tcattgacat tcgtttgggg 780 

tttgctgccc gcctgctggt tgattctact atgtctattg ctgaaatctg ttatgaatgc 840 

gggtttaata atctttctaa tttcaatcgg atcttcaaga aaaagaaaga atgttcgccc 900 

aaagagtttc gtgaaaacta caggaagaaa aagaaactgg tataa 945 

<210> 1016 
<211> 324 
<212> DNA 
<213> B.fragilis 

<400> 1016 

cttaaacaaa aggaggcaat tatgaaacgg attttcacac tatatctctt tatcttattc 60 
tgtctgattt tgcaagcaca agaagaatta tatgaacggg tatacgtaca tacggataaa 12 0 
acgtgttatt tggccggtga agaagtatgg ctcaaatttt atactattga cacacatttt 
cgcccatctt ctttcagcaa agtgggatac atagaaatat caaatactga acggcctaaa 
gcacagctta aactggcact tgacaatggg agcggttcgg gcaaagtaaa gattcctaca 3 00 
gacgctcctt cgggaatttt atga 324 

<210> 1017 
<211> 867 
<212> DNA 
<213> B.fragilis 

<400> 1017 

atggcaaaaa aaactaaaat atacccgtta tacattgccc tgcttctctg tttttttcag 60 

gtagcaggga ttgatgtgta tgcacaggag cctgtcaaag tatcccaaga ctccatttct 12 0 

ccggtacgcg aagcccccaa agcacgggca cgccgccatc gcgagccggt cgtttctact 180 

ccggccaccg acagtgtgaa agtggagaaa gcagtcgtcc tcccaccgat agacagtttg 240 

gagaacctga aacccgccat cgttacggca gacagcctgg aggaagtcaa ccgacagaac 300 

ctggaaagga tagaaacacc cgtcatgcca tcggtcgtaa aggcagatag cctgccaccc 3 60 

gtcatgccca agaagctttt cgtacctaat ccgacgaaag ccacctggta tgccatcgta 42 0 



180 
240 



413 



tttccgggcg 
ggatttgccg 
caggcgtata 
cctccgaatc 
acataccggc 
atcatcgacg 
atgagggtgg 
ggcgtgcaat 



gaggacaaat 
gatgtgctta 
tggatattat 
ataactatac 
gctatcggga 
cctatgtgga 
aacctactat 
gcagcctcag 



ttataaccgc 
cgcattgagc 
ggataacaat 
cgatacgcaa 
tctcagtata 
tgcggaacta 
tataaataac 
attttaa 



aaatactgga 
tggaacggga 
cctaatacca 
ctgaaagacc 
ttcgccgtca 
tcgaatttcg 
aacccgttgc 



<210> 1018 
<211> 1206 
<212> DNA 
<213> B.fragilis 



agttacctat 
aaatgtataa 
acagttttca 
tgctccgcaa 
tcggtgtata 
acatatcacc 
aacccggcag 



tatatatggt 
agactatgct 
ggatttgctt 
acgaaaggac 
tctgatttcc 
cgacctgagt 
caagtcggta 



480 
540 
600 
660 
720 
780 
840 
867 



<400> 1018 
ttatatatga 
agaaaacgta 
tctggttttg 
agcttcccat 
atgaagggag 
caatctgtga 
ataattgtgt 
tattccaata 
tcatccattc 
caggaaattg 
attcgtcctt 
aagaggagga 
ctgattgaat 
gggacagatc 
ggacttgtag 
cctcgtaaag 
atggcttcgg 
aat catttga 
tgggggaata 
ttggataata 
acatga 



agttacttta 
ccaaaggaca 
aaaatttaga 
tacgctgtaa 
ttaatggctc 
taaatgaagc 
actctctcat 
tgaaagtgtg 
tccatagagt 
attcctttat 
ggtatctttt 
aaacgatact 
catttatcaa 
gagcttttgt 
atcaaaagcg 
gagatgcaga 
gaacacctac 
ttcttatacc 
agggacagga 
aaaattcaga 



tattggtgct 
tataacagtt 
aaaaaaatta 
taatccattc 
atttcttaat 
taaacgatgg 
gtatccatac 
ttgtattgtc 
gttagaagca 
tttactgaca 
agagggtatt 
ctatactggt 
gattgatgat 
tgagactgca 
tgtatttgag 
atacactaag 
cataatgtat 
ggaccattca 
tgaattggat 
aaatcaggca 



ttttgtgagc 
agtgctacga 
gactatatca 
ttttcaagaa 
attacttact 
cttaatttac 
cttaaggcag 
ttggatttgc 
agaaacacta 
gagtttatga 
tatagtcctg 
aagttagatg 
agagagtttt 
gccaagaacg 
atgcagcaac 
tattcttttc 
aaattaccag 
cgagaaacac 
gactttggta 
aggcgattgt 



cttcaacaga 
cttttcagaa 
taaatattcc 
cgaacttcca 
tgaagaagta 
atagggatga 
ctattgattt 
ctgagtattt 
ataagattta 
aggataaatt 
tagaagttgc 
ctcgatttgg 
ctttgtggat 
attgtcgtat 
aagccacgtt 
cttctaagac 
gacttcctgc 
ttactacatt 
agcatgcaag 
tggaatttct 



tttcttaatt 
agctcttctg 
cgatatcgga 
atttgctttt 
tagtatctat 
ggaagtaact 
gaaaaaacat 
tggagataat 
ttctttagtt 
gagagttggt 
tctacaaaaa 
catacgtgat 
atgtggtttt 
aacatattgg 
gttgattaat 
aatggagtat 
taactactta 
gttaaaagag 
acaatttata 
ggtgaataag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1206 



<210> 1019 
<211> 1029 
<212> DNA 
<213> B.fragilis 



<400> 1019 
gacatgaaga 
acacaaaagt 
ttatccaata 
ttttatcgat 
tttactggag 
caagtgctat 
tctgatttaa 
catgtggttg 
gttgtctgga 
atctttgctg 
aagaaaggtg 
tttgttgaat 
aaatattatt 
tgtgccaaca 
gatcaacttc 
aaattggctt 



aaaaaaaaat 
tctatcagat 
ggatagttga 
attcattttt 
gtattgataa 
tttttaagct 
agaatatagc 
atactgcaca 
tgggtgagga 
aattgaagaa 
agggaactgc 
tagtgggtga 
ttcagctctc 
atattctgat 
tttttgatat 
cttttacttc 



cttgttttat 
agatattgct 
cagtcttaag 
tgcttcattt 
tttagatgaa 
ctgttattgg 
aaaagtattt 
gtttatgtct 
agggaatgtg 
gataacttca 
cttggttcaa 
agtctcagaa 
attgtatgaa 
tcattcgggt 
agataatgat 
attgattcca 



tcgagtgtta 
ctgctgagga 
ttttgggaat 
tttgctaaat 
aattatgcct 
gtctcaagtt 
cattttagaa 
gatgttccga 
cgacgaaaag 
ttttctgatt 
tcattgatta 
gaagaaaaaa 
ggttttggct 
agggggggct 
tttgataaag 
aatgatgagg 



gatcactcga 
atctaggtta 
atgatatttt 
gtttaggtaa 
ctacccgaaa 
cttgtattat 
gaagattgag 
aagagaagct 
gggtggataa 
accgatttat 
atgaatatga 
ttgctttctt 
tagcagcttt 
tggctaatcc 
agttttcgat 
tcttaagtta 



attgttgaat 
tgatgtctgt 
gttctcttat 
aagaacctat 
ttataagatt 
tgtgtctcaa 
ttatagtgaa 
atttactacg 
ggcattgcga 
tattatgggt 
tttagaaaaa 
aaaacgttct 
agaggctttg 
tatctacaca 
tttaactaaa 
ttatgatatc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 



414 



cagagaagaa aagaagattt taaggctatt attacagaca acaataggaa ttataaatta 1020 

aataattaa 1029 

<210> 1020 
<211> 375 
<212> DNA 
<213> B. fragilis 

<400> 1020 

ttgttaaacc ttaaatctat ttttatgaaa ataaaaaaat tatttactat tctaacagtg 60 

ctttgtttct ctgctttagc aggagttgtg tttcttaaat tcgtagataa taaaaagaac 12 0 

gccagagata ttgcattggc aaatgttgaa tctcttgcta atgctgaagg cgatggtata 180 

gggaatgagg ataatccatc aactacgata aaaaagtgtg taagctccag tgaatttaca 240 

aaagatgatt ctacgggaga gtattatttg gtgtgtaatt caggaactac cgaaagcgtt 3 00 

atttaccgtt gtccatctgg tactaccgaa ggtcacaaag attggatata tagttttctt 360 

tattgtacga gatag 375 

<210> 1021 
<211> 981 
<212> DNA 
<213> B. fragilis 

<400> 1021 

acaaatatag aaaattcaaa cgataaattg gttacttttg catcagtttt caataaaggt 60 

aacattttcg gtatggagat tgataagaat ctgaaaggac acgcattggc atttacagcc 12 0 

aacatgatgt gggggctgat gtcccccatc ggtaaatcgg cattggcaga gttctcagcg 180 

ctttcggtaa ccaccttccg catggtaggt gccgcagcag ctttctggat actttccgct 240 

ttctgcaaac aagagcaggt aggacaccgt gacatggtga agattttttt tgcttctctg 300 

tttgcccttg ttttcaatca ggggatattt atattcggat tgtctctcac ttctccgata 360 

gacgcatcta tcgtaacaac aacttcacct atcatcacta tgattgtagc ggccatctat 42 0 

ctgaaagaac cggttaccaa caaaaaggtt ttgggcatct ttatcggagc aatgggagcg 48 0 

ttaatcttaa ttctaagcag tcaggcagta agtgcaggag gaggaagcat ttggggagat 540 

ttactttgca tgattgcgca acttagcttc tccatctatc taaccgtatt caaagggtta 600 

tcccaacgct attcggccat cacgattaat aagtggatgt ttatctatgc atccatctgt 660 

tatattcctt tctcatacca ggatatagca agcattaagt gggacagcat ttcgacagcc 720 

gccatctatc aagtacttta tgtggtacta tgtggaagtt tcattgctta catctgcatc 780 
atgaccgcgc aaaaactaat gcgccctaca gtagtaagca tgtacaatta tgtacagcct 
attgttgctt ctattgctgc tattttaatg ggaatcggaa gcttcggctg ggaaaaagga 

gttgcgatcg cattggtatt tcttggagtc tactttgtga ctcaaagtaa atcgaaagca 960 

gatttggaag gtgtgtcata a 981 

<210> 1022 
<211> 756 
<212> DNA 
<213> B. fragilis 

<400> 1022 

ctaaaaaaag agatgaaaaa aataaaacta ctttggatgg caatgttgac actgatgctg 60 

ccggcattgc aatcgtgtga cgataatgat ggctattcat tgggagatat agcggtagat 12 0 

tgggctacgg tacgtgtggt cggtggcgac acttattcgc tgaatgctga ccgttgggga 180 

actctttggc cggctgcaac tgctattcca ttttataagc cgatagacgg gcaacgggtg 240 

attacttact tcaacccact ttacgataac tatgaaggat atgatcatgc tgtgaaggta 3 00 

gagcataatt ataatgtcct gaccaaacag gtagaagatt tgacggctga gaatgaatcg 3 60 

gaatttggga atgatccggt ttgggttaac aaggatatga tgtggattgg cgggggatac 42 0 

ctgaatgtca ttttccgtca gaatttaccg gttaaggaga agcatcttgt cagtctggtt 480 

cgtgataagt gggctacagc tgctgaggga gaggatgatg gatacatcca tttggaattt 540 

cgctataata catacgatga tgtgaccgct cgccaggcga atggtgccgt atctttcaac 600 

ttgaattcat tggatctgac cggtaagaaa ggcattaagg tgaaattgaa ttccgtaaag 660 

gacggggaaa cggaagtggt ctttaactta aagggccagt caatgccgga ggaagcaaag 72 0 



840 
900 



415 



caggtgacgc tttcggatga agtgcaaata aaataa 



756 



<210> 1023 
<211> 903 
<212> DNA 
<213> B. fragilis 



<400> 1023 
ataaacgaag 
ctctccatgg 
aaaatctccg 
ttggccgatt 
gacgacgagt 
ctggatacta 
ctgatagaga 
cacctgatag 
cgtacgacca 
ctgcaaaaca 
aaactacaag 
gaagagatcg 
accccgaaac 
ttctttaaca 
cccttcagca 
taa 



atatggcaac 
aagaggtgaa 
tcaaccccaa 
cgatcagaga 
accagattat 
ttcccgcata 
atatccagcg 
agcaatatga 
ttgccaacta 
agcagataga 
ttaaaatatt 
tgaaatcgct 
gggccaaact 
ccaaggtaca 
atgaagaaga 



acagagaaga 
aaccgaaggt 
ccaaccacgc 
aataggaatt 
tgccggagaa 
tatccggaca 
tgaggatctt 
cctgacacag 
tctgcgcctg 
catgggacac 
cgaagagatt 
gagcgaggga 
gcccgaagaa 
actgacctgt 
gttggaacgt 



aatgcattag 
tcttcgtcta 
cgtgagtttg 
atccaaccca 
cgccgctacc 
gccgatgatg 
aattcagtgg 
gaacggctga 
ctgaaactgc 
gcccgggcac 
ctggaacacg 
gaagccgtaa 
ttcaatatgc 
tctgaaaaag 
atcatggaaa 



gccgcgggct 
ttaacgaaat 
atgaaacggc 
tcaccttacg 
gcgcctctca 
aaaacgtgat 
aaatcgcact 
gtgaacgtgt 
cggcaccgat 
tgatcacatt 
gatactccgt 
agagcggaac 
tgaaacagca 
gaaaaggaaa 
tcttcgattc 



ggacgcccta 
agaactgtcg 
actggaggag 
taaagtctcg 
gaaagccgga 
ggaaatggcg 
ggcctaccag 
cggcaagaag 
acaaatggcc 
aggcgatccg 
tcgcaaagta 
caaaaagata 
cctttcggga 
aataagcatc 
actgaagaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

903 



<210> 1024 
<211> 810 
<212> DNA 
<213> B. fragilis 



<400> 1024 
accatatact 
ataacaacaa 
ttaaccatgc 
ttt tttcaaa 
agaaaattca 
cccaactaca 
gtctcagcat 
ttttcaaatg 
tttatacttg 
gtatttgctt 
gttacaggaa 
cataaagagc 
tcatttttac 
tcctatcttt 



gccatacaat 
acacatgtct 
aaatagcctc 
atgatgaacc 
gaaaagaaga 
aatatgatgt 
catctgtaga 
gagatgctca 
ctccggcata 
tacaacgtgg 
aagaattcat 
taataagctt 
cgaatcaatt 
cggtaggcat 



ggcatggaca 
atatccggag 
gcaagtttac 
tgcctatatc 
ttttgcttct 
attaagttat 
ggttctaaaa 
tgctaaaaat 
tgatttgcta 
cttgtttaaa 
agaatttggt 
ttgccaaaag 
aaagaaacaa 
tctcacataa 



ataatgaaac 
ctcaacaaaa 
aatattccta 
acacgacgtt 
ttagccggaa 
gaagagatgg 
ttctttcgat 
ttctctttgt 
aataccagac 
gagaatacat 
atccgcatcg 
gcagaacaag 
tatttactcc 



ggaacaaaaa 
taagtccagc 
cagcagccaa 
ttgacattgc 
tatcaaaggg 
cggatattat 
tagttatttt 
tagaaactcc 
tgcatatttt 
taaacggaaa 
gcattccccc 
tacaagattt 
attaccaaat 



gaatttcagg 
caatgaacac 
tgggctctgt 
acccaatgga 
taataaaggt 
caaacagtac 
caacttcctt 
ttctggagat 
tgatgatcat 
tgacggtgcg 
caaaagagtt 
agtcgaaaaa 
gagaaaagat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

810 



<210> 1025 
<211> 1443 
<212> DNA 
<213> B. fragilis 



<400> 1025 
caatgggagc 
atttggacgg 
cccattgctg 
tctgccgaga 
tcccggtcga 
gaagtatccg 
cttgaagaat 



ggttcgggca 
aatccacccg 
taattaacac 
tatatccgaa 
actacaacac 
atcttactgt 
ccacctggag 



aagtaaagat 
atatatgagg 
ttcccgagta 
agggaaaccg 
acgccaactc 
ctcggtaagc 
caaacaagtg 



tcctacagac 
aatgagggag 
tcagactccg 
ggtaccaccg 
gtagaattga 
cgtaatgatt 
accgctactc 



gctccttcgg 
aaaaagtgtt 
atcctattga 
aaaatattca 
ccattaatag 
cactggtaac 
ccggaacatt 



gaattttatg 
tttcagaaat 
actagcggat 
tataaaaact 
actaccggat 
cctcccgcct 
ttccggaaaa 



60 

120 

180 

240 

300 

360 

420 



416 



tggataccgg 
acattgaagc 
atacgatatg 
gtatatggaa 
atgaacatct 
tatcggaata 
gtgctcgatt 
ctgaattaca 
ttcgtacgca 
gaaggagaaa 
catgaccatg 
aacggccgtt 
agaggagact 
ccgcaactgc 
cgtagaccgg 
atagatacca 
gaaggcttca 
tga 



aatatgaagg 
aagtacaaaa 
tgcaaggaca 
cgaacgatgt 
tatcaccctt 
aaaaacgact 
cattggatca 
atttagatga 
gtgtgataat 
aaagattcaa 
aggatatttt 
atggatttgg 
taccttccat 
ctgttacatt 
atttccgcca 
cactttcatt 
cattaaaagg 



gcatatcatc 
tgaacctata 
agtggaatcg 
agtagccgct 
cagtgaaaag 
gttggaaagg 
tgccattcct 
atacacacga 
acgtaaagtt 
tatcggaaat 
aaaatataat 
tggtgaagta 
acagcttagt 
taagatgcct 
tacgctatac 
ctatacttct 
agagctgatc 



tgcggacaga 
tcagccgaca 
ggaggaaata 
gcatggaaca 
ttaccccaaa 
agcataggta 
ctgcaatctt 
ttcaatacaa 
aatggaaaac 
acattggtac 
cccagattgg 
ttcgaatgca 
gacgattcac 
gaatataaag 
tggaatccct 
gaccttgaag 
agaggcgagg 



tagaaagccc 
tcgctttcgt 
cactgttcta 
ttaacgggga 
acttgccatc 
tacagctgca 
gttatggact 
tgacagaaac 
gtagattaag 
tgctggacgg 
tgaagaaaat 
tgatttcact 
gactaacggt 
acgctactga 
ctgtagagac 
gagaatttaa 
taaacttcca 



tacaggagaa 
aggaaaagat 
cacaagccat 
accgttcaga 
attaaaactc 
gcaagtcaca 
tcaaccttat 
tttcgtggaa 
agtactcaaa 
tgttcccatt 
cgaaatatat 
tactacccaa 
ttacgagtgt 
caaaaaatca 
agaagcagga 
ggtggtagtg 
tgtgaaaaaa 



480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1443 



<210> 1026 
<211> 951 
<212> DNA 
<213> B.fragilis 



<400> 1026 
ccctgtcata 
ttgaataatt 
gaaacatttg 
tatggaacgc 
gatgtttata 
gatcattatg 
ttcgtagata 
tacatagatg 
agatatgata 
attttgcatt 
ttctataaaa 
ataaaaaagt 
gtattggaca 
gaagtgccaa 
cctcccaagg 
ttgtctttta 



gttacaaaag 
atttgtcaat 
cagcttttga 
ttatcggtgc 
tgttacgtga 
atattatgga 
cgaataccac 
tatttcctct 
agtattcact 
cattgtttag 
gacataaata 
ctaaaggtaa 
aaaaattgtt 
ttggttatga 
aaaagagaat 
gtgaaattag 



acataataga 
gaatgaaggt 
tcgattctgt 
tgttcgacat 
agattatgat 
tattaatgac 
tctttgggag 
agatgagtgt 
tttggtggct 
gttgaaactg 
cgcatattac 
tttcttggtt 
ttcggagtct 
ggagtgttta 
ttctcaccat 
tagagaaatt 



acgattgaaa 
ttaacaaaag 
aaagcaaatg 
aaaggactaa 
aaattctgct 
gatggttatt 
tttaaaaatc 
aatatggagc 
caatcaatga 
aagtgtttat 
aaacagaagt 
tcttatgatg 
gtgttggttc 
aaaagtatat 
agtcgttatt 
tcattaagag 



aatgttctgg 
aatatcgggt 
gaataaaata 
ttccatggga 
ctctgaaggg 
ggttactctc 
gccctcttat 
aagtaataac 
tgagatattc 
tattacaaat 
atttaagttg 
gaccttatgg 
catttgaaag 
atggagatta 
atttgaattt 
gcaaaatttg 



gaacaataaa 
aaagttgtta 
ttatgcggct 
tgatgatata 
aaaagtcatg 
tttagcaaag 
tttgggtgtt 
tcttaaaaat 
tttaggagat 
ctcttgtgtt 
tgttgaagat 
aatgggagag 
tatgtctatt 
tatgaaacta 
aaaccatcgt 
a 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

951 



<210> 1027 
<211> 2778 
<212> DNA 
<213> B.fragilis 



<400> 1027 
agaattgata 
ctactatttg 
agtataacta 
agcactatta 
ttaaaagaag 
gtgctgaaag 
aaaggcgaaa 
tcttcagaaa 
gcggtactca 
ctgattttaa 
gaacttccta 



gaccaaaaac 
catggccgca 
ttgactcttt 
gtgctccttt 
cacttgagcc 
agcaggaact 
gttattatgg 
acaaagtgta 
aagggcaagt 
aggacccatg 
ccggacacaa 



aaacacgatg 
acaaatgacg 
attggagact 
caaggttttg 
gactccctat 
catcacgcta 
tgacgtatat 
taatgtggga 
tactaacttt 
gattgcgacg 
acaaattgat 



aaaaaaagta 
gctcaacacc 
gtagaaaaaa 
gtgaaaggaa 
aaattgtcag 
ttacctgcta 
acttacctaa 
gatgtacgaa 
aaaaccggtg 
acaacagatg 
atcaaaggac 



catacttatg 
ccctctcact 
atacacccta 
aagcttcacc 
tatccggaaa 
aactgaccgg 
gcggagaacc 
tcaagcaacc 
agcccatgat 
ttaagggaaa 
ttaacattaa 



gatgttgccc 
tcctgctgac 
taggattttc 
cctgcaacaa 
caatttattc 
agaaccggaa 
tgaaaaggca 
accacgaaaa 
aggcatcaat 
ctttactctt 
agatacccgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 



417 



cgacaaatta tgctttacag tgatggtaca ctggacatcg aacttgaaga aactacgcat 72 0 

atgctggacg aagtaaccat tacttccgga cgtatacaaa atgtgaaaag tacgcaatta 780 
ggtgcagaaa cactgcgtcc aacccaattg aagaatatcc cgatggcctt gggagaagta 
gacattttaa agatggtaca ggctttaccc ggtgtaaaaa cagtaggtga agcttcaagc 
ggcttcaacg tacgtggtgg agctaccgac cagaatctaa ttctgctaaa tgacggaacc 
atctataacc cgaaccattt attcggcttt tttgcagcct tcaattcaga tatggtaaaa 
gaagccgaga tatataaaag cagcatcccg gcacaatatg gtggacgtat ttcatccatt 

ctagatatca ccggtaaaga agccaataaa gaaaaattca ccggttcggc cggtatcggt 1140 

ctggtaacga gtaaactgaa tctggagatt ccgattatca aagacagaac gtctgtatta 1200 

ctaagtgggc gtactacgta ttctgactgg atcatgaagc agcttccgga gaaaagcgaa 12 6 0 

tacaaaaacg gtaccgccgg cttttatgat ttggctgcta ttgtggcaca taaattcaat 1320 

gacaaacata gtcttaatgt ctacggatac tatagtcatg accgtttcgc tttcaattca 1380 

aacgaaaaat acggctacaa taatctcaat gcttccgcac gatggagagc tgtatttaac 1440 

gaaaaactga taggatactt ctccgccgga tacgatcatt acgattacaa taaccgcgag 1500 

accgtcaatg catcaactgc ttataaactt tcatttgata ttaatcagta ttttgtcaaa 1560 

gcagacttca caaacatact ggccgataag cacacgctca actttggttt caagtccatg 1620 

ctctatcata tcaattcggg tacttatgaa cctgaaggaa gtgaatcatt tgtaaaaaag 1680 

gacgttttac aaaaggataa agccttggaa acggcatttt atttaggtga tgaatgggaa 1740 

atcactccca aactatcggt caacgcaggt atccgttact cactgttcag tgcactcggt 1800 

ccacgttcgt actatcaata tgcatcaggc atgctcccac acgaatcgac cataacggac 1860 

accatcactg caggagcagg aaaattcatg aagacttatc atgggccgga attccggtta 1920 

tccgcccggt atgccttcac agataatttc tcggtcaaag ccggatttaa ctcgatgcgg 1980 
caatatatcc ataagttgtc gaacactgtc attatgtcgc caacggatac atggaagttg 
agcgatgtga acatcaaacc ccaaagaggc tggcaagccg cagccggact ttatctaaat 
tctccgagtg gcatctggga atattctgtg gaaggatatt acaaacgaat gtccgattac 

ctggattatc gtggaggagc aaagctactt atgaaccacc atattgaaac tgacgtcatc 2220 

aacacgcagg gacatgctta tggcgtagaa ttgcaggtaa aaaaacaagt cggtaagctc 2 2 80 
aacggatgga tgagttacac gtattcacgt accttcttga gacagaatga taaacgaatt 
gagaaaccgg tgaataacgg tgactggtat cctacagaat acgataagcc tcacgacttt 
aagtttgtag gtaattacaa atttacgcat cgatacagta tgtcaatcaa cgtggattat 
agcacaggac gccccactac catacctgcc ggacagtatt atgatgaatc aacgcaatcg 
atgcgagttt actatacgga aagaaactca taccgcatac cggattactt tcgtacagac 2580 
atctctttta atatagaacc cagccatcat ctgaccctgt tgacacatag ttctatttcg 2640 
attggggtat acaatgtaac cggaagaaag aatgtgtatt ctatttatta tatgccggaa 2700 
gaaggacaaa taaaaggata ccagatatct attttcggag ttccgattcc tttcattacg 2760 
tataacataa aattctaa 



<210> 1028 
<211> 1017 
<212> DNA 
<213> B. fragilis 



840 

900 

960 

1020 

1080 



2040 
2100 
2160 



2340 
2400 
2460 
2520 



<400> 1028 

cagtggtgca gaaatattct ttcagatttg tttaggcgat atgcttgttt cgtaagtgag 
tttagtgact ttcaggaaaa tcctatatta gaagaagtta ataatggttg caattttttc 
gagaagtcaa aatcggacat tataatagca tgtgggggcg gaagtgtact tgacatggct 
aaattaattc gttttaaagc tgcttatgat ggcgatttgg ttgattctgt ttttgaaaag 
aaaaaggaat taactcctct tattgcatta ccaaccacag ctgggactgg atgtgaagcc 
actccttttg ctgtatgtta taagaattca ataaagtact cagtggctca taatgatatg 
cttcctgatt atgctgtgat atttcctcag tttacttata ataattcttc atatctgaca 
gcctgtacag gtttcgatgc actttctcaa agtatagagg cctattggaa cgtgaatgcc 
acagcagaat ctgatgaata tgccaaaaga gctatttctg ttctttggga taatcttcca 
aaggtagtaa attctccttc aaatgagatt cgcgatttga tgtctgttgc agcttattgg 
tctgggtgtg caattgctat tactaagact acagctccac atgctttttc gtatgctttt 
actactcatt gcggttatcc acatggacat gctgtagctc tttcatttcc cttttttatg 
gcattaaact tattggaaaa acaggacttt gctttccaac caagaataaa tattgatgag 7 80 
tattataaaa agacagcatg gcttcagtct caattaggct tctctgatga gattaatata 840 
cagtctgaaa tgcaaagtta tctgaacaat ataggtttat gtaataatgg atatggagat 900 
aatgacttga ccataatgtt gaatcaggta aatattcagc gattggtaaa taaccctgtc 960 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 



418 



ggatacttat 


tttgaagaag 


60 


tcaagtttcg 


gtatgatggc 


120 


ttatccggtt 


ggctgccgta 


180 


ataccggaca 


gaattatgat 


240 


aggctccgga 


tgtctatatg 


300 


ttttgaatgc 


gagctataaa 


360 


gtgataccaa 


ttcttgccta 


420 


atatggaagc 


cggtaaccgt 


480 


ttgtagacat 


tatttctgat 


540 


atgcttcggg 


tgtggcaaag 


600 


tgtctgagaa 


cctttctgct 


660 


aaggacaata 


tattctgctg 


720 


tcgcttcatt 


gtttgaaggc 


780 


acagttgcca 


tcctcgtagc 


840 


gggtgattcg 


gcatgcccct 


900 


atgccgtggt 


cagtgatagc 


960 


gtcactcttt 


tccggctgtt 


1020 


agggatgttt 


tattcttgcg 


1080 


ctgtggaaat 


gaataggaat 


1140 


atgtatcgac 


aaaggtggtg 


1200 


tctggagaaa 


atcctga 


1257 



atagttacaa aagacataat agaacgattg aaaaatgttc tgggaacaat aaattga 1017 

<210> 1029 
<211> 1257 
<212> DNA 
<213> B.fragilis 

<400> 1029 

cttttatgtg gtgtaacgag tgttttgatc ccggtaggcc 

tttaatagca ttatggaaat caaactggat tattcagata 

aagctgagac tgttgattat agtcggtacc cgtccggaga 

ataaataaat gccgtcgata ttttgattgt attttggctc 

tataatctga acggagtctt ttttcatgac ctgagcttac 

gatgctgtag gggatgattt gggttcgaca atgggtaata 

ctgatgtcac acttacgccc tgatgccgtt ttggttctcg 

agtgtaatca gcgctaaacg cttgcatatt cctatttttc 

tgctttgatg agtgccttcc tgaagagaca aaccggcgta 

atgaatcttt gctattcgga gcatgccaga cggtatctga 

gaacgtactt atgtaaccgg ttctccgatg gccgaggttc 

atagaatctt cggatatcca cgccagattg ggtttgagga 

tctgcacatc gggaagagaa tattgatact gacaagaact 

ataaacgcga tggctgaaaa gtatgacatg cctgtacttt 

cgtaatcgcc tggaatcaag tggatttaaa ctggatagcc 

ctggggttcc atgactataa ctgcttgcag atgcatgctt 

gggacattac cggaggaaag ttcttttttc acttctgtcg 

tgtattcgca caagtaccga acgtcccgaa gctttggata 

gggattgata aagcctcttt gcttcaagct gtagatactg 

ggtgataatg gtgtccctgt tccggattat atggatcgaa 

aagttgattc aaagttatac aggaatagta aacaaaatcg 

<210> 1030 
<211> 426 
<212> DNA 
<213> B.fragilis 

<400> 1030 

attgagtgtg gtatggatat gcaggatgtg aaggtattgg atttacctaa gatattagat 60 

aagcgtggta atttatccat tatacaagaa gttgaaaata tcccttttaa aataaagcgt 12 0 

atatactgga tttatgatgt accggggggg gaaagacgtg gtggccatgc ttataagaaa 180 

aatcaagaat ttatagtagc tctttctggt agttttgatg ttgtattgga tgatggaaat 240 

tgtgagaaag ttttttcttt gaatcgctct tattatggaa tttatgttcc tcaaggtata 3 00 

tggagaaaaa tgcaaaattt ctctactaat gcactggcgt tagtgttatc ttctacaaat 3 60 

tatgatccag atgattatat tttggaatat atagattttg tgcaaagtaa aaagaattca 42 0 

ttatga 426 

<210> 1031 
<211> 594 
<212> DNA 
<213> B.fragilis 

<400> 1031 

ttactcatgg gaaacttaac agaaaatgat tttcagcgtg tggcggattt acttggcatt 60 

gaggtagcag tggtaaaagc tgtacaggca gttgagacca gtgggcatgg gggctttgtg 12 0 

gctccggggc gaccgatgat cttattcgaa ggtcacatct tttggcgtga actcaagaag 180 

cggggactag atccggagag gtatgttgcg ggcaatgaaa atattcttta tcctaaatgg 240 

gagaagggtc attattatgg cgggatgaaa gagtatgaac gtctggaaaa ggcctggcaa 3 00 

atacataaag aagctgctga cgcttccact tcatggggaa tgttccaagt gatgggcttt 3 60 

aactatgcga tgtgcgggta tggcagtgtg gaggaaatgg tgaaagatat gtgtgtcgga 42 0 

gaagataagc aactggaagc ttttgcgagg tttgtgaaac ttgctaagtt gcagtcctat 480 



419 



240 
300 
360 
420 
480 



ctggagcaga aagactgggt cgggtttgcc aagaggtata atggacccgg atatgcccgg 540 
aatcagtatg ataaaaaact ggaaggggct tatcggaagt ttacgaagga gtag 594 

<210> 1032 
<211> 501 
<212> DNA 
<213> B. fragilis 

<400> 1032 

ttcaaattta ttatgactct ttccgaagaa gtagcttccc ttcagcgtgc cgcgcacgac 60 

ctgatgtatt tgggcatgga cgggagtccc atttacagcg atgacttgtc ccgccgcaac 12 0 

aatgaagttt accgcttgac cacaacattg tataattccg gtgtccaagg ttccacggtt 180 

gaagaacagg cctctgtctg tctcgctctc ctaatgggtt acaacgcatc gttcatcgac 

cacggagaaa agcgcgaaca tgtccagaag atattagatc gttgctggga tatcctcgat 

actcttcccg cttcactatt gaagcttcgt ctgcttaccg cctgctatgg tgaggtattc 

gacgagcctt tggctgacga agcccgttca atcatcgctt cttgggattc ggtgtcactt 

actactgaac agcaagaggc tatcaacgag tttcagactg tggtggataa cccttatccg 

tgggagtatg ttgaagaata a 501 

<210> 1033 
<211> 891 
<212> DNA 
<213> B. fragilis 

<400> 1033 

gttttaatta tgaaaggtat tgttttggcc ggtggttccg gcactcgctt atatccgatt 
accaaaggag tcagtaagca gttacttccg atattcgata aaccgatgat ctattatcct 
atctctgtac tcatgttggc agggattcgt gagatcttga ttatctccac tccttatgat 
ttacccggct ttcaacgttt gctgggtgac ggttctgatt atggagtgcg atttgaatat 
gcggaacaac cttctcctga tggtttagca caagctttta ttattggtga gaaatttatc 
ggtgatgatt cagtatgttt ggttttgggt gataatattt tttacggaga tggattgatt 3 60 
gaaatggttc aggctgctgt gaaaaaggct gatttagaga ataaagctac agtttttggt Aon 
tattgggtga gtgatccaaa acgttatggg gtagttgagt ttgataaaga aggaagtgtg 
ttaagtcttg aagaaaaacc acgtgatcca aaatcaaatt atgcagttat aggtctttac 
ttttatccta atgtagttat tgagttagct aaaaatgtta taccctcatc tcgtggcgag 
ttggagatta cttcaattaa tcaagaattt ttatataaga aaatgttaac agtgcagcta 
ctaggacgtg gctttgcttg gctagataca ggtacacatg attctttggc agaagctagc 72 0 
acatttatcg aagtgattga gaaaagacaa ggattgaaga ttgcttgttt agaggatata 780 
gcttttggac aaaggtggat tactattgat aaattgcgaa aactggcaga aaagatgaag 840 
aataatcagt atgggaaata cttgttgaaa attgtagaag gactgaattg a 891 

<210> 1034 

<211> 798 

<212> DNA 

<213> B. fragilis 



60 

120 

180 

240 

300 



420 
480 
540 
600 
660 



60 



<400> 1034 

aaatgtgtta tggctaaact ttacccgatt gggatacaga actttgagaa aatacgtagg 
gaaggctatc tttatataga taagactgca ttagtctgta gattggtaaa aacgggttca 120 

240 
300 



tattatttcc tgagccgtcc ccgtcgcttt ggcaagagtc tgctgatatc tactcttgaa 
gcttattttc aagggaaaaa ggacttgttt cgtggtttgg ctatggagga gttggaaaaa 
gattggataa aatatccgat tttacatctg gatctgaaca ccgaaaagta tgatacgccc 

gaaagcctgg atcgaatatt gaacgatacg ttggctaaat gggaaatggt gtacgggact 3 60 

gctccttctg aaacttctat tcctttgcgt ttcaagggta ttgtacagcg tgcctgtgaa 420 

cagtcggggc agcgggtggt gattttgatt gatgaatatg ataaaccgat gttgcaggct 48 0 

atcggtaatg aggagttggg agagaagtat cgtgatacac tgaaaggttt ttattctgtg 540 

ctgaaaacga tggacgggta tatccgcttt gcccttttga cgggagttac caagtttggt 600 

aaggtaagtg tgtttagtga cctgaataat ctgaacgata tctctatgga cgaaccttat 660 

gtggagttgt gtggaattac agaaaaggaa atccatcatt atctggaacc ggaaattcgt 72 0 



420 



cagttggcga aatatcaaaa gatgtcgtac gaagatgctt gccgtcttca ccacagggct 
ggaaggatcc tcgaatag 

<210> 1035 
<211> 888 
<212> DNA 
<213> B.fragilis 

<400> 1035 

ttatcatcta tggaaaatga agagactttc gctttcctat ttggttctgt tgtcgataca 



780 
798 



agttcttttt gttacttttg tgcatttcga aaacataaaa taacagataa ggtagattgt 

atggaaaata aaagacctct gatcctcgtc tccaatgacg acggcatcat ggcaaaaggt 

attagtgaac tgataaaatt cctccgcccg ctgggcgaga tagtggtaat ggccccggat 

gcccctcgtt ccggcagtgg atgtgcatta acggtgacac agccggtgca ctatcagtta 



60 
120 
180 
240 
300 

ttaaagalag atgtgggact gactgtttat aaatgttccg gtacaccgac cgactgcata 360 
aaactggcac ggaatcagat actcgaccgg aagccggacc tggttgttgg tggaatcaac 
catggtgaca attccgctac caatgtgcac tattccggta cgatggggat cgtgatcgaa 
ggttgtctca atgggattcc ttctatcggt ttctctattt gtgaccacgc ccccggagct 
gattttgatg cagcaggacc ttatgtccgg agaatagctg cgatggtgct tgagaaagga 
cttccgccac tgacttgcct caatgtgaat tttcctaata ctcaggagat aaaaggggtg 
agaatctgcg aacaggccaa aggacattgg agcggagaat ggcaggcttg cccccggaga 
gacgatgcga atttctattg gttaaccgga gaatttatcg atcatgaacc ggaaaacgaa 
aagaatgatc actgggcact ggctaatgga tacgtagcga ttacacctac tgtagtggat 
atgaccgctt atcattttat ggatgaactg aaatcctggg aattatga 

<210> 1036 
<211> 549 
<212> DNA 
<213> B. fragilis 



420 
480 
540 
600 
660 
720 
780 
840 



60 

120 

180 

240 

300 

360 

420 

480 

540 

549 



<400> 1036 

aataatcgta taatggaagt ggaaaaagaa accgaaatat ggttcgctat gcgtgccact 
tatcgtcgag agactgacgc tatgcggttg cttgcgaaag agaacttggg ctgttttgtt 
cctatgcaat ataagataag tataaagaaa gggaaaaaag tccgtgtttt ggttcctatc 
attcacaatc taatttttat tcatgcttgt ccttccgaag tgaagcgtgt caagtctatg 
gttgcttatt tgcaatatat caccgatacc cgtagcggca agaagatcat tatccccgac 
aatgaaatgc agcgtttcat tgctgtagcc ggtacttaca gtgaccatct tttatacttt 
caacccgatg aactcaactt gtccaaagga accaaagtcc gtattacagg tggtgacttc 
gagggccaag aaggtgtttt cctgaaagtg aaaggtgccc gggatcgtcg cgtagtcatt 
gctatacaag gtgtcatagc cgttgccatg gccactattc accctgatct tatagaagta 
atcaaataa 

<210> 1037 

<211> 2043 

<212> DNA 

<213> B.fragilis 

<400> 1037 

tctatcagta tggacaataa cagcaagaaa cccaacaata aagtaaatat gcccaagttc 
aatctgaact ggatgtatat gattatcgcc ctaatgcttt tagggctgta tttcgctaat 
ggaagcagtt ctgtcagtaa gaacatctct tacgatgagt tccagcagta cgtacgtgac 
ggctatgtaa gtaaagtgat cggttatgat gataattcgg tcgagattta tatcaaaccc 
cagtacgtag gaaccgtatt caaacaagat tccacccgtg taggccggaa tccgatgatc 
actacggaag ccccttcacg cgagaacctg gataactttc tacaaaaaga aaaagaggag 
acgcactttg acggttctgt cagctatgat aagaaaaaag actatttcag tgcaatactt 
tggaatgtac tgccgattgt cttcctgatt gctttatgga tattcttcat gcgacgcatg 
ggcagtggtg ccagcggagg tgcaggcgga gtattcaatg taggaaagtc gaaagcccag 540 
ctttttgaaa aaggcggttc catcaaagta actttcaaag atgtagccgg actggcagaa 600 
gccaaacaag aagtagaaga aattgtggaa ttcttgaaag aaccccagaa atatactgac 660 



60 

120 

180 

240 

300 

360 

420 

480 



421 



ctgggaggta 
ttgcttgcga 
gatttcgttg 
gctaaagaga 
cgcggtaaga 
ctgacggaaa 
cgtgtggatg 
gtagatttac 
aaaatagacg 
gcagacattg 
ttcgtaggca 
aaaacaaaga 
gccagcattt 
cgcggacggg 
gagcaaatgc 
atcggccgtg 
ggcatgatcg 
gatgagtatt 
gtcaaaagaa 
gagcaacaca 
gtagaacgta 
aat aacaaac 
ccgcaagcaa 
taa 



aaatccctaa 
aagctgtggc 
aaatgtttgt 
aagctccttg 
atcctgcaat 
tggatggttt 
tactagacaa 
ctgatctgaa 
atacggtaga 
ccaatgtatg 
agcaagactt 
ttactacgga 
cctggttatt 
ccttgggcgc 
tcgacgagat 
tatcaagcgg 
catatttggg 
cattccagcg 
tggtaaacga 
acgaattggc 
tctttggaaa 
aagaaaacgc 
cagagtctca 



aggcgctcta 
cggtgaagcc 
cggtgtaggt 
tatcgttttc 
gggcggaaat 
cggctcaaat 
ggcattgctc 
cgaacgtaaa 
tgtagactta 
taatgaagct 
tctggacgca 
agccgaacgt 
ggaatatgcc 
tgcctggtat 
gtgtgctact 
agctgctaac 
tatgagtgaa 
tccatatagt 
acagtatgaa 
acagctactg 
acgtccttgg 
cgttcatcct 
agagggcaat 



ttggtgggcc 
aatgtacctt 
gcatcccgtg 
atcgacgaga 
gatgaacgtg 
agcggtgtta 
cgtgctggac 
gaagtatttg 
ctggcacgcc 
gctctgatcg 
gtagaccgta 
cggtctattg 
aatccattga 
ctgccggaag 
ttgggcgggc 
gatcttgagc 
aagctaccca 
gaaaaaactg 
cgtgccaaac 
atcgataaag 
gcttctcgtt 
gcagatggag 
acgcaacaag 



ctccgggaac 
tcttctcttt 
tacgcgacct 
ttgatgctgt 
aaaatacgtt 
ttatcctggc 
gtttcgaccg 
gcgtacactt 
agacacccgg 
ccgcgcgtca 
ttatcggcgg 
ccctgcacga 
ttaaggtaac 
aaagacagat 
gtgctgccga 
gcgtaaccaa 
atttatgcta 
ccgaactgat 
agattctctc 
aagtcatctt 
cggaggaaat 
aagatgtaga 
agtcagcggc 



aggtaaaacg 
ggccggttcc 
cttcaaacaa 
aggacgtgct 
gaaccagttg 
agccaccaac 
acaaatccat 
gcgccccatc 
attttcgggt 
cggaaagaaa 
acttgaaaag 
ggccggacac 
tatcgtcccc 
cacgactaaa 
agaccttttc 
acaggcgtat 
ttataataat 
tgacgaagag 
ggaacacaaa 
tgctgaggat 
catggcagct 
cacaactact 
atcacaaaac 



720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2043 



<210> 1038 
<211> 423 
<212> DNA 
<213> B.fragilis 



<400> 1038 
aaagaattca 
cgttcaggaa 
gttttttatt 
tgccatcaat 
aatagaaaga 
tgggcttccg 
tatgatgaaa 
tag 



ttatgacagt 
atataactcc 
cttatgacat 
gtttaattgc 
cagtactttt 
aacaaggttt 
atgattatgt 



gtttgattgt 
tgtacatagt 
tcctggtgga 
agcaagtgga 
aaatcgtcca 
ttcctctggt 
gagaaattat 



tcaataatag 
gaagagaatg 
ggagctcgag 
agttttgaag 
tattacggct 
gcaatctgtt 
cagaactttt 



aatttcctaa 
tgccatttga 
gagcccatgc 
ttgttttaga 
tacatattcc 
tagtattagc 
taaataacaa 



aatagaggat 
tataaaacgt 
tcataaagaa 
cgacggtgtt 
gccaggtgtt 
ttctcatgta 
agagatacta 



60 

120 

180 

240 

300 

360 

420 

423 



<210> 1039 
<211> 597 
<212> DNA 
<213> B.fragilis 



<400> 1039 
tcagtaacaa 
ttacgagccg 
cttcttggtc 
ggaaataatc 
attcatcctt 
caagggagta 
gcctctgtgg 
ctctgtggaa 
cccggtgtga 
attccggatc 



tcatgttttt 
gtcatgaatc 
atcccgtatt 
ggatacggaa 
tgtccattgt 
ttatacaggt 
atcatgaatg 
atgtattggt 
agataggaaa 
atgttttggc 



atatggtgcc 
gattgaagcg 
acgtccgtca 
gaggattgtg 
atctgaattt 
gtgtgctcag 
tgtcattgag 

gggggagggt 

gtggagtgtg 
ggtgggaaac 



agtggtcatg 
ttgtttgatg 
gaagtgcggg 
gatactttgt 
gctgatattg 
gtaggcagac 
gattatgtgc 
agctggattg 
atcggtgctg 
aaatgtaaaa 



ccaaagtgat 
ataatgtgga 
gaccactgat 
ccgtggagtt 
gagaggggag 
attgcattat 
atatttctcc 
gcgccggtac 
gttccgtggt 
ttattaagag 



cattgatatt 
ggttacttct 
cgtcagtatt 
tggatgtgct 
tgtcgtgatg 
taatacgggt 
acattccact 
taccattatt 
gactaaggat 
tatataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

597 



<210> 1040 
<211> 618 



422 



gtgaagatga 


ttcgatttgg 


gcttatgacg 


60 


ggaa.agct.tg 


gaattcttag 


tctgatgttc 


120 


aagaaaaacg 


tatatccgga 


actagaagct 


180 


aaaaacttta 


atacggttgg 


taataaggct 


240 


gtcgattcgg 


gtaattgtac 


accttgtagt 


300 


aagcgtcggt 


tgttggatac 


gcatacagga 


360 


tttactgtta 


atgaggtgtt 


tagacaaatg 


420 


ttgggatgtt 


ttaaaaaaga 


aaatggtatt 


480 


atagaccgct 


caaataaggt 


agtatggttg 


540 


cagcaatacg 


agaaagccat 


atctatgtta 


600 
618 



<212> DNA 
<213> B.fragilis 

<400> 1040 

atcctcaaaa taatagattg tacgctcagg 
tttcctggct tagtaaatga agtattgatg 
tgcaccattg ttctattatc ctgtggtaaa 
tttatcggaa aagaaattat cttttccgat 
tttaaagacc aatttttact gatatcttat 
ttggagaaga tgcaatatat gaaaacaaat 
gttcttatga ttgtgcatga aaaagatact 
catgtcagtt atccgatatt ttttgattca 
tttgataatc ctttatatca agatttcgtt 
ggaaaccctt tacgtaataa gcagtcatgg 
atggatagtg gtaaatag 

<210> 1041 
<211> 897 
<212> DNA 
<213> B.fragilis 

<400> 1041 

gttatagaaa tatatgttat attactttta ataaatatgg tttcggtaat aattcctttg 60 

tataataagt atttagcaat tgggcgtact atagaatcag taattgtgca aacttataaa 12 0 

gattgggaac ttctgattat tgacgatggt tcgaatgacg gcagtgggca agtggctgag 18 0 

caatatactt ttgatgagcg tatccattat atttataagt caaatggtgg tgtatcgtca 240 

gcacgtaata tggggataca aatggcgaaa ggtgaatggc tgctttatat tgatgcggat 300 

gattatcttt tacccaatgc tttagagacc ttattaaact tagcggaaaa atttgaagta 3 60 

agtattgcgg ctagtaattt ttatgttgag tttgaaggaa aaaagaggcg ttgcctttac 42 0 

aatgtaagtg aaggggtcgt gttaaataac tttcgttcac ttttctttaa ttccttcgat 480 

atcagagcag gagctacact ctataaatca tcattaatta aacagtataa atttgatgag 540 

actctaatac gttatgaaga tgccaaattg gaatttgaca tattaagaaa tcataaagtg 600 

gcaataactc ctcaatttac tatggtatat actaaggatt atgctggttt aagtaaacca 66 0 

gcaagtgact tttcaaagga ttatatatca tgtatgtctt ttgaaggaaa accgttttgg 72 0 

gaaaaaatga agcttggttc tttagctaat tatggtttag atatctaccc taataatcga 780 

agtgaaatta aagctatata ttcaaatgac ctcaaatgga tatatcttag cgctaaaatt 84 0 

ggcttttttg tatacttatt taataaatgt tgtaacctat tgcataagtt gaagtag 897 

<210> 1042 
<211> 1497 
<212> DNA 
<213> B.fragilis 

<400> 1042 

cctgtagaaa ttgatttaaa acctataaat aattcattta tgagcacaat tcttgattta 60 

gctccacaaa atgtatggaa gcacttttat tctttgacac agattccccg gccatcggga 12 0 

catatggaaa aaattaccga atttctggta aacttcggta atagtcttgg attgaaaact 180 

tttgtggatg atgccggtaa tgtgattatc cgtaaacccg ccactccggg gatggagaat 240 

cgtaaaggag ttatccttca ggcacatatg gatatggttc cgcagaaaaa taacgataca 3 00 

gttcacgatt ttgagaaaga tccgatcgaa acttatatag atggtgaatg ggtaaaagcg 3 60 

aaaggtacta cattgggggc cgataatggt ttgggtgtag ctgctatcat ggctgttctt 420 

gaagatcaga atctaaaaca cgggccattg gaggctttga ttacgaaaga tgaagaaacg 480 

ggtatgtatg gtgcttttgg cttaaaaccg ggtacggtga atggcgaaat attgcttaat 540 

cttgattctg aagatgaagg tgaactttat attggctgtg ccggtggtat ggatgtaact 600 

gcttctcttg aatataagga agttgctccg gaagaaggtg atatcgctat tagagtgaat 660 

ctgaaaggtc ttcgtggagg acattccgga ttggagatta atcagggacg tgccaatgcc 72 0 

aacaaattgt tggtacgttt tatacgtgag gcagtagcaa cttatgaagc ccgcctggca 780 

agctgggaag gcggaaatat gcggaatgcc atacctcgcg aggcacatgc tgtagtgact 840 

attcctgctg aaaatgaaga agaattgttg gctttggtga aatattgtga agatcttttc 900 



423 



aatgaagagt tcaaagcgat cgaaactcct atctgcttta ctgcagaacg ggtagaatta 9 60 

cctgctggag aagttcctga agaaatccag gataatctga tcgatgctat ttttgcttgc 1020 

cagaatggtg tgatgcgtat gattcctact attcccgata ccgtggaaac ctcttcaaat 1080 

ttagctatca tcaatattgg tgaaggtaaa gcctctttca aaatccttgc gcgcagttcc 1140 

agtgacagta tgaaggaatg tctgactacc agtttggaat gctgtttttc tatggcaggt 12 00 

atgaaggtag agatgaccgg aggctattcc ggatggcaac ccgatattaa ttctccgatt 12 60 

ttacatgcta tgaaggaatc ttacaagaag cagtttggta cagaaccggc agtgaaagta 132 0 

attcatgccg gattggaatg tggcatcatc ggagctatta ttcctgggtt ggatatgatt 1380 

tcatttgggc caacactacg ctctccgcat tctcccgacg aaagggcttt gattccgaca 1440 

gttcaaaaat tctatgactt tttaattgct actttggagc agactccaat gaaataa 1497 

<210> 1043 
<211> 5784 
<212> DNA 
<213> B.fragilis 

<400> 1043 

tatatgagaa ttaaactgat ttgtattatc gtgttacttt ctatggggat gatgtcatgg 60 

acgcatgctc aatcatacga tagattatgg aagcaagtgg agcaggccca gcagaaaagt 12 0 

ttgccccaaa cagttgtccg attaaccggt gaaatttatc aaaaagctaa agcagagaag 18 0 

aattctcctc aaatgctgaa agcttatatt tggcagatga aatttaggga agagattact 240 

ccggatagct tttacgtgtc gttgaatggc ttagagcagt gggcggtgac tacggacaag 3 00 

ccattggacc gtgcaatcct gcattcgctc atcggtagta tgtatgctga ttatgcttcc 360 

caaaaccgtt ggaaactgaa tcaacgtacg gatttggaag aagaagctcc ctctgttgat 42 0 

atccgggagt ggagtaagaa tcagtttgta actaaagtaa tgacagaaat agccgtaaca 480 

tttcaagact ctttgctact gctcgatact tcctcccgaa gctacattcc ttttgtggaa 540 

cttggagtaa ccagtgatta ctatcatcat gatatgtatc atttgttggc ttcccgtgcc 600 

attacttcac tggagaatct atccggattt ggccatgatt ctttaataaa tgtacgcatt 660 

gaagaaattt atcagcatat gatgaattca tatcatcgga ctgataatca cgatgctctg 72 0 

ttgcttacta ctttggatta tttgcagtgg aagaggcgta ctgatatcga ttttcggcct 780 

tatcgtgctc cggaggggaa acttggcttg acacaggatc cttatttggc tgcactggac 840 

aaactgattg ccgaaaataa gtcacatgat gtctgtgcag aagtttattt gctcaaggca 900 

caggcggcaa tggatgcagg agtgcccgct tctgcgttac aattgtgcga agaggctatc 960 

tctcgctatc cggactatcg ccgcatcaat gctctgaaag aactgaagca agaaattctg 1020 

cgtcccgatc tgacagtaca atccccgtca acagtttatc cgggtgagga gtttgattta 1080 

aaagtcagct ttaaaaattt gaaagacttt acggtagagt tgtatgcaat taatttaccg 1140 

gcacgtccga atacggtgga agcacccaat gacgcatttc taaaaaaaca tggacgttta 1200 

ctttcatcgg aacattatgt tctcttcccg tcggatgact ataaagtgaa agattccatc 12 60 

taccacataa aggcacctga aacaggactg tacgccctcc gtgttattcc gggtgttaag 1320 

gttcgttcca atgtttcaaa atttctctat tctacttgct ttaaagttct tacccgatct 1380 

ctaccttcta atctgagtga ggtagctatt ctcgatgcta tgagtggcaa gcctttgcag 1440 

ggtgttgttc tttcattctt cgatcgacag aacaaacaac ttctgacagc cactaccaat 1500 

accgaaggga aagtacagtt tgcttcgtcg gaaaaatata ggtatctgac tgctgccaaa 1560 

gggaatgata cagctatgcc gcagatgtat ttatggggag gagattataa ttttgcagac 162 0 

cattcaaaac ctgtttctgt ggtaactttg cttaccgacc gttctgttta tcgtccgggg 1680 

cagactgtct atgtaaaagg cattgcctat gaacagtatc ctgactctgc ccatgtgatt 1740 

gcgggacagg aatatacgct gactctttcg gatgccaatg ggcaagaaat cagtgcaaag 180 0 

aagctgcgta ccaatgattt tggatctttc accgcagaat ttattttacc gtctgtttgt 1860 

ttgaacggta ctttcagttt gaatactcaa aacggatttc gttccattcg tgtagaggat 1920 

tacaaacgtc ctacatttga tattactttt gagccggtga ctgaaagtta tcgattggga 1980 

gatcgtgtcg agttgaaagg gagtgtaaaa actttcagtg gtgtgccgtt acaggatatt 2040 

cctgttactt ataccattac ccgctcattg tatacttggc ggatgtgggg gatgaatccg 2100 

gttattttgg cctctgacac tgtccgtttg ggagtagatg gaaacttcga aattcctgtg 2160 

gatctgaaac cagacacttc gaatcccgat ttgggagacg gtgataatac ctctttatat 222 0 

tatgactata aagtgcagct ttcagtaacc aatgtcgcag gtgaaacgca aacctcagaa 22 8 0 

acaagtctgc gggcaggaaa gacctctttg ttattgtttg ctgatatatc cggactgatt 2340 

tgcaaggatg attctatgaa agcgactttt cgggtaaata atctggatcg taaaccggtc 2400 

agtgtcgaag gcagttaccg gttgtttttg atttccgatt atcagaaatc aaaacccttg 2460 

aaagaacagg acgtttccga tcaaccggct ctttccggtt cgtttaggtc caatgaagag 2 52 0 



424 



atcttgttgt ccgattggaa aaaacttcct tcaggtgctt acaaacttgt agcttcggtg 2580 

aaagatgatc aaggccgtaa ggtagatgcg gaaaaggtag tgattctttt tgcttccgat 2640 

gataagcgtc ctccggtatc tatgcctttg tggtgttatg aagtgaatac ccggtttgat 2700 

gcagcacatc ctgcgctgtt ctattttggt acttccgaaa aagatactta tgtcttaatg 2760 

gatgtgttct gtggtaataa gcatctggag agtaagcttc ttcatttgtc cgattctctt 2820 

gttcgttttg aatatccgta tcgggaagcc tacgggaatg gtttgggtat tacctttgta 2880 

tttgttcgta aaggtgttgt ctacgaacag gaggtaagcc tgataaaacg tttgcccgat 2940 

cataacttga atatgcgttg ggatgtattc cgtgataaat tacgtcccgg gcaggaagag 3 000 

gaatggaaac tgactatccg taatccacag aaatcacctg ttttggccga aatgctggct 3 060 

actatgtatg atgcctcttt ggataaaatt tggaagacca accagtcatt gcaattacat 312 0 

tatcaacttt ctgtccccat tgctcgttgg aggagagatt atgttggctc aaactatttt 3180 

tattttggtt tccgccggac tgatttgaag gtgcctccgt ttagctatga tcattttgac 3240 

ttgcctccgg ttttatatgc ggttgccgaa atgttgtcgg tcaccaatga tgctgctccg 3300 

actacccgat atgcacgtct gcgcggaatg ggtgctgcaa aaccacaaat gaagagcgct 3 3 60 

gccgtagccg atgtcgtttt cgaatctgaa atggttcctg ttacggaaga gagcggaatg 3 42 0 

gcaatgtcaa tggacaatgc cgatatgggt aggacaacag atatagagtt acgtaccgac 3 480 

tttgcggaaa ctgccttctt ttattcgcag cttcacacga acgttcaggg tgaggtcagt 3540 

ttctctttcc gtatgcctca aagtctcacg acctggaatt tccgaggata tgctcataca 3 600 

caagatatga tgacagggca gatggatgct actgctgtta ccagtaaaga gtttatgctt 3660 

actccgaatc tacctcggtt tgtccgggta ggagatcaca cttccatggc tgcttctgtt 3720 

agtaatctga ccggtaaaaa tctgtctggt actgtgaaac tggtactttt cgatccaatg 3780 

acagatcagg taatatcgac ccaacagaag aaatttaatg ccggagccgg acagagtgtc 3 840 

ggcgtaagtt tcttgttcac agtaactgat aaatatgaaa ttttgggatg ccggatgatt 3 900 

gccgaaggag gaaatttcag tgatggagag cagcatttgc ttccggttct cagtgataag 3 960 

gagaacttga cagagacttt gcccatgcct gttcgtggcg agcaaacacg tactttttct 4020 

ttggccgatc tgttcaatca ccatagtaag acagcgacca accgccgttt aactgtagag 4080 

tttacttcca atccggcttg gtatgcagtg caagcattac cggcactatc acaacctcgg 4140 

aatgatgatg ctatttcgtg ggctacttcc tggtatgcca atacaatggc ttcttatatc 4200 

atgaatgctc agccacgtat tcaggcaata tttgatagtt ggaaactcca gggaggcact 42 60 

aaagaaagct tcttgagtaa tttgcaaaaa aatcaggaag tgaagaatat ccttctttct 432 0 

gaatctcctt gggtgatgga agcgacttcg gaaagcgaac aaaaagagcg tattgccact 4380 

ttattcgatt tgaataatat ccgtaacagt aatacagcag ctttgttgaa attaaaggaa 4440 

ctccagttac ctgatggttc gtggagctgg tataaaggaa tggacggaag tctttttgtc 4500 

accgacttca ttgtagaaca gaatgcacgt atagccctgc ttacagggaa gccgctggag 4560 

ggaggagcgc tggatatgca acaagtagct ttcggttatc tccataaaga agccttgcag 462 0 

gaatatcgtt ctatccgtga ggcagaaaag gttggcaata aatcagaggg aatttcacga 4680 

agtgcattga agtatctcta tttgattgct gtctctggcg agaaagttcc ggcatcggcg 4740 

aaagaaggtt atgattactt cctatccaaa gttgctccat cattgtctca acaatccgtc 4800 

accgagaagg cctggtcggc tattgtatta cagaaggcag gtaaggtaaa agaagctcag 4860 

gagtttatgg catctctcaa agaatatctt actcaaacag acgagcaagg catgttcttt 492 0 

gataggactg atagtccgta tgcttggaat aatttaaaag tgcctgccca tgttgatgta 49 80 

atggaggctt ttgagatggt aggcagcaat gccacgattg ttgaagaaat gaaaatgtgg 5040 

cttttaaaac agaagcaaac tcaacagtgg gactctccgg tagccacagc cgatgctgtt 5100 

tatgctttgc tttaccgggg tactaattta ttggataatc agggagacgt gcgtattgtc 5160 

cttggcaatg aagttttaga aacgataagt cctgccaaga cgactgttcc gggattaggc 5220 

tatatcaaaa agacttttac cgataagaag accgtgaata ccgatgagat tatcgttgaa 52 80 

aagagagatc cgggtattgc ctggggagca gtttatgcac agttcgaaga gaatcttgat 53 40 

aaagtggtcc gccagggaag tgggttgaat gttgataaga aattgtatgt agagacgatt 5400 

gttaataata atcgccggtt acagccggtt atcggtaaaa ctcaattaaa ggttggtgat 5460 

aaagtagttg ttcgtctgac cgttcgtctc gatcgtacaa tggactttgt acaattgaaa 552 0 

gatcaacgtg ctgcttgtct tgaaccggta gaggtcctgt ccggatatcg taatgtcggt 55 80 

gatgtcggtt gttatgttgc agtaaaagat gcttctaccg atttcttctt cgatactttg 5640 

aataagggga cttatgtttt agaatacagt tatcgtgtag atcgtgcggg aagttatgaa 57 00 

gcaggaattg ctactattca aagtgcctat gcacccgaat atgctgccca ttcagcttct 5760 

gcccgttacg aagtttctca gtaa 5784 



<210> 1044 
<211> 1089 
<212> DNA 
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<213> B.fragilis 



<400> 1044 
caccgggtga 
ttgatatttg 
aataagataa 
ggtataggaa 
caggaaccat 
gtatttttct 
tcagataaga 
caatcgttta 
cttatattaa 
tctgcatttt 
atgcttattg 
tttatatcaa 
tctcttatta 
ttatgcttag 
ttcaaccttt 
caacgtgcta 
tgtttttctc 
tttataagtg 
tggaattaa 



tattgttttc 
tctctacagt 
ctgcattctt 
atgaccatgc 
ttacattatt 
atatctatgc 
gtattcgatt 
atttgattcg 
ataataaaaa 
ttggactact 
tttatataat 
aatatgaatt 
atgatgatgt 
tcgtttatac 
tttttgttgg 
cttattttcc 
catattggaa 
cttcgttggc 



ttcaaatgat 
catatttatt 
atgggtaatt 
taattatatt 
tatattctat 
ttttcttact 
tattattgta 
ccaaattttg 
aggatattgg 
tatagtttgg 
gtctgtagct 
tttgatacgt 
agcagcaaat 
caaaagaaaa 
tcaaatatta 
ttatttctct 
ttctaggtta 
ttctcaagac 



agctgtgtaa 
tcgcatatat 
atgatgactt 
gctcagataa 
ataattagat 
tattattttt 
atgatgatac 
gcttgcgcaa 
tttctattat 
atagcctcaa 
cttttatata 
atgacttact 
tttggtgttg 
tttttcaaat 
tacaatttat 
atgatattag 
gctttagtac 
tgtccatacg 



ttgtgattgt 
atgttaaaag 
tgtttgtggg 
aatccccatg 
cttgtgattt 
tgtataaagt 
ttcaatctct 
tatttttgta 
tagcatgttt 
aaataagagt 
caggtggttt 
atggctacta 
tatataagat 
cgaacggtca 
catccgcaaa 
taatcccttt 
tgtattcatt 
taccttataa 



ttatatattt 
cgagaaagta 
gtctcaagat 
ggccactcct 
atctgtttat 
gatattatta 
attatttttt 
tggaatatct 
ggtgcattat 
aagtactgtt 
tattggagat 
tttagactcc 
tacctttatt 
aattttactc 
tattacgatt 
attggccaaa 
atattttctt 
aaatataata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1089 



<210> 1045 
<211> 846 
<212> DNA 
<213> B.fragilis 



<400> 1045 
cgcgcggcga 
ggtgtctggc 
tgtacgccaa 
tcgaaacgtc 
aggataataa 
gtattttcac 
gcatcaatct 
tcgcgtacac 
gagaagaaag 
cccggagggc 
tggggttctt 
gctacatctt 
tttcctacat 
aagaatatcc 
aaatag 



tcagagcagc 
gtgccagtaa 
atacttcttt 
cagcacggag 
caccgctatt 
gttcatcatt 
cgtcgatgaa 
gggatgcacc 
gtacattggc 
ccaccaatag 
tcaagaattc 
tgaaagttac 
tgaatactcc 
ataaagcaat 



ttcattacat 
gtctacatct 
acgttcgttc 
caatgccttg 
tgagccgaaa 
tccgcccatt 
aacgatacaa 
tacaccgaca 
ttcaccggcc 
agcgccttta 
cacaatttct 
tttgatggaa 
gcctgcacct 
caggaagaca 



acattggcaa 
accgtatcgt 
agatcaggta 
tctagtacat 
ccatccattt 
gcaggattct 
ggagctttct 
aacatttcaa 
acagctttcg 
gggattttac 
tctacttctt 
ccgccttttt 
ccgctggcac 
atcggcagta 



tgtctgcacc 
ctattttgat 
aatctacatg 
ccacacggtt 
ccgtcagcaa 
taccgcgagc 
ctttagcttg 
cgaaatcgga 
caagcaacgt 
ctcccaggtc 
gtttggcttc 
caaaaagctg 
cactgcccat 
cattccaaag 



cgaaaatccg 
ggggcgcaag 
gatttgtcgg 
ggtggctgcc 
ctggttcaac 
acgtcctaca 
tttgaagagg 
accggccaaa 
tttacctgtt 
agtatatttc 
tgccagtccg 
ggctttcgac 
gcgtcgcatg 
tattgcactg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

846 



<210> 1046 
<211> 1323 
<212> DNA 
<213> B.fragilis 



<400> 1046 
cccatgaaga 
ttatttttgt 
aacggcaaag 
agcctgctca 
gcggagatta 
actgttatgg 
cgcctgcgga 
gaggaggccc 



ttgaacaaat 
ttgccacacc 
aacgccaaga 
acgactggaa 
acccgctgtt 
agatgccata 
atcaagtctc 
tcgatgcgta 



aagaatgaag 
caaggtaaac 
gagcatcgaa 
agccaagaat 
cagcgactcc 
caacgaaatt 
ttttatgttg 
taatttacca 



aaattagcaa 
gcacagagtg 
cttcccaaaa 
tatatcgact 
gtatacatcg 
gtacgtaaat 
agtgcctgca 
ttggaattga 



actattgccc 
tagacgtagt 
gtatgactta 
taggaaaaga 
accgcctatc 
ttattgatat 
acttctatat 
aatatctgcc 



cctgatttta 
gattcgcgac 
tcctctcgat 
ctgcagtaca 
acgcatgcca 
gtatgccgga 
gcctatcttt 
tattatcgaa 



60 

120 

180 

240 

300 

360 

420 

480 
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tcggccttga acccgtcggc agtatcgcgc gccggtgcag gaggcttgtg gcagttcatg 540 

atcggaaccg gcaaaatgta cggtctggag tcaaacagcc tggtggacga ccgtcgtgac 600 

ccgataaaag caacctgggc agcagcacgc tacctgaagg atctatatga catttatcat 660 

gactggaatc ttgtgattgc ggcttacaac tgcggaccgg gcactatcaa caaagccatc 72 0 

cgccgctcgg gaggagaaac cgactattgg agtatttata attatctgcc caaagaaaca 7 80 

agaggttacg taccggcatt tatcgccgcg aactatgtaa tgacctatta ctgtgatcac 840 

aacatctgcc cgatggaaac caatattccc gaaagcactg atacgattca agttaacaaa 900 

aacctccatt tccagcagat cgccgactta tgcaacgtcc caatggacca aatcagaagt 9 60 

ctcaatccac aatataaaaa agaaatcata ccgggcgaaa gcaagtcgta cacactgcgc 1020 

cttccacaaa acgcggtcag ttcgttcatc gaccgtcagg atacgattta tgcccaccgt 1080 

gccggcgaac tgttcaagaa tcggcgtaca gtggccatca gggacgacag ttcagcctcc 1140 

aagaggagag gtagctctgc caaagccggc agtggtacac ccacttacta taaaatcaaa 1200 

aacggggata ccctgggagc cattgcggca aaatatggtg ttcgcgtaaa agatcttcaa 12 60 

aactggaacg gattgcgggg aactaacatt tccgcaggaa aacgtttgaa aatatacaaa 132 0 

taa 1323 

<210> 1047 
<211> 1140 
<212> DNA 
<213> B.fragilis 

<400> 1047 

ctaatacatg ttagaatgat ggttttaaaa aataaatgga agaataggtt tattttaatc 60 

cttttcatgg tcggatttgt tttttttgct tgtaaagaag agacggatct ttactttaat 12 0 

ggtgatataa cagttatcaa atcgtttgat aacgatactt tgttgtctcc ggtaaaagta 180 

gagcttgaag atatttatga cggttcagta ttagcgtatg actcattatt gttttttact 240 

tcgcataaat atagtgattg ctggatgtat gtatttagtg tgaatagtgg caaacatatc 300 

gcttctttat gtcccaaagg gcaagggcca aatgactatt tatcctgtaa aaactctcag 3 60 

cagtttataa gggagaatgg cgaattgaag ttatgggtca gagataatgc caaatcagct 420 

agattattga atattacaaa gtctatagag acaggagcta ctgtttgcga tgctattatt 480 

cccatggatt ggaataaata tttcgtttac ccggctacta ccctgttctt tttgaaagat 540 

ggatatatat tagggcaaaa tcagtgtgag gaacagtatt ccaaaggtaa agaatatatt 600 

ccgcgcaaat tttatttata taaagattct ttggggaata aagtgaagga atataagttg 660 

ttcaaccgac cggttatttt gaaggatgat aaatatgatg ttctgagtgg catgttttac 72 0 

gctaatcata gttatataca tccggatcag actaaagtgg ccattgccat gcagcgggtt 780 

gcccagatca ccatattaga tgtaaagtca ggaaaacaag tagggtacag gatggatgat 84 0 

acgcttgact tcagtgatat agaacaaaat cttgagtgca tacgctatta ttatacaagt 900 

gcggctgtga attcacgata tatatttgcc ctttatattg atcaggcaga aatgggaggt 960 

aagtatcctt ttaaatcaaa gactgtccat gtcttcgatt gggaagggcg gcccgtatat 1020 

aagatacagt tggataagga gatctcttgg atcactttag atcctcaaaa taatagattg 1080 

tacgctcagg gtgaagatga ttcgatttgg gcttatgacg tttcctggct tagtaaatga 1140 

<210> 1048 
<211> 819 
<212> DNA 
<213> B.fragilis 

<400> 1048 

atgaaattag taaagcctaa aaagtttctc ggacaacatt tcctgaaaga cctgaaagtg 60 

gcacaggaca ttgccgatac agtagataca ttccccgatt tgccaatttt ggaagtcgga 12 0 

ccgggaatgg gtgtgctcac tcagtttctt gttaagaaag aacggttggt aaaagttgta 180 

gaggtagact acgaatcagt agcctatttg cgagaagcct atccgtcatt ggaagataac 2 40 

atcatcgagg atgacttcct gaaaatgaac ttacaacgtt tgttcgacgg acatcctttc 300 

gtcttaactg gaaactaccc ttacaacata tccagccaaa ttttcttcaa aatgctggat 3 60 

aacaaggatc tgatcccctg ctgtactgga atgattcaaa aagaagtagc cgaacgcatc 42 0 

gccgccggac cgggtagtaa aacgtatggt atactcagcg ttctgataca ggcctggtat 480 

cgggtagaat atctgtttac agtaaatgaa caggtgttca atccacctcc caaagtgaaa 54 0 

agcgcagtca tacgaatgac acgcaacgag acacaagagc tcggttgtga ccccaagcta 600 

tttaaacaaa ttgtaaaaac aactttcaac cagcgtcgaa agacattacg aaattcaatc 660 
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aaaccgattt taggtaaaga ttgcccgttg acagaagacg ccctgtttaa taaacgaccg 720 

gaacaactat cggtacaaga gtttatccac ctgacaaatc aggtggaaca agcactaaaa 7 80 

gttccgatag aaccagtttc tcagatagaa aatccataa 819 

<210> 1049 
<211> 1416 
<212> DNA 
<213> B . fragilis 

<400> 1049 

actaacaaag taggaaatgt aatatacaaa acttgtcttc cttattatta caggaagtta 60 

agtaaggctg taattatgaa cgaagaatac attgacaacg taaaagaact tatcgaagaa 12 0 

aaagatgccg ataaggtaaa agagcttctt atcgacctac accctgctga catagcggaa 180 

ttgtgtaacg agttgaaccc ggaagaagcc cgcttcgtct accgattact tgataatgaa 240 

acagctgcgg atgtacttgt cgaaatggat gaagacgttc gtaaagagtt tctcgacatc 3 00 

ctgccatcag aaactattgc caaacgcttc gttgactata tggatacgga cgacgcagta 3 60 

gacctgatgc gtgaactgga tgaggataaa caggaagaaa tactttcgca cattgaagac 42 0 

atcgagcagg caggagacat tgtcgacctg ctgaagtatg atgaaaatac tgccggtggt 480 

ttgatgggta cggaaatggt aaccgtcaac gaaaactgga gtatgcccga atgtctgaaa 540 

gagatgcgcc aacaagccga agaactcgac gatatctact atgtatatgt aatagatgat 600 

gatgaacgcc ttcgcggaat atttccactg aaaaagatga tcacatctcc ctctgtatct 660 

aaagtaaaac atgtaatgca gaaggatcct atctcagtac acgtagacac ccctatcgat 72 0 

gaagtggcac aaattattga gaaatatgac ttggttgcca ttcctgttct tgacagtata 780 

ggccgactag taggacaaat caccgtagat gacgtcatgg acgaagttcg tgaacaatca 840 

gaacgtgact accagttagc atccggtctt tctcaagatg tagaaacaga cgataatgta 900 

ctccgccaga ctactgcccg cttaccttgg ttgttaatcg gtatgattgg aggtattggc 960 

aactctatga tattggggaa ctttgattcc acttttgccg cgcatcccga aatggccctt 1020 

tacattccat tgattggtgg tacaggcgga aacgtaggga ctcaatcgtc agctctcgtc 1080 

gtacaaggct tggccaacag ttcccttgac gccaaaaata ctttcaagca agtcagcaaa 1140 

gaagccgtag ttgccttgat caatgctacg atcatctctt tactggtata tacctataat 12 00 

tttatccgtt tcggagcaac cgccacagtc acttattcgg tatctatcag tctgttctca 1260 

gtagtgatgt ttgcctccat cttcggtact ttggttccaa tgacactcga aaagatgaaa 1320 

atagatccgg ccatagctac aggaccgttt attgccatta cgaacgatat catcggcatg 13 80 

atgatgtata tggggattac ggtgttatta tcgtaa 1416 

<210> 1050 
<211> 1104 
<212> DNA 
<213> B. fragilis 

<400> 1050 

aatatggaag taagagtttg gaattattta aaagagtatg catcttctaa agaagaaata 60 

ctaaaggctg tagaagatgt ttttgaatca ggacaactca ttcttggtgc aaaaggaaaa 120 

cactttgaac aggcgtacgc tgaatattgt ggtgttagtc atggtgttgg ttgtgataat 180 

ggtactaatg ctataagttt ggcattactt gctgttggtg tgaagcctgg tgatgaggta 240 

attactgtgc ccaatacggc cattcctact gtttcggcta tagtaactgt cggggctacg 300 

cctgtttttg tagatattga tcctcttact tatttaatgg atgtgacaaa ggtggaaagc 360 

catattacgg aaaagacaaa atgtattctt ccagttcatc tttacggaca gtgtgtcgat 420 

atggatgaac ttatagcttt ggcctggaaa tataaattat ccattattga agactgtgcg 480 

caagctcaag gtgcagaata taaagggtat aaggcaggtt caatgtctaa tgcttctaca 540 

acctcattct atcctacgaa aatattaggt gcttacggtg atggtggaat gattattacc 600 

aatgatgcag aggtggaagg gaaattgcgt cgtttgcgat tttatggtgc agagaagatg 660 

tattatgcta tcgaacatgg ttataattct cgtttggatg aagttcaagc tgccatcctt 720 

ttgacaaaat tgcctcattt ggatcaatat atcaaacgta gaagagaaat agcgtatttg 7 80 

tataatgaat tgctgaaaga taccaattta atattgccaa aagaagcaga ttacggtaaa 840 

catgcttatt atttgtatgt agttcgtcat tctaatcgtg atgaaatcat ggcagcatta 900 

aaagaaaata atatatttgt aaatattagt tatccatggc ccattcatac catgacaggt 960 

taccagttcc ttggttataa ggaaggtgac ttccctgaaa cagaatctgc tgcaaaagaa 102 0 

atattttctt tgccgatgta tcctacatta acggatgaag aagttcatta cgtgtctgat 1080 
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atattacaca aaatattaaa atag 1104 

<210> 1051 
<211> 1140 
<212> DNA 
<213> B.fragilis 

<400> 1051 

aaacccctat ttttgtactc taaacaacta ataattataa aacagagacg aatgaaacgt 60 

agtttactct tcatatttac tttacttact attactttat cggctgtagc tcaacctcgt 120 

atctcttcta ataaggagac ccatcatttc ggacaaatcg aatggaaacg tccggtttct 180 

gtagaatata ctattaccaa tacaggtgat aaacctttgg tactgactaa tgttaccact 240 

tcttgtgcct gttcggttgc taactggacg aaaactccga ttgctcccgg agaaaaagga 3 00 

acagttagtg ctacgtttga tgctaaagcg ctagggcatt tcaataagtc aatcggcatt 3 60 

tacagcaatg cacagcctag tttggtttat ctgaattttg acggggaggt ggttcaggaa 42 0 

atcaaggact ttactaaaac acatccttat gcaatcgggc aaatccggat tgatcgtaca 480 

gatattgatt ttccggatgc acacagtgga gagaaaccgg taatcacatt aggagtagtc 540 

aatctttccg atcgtccata cgaacctgta ttgatgcacc ttccgcctta tctgaaaatg 600 

gaaactaatc cgaccgtcct tttaaagggt * aagaaaggaa ccattactct cacactcgac 660 

accaaacaac taatggattt gggcttgacc cagtcttctg tttatctggc tcgttttgct 72 0 

ggtgataaag tgggtgaaga gaacgaaata cctgtgtcag cagtccttct tccggacttt 780 

tcaggaatga ccgaacagga taaggcggtc gctcctgtta ttcgcctgtc cgaatctaag 840 

attgatttaa gtcaggtgct ggccaagaaa aacaaagcca gacgagacat tgttatcact 900 

aatacaggta aatctcctct gcaaattagc aaactgcaag tgtttaatcc ggcagtgggg 960 

gttgctttga aaaaaactgt actgcaaccg ggtgaaagta ctcggctgag ggtgactgtt 102 0 

ctgaaaaaga accttggaaa gaaaaagaga catttacgta tcctgatgat caccaatgat 1080 

ccggtgcaac cgaaagtgga gatcgatgta aaagctacga ataacgaatc acataattaa 1140 

<210> 1052 
<211> 1209 
<212> DNA 
<213> B.fragilis 

<400> 1052 

ctaaggatat tccggatcat gttttggcgg tgggaaacaa atgtaaaatt attaagagta 60 

tataaaatta tattattgat gaacaaacga atctggcttt cgcttgctca catgggtggc 12 0 

cgtgagcaag actttataaa agaggctttt gatacgaact gggttgtccc tttgggacct 180 

aacgtggatg cctttgagca atctttggtt gaatatttgc atgaagaccg ctatgtagtg 240 

gctttgagtg ccggaacggc tgcacttcac ttgggcttga ttcttctgga tgtgaagccc 3 00 

ggtgatgaag tgatctgcca aagctttact tttgctgcct ctgccaatcc gatttcttat 3 60 

ctggaggcca aacctgtttt tgtggacagt gagaaggata cctggaatat ggatccggta 420 

ttgctcgagg aggctataaa ggaccgttta cgcaagacgg gcaggttgcc gaaagcgatc 480 

attcccgttc acctttacgg tatgcctgcc aagatggacg agatcatgga tattgcgggt 540 

cgttatggta tctccgtatt ggaggatgcc gcggaggctt tgggttcgga actgaacgga 600 

cggaagtgtg gcacattcgg tgaactggcc gctctctctt tcaatggcaa caagatgatc 660 

acgacttccg ggggaggtgc tctgatctgt cgtacggaag aggaggcccg acagacaaag 720 

ttctacgcta cgcaggctcg tgatgccgct ccgcattacc agcataccca tatcggttac 7 80 

aactatcgga tgagtaatat ttgtgcgggt atcggtcgtg ggcagatgtt tgtcctcgat 840 

gaacatattg cccgtcgccg tgccattcac tctttgtatg ttgatttgct gaaagatgtg 900 

gcgggtatta cggtcatgga gaaccctgat tcgcggtttg cttccaactt ttggcttact 960 

tgtattctgg ttgatccgaa gcttgcgggt aagagtcgtg aggatatccg tttgaggctg 1020 

gactccgaga acatagagac gcgtcctctg tggaagccga tgcatcttca gcctgtgttc 1080 

acggatgctc cgttctatgg gaatggtacg agtgagaggt tgttcgatat cggcttgtgt 1140 

ctgccttcgg gacctacatt gacagatgag gatatcagga gagtggtgga tacgatcaga 12 0 0 

gcgatataa 1209 

<210> 1053 
<211> 840 
<212> DNA 
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<213> B.fragilis 



<400> 1053 

atgcaaccat ttatggaact atcagccaat tatatcaaac gaattgaaat agacggatta 60 

tgggaccgtt tcaatatagt ctgggacctt cgtccggatg tcaatatact atccggaatc 12 0 

aatggagttg gaaaaaccac tatcctgaat cgatcggtcg ggtatctcga gcaactgtcg 18 0 

ggtgaagtga agagtgatga aaaaaacgga gtacatatct ttttcgacaa tcccgaagct 240 

acttacattc cttatgatgt tatccgtagt tacgatcgtc cgcttattat gggggatttc 3 00 

acagcccgta tggcagataa gaatgtaaag tccgaactcg actggcaact ctatctgttg 3 60 

cagcgccgtt atctggatta tcaagtcaat ataggcaata agatgattga gctgttgagc 42 0 

agcaataacg aggaagaacg tagcaaagca gccactcttt ccattgctaa acgtcgtttt 48 0 

caagacatgg ttgacgaact attcagctat acccgtaaaa aaatagaccg tagacgcaat 540 

gatattgctt tctatcagga tggtgaactt ctgtttcctt ataaactttc ttccggcgaa 600 

aagcagatgt tagttattct gcttactgta cttgtacagg ataatgccca ttgcgtattg 660 

ttcatggacg agcccgaagc ttctttgcat atcgaatggc agcaaaagct gatatcgatg 72 0 

atccgtgaac tgaatccgaa tgtgcagatt atactaacga cacattctcc tgcagtgatc 780 

atggaaggat ggctcgatgc tgtgaccgaa gtaagtgata ttgcaaccag ctataagtag 8 40 



<210> 1054 
<211> 303 
<212> DNA 
<213> B.fragilis 



<400> 1054 

gtatcacatt ctcaggaaat aaaaaagaga gtatcctttt ctggacaccc tcttccctac 60 

gcatctgcgt cacatacttt actattgttt gatttcccat attatacccg tttaggtaca 12 0 

aatacaataa aagactccca cttttatacc cgatatggta taaaagtggg agcaagtcag 180 

gaatctatac ccatttgggt atattgttac tcattctatc tgattacatc tatgactttc 240 

ctgatatttt cattagaacc tactttctcc tatctattac atttcttttc tactccttcg 300 

taa 303 



<210> 1055 
<211> 234 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (92) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1055 

gaaagccgga agtgcaacgg tcaaacagtt cgggaatcaa agggtaccta tcttacggag 60 

gaatatgtat gggaaaataa gttggacatc gngcgtacgg catggcagta tgatccgtat 12 0 

acgcacgaat gggtagacat gcctttggta tcgaaaggca aaaaacaatc tgaagaactt 180 

ccagagcctg agtatggtga caaacaacaa tgtaagtgtc tctcagaaag gtaa 234 



<210> 1056 
<211> 1560 
<212> DNA 
<213> B.fragilis 



<400> 1056 

acatgccttt ggtatcgaaa ggcaaaaaac aatctgaaga acttccagag cctgagtatg 60 

gtgacaaaca acaatgtaag tgtctctcag aaaggtaaat atggaagtgt acgcacttca 12 0 

ttgacccatg tgtataataa aggacagtat ccgaaccaga aactgaataa gatcacttat 180 

tcggtgt egg gtgatatgaa gtggaagaaa ttctcttttg aeggaggatt gacttataat 2 40 

aagcgctttt atcccaatga catgggagee ggatacggtg gtageggatt cctttataac 3 00 

ctgttggtgt ggtcgggtgc cgaatatgat atacgegact ataagaacta ctggatcaag 3 60 
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caggacgaac agcagaactg gatggatacc aagtggtatg acaaccctta tttcatagca 420 

aatgaaattg tccgttcgag tgattacgat ttgattaacg gatatctttc tgccaactat 480 

gattttactc cctggttgaa cctgtcgctg cgttcgggtc tggattcata ctcgcagaag 540 

aaagagtggc ggaatgccgt cagtgccgta ggtggctggc ataaacaagg ttattatggg 600 

ctgcaacgtt taggaggata cagcttaaac aatgacctga ttttgtctgc cgatcacaaa 6 60 

ttcggtgatt ttaatgtcga tggttttatt ggtggaaacg tttattattg gaagagtgac 720 

aatatcctgg gcgaaacgca gaatgggttg aaaattccgg ggtattattc attgaagtca 7 80 

tcgattgatc cggtgaagac aaccagcggt attaccaaaa aactggtgac cagtgtatat 840 

gccaaagcct ctgtttcctg gaaaagtaca ctgtttctgg atgtgacagg acgtaatgac 900 

tggtcttctt cattgccgtc ggagacacgt tcttatttct atccttctgt agccggtagt 960 

gtggttcttt cacagttcat cccaatgccc gaagtgattg acttctggaa agtgaggggg 102 0 

gcatggacgc agaccaagag tgacttggga gtatacgata ccaacaatac ttacagtgtt 1080 

tctaccgatt tgtggaacgg tgagagcgcc gcatattatc cgacatctat ccgtggtgta 1140 

gcggtgaaac cctcggccac gcgttcttat gaaatcggta cggcaattca catgtttaag 12 00 

aatcgcctga aactggattt tacatattat aataaactct attacaactt gacccgcagt 1260 

gcaggtatca gtaactcttc cggatttacg tctacattga tcaatatcga tgaagaatat 132 0 

gtgggacggg gagtagagtt gactttatcg ggcgatatta tcaggacgag agacctgaaa 13 80 

tgggagtcgt ccttcaactg gtcgcgtgac cgttggtatt ataccaaaat agacccggtg 1440 

tattctacac aaaaaccttg ggtagccgtc gggaaacgtt gggactggta cggtatttac 1500 

gattgggagc gtgattcaca gggaagtctt caccacgggg gtggaaggag caacgtctga 1560 

<210> 1057 
<211> 825 
<212> DNA 
<213> B.fragilis 

<400> 1057 

cagacacgat ttgccacaga acgtatatgt ggggcgactg ggcccttccc cgatcccgac 60 

agggctgggc caaatatccc gttttcaatc cggcggaccc cgttccggca tccggtatcg 12 0 

gtcggctatt ccaacatata ctccttttgc agggactttg tcagcacgcc ccgtttcctg 180 

ggggctttcc cgtggctgga actggcgggc accatcgccc ccttgctggc agcctataat 2 40 

gccaacgcgt cggcactgag cctgcatata gagagcccgc aggcctattg ggacgcggca 3 00 

gaggacagga tcagggaaat atgcaagcgt aagggggtgc cttattcggc cagaatgctc 3 60 

gaggaattca aggatgaagc catggagaag ttcgcttcag gagtgaccgg cagggagaat 42 0 

gtgggcaaat acatgcacac gaccaggttc tgggatgcgg acgccaatga cttccagggc 480 

tggacgataa cccccatcga caagaaaata agggattata tcgaaagcca gatcaagatc 540 

gccaacaagg ccgatgccgc ggccacatcc gggttcggac tggatccggt gctctcaaac 600 

ctgatcatgg ataacaagct gtcctcaggg tcggaaaaac tgtactccat caaggtgtac 660 

aatgccagtg agacggccat accggacatg atcctgtgca aaccgttgat gcactacata 720 

cgggccaatt atccgggaag cagaacacag gtggggcttt accggagcgt agtggaatcc 780 

gaacagagtg tatcacccgc aaacaggatg aaagaaaata tataa 825 

<210> 1058 
<211> 477 
<212> DNA 
<213> B. fragilis 

<400> 1058 

attcgaaacg ccatgaacga ggatctgata aaacaagagt ttgtccggga gaatatcgaa 60 

agggatatcc gggccatttt cgaggcgcaa tacctgatcg ccaccgaaag ggtgtatacc 12 0 

tctgccatct atccgactca ggtcggacag gggcggagcc ttgtccggga acaggggtat 180 

ggacgcctgg tgcggggtac taccggccgg ctgctcagcg ccttacacaa ccccgtttac 240 

agcgtcgggt tttccgggcg gggggtggtc gccacttcca acatccccct ctatatccgc 3 00 

ttcctggata tgaagaaaca tggaaactat ggcatctata accgccaggt atggggaatc 360 

ctctggaaca attcgctcca gaccataaaa tacggatatg gcaaggaggt ccgcgaccgt 42 0 

atttatgccg gattacagga agctttccaa agaatggaaa tacgtacgga ttcctaa 477 



<210> 1059 
<211> 456 
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<212> DNA 

<213> B.fragilis 



<400> 1059 

aactgttcag ctatggtaga tttacagcaa tatgaagagt attggtcggg tataacggag 60 

aggattccgc aaataaagaa ggtggtgcct gtcaccttcg accccgacat gggcgctttg 12 0 

gtccaagggc ttaaagcgga cgaactaccg gcgctgctac tcatcatccc aagcgccaaa 180 

ggaaaatccc cggatgtgga caacctgttg gaattaaacc tttgcgtagc gttcctgatg 240 

gacaagaccg atccgcagcg taaggggact tatcaggtgt taaaggagtt gcagcccgtc 3 00 

atggagaaga tgaaagcgca gatgatcgat gacaaggctg cgggatgtca cctgctctcc 3 60 

cgtctggacc tgtcgagctt atccaccatt cccgaagcgg gattttattc ggtctttgcc 42 0 

ggatggagcc tgggatttga attcgaaacg ccatga 456 



<210> 1060 
<211> 402 
<212> DNA 
<213> B.fragilis 



<400> 1060 

gaagcaatga agtattttac aatcaaggaa ttaagccaca gcgatacggc cgtggcgcgc 60 

gggattgaca atacccctac gggggaggtg gttcacaacc tgacagagct ggtggaaaac 12 0 

gtcctcgacc cgctccgtga aaagtacggg aagcccatcc gggtaagttc cggttaccgg 180 

agcgctgtgc tgaacagaag cgtgaacggg gcgacctcca gccagcacct actgggtcag 240 

gccgccgata ttaccgtagg cagcaaggag ggaaaccgcc ggcttttcga gatcatccgc 3 00 

aaggaactgc ctttcgacca gctgatcgac gagaaggatt tctcctgggt gcatgtgtca 3 60 

ttccgcacag gcaaaaacag aaaacaggta ttaaaactct aa 402 



<210> 1061 
<211> 2847 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (2724) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1061 

gaatcggtga atgacctgga gctcatgcag gccgcggtac gcgccaaaga ctttcgtatc 60 

ccgctcgagg acctggggaa atacttgcag tttgcccggt taaaagcaca gcagaccgga 120 

cagagcgtgg attatctgac tgactcgatc ataaccggac tgggtcgtaa atcactgctg 180 

atactcgaca acttaggcct gtcggccgcg gaggtcaacg aggagatggc caaaacgggg 240 

gatctgatgg cagcggtggc tgcaatcgtg gacaggcaac tcgcgcaagc gggggagaat 3 00 

tacgtgtcgg ctgcggacaa ggccgctcaa aaagccgcgg agctgcaaaa caggcagatg 3 60 

gaaatcggaa ggctcctgct tcccctgcaa gaaaaatgga gcggtctctt ccagtctctc 42 0 

aagctcggtt tttcagatgt ggcgctccgg gtattggaac ataaaaaaag catcattacg 480 

cttatctccg tggttaccgg ttttattctg gtttataaaa cggttattct tctgcaaaaa 540 

acatggaacg ggcttttaat gctgggcaag gcggtcggcc tggcctacgc ctcggttgtg 600 

gccatgcaga gaggaaacat cctcagaagc gctgccgcca tgaaaatgta taatgcctcg 660 

gtggcgtcca acaacatcct ggtaaaggca tgcaccgcat ccacttatct gtttgccgcc 72 0 

gccaaggcgg tgcttaccgg gaatatcaac aaggcccgga ttgccatgca ggccttttat 780 

gcaatcacca aaataagccc gctggccata gtggccaccg tggtcgccgc cctgacatac 840 

aagctggtgt cttatcgcag ggaacttacg gcgacggaaa aagcggagcg gagcctgcac 900 

cgggtgcggg cgcaggccgc cgacaccgta gccaccgaaa cccgggaact gaataccctc 960 

ctgggaatcg cacgcaacga gaaaataagc aaggagcagc ggatggaggc cataaaaagg 1020 

ctcaacgcct taagtgagga ataccttggc ggtctcaggc tggaaacaat caacaccagg 1080 

gaagccacgg ccgcagtgaa ggactacacg gacaacctgt tgtccatggc caggatacgc 1140 

tcggcaaact caaggctgga ggagatccag aaagaaaaaa gggcactgga ggaacagcgt 12 00 

aaggatatcc atgccaaccg gaatctctgg gacagtttca aactggggct cgccaaaggg 1260 



432 



ttcaattctt tgtccgtagc ggtaaagggg tattccgacg cctggtcgga gggggtcatc 132 0 

catgactatt ttgcaaggga attcgaccaa atacaagcct tgaaccggga agaaaagaaa 13 80 

cttacgcagg agatcacggc ctcacagcag gatatcatta aggtcgatac ccaatccgag 1440 

gcaaaaacca aggaccttat ccaggccaaa aaggaagaga ttgcccaggc tgagcgggaa 1500 

gtcgcctcga cgccggccct gctggccgcc aaaaaccgga aactccaaca gttgaacgag 1560 

gagcttaaag cgttgcagca gctgggaact atccgggaaa ccccggacgg attcgcctcg 162 0 

cagaccgaca aggtcctctc ggccctgaat gagaggcatg aaaaggagct gctcaagatc 1680 

cgggaaaaca aggagaggca gcagcagaca caggctcaat acgataaggc cgtgctggcg 1740 

gaagacataa ggtttcacac ccaaaggctc gtcatcctcg aagggctgga gaaaaagacc 1800 

gcccggacca aactcaggca gctggccgac atccgggcaa aaatgacgga aagctccgcc 18 60 

aaaatactgg agttacagcg gaaactggat gaaaacgagg ttgccctgct acaggaacag 192 0 

cgggataaaa aactggccat acaggaggat acgtacaagg ccaccagggc acagatagaa 19 80 

ttgaattatg caaacctgca tattacgcag cagcagcgcg acatgttgct gttgagcctg 2040 

gaggagtcca attcccggga aagactcggt atcctgaagg aataccggaa ggatgtggaa 2100 

gccctggagc tacagacggg ggatgtgaaa atacaggccg tcaaactctc cgggcaaaag 2160 

gtactggagg cggagctggc caacgccaaa gacagggccg cgcagcaaaa ggcgatcgaa 222 0 

accatgcttt cctctttcaa aaaagagttc aaccttttca atctgccgga tgaaacggac 2280 

cttcagctca aggtgctgga agcgtcatac cgggcccggc tggaactgat ccgcaatgcg 2340 

ctcaaaaacg agcttgtcac aaaggatcag gccgcccggc aggaaaagga gctggaagaa 2400 

gcgtacagca ccgcaaaact gaacataacc cggggtgccg aagagcgcag aaacgggatt 2 4 60 

ttggagaagt acgggctgct cggattccag caacgctatg ccatgcagat ggcggccctc 252 0 

aagcgggaga aggaacaggg gttgataggt gccgaagcat atgcaaaggc cgaaaagatg 2 580 

ctcaagatac agttctggaa agaggctttc gattattatt ccaacctgtt ttcaggggct 2640 

gtctctgccc tgcaaaacgc cgagatcgcg aacatggagg ccaaatatga cgccgagatc 2700 

gccgcggcac agggaaacgc gcangaagtc gaacgcctga aaacggagaa agcgcagaag 2760 

aagctggaga tcgagaagaa atacgccgac gtgcagtttg ccgtaaaagc caccagatca 282 0 

ttgcccgaca cggcgtggcc atcatga 2 847 

<210> 1062 
<211> 951 
<212> DNA 
<213> B . f ragilis 

<400> 1062 

tatacaaata tgaaactgat atttgataaa gattcaaacg gcacgcagga actggtcgac 60 

gccttgggcc tgatcgatgt ccgcacggac ttctccaaat ggaagccgta cataccttta 12 0 

agcatacgtc gcctgaccgc catcataggg caggaggttt atgacaaggt tctcgacttc 180 

taccaatcgg caagcgtcga tccggatggc aagctcaccc gcctgttggg aatggtgcag 240 

cagtccgtag cgctgtttac ctggctgaaa atcatcccca cactggatgc gcagcatggg 3 00 

aacacaggca ggcagaagcg cttgggggag cacgaaaaag ggctgacagc cttacaggag 3 60 

tacaaggatg aagccaacat cctgagtcag gcctacgagt cggtagatgc cctgatagca 420 

tatctggagc aggaaaagtt cgatttctgg atacaaagcc ccaaaaggaa ggctgtatcg 480 

gaattgctcc tgaatagcaa ggaggcattt gatttttact atgtaaccgg cagccaccgg 540 

ctgtttctga ccctggcacc catcatccgg gaggtgcaac agaggcatat catcccgata 600 

atcacgtacg gccgttatga aaagctggta gcgggccagc aggtggcaga ggggttccga 660 

gacgccgtct gtcggccgct ggccctgctg tccatgagca aggccgtgga acgtttgccc 72 0 

gtggaggtcc tgcccgacgg tgtggtgcag gtgcagcttg caggaagcgt ccgtgaaaag 7 80 

ctcagggcgg aagccgaagc gcgcaagaca gtggcaaaaa gcctggaaca agatggcatg 840 

cgggatcttg ccgcgctgga ggacctggtc gcggcgctcg acgccgcacc ggatgaaccg 900 

gatctgtatg taccctcgat cacccttcaa tcaaaaggca taacattctg a 951 

<210> 1063 
<211> 648 
<212> DNA 
<213> B.fragilis 

<400> 1063 

aatacggata tggcaaggag gtccgcgacc gtatttatgc cggattacag gaagctttcc 60 

aaagaatgga aatacgtacg gattcctaaa gacacattgt ccttttgtcc gggagacggc 12 0 
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ccgcctattt 
tggacgcttt 
accaaagagc 
cagggaaaac 
cgtaccatct 
gacaaatcgt 
acggtaaaat 
aaggaggcga 
ccttaccagg 



ttgtccttaa 
tggtcaatgc 
tcaaggcgga 
tccagagcaa 
cggaaaacgg 
atgcccagct 
cgttacaacc 
tggagcagct 
atgaaatccg 



aaataacgtt 
ggacgggttg 
gaacgtgcgc 
ggagtataag 
ggagaagctc 
ctcaaaacag 
ccaagagtac 
aaggcccaaa 
cggtagtccg 



atgggaaaat 
cagaaggaga 
ctgaaatcct 
gaactgaatg 
cgcctgctcg 
gccagacagc 
gcccgtctgg 
acccgaggcg 
gttttttcgc 



tacaaccgga 
tgctgaaggt 
ccatggaaaa 
cgcaactcaa 
aaagccgtct 
ttcgcaggga 
aggcggaact 
gtgaaagagt 
cggtatag 



ttatatcacc 
gcgcaacaac 
tcttgccatg 
ggccaacaac 
gaacaacgcc 
attggacaat 
ggcaaagacc 
catttttcag 



180 
240 
300 
360 
420 
480 
540 
600 
648 



<210> 1064 
<211> 795 
<212> DNA 
<213> B.fragilis 



<400> 1064 

attatggaaa tcagaagaac cggaaatacc ggatttatag ataccgggga ggggcagctt 60 

atctccttcg ccatgggaaa gggatggtct ccttcttcca tcagcttcag ccgcccggag 12 0 

agctggcaga cccgcaagat acgggtcgcg ggtgtgaata tcgtgcccat gggtgccaat 18 0 

aacgaccttc ccggcgacgt acaacgcctg ctggataact tttacggcgg tgagggtatc 240 

atgggtaaaa tacagggatt gcagtgggga gagggccccc gcttctttga ggaggccatc 3 00 

gactccgaaa acaaccgctt ttaccgcaaa tggatactcg atgacgtcat acaggcggat 3 60 

ctggagagtt gggattaccg cgactatatg ctccgctgcc tggtggacct ggtgcacatg 42 0 

caagggttct gggtaaagtt catccgtaac cggggaccgc gtatcggaga ggatggaagg 480 

ataatcaggc tggaacatat cccttacagg aaatgccgct tcgaatatcc cgatgacaga 540 

cacgatttgc cacagaacgt atatgtgggg cgactgggcc cttccccgat cccgacaggg 600 

ctgggccaaa tatcccgttt tcaatccggc ggaccccgtt ccggcatccg gtatcggtcg 660 

gctattccaa catatactcc ttttgcaggg actttgtcag cacgccccgt ttcctggggg 72 0 

ctttcccgtg gctggaactg gcgggcacca tcgccccctt gctggcagcc tataatgcca 7 80 

acgcgtcggc actga 795 



<210> 1065 
<211> 858 
<212> DNA 
<213> B.fragilis 



<400> 1065 

accggatctg tatgtaccct cgatcaccct tcaatcaaaa ggcataacat tctgactatg 60 

aaagaaatta cgtacaacaa tcaaaagaaa gagattccgg actccctgga ggagttatcc 12 0 

cccaaggagt attaccgtta cctggagttg gtattaatga tgaacgcggg ggagatttct 180 

cctttccaga tgcgctgcaa gctgctttcc tgccttctgg ggatgaagca cagccttctt 240 

ctgtgcctgg gagaaataca ggaagagctt ttggcgcaac tccccgccct ggacgggttc 3 00 

ttcgatatca cctcgcagga ggggatgatg gtttacgacg cccgcctgaa aactggccgg 3 60 

aacctgctgc ccgcctataa ggagtggaaa ggcccggggg atatgctctc ggggattact 42 0 

ttcggacagt ttatcgagtg catgggggtg atggcggaaa tggagcgcgc ccgggagcag 4 80 

ggaaatgaag aagatatagg ggaactgata tcttctatag gcagactgct ttataagaaa 540 

cagggccctc aggaaaccgg cactcctcct ttcccggtct gcttccacgc atacatcttc 600 

tttctcgcgg tctgggagct gatttacagt gtccccattt caaccaacgg gaaggacatc 660 

gacttctcga tcctgttcga gaaatccggg cgggggaatg caggggacaa taccggctgg 72 0 

gtgggaatct cgtacgacgt ggccgcatcg ggtgttttcg gtgatttcag acaggtaaac 780 

gacacccctt tctgggatgt gatgctatac atttataaat gcaggtttga aatgttacat 840 

aacaataaga agcaatga 85 8 



<210> 1066 
<211> 507 
<212> DNA 
<213> B.fragilis 



<400> 1066 
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aaaggcgaat cgaaaacatt aaaaatagga aaaaagacta tggggcaatt agacaaaacg 60 

gatgttgaaa tacttcaggt attacagaaa gatgcgaaag tgaacactaa agagctttct 120 

gagaagctcc atatatcaaa aacgccgata tatgaacgca tcaaacgact cgaaaatgat 180 

gggtatataa aaggatatgt cgctttggtg gataataaaa aagtcggatt gcctttgatt 240 

gttttctgta atgtctctct ggcagttcac gacgacgaac atataaagcg ctttcaagag 3 00 

gagatcaagg agatcgatga aattatggag tgctattcta ccggcggtat ttatgatttt 360 

ttcattaagg tggtcttgaa agatctggat gcctataacc gattcgtttt tgagaaactg 420 

actaaagttc acggtatagt taagatgcag agttcgtttg ttcttagtga gattaaacat 480 

acgacagttt tgaatataga ccgatga 507 

<210> 1067 
<211> 648 
<212> DNA 
<213> B.fragilis 

<400> 1067 

cagctcttgt tggcccgtca ggaagtggaa aagagtacgg taatgaaact ttgtgcccgt 60 

ttttatgatc cgacaaaagg gcgtatactg tttggtggag taccggtacg agagattgaa 12 0 

cctgaaaaat tgatgagtcg tatttcgatg gtttttcagg atgtttattt atttcaggat 180 

agcatacgca acaatattcg gtttggtaaa agtgatgcca cagatgaaga gattgtagca 240 

gcggccaaaa aggcctgttg tcacgacttt atcatgcatc tgccacatgg ttacgataca 3 00 

atggtgggag agggaggctg tacgttatca ggtggtgaaa aacaacggct ttccattgcg 3 60 

cgtgccatgc tgaaagacgc acagatcgtt ctgctggacg aggcaactgc ttcgcttgat 420 

cccgagaacg aagtagagat acagaaggct atcgatacgt tgattaaagg acgaacggtt 480 

attgttatcg cccatcgtct caagacaata atgggggccg accacatcgt tgtcttatcc 540 

gatggaaaag tggaagaaca aggtacgcat tcggaattga tgtgccggga tggtttatat 600 

cggaagctct ggaacattca agaaagtaca ttgggatgga cattatag 648 

<210> 1068 
<211> 423 
<212> DNA 
<213> B.fragilis 

<400> 1068 

attatatacc caaagggaat taatatcatg atacagacaa tacaagtaca aggaacagaa 60 

aaacgcttat accaacttat tgctccattg gtgatgaatc cggatgtttt aagtgcaaat 12 0 

aataattatc cttttaaaac gacagaacaa tacgtgtggt tcattgctat cgataaaaaa 180 

tcggttgttg gttttatgcc ggtggagcat agaaggagcg gatgcgtaat caacaactat 240 

tatgtcagcg gtgataaccg tgaaacactc tcattattaa tctccagtgt tttggaagca 3 00 

atcggaaaag aagtacgttt gtttgccgtt gttatggtca accatcaggc tgtatttgag 360 

gaacacgggt ttataatgga gaaggcatgg aaacgttatg taaaaatgca aaaagatgaa 42 0 

tga 423 

<210> 1069 
<211> 1827 
<212> DNA 
<213> B.fragilis 

<400> 1069 

ataaatcggt tgatcatggt aaataagaag aaagaagggc tgtcccgtct gtttgagatt 60 

gcaggacaga aaaaaagtct gcttctgttg gcaggcttgt tatcggctgg gagcgcggtg 120 

tgtatgctca taccttattg ggcgatctac cggatactct atgaattgtt gaaccatagc 180 

cgggagctgt cgtccatcga tgagaccaat atgatccgtt ggggttggat agcctttggc 240 

gggctgatcg gcggattatt gttgctgtat gcttccctga tgtcatctca tgtggcagca 3 00 

taccgtattc tctacggact gcgtatccgg ttgacggaac atatcgggag attgccgctg 3 60 

ggttatctga acggaacatc aacgggagcc atcaagaaga cgatggaaca gaatgtagaa 42 0 

aagatagaga acttcatagc ccacacgatt cccgatttgg tgaacgttat ggcaacagta 480 

gtggtgatgt tcctcatttt cttttcgctc gatggatggc tggcaggtgt ctgtttggca 540 

gtgatcgtac taagtatatt cttgcaattt tccaatttca tgggaaaaaa ggcacgggaa 600 
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tttacacgca tctattacaa cgcgcaagag cagatgagtg cttctgccgt gcaatatgtg 660 

cgcggaatgc ctgtggtgaa aatctttgga cagagtgtcc gctcattccg tcagttcaat 720 

gccgaaatcg aagcttacaa gacctatgca ttgaaagttt gcgacactta cgaatcgggt 780 

atgacatatt ttaccgtact gctcaattcg attgtcacct tcattctccc tgtcggtatt 840 

ttactaatgc aaaatgactc ccggagtctt acgctggcag ctgtatggct tttctttatc 900 

atactcggtc cgggcgtggc ttcacccgtc tataagttga tgtatctggg cagcagtacg 960 

cgggaaatca atgaaggtgt atcgcgtatt gatcgtattc ttgaaaatca gccggtctcg 102 0 

gaacctgctt gtccgaaaat tcccgcgacg tatgatatag agtttcgtca tgtctcgttt 1080 

tcctatgaaa acaaggagca ggctactcgt accgaagcgt tgcacgatct ctgtttcacg 1140 

gcccctcaag gtaaaattac cgcttttgtc ggtccgtcgg gaagtggtaa atctaccgtc 12 00 

gccaatctga ttccccggtt ttgggatgtg gagcagggag aaatccttat cggcaatgtg 12 60 

aatgtgaagg atattgcaac ggagcagtta atggatctcg tttcgttcgt ctttcaggat 132 0 

acattccttt tttacgatac actctatgaa aatattgccg taggttcgtc caaggcaacg 13 80 

agagatacgg tcattgctgc cgctcgtgct gcgcaatgcc atgagtttat cgagaagtta 1440 

ccgaacggat acgaaacacg tatcggagat aaaggtgttt tcctttccgg tggtgaagca 15 0 0 

caacgagtct gtgtggcacg ggctattttg aagaatgctc ctatacttgt actggatgaa 1560 

gcaacggctt ttgccgatcc cgagaacgag tacaagatgc agcaggcttt gaaatcactt 162 0 

attaaggata agacggtcat catcatagcc caccgccttt cttccattgt atcgtccgac 1680 

cggatcatcg tactgaaaga tggaagggca gtacaatgcg gacggcatga agaactttcc 17 40 

tctcaagaag gggtatataa aaagatgtgg aatgcttata cgagtgcgtt ccgctggcaa 1800 

ttgaatgtga aacaagaaaa agaatag 1827 

<210> 1070 
<211> 558 
<212> DNA 
<213> B.fragilis 

<400> 1070 

aaacatattg caatgagtat aaagaaaagt ccggtatata atgtaatagc agttcccgta 60 

gaaaaagtac aggccaacga ttacaatccg aatgtggtgg ctcctccgga gatgaggctt 12 0 

cttgaacttt ctatctggga agacggcttc actatgccct gcgtctgcta ttatgataag 180 

gaaaaggatg tttatatcct tgtcgacggt ttccaccgtt attctgtgct gaagacttcg 240 

aaacgtatct ttcagagaga aaacgggatg ttgcctattg tggtaatcga aaaggatctt 3 00 

tccaatcgta tgagttccac tatccgccat aatcgtgccc ggggtacgca caatatagaa 360 

ctgatgtgcc atattgttgc cgaacttgat aaggcaggca tgtccgatca atggattatg 42 0 

aagaatatcg gtatggatcg ggacgagttg ttgcgcttaa agcaaatatc gggtttggcc 480 

gatctgtttg ccaatcgtga cttcagtgtt cccgaagatg accagccggg aaatgtagat 540 

aagaaaccta ctcgttaa 558 

<210> 1071 
<211> 1014 
<212> DNA 
<213> B.fragilis 

<400> 1071 

agaatgagtg caataagaaa tattacaata ggccgtacgg aaaggcttta taaacctgta 60 

ggctacacta tgcttgccaa tttggtgaac attgttcctt tttgcctttc tatcgaggcg 12 0 

attcgtatta tattccgtgc tttcaacgga ggcgggcaat cgcttgatac cacccggttg 180 

tggtgtatat ttggctgtat gacaggttat atagctgtta tggtactggc ggaaagggct 240 

gcctatcgtg ccaatttccg tggtgcttac gaaatgagtg catcgggacg catctctttg 3 00 

gcagaacatt tacgcaaact ttcgttaggt tttctgggta aacgggatcc gggtgattta 3 60 

tcatccatgc ttattaccga ttttacaatg gcggaaacag gtatctcgca ttatttgcct 42 0 

caactgatgg gagcattggt gatgcctgta ctggcttttg tttcgcttct ttggatcgat 480 

tggcgcatgg cggtcgccat gttcgtggct cttccgtttg caatgggcat tttgtggttg 540 

agcacgagcg tacaggagag gctgagtggc aggcagatca aagcaaaagt caatgccgga 600 

aaccgcctgg aagagtacct gcaaggcatc cgggtgatga aagcctacaa tctgctgggt 660 

gatcgttttg ttcggttgcg tgatgctttt gccgaattac gtcgtgcctg cattcggttg 72 0 

gaggctctat tgggaccttt tgttctattg gctattacac tcgtgcgtgc aggattgaca 780 

ttgatggtac tgtgcggaac atacctgctt ttaggtggcc agttgtcgat tctcacgttt 840 
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gtcatgttcc ttgttgtcgg ttcccgtgta ttcgacccgc tgacttccgc tcttaccaat 900 
tttacagagt tccgtcattt ttctatttcg ggaggacgta ttctttctct tatgaacgaa 960 
cccgaaatga aagggacaaa agaagctccc gaagacggta atatcatctt ttga 1014 



<210> 1072 
<211> 354 
<212> DNA 
<213> B . f ragilis 



<220> 

<221> unsure 
<222> (280) , (285) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1072 

tcgtggaggt atccacttga tttaggggct gtaaagctgt ctgcacagca aatgattgtg 6 0 

cttacaccgg tcttgcgata tacagaagga gaagaacagc agctattagc tccggtagtg 12 0 

atagccggac cccgtcgcta tcgggtgctg aaacgatctt tagctttcgg tactgacaat 180 

tttgaaatgt ttcctatgct tgtcgagaat cgaaagagcg gtactcccca gactgtgaat 240 

attcacttcg gattacccta tcatgaatgg atgcgccggn catanctgat tatacgtgaa 3 00 

aggtgcccgg ttgtgccgat cgtcttgtca gtcaggatga ccataccgtt ataa 3 54 



<210> 1073 
<211> 471 
<212> DNA 
<213> B. fragilis 



<400> 1073 

atacctttgc aggtacaaaa gtatatagaa tatatggaaa caaagattct ttcaaatgcc 60 

acacacaaat gtgttttagt gatcgataac gctcaaccta cgggcatagt agccaatatt 12 0 

gccagtgtct tatccatgac gctagggtgc agagtcagca acattgtgag tcatgatgta 180 

tatgataaac aaggtgaaag gcatttgggc ataacacaac tgccgattcc tatacttgga 240 

gcttcacagg agaagataaa agagctccgg aactatttcc actctttaga aattgaagat 3 00 

ctggtactgg ttgacttttc cactattgcc caacaatcca gaacttatga tgaatatgaa 360 

cgtgaaatgt atagtgccaa tgaagatgat ctgcactatg taggtatcgg tatttgtgca 42 0 

gagaagaaag ctataaataa agcaaccggc agcctgagtc tgatcagata a 471 



<210> 1074 
<211> 183 
<212> DNA 
<213> B. fragilis 



<400> 1074 

ttgataaggc agctatactt gccgttattg gcgaacatac ggatttcgcc acacgtaaaa 60 

aatcggttgc aggcgttgaa tgtaggaact acctacagga tgttactgag ggactattat 12 0 

ccgcctacac gtagtaatgt atataccatt tttaatgtgg cgagagcttt caatgtagtc 180 
tag 183 



<210> 1075 
<211> 1305 
<212> DNA 
<213> B. fragilis 



<400> 1075 

atgatgacta aaactacaac tgtacctaaa aatgtgtatg aactggcgca agaacgtttg 60 

cgtatcgtct tcaatgagtt cgataatgtc tatctttcct tttcgggagg aaaagatagc 12 0 

ggagtgctgt tgagcttatg tattgattat attcgccgga ataacctgaa gataaaactc 18 0 

ggggttttcc atatggatta tgagatacaa tataagatga cgattgacta tatagcccgt 240 

atgttggaag acaataagga tattcttgaa gtataccggg tttgtgtacc tttcagggta 3 00 
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gctacctgta 
ctctgggtgc 
acacaaatgt 
gatgccgtgc 
cgctgtattt 
gtaggaaacg 
actgctaacg 
ggagtcaatc 
ctggcacttt 
ggagtgaact 
cttccggaag 
cgtgcccgaa 
ggagggtgtc 
gtaatggaca 
gatattgata 
ctgaaaaatg 
aaacgaaatc 



cttccatgta 
gtccgttacc 
gggattatga 
gcacctgctg 
acctcaatcg 
atatctttaa 
gaaagtttaa 
tggaacgtca 
accgagccat 
tcacgagtat 
gatatacctg 
acggatatct 
tgagcgacga 
actccaatta 
ttccggagtt 
atcatgcttg 
aagtaatcga 



ccagtctttc 
tgagaacgcc 
gttccaaatg 
tctgatcggt 
caagtaccag 
tgcatatcct 
gtgggattac 
gcgcgttgcc 
cgatccgaat 
gtatggagga 
gcgggaattc 
ccggaagtta 
aactattcgt 
taaaacgacc 
cagggaaatc 
caagtatatg 
acaatataaa 



tggcgtccgt 
atgactaaag 
cgttttgcca 
attcgtacgc 
atgtatcatc 
atatacgact 
aataaactat 
agcccgttca 
acgtggggga 
acccatgcga 
atgtattttc 
caggtcagtg 
aggctgaatg 
aagaagccgg 
cccacctaca 
ggattctctc 
aacatattgc 



gggaagatag 
aagactttcc 
gttggcttca 
aggaaagttt 
gttatcggtg 
ggaaaactac 
acgactatta 
taggtgaggc 
aaatgatagg 
tgggatggca 
tgctttccac 
tacaattctg 
aagccaaggt 
tccgcatgga 
agcgcatgtg 
cgacaaaaga 
aatga 



taaaaaagac 
tttttataat 
tgagaaaaag 
caatcgttgg 
gacttcgaag 
ggatgtatgg 
ctattgggcg 
acaagagagt 
ccgtgtcaat 
gtccattaag 
tttgcccgac 
gcgtaataag 
acctattatt 
atatcaggat 
catctgtatt 
ggaaatgagt 



360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1305 



<210> 1076 
<211> 291 
<212> DNA 
<213> B.fragilis 



<400> 1076 
agcggaatca 
tacttttatg 
acaaaaaaag 
gatttatatt 
caaacaatat 



ataatgtttc 
atgttagtgt 
aagagaacct 
tttgtcttct 
atactcatgc 



cgccacgatg ttttcaaatt 
cagagtagcg ctgaaaaatg 
tcgtaaagat tctcttctca 
attgctcata ctctcagcaa 
acgatatact ccatttcctt 



catgtttttg tttcattcta 60 

aacggtattc aaaatcgttc 12 0 

attttaatcc ggtaaacggc 180 

agccctccaa ccttgaattt 240 

gctatcgata g 291 



<210> 1077 
<211> 327 
<212> DNA 
<213> B.fragilis 



<400> 1077 
aacatggaaa 
ttaaattaca 
cttcatccgg 
atagcccgcg 
atggatgact 
aattggaaag 



aaatagagat 
tcaacggata 
aagtcgaagt 
tcaataaaaa 
ccataaaaat 
cgctactccc 



tgtattacgc 
ttggtatact 
cagaaaatgt 
aattataaaa 
actacttccg 
tccataa 



cgggaacaaa 
tacgaatggt 
attggagtgc 
aaattagaaa 
ccgttcaacg 



ataacagaaa 
cagccttttt 
aacccgatga 
ggaagtatca 
aagatgaaaa 



tggcatttat 
gctatgtatg 
aaactatgct 
gacatccatg 
tatcttcctg 



60 

120 

180 

240 

300 

327 



<210> 1078 
<211> 924 
<212> DNA 
<213> B.fragilis 



<400> 1078 
gtgcaaagaa 
aaacttcact 
acccatagcc 
ggaatgggaa 
catatccggc 
gactttctgt 
acaggagagg 
gttatggtga 
tggcagaaat 
cacacaagta 



aaaaaaacgg 
ggggtaaagc 
tggcacgtgt 
tgggggcttt 
agactttatt 
ttatgtattt 
tcgtcagccg 
tgatactcta 
tatttttcgg 
ttgtagcttt 



taaaatgaaa 
ggttatgtcc 
cctgaaagag 
cggattattc 
gggattgttc 
tgccaaccgt 
gccggagtat 
tctgttctgt 
caagcacaaa 
tatggaagtg 



gatagtaaat 
atagtagtga 
ggaaccacag 
atggtgattg 
ggaggaatgt 
ttcggtacgc 
ctgttactgc 
actcggaacg 
aaggagattg 
atcactatgc 



taaaaacggc 
cactggctgc 
gagtggagca 
cgggagtgtt 
tctattggat 
aggcacaact 
ctgctacctt 
gatgcaactt 
ttgtccgggc 
tatggacgtg 



aggtatgcgg 
tatgccgttg 
gttttatgcc 
tgtgaaaggg 
gggagctgtc 
tgatccggtg 
cggtttttgg 
cttgaactgg 
catgacgcgc 
ttacctggta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 
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ctgatgtttt gttatgatga acggttcttt ggcgatcacc atccggtgac actgttggtg 660 

ggaatgttgg gacttatcgg ttcgattttt atgtttgcca aactgctgcg tcatgcctcg 72 0 

tgggatatga gtttacggtt cggctttgcc accgtcatta ttttttggat agcggtagag 780 

gtgtttgatc gtattcattt gttccccggt ctgtgggaaa atccgggtgg acataagcag 840 

gaattgcttc ttatagcggc ttctatcatt tttacagggt ggtggcttgg atataataac 900 

ctattagttt taaagaataa atga 924 



<210> 1079 
<211> 954 
<212> DNA 
<213> B.fragilis 



<400> 1079 

aatggtggtt gtttgataaa aaggaacttt ttctgtcctt tgcagtacct ccgtcacata 60 

aaactttgga atgaacgcaa aatggcagat aatgtaaaag caaacaaaat gaactctcag 12 0 

gccgacgatg acattggatt tgggatctat acggatattg cagatcttcc tatgaccggt 180 

tgtccgtctt atattgaaga ggggatcggc ggtgtatgcg aatcgggtac ggcaactatc 240 

gtagtttttg atgttccatt ccaaatcgta ccgaatgtgg tcattaccct gatgccgtgg 300 

caactcgttt ttatcaaaga gattagtgag gatttcagga tcactttttt caaaatttcc 3 60 

aaagatatgt tttcagaaac tctgagtaca ttatggagac ccgcttccgg gtttttgctg 420 

tacatgcgca agcatattgt atcaatcccg gacggggagt tgatcggtcg ttttttggcg 480 

tattgtaatc ttctggtata caggatgaag catacaccgc aaaattgtcg tcaggaatca 540 

atcatgcaac tgttaagggt ttacttctgg gatgtctata ctgtgtatat caatgatcct 600 

caggctgaga agagtctgaa gtttacacgt aaagacgaat atgtctatca atttgtacgt 660 

ctgattatag aagatcattc tccggataaa gatgtggcct attttgcaca gaaactgggt 72 0 

atttctccca aaaggctcac aaatcttatc cggagtatca gtggtcaatc agcgcgtgaa 780 

tggattgttt attataccat tcttgagatc aagtcattgt tacgggagtc atccctggac 840 

attaagtcga ttgccgccag ggttaatttt cccgatcaaa cgacattgag taggtatttt 900 

cgtcattata ccggagtaac gccatcccaa tacagaaaaa atatttattt ttga 954 



<210> 1080 
<211> 645 
<212> DNA 
<213> B.fragilis 



<400> 1080 

ttttacgtcc ggaaacaaaa cgctgttcgt catatgcaaa cactcaaatc ggatatacgc 60 

aaccggattc tgtcggccgc aaaagagcaa tttgtgcaga gaggatattt gaagacctct 12 0 

atgcgcgaaa tagccgatgc tgtagatgta ggcgtaggaa atctctataa ctattttgag 180 

aataaagatg agttgttttg tgtgatactt cgtcctgtat cggatgcttt ggagcgaatg 240 

ctgcaggaac atcatggagc caaaggagca gatattatgc ttatatgttc cgaagagtat 300 

ctcaagtctg ctgtcgatga atatatatcc ttgataaaca agcatggtga gctgatgaag 3 60 

attctattgt tccattcaca aggctcttca ttggaaacat tcagggaaga ctatacaaac 420 

cgttcgacgg agatggttaa aacatggttt gccgaaatga aagagaagca tccggaaatc 480 

aatgtggtgg tatcggattt tatgatccat ctgcaagcag tctggatgtt cacccttttt 540 

gaagaaatgt tgaagcatgc tatcgatagc aaggaaatgg agtatatcgt gcatgagtat 6 00 

atattgtttg aaattcaagg ttggagggct ttgctgagag tatga 645 



<210> 1081 
<211> 867 
<212> DNA 
<213> B.fragilis 



<400> 1081 

acaagaaata gtttaactat gaaattatta catatagaac gacacaccac ttgcctcaat 60 

tatgtatcag attacaatat atgctttatt catcaaagac tcttctccgg gggcgatttt 120 

aagatagata accatcacca ctcgtgtatc ttatttcttt taaaagggga aatactgaca 180 

tcctgcagcg agtttcacga tcagcacatc gttgaagggc acatggttct ctttccccaa 240 

aacgatccta atcaaagcaa aagcatgaca gaaacagaat ttatactact gttcttcgac 3 00 
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aatcaagtca atcttcacag taaaatgtcg attgaattgt ctgccattca tcttgagtct 3 60 

gaaaagagtt gtttttattc cttatctatc tgtcctccgc tacgacatgt gttggacagt 42 0 

atttgcttct atctcaaaca gaaagttcag tgtagtcata tgcatgaact gaaacagaaa 480 

gagatattca tggtattcgg tacattctac aatcgaacag atatggccca cttcctgatg 540 

cccatcacag ggcgagatcc gaatttcaaa agtttcgtat tggaaaacta cctgcagata 600 

cgaaacatca aacagtttgc acaattatat cattgttccg aacgttcttt caatcgtaaa 660 

ttcaaaagct gctttcacga tactccctac aattggattc ttaatcaaaa aacacgccat 720 

attaaagggc aattagccaa tcgtaatata ccgatcagtg aaatagccag aacctttcat 7 80 

tttgcttcac cttcacattt cactacttac tgtaaaaaaa gacttggaat cactccgagc 840 

gaattcagag aaaaaattgc aaaataa 867 

<210> 1082 
<211> 603 
<212> DNA 
<213> B.fragilis 

<400> 1082 

aaggtgcccg gttgtgccga tcgtcttgtc agtcaggatg accataccgt tataacatcg 60 

gtattcgggg aacaatttgc gcctcgtact gagctgagta tcgtgactcc gccggtagat 12 0 

ccgttgaagc agcgtagcga gacacataca gcctatctga attttgaggt ggacaaatat 180 

gtaatgtcgc gcaattataa gaataatgcc aacgtgcttg ctgatgtcga ccggattgtc 240 

aatgagatac aaaacgattc caacctgacc gtaacggaat ttcgggtgac aggctatgca 3 00 

tctcctgaag gtgactatag ccgcaacatg gagttgtccg aaaaccgtgc attggcattt 3 60 

gtcggttatc tgcagaatct cggaggagtt gacgaatctc tgctgacagt cgattggaaa 420 

ggagaagact ggtccggcat gcgtcgtgaa gttgcggctt cgagtatgat tgataaggca 480 

gctatacttg ccgttattgg cgaacatacg gatttcgcca cacgtaaaaa atcggttgca 540 

ggcgttgaat gtaggaacta cctacaggat gttactgagg gactattatc cgcctacacg 600 

tag 603 

<210> 1083 
<211> 594 
<212> DNA 
<213> B.fragilis 

<400> 1083 

tcagtatata taatagaaga aacgagtatg aaaaagctaa ttctgtttgg agccgcaata 60 

tcgatttcag tagctgtaag tgcacagcac gtcgctctga aaaataatct cctgtacgat 12 0 

gctaccacca cacccaatct ggcattagag gtagggttgg ggaagaagac cacgcttgac 180 

ctgtatggcg gctataatcc gtttacgttc ggaaatcaca agcgtttcaa gcactggttg 240 

gcacagccgg aattccgtta ctggacctgt gagcgtttca atggaacctt ctggggtgta 3 00 

catctgcatg gaggtgagtt tagcgtggcc ggtatcagtt tacctttcaa aatattccct 3 60 

tctcttaaag accatcgcta tgaaggatac ttctatggag gaggtgtcag tgtgggacat 420 

caatggctgt tgagtaaaca ttggagcctg gaggcctcgg tcggagtggg atatgctcat 480 

tgggtatacg ataagtatcg ttgtgtgaat tgcagtccta aaataaagag tgggcataaa 540 

aactatgtcg gtcctaccaa agcggctgtt tcattggtct actttattcg ctaa 594 

<210> 1084 
<211> 360 
<212> DNA 
<213> B.fragilis 

<400> 1084 

caggttagaa ttttaataat cattaaaaaa attacgatca tgaaaaaggt attagtagca 60 

gtagcattgg taatgggatt aggtagttct gtagcatttg cacaggaagt tgaaaactct 120 

acggcagtag aaacgcaggc acaagctcca caagatgagt ttacgaaaat tgatgcccaa 180 

aaacttccgg atgcagttat gaatgccttg gctaaatctt atgaaggtgc ctcaatcaaa 240 

gaagtttatt cggctgacaa agagaccggt aagatttata aggtgattct tacaaccaaa 3 00 

gattctcagg aagttaccgt acttctggac gaaaagggcg aagagataaa agaggcataa 360 
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<210> 1085 
<211> 828 
<212> DNA 
<213> B.fragilis 

<400> 1085 

aagtatagaa tgaaacaaaa acatgaattt gaaaacatcg tggcggaaac attattgatt 60 

ccgctttaca tgagagccaa ggagaaccgt cggaaaaatc cgattctatg tgacaaattg 12 0 

gctgagcaac tggttgagaa catcgaatat gattattcca ggttcgatgg ggccaagttg 180 

agtgaagtag gttgtgtgat acgcggttgg tattttgatc atgctatccg gcggttcatt 240 

gacactcaca cctgcccggt agtggtaaat gtgggttgcg gactcgatac ccgttatcag 3 00 

cgtgtcggaa atgacggaaa ggctgtattt tatgagttgg atctaccgga ggttattgct 3 60 

atacgtcgtc ggttgatacc cgaacctgag aatgattgtt acctgtctgc atcgttgttg 42 0 

gaaaccgatt ggatggatcg gatccggctc cttcatccca atggagattt catctttgtt 480 

gtggaaggag tattgatgta ttttcgtgag gaacaggtac ggacatttct acataacata 540 

accatgcgct tcgaaggtgg cgagttgtgg ttcgatgtat gcggaacgat gatgagccga 600 

cgtggtgtga agcccgattc cttgagggaa cataaggcgc agatacgttc ggggatagat 660 

gacgggcata tggtggagtt gtgggaaccc ggattgcatt tgttggaaca ggccaattat 72 0 

atgaaatttt tccgttcccg ttggggattt tttttcgggc agatattggg caggatgacg 780 

aagttgtgct acaagttcag ttccatgctc gggtataaaa taggataa 82 8 

<210> 1086 
<211> 186 
<212> DNA 
<213> B.fragilis 

<400> 1086 

agaacgcaga tatccggact gatgggcccg ggtagcaaga ctgacataaa gcaacccata 60 

gcggaaccta taaaagaaga agtgcgttgc tccggacaaa agcacctctt ttcatccatc 120 

cggcacagtc cggatgtgaa gtttgcattt cacttcattc atataaatta caataaccca 180 
ctataa 186 

<210> 1087 
<211> 717 
<212> DNA 
<213> B.fragilis 

<400> 1087 

atactcatgg taattatagg cgtgtttgcc caaggagata cccgccagac gctatggggc 6 0 

ttttttggag gactgctttt ttggacgggt tgggtggaat ttttatttat gtattttgcc 12 0 

aatcgtttcg gtacacaacc tgaactggat ccggtgacgg gagaaatcgt gacccgtcct 180 

gaatatctga tactgcccgc ttcttttggc ttttggatga tggtgatggt aatgtatttg 240 

ttcagtacga agaatggctg caatttcatc aactggtggc agcgtttgtt attgcgcgga 3 00 

cgaaaagccg atatagcagc acgtcccatg acacgccatg tgtcgatcat cacctttatg 3 60 

gaactgatga tgatcctgtg gacttcttat cttgtgctga tgttttgtta tgacgatgta 420 

tttctcggcg aacatcatcc ggtgacactg ttagtgggat taggatgtct ggtaggagcg 480 

ttctttatct ttgtcaagca attacgcatt gcctcgtggg gagcgaatat acgtatggct 540 

attgctacag tggtggtgtt ctggacaccg gtggagatac tgggacgcat gaatctgttc 600 

agtgagatat ggattgatcc gatgaaccac gtgatggaga tgggtattat tcttgctgtg 660 

ttcattatcc ttactgttta tctctggtac atgagtgcaa agaaaaaaaa acggtaa 717 

<210> 1088 
<211> 1536 
<212> DNA 
<213> B. fragilis 

<400> 1088 

atgtcgaatt cttctattaa atcaaaaacg gcgctactcc gaaacgggga aactcaaaaa 60 
ggaaatggct atccggaagc caatgattat tccgccagga ttcttgaaac catgcagacc 12 0 
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ggtgtgatct ttttcaatac cgaacaaatc atttccggca tcaacaactt ggcatgcgag 180 

gatttacaga tcccccggga tccttccgga cataaaataa cagacatcat ttcaatcatc 240 

caccaagaaa aagatatctt cccggaactg atcgcccgat tgaagtcttc ggaaacggat 3 00 

atggagaaat tgccaataga cactttgata cgttctctgg aaacgaaggt gcaattcttt 3 60 

gccagtgggt gtatcatgca gttggagacc ggacgttatt tattagcatt ccgtaatacg 42 0 

atggatgaag taacgcatga acaccttctg agcatgattc tggcaaggac aaagatcttt 480 

ccatggtttt tcgacttgaa acgtaataaa atgttaatcg atgcccactg gttttcttat 540 

ctgggtatcc cggcagaaga ctgtgagata acaatcgaga agttcttctc cagagtacat 600 

cccaacgaac gggatatgct tgcagatgct ttgcaaaaac agttatcaga aaaagaaata 660 

cccgattcat tctcctatcg gctgcaacgg ggcgatggaa gttgggagtg gttttctgaa 72 0 

cagtcgatgt atctcagcaa aaccaatgac ggttcacctt atcgtattgt aggcgtatgc 780 

catagcattc aggagcataa aaatactgag gataaattgc gcgctgcacg caataaagcc 840 

caagaaagcg acagactcaa aagtgcattt ctggctaaca tgagtcacga aatacgaact 9 00 

cccctgaatg caattgtcgg tttctccaat cttattgcag gcgggattgt cgacttggat 9 60 

acagaggaag ccagagatta ctcggcatta atcagtaaaa actgtaatta tctgctcaca 1020 

ctggtctcgg atgtccttga tctttcatgt atagagtccg acacgatgac ttttaagttt 1080 

acagtatatc cacttacccg acttctgaca gaaatctatc agaaatatga aaacagaata 1140 

cctcaggagg tacagtttaa tttgctgcta cccacagata atgttgaaat agaaacagat 12 00 

gctgtgcgcc tacggcaagt gatagagcac ttgttggata atgcggcaaa atttacagta 1260 

aaagggcata tagatatcgg atatgccctg tcggatcatg gtgagaaaat atatgtattc 1320 

gttgccgata ccggttgcgg tattccaagc gatcaatata aaaaagtatt cgagcgcttt 1380 

tataaaatcg attcattcgt acagggtgcc ggtttaggac tttcagtctg caaaaccatt 1440 

gtagaaggtc tgggaggtac gattaatgta tattcgcaac tgaaagaagg ttctcgtttt 1500 

tccgtgatcc taccgctaaa cagactccat aaataa 1536 

<210> 1089 
<211> 1455 
<212> DNA 
<213> B.fragilis 

<400> 1089 

agcaactttc cacatacgtt tatttggggg gaagctcaag gtgaagcctt aattacttct 60 

ggtactagtg ccgatggtat gggtatgcta tatggtgata ttaaacttca aaatttgatt 120 

gaagatcata cagaagggat caaaaaattt ggtgatattt ttggtcgtct aacaaactta 180 

aatcttttta ttgcaagagt aacagatgct acttatatgg atgatgtcaa aaagggatat 240 

tatcttggac aagcttatgg cttaagggca ttttattatt tcgatttata tcgtacctat 3 00 

ggcggtgtgc ctctacgttt gactgctgat gtggtagaag gggttattga tcctaataaa 3 60 

ctttatatgg cacgtgccac tcctaaagaa gttatggatc aaataaaaaa ggatttggat 420 

aaatcaatgg aatcttttgg agataataat tcgtttgatc ctaataatcg tggaaataaa 48 0 

aaagggtatt ggtcaaaagc tgcaaccgaa tgtttaatgg gggaggtcta tttatggatt 540 

tcaaaagtgt cgacaggaga tgatgctgcc aatgaggcca atctggagat agccaaaaca 600 

catttgcaaa atgtcatcaa caattacggt ctaaaaatgt tagacgattt ttcgtcagta 660 

ttcgatgcca aaaatggtaa gggaaactct gaaattattt ttgctgtcag atatatggaa 72 0 

ggcgaagctg gcaataataa caacttattc acttatgcta tggctacagg tagtacgaaa 7 80 

gacaattatc tggctaatgg cgagaaattc ctggatgctt tgaatattgc aaatacgggc 840 

agtcagcagt tggaatacaa acatgaaatt tataatagtt ttgatgtggc tgacacacgt 900 

cgtgaagcca cattcattgc ttcatattct aaaaatactg aaaccaaaga gttaacttta 9 60 

agaggaacac acgttcgcaa aaacatcggt tatgtgaatg ctcaaggtag tcgtatctat 102 0 

tgtggggatt atattattta tcgtctacct ctcgtatatt taatgcttgc cgaaattgag 10 80 

aatatgcagg gaggagatgt tgccaaatat attaacttag ttcgtgaacg tgcttatagc 1140 

accaattggg ataaggcgat ttatgggtat acaaatgccg atttcacaac taatgaattg 12 00 

gccattcttc atgaaaagga taaagagttt attcaggaag gacagcgttg gtgggatatt 12 60 

cgccgaatga cgttaactaa ggggggcaaa catcttgtct ttgtcaaaga aggtagtatc 132 0 

ggaacagata tgcctacttt agatgaagcg actgaagcgc ataaagtcct ttggccggta 13 80 

gataaagatt tgttgggtaa tgacccttta atttaccaga ccccgggata tgcaacttat 1440 

aaaaaagcag aatag 1455 



<210> 1090 
<211> 3270 
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<212> DNA 

<213> B.fragilis 

<400> 1090 

ctcaaaaaag caatacttat gaagaaaacc atcttcttga ttttgtgcat tttatgttct 60 

cttggagcca tggcacaaaa gaaatcaatc acaggtgtgg ttatggatgc tagcggtgaa 12 0 

tcaatcatcg gagcgagtgt tgtcgaggtc ggtaccacca atggtgtaat tactgacatc 180 

tcaggcaaat ttacgttaat ggtcgatcct aacggaaaga tcaaagtttc ttatatcggg 2 40 

tatcagcctc aggtactcga tgtaaagggt aggaattctt tcaatattaa attgaaagaa 3 00 

gactctgaaa tgttggatga agtagtagtt acaggctatg gaggaaaaca gttgcgtacc 3 60 

aaagtgacga attctatttc caaagttagt gaggaatcat taaaggttgg tgtcttttct 42 0 

aatccggcac aagcattatc cggtgcagtt tccggtttaa aagtgacgca gagttccggt 480 

aatccaggaa gtacgccgac cattgtactc cgtggcggta cagaatggga tggatctggt 540 

tctcctttag taatggtcga tggacagctg cgtgatggtt taaatgatat caatccggaa 600 

gatatcgaat ctatggaagt tttgaaagat gcaggtgcta ctgcattgta tggtgcgcgc 660 

gccagtaatg gtgtaatatt gattacaacc aaaacgggta aagtcggtaa ggcagaaatt 72 0 

aatcttaagg ctaaagtagg tatgaactat attaataatc cctatgattt tctaggagct 780 

aaggatttta tcactgccat acgtacagct tatgacacaa caccatgggc tagtaaatca 840 

tcattggatg gtgcctccgc ttatggaaca ggaaataaat atggcagcga tttggtttgg 9 00 

aacttattgg taaaggatag cggaaacgaa ttcctgttga acaaaggttg gcaacaaatg 9 60 

caggatccgc ttaattcttc aataaccctt ctttataaag atatcaaacc ttccgattat 1020 

aatttgaata acccttcttt aactcaggat tataatgtaa atatgtcagg cggtaatgat 1080 

aaaggaactt attatgctgg tttagggtat aataagtcag aaggacttcc catttcttct 1140 

ttttatgagc gttatagctt tattttcaat ggcagttata agttggctga ttggattacc 12 00 

gcaaattcta attttaatta taatcgtgct aattggcgtt ctatgcccgg ttcgcaagat 12 60 

aatgaaggga actattttgg acgtataatg tccttgcccc ccactgttcg atatgaagat 13 2 0 

gaagatggaa atcctgtcct cggccctaat catagcgatg gaaaccagtc gtatcaaccc 13 8 0 

gaaaaatggc ttgtagataa ccaaacggat aaatttacaa tgatccagtc gttggaaatc 1440 

aggccaatga agaatctcgt aattaaaggt accgctaact ggtattactc ggaaggtgtc 15 00 

tatgaaagtt ttaccaaaga ttttgagaca gctccaggta aattcaacac aactcgagct 15 60 

tcctcagcca aatttgagcg tgacttctct caaacttata acgtagtatt aaattacaat 162 0 

aatacatttg ctcaaaatca taatatagat gttatgttgg gttctgaata ctacgataaa 16 8 0 

aagacaaaag gatttagtgc gtcaggttcc ggtgctccca ctgacgattt tgcagatctc 1740 

aacctgacag ataatgggga agggaaacgc acaattgatt catggcatag ccagtaccgt 180 0 

attctttctt attttggtcg tttgaattat gattatcagg ggaaatatct gttatcagga 1860 

gtattccgtt atgatggata ttcttcctta ttgggagata accgttgggg attttttccg 192 0 

ggagtatctg ccggatggat ttttggcaaa gaggacttta taaaaaatgc tgtgcctgac 19 8 0 

ctgtcatttg gtaagttacg tttcagctat ggtgtgaatg gtaatgcaac cggtattgga 2 040 

gcttatactt tacagggttc ctataactct cagaaataca atggtaatgt gggatattta 2100 

attggtgctc tccctaatcc ggggttaaaa tgggagaaga cccgtacaac tgaagttggt 2160 

ttagacttaa gcttctttga taaccgctta aacgcaaact ttacttatta taaccgttta 2220 

acgatggata aatatgctga tttgagttta cctactacta ccggtttctc atcggtaaag 22 80 

aataataatg gagatttccg taatagtggg attgagatgg agctatctgg tacaatactc 2340 

aaaataaagg attggacctg gaaaatggga ggtaatattt catataataa aaataaagtt 2 400 

gttaccttac ccgataatgg tcagccaaag aatcgtattg gtggccaaca aatttatacc 2 460 

ggacgcaaag ttttagatga agcagggaat caagtggatg aagtaatctt tgtaggcggt 2 52 0 

aaacaagaaa ggcaggaacc gggtatttta gtcggatata aagcggaagg attatataaa 2580 

agttgggatg atattccaga gaatctaatt gtaaaaacgg gaaattatca aggaaaatat 2 640 

caatatggtc cgaaagcgta tgcagcattg tcagatgcag aaaaagcgaa agctctccaa 27 00 

attgcccccg gtgatgtaaa atggaaagac attaataatg atggtacgat cgatgccttt 27 60 

gaccaggtag taatgggaaa taccactcct cactggtttg gtggtttcaa tactacattg 2 820 

acttggaaag gtctgacact gtatggacgt tttgacttcg cactggacta ttggatttat 2 88 0 

gataatacga ctccctggtt cttgggatgt atgcagggtg gatataatac aacaaccgat 2 94 0 

gtattcaata cttggagcga agaaaatcct aacgctaaat atccgagata tgtttgggcc 3000 

gatcaattag gtactgccaa ttattatcgt acgtctacca tgtttgctta taaaggtaat 3 060 

tatctggcaa tccgcgaaat ttcattgtct tattctttac ctcagaatat tgcacggaaa 312 0 

ttttattgcc agaaattaga tgtatctgta acaggccaaa acttaggata tatcacttca 3180 

gccaatgtag caagtcctga agtttcaact gccggttctg gatatgcctt accacgcact 32 40 

ttactcttcg gcgttaatgt tacattttaa 3270 
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<210> 1091 
<211> 1629 
<212> DNA 
<213> B. fragilis 

<400> 1091 

aaagaaaaat atatgaaaag aataaaatct acaatattat atggtttact ggtggcatct 60 

tcggggctgt tagtaacgtc atgtgccgat aaattggatc tgtctccgat tgattactac 12 0 

ggaagtgggt cttattggaa aacagaagct caggctaccg cctatataga tggtattcat 180 

aagcatttac gcgatgcggc atggcaacat acaatcacat tcggagaact tcgtggtgga 240 

cgtttcatca ccggtgcaag tagtgatggc atgggagtta gtaatggtga tattattttg 3 00 

caaaattttg atgaaacaca taccggagta agtaagttcg gagatttatt cggccgtatt 360 

actaacttga atcttttcat agcacgcgtt acggatgcca cctatctgtc cgatgaaatg 42 0 

aaaaacttct atttgggaga agtgtacggt ttacgtgctt tctattattt tgatctatac 480 

cgcatctatg gcggggtacc tttgcgtttg acggctgatg ttgttgaagg agttattgat 540 

cctaataaac tgtatatggc ccgttcgacc cccaaagaag taatgaccca aataaaaagc 600 

gatttgaata aatcgatgga gtattttgga aatatgaatg attttgatcc atacaaacgt 660 

ggcaaaaagg tgtattggtc aaaagctgca accgaatgtt taatgggaga agtttatttg 720 

tggacttcta aagtaaccac aggagatgac gtagcaaatc ctgctgatct gactatagct 780 

aaaacccacc ttgaaagtgt attgaataat tataatctga aaatgctgga tgacttttca 840 

caagtattca atgccaaaaa caaggcaaat gacgagatta tatttgccat tcgtttctta 900 

gaaggtgaag caaccaatag taatggtaca tttacttata atgtaggtac cggtagtacc 960 

aaaaacagat atcaagccaa tggtgaagta tttggtgatg ctttagacat acagaatact 102 0 

ggcaatcaga cgtatgaata caacaaagct gtttatcaaa attttgatga tgcagatacc 1080 

cgtaaggaag cgacctttat cgcctcatac aataaagatg gcaaaacagg tgagttatct 1140 

ctctatggaa cacatgtacg taaaaatata ggttatgtaa atgcacaggg agcccgtgtt 12 00 

tactgtggtg actatatctt ctatcgcctg ccgtgggttt atcttactct tgcagaaata 12 60 

gcaaacatgg aaggagataa tgcagctgtt gccaaataca tcaacctggt aagaaaacgt 1320 

gcctatggca atgcatggga tgaaattctg tatgcatatc cggaaacggc agattttaca 1380 

actaatgaat tggctatttt gcatgagaaa gataaagaat ttatccaaga aggacaacgt 1440 

tggtgggatt tacgacgtat gactttgact aaggggggaa cacctttggt tttctgcaaa 1500 

gaaggaagtc ttttgggaga tgccccgata ttgaataaat ctacagaagc acataaactt 1560 

ttgtggccaa ttgaaaaaac aatgttggat aaagaccccg cactggagca aacacccgga 1620 

tacaaataa 1629 

<210> 1092 
<211> 1263 
<212> DNA 
<213> B. fragilis 

<400> 1092 

aatttaaatc aaatgaaaaa tattttttta ataattggaa 
agcctatatg ctcagtccga tgactggtct cctaagaatc 
cgtgaggacg ggcgattctc aagttcttat ggtgtagtgc 
gagccacgct atgcttttca tagagagttt tctcccaaag 
ggacttcgcc atgcgatgga agaaataatg aaatttcctc 
cctgtctgta taaaaagaga acagcgggaa gggtatcgat 
ccgcttcctg aatgcgtttc tacttttctt gttttaatac 
gtacctgcca ttttgtgtat tcccggttcc ggaggaaata 
ccggggatag ctcccaaatt gaatgaccgg tacaaagatc 
aactttgtaa aagaagggta tatagcagtg gcagtagata 
tcagaccttg agagatatac attgggctct aattatgatt 
cttttagagt tgggatggag ttatttggga tatgcttcat 
aattggatga agacccagaa gcatattcgt aaagatcgca 
ctgggaaccg aacctatgat ggtattgggt acgcttgata 
tacaatgatt tcttatgtca aactcaagaa cgggcggaag 
aacggacgtc gtccattccc taattctata cgccatttaa 
tttaattttc cggacatcgt agcggctttg gcaccccgtc 



tatcactgtt 


ttttaatggc 


60 


ataatttaat 


taagtctgta 


120 


atgccatgct 


cagaaatact 


180 


aatttcgaaa 


atggcaaaag 


240 


aaataaaaaa 


ctctccagct 


300 


tagaaaaatg 


ggaattttat 


360 


ctgataatat 


aaacaagccc 


420 


aggagggact 


tgcaggtgaa 


480 


cgaaactgac 


ccaagccctc 


540 


acccggctgc 


cggagaagcc 


600 


acgatgttgt 


atctcgctat 


660 


atttggatat 


gcaggtttta 


720 


ttgtagtaag 


tggattttct 


780 


cttcaattta 


tgcttttgtt 


840 


taatgactat 


gcctgacaaa 


900 


tacctgattt 


ctggaaaaat 


960 


ctatcatact 


gaccgaagga 


1020 
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ggattagatc gagacttgga ccttgtgaga aaagcgtatg ctatagcagg cactcccgat 1080 

aacgtgaaaa tatatcatta taagaagttc tcagatccgg atacacgaaa aaatgtagaa 1140 

tatttacctg aaggactaga tcgtaatgaa tattttcgga tggtaaatgt agatggtccc 12 0 0 

aatcattatt ttaaatcaga actggttgta ccctggttga gaaaattatt ggaagaaaga 12 60 

tga 1263 



<210> 1093 
<211> 1632 
<212> DNA 
<213> B.fragilis 



<400> 1093 

aatacaagaa atatgaaaac aataaaatca ataattatat caggcatgtt actggtagta 60 

tctggtggca taatgacttc atgcagtgat ttattggatt tatctccaat tgatttttac 12 0 

ggaagtggtt cttattggac tactgaagcg caagttaccg gttatatgga tggtcttcat 180 

aaacatctgc gtgatgtagc cgaacagcac atcttcacct ttggagaact aaggggcgga 240 

atctatagaa gtggtaatgc atctgatggt aacgcactga attacggcag tattatattg 3 00 

cagaattttg ataggaacaa tactggtgta accggttttg gagggcatta tggacgcttg 3 60 

gctaatatca atttgtttat tgaccgtgtg tcgaaagcag attatataga tgatgccaag 420 

aagaaattct atttaggaca ggcatatggt ttacgtgctt ttatctattt tgaactttat 480 

cgtatttatg gcggtgtacc tttgagactg gatgtagaag taattgatgg agtacttgat 540 

cccaataaac tgtatatggc tcgtgcgacc cccaaagaag taatgacgca aatcaagaaa 600 

gatttggacc tttcaatgga gcattttggt aatgtaacag cttttgatcc atataatcgc 660 

ggtaaaaaag tatattggtc caaagcagct actgagtgtt tgatgggaga agtctatcta 72 0 

tggacttcta aagttactac cggtgacaat gaggccaata tcgctgactt ggcaatagcg 780 

aaacaacatt tacaaagcgt cattgataat tacggtctga gcatgatgga taatttttca 840 

gatgttttcg aagccaaatc ccataaaggc aacaatgaaa taatatttgc gattcgttat 900 

cttgaaggag aagcgaccaa tagaaatgtc aactatacat acatgaatca gggagagata 960 

gataaaggag gttttcgtga agatggaact ccatggaacg atcctttagg attaaagaaa 102 0 

agtggtgctc aatggtgtga gtatattcct gaactctttc aattatttga cgtggaagac 1080 

actcgtcgtg atgcgacttt cctggcttct tataaaaaag ataaagatgg taatttaagt 114 0 

ctttggggaa ctcatgtcca aaagaatata ggttacataa attctgaagg caatcgtgtt 12 00 

ttttgtgggg attatgcttt ttatcgtctg ccctgggttt atctttcatt agctgagatc 1260 

gctaatatgg aaagtgatca ttctggtatt gagaaatata tcaatctggt tcgtaaacgt 13 2 0 

gcttatgctt ccaattggga tgaaaataag catggatata aatcaggaga ttttactcaa 13 8 0 

aatgagttgg ctatactaca tgaaaaagat aaagaatttg ttcaagaagg gcaacgttgg 144 0 

tgggacgtgt tacgcatgac tctgacaaaa ggcggtaagc atttagtatt ctgtaaagaa 15 0 0 

gctaatttga aaaatgatgg agtaccaatc ttgaatgaag caacagaaag ccacaaagtt 1560 

ctttggccaa tcgaacagaa tatgcttgat aaagacccct cgataaaaca aactccgggg 162 0 

tatgataaat aa 1632 



<210> 1094 
<211> 216 
<212> DNA 
<213> B.fragilis 



<400> 1094 

gcgaaaaatg gagagaaaag aaagttttgt gaatattttc aaacctctaa aaatcaagca 60 

tttattcttt ttcttgtatt agtttttagc ctccacaaag ggcttaaaat gaatctttta 120 

gaaggggata aatctagtat aaaaatcaga ctcattcatt tactaaagaa tgaattaatg 180 

atatttttcc aaaatccaac agaagtgaga tgttga 216 



<210> 1095 
<211> 1332 
<212> DNA 
<213> B.fragilis 



<400> 1095 

acaaaggata tgaagtttta caatagggag aatgaattag ctgaattaca aaggatacaa 



60 
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gaattatctt ttgaagagaa ctctcgtctg acagtagtta ccggaaggag aagaataggt 120 

aaaacaagtc ttattatgag agcttttgaa aaaactccta ctatctattt atttgtgggg 180 

agaaaaaatg aagcatcttt atgtagggaa ttcataactt tagtttccca agcacttgat 240 

atttatgtgc cagaagaaat atcgactttc aaatctctct ttcggtatat tatggaagtt 300 

gctaccagac agtcattcaa tttggttata gatgagtttc aagaattcta taatatcaat 3 60 

aagtcgattt atagtgatat acaagatatc tgggatcagt atagacaaaa aactcacatg 420 

aatttcgttg tgagtggttc tatttattct ttaatggaaa agattttcca taatgaaaag 480 

gaacctcttt ttggccgtgc tgacaatatt ataaaacttt cagctttcag tctgaatgtt 540 

ttaaagaaaa tcataaaaga ctatcatccc caatatacaa atgatgattt attggcacta 600 

tactcatttt ccggtggggt tcctaaatac gttgaattat tttgtgataa cagagtatta 660 

accgttgatg gaatgattga tttcatggtc agagacaact ccccttttac agatgaagga 72 0 

aaaaatctgt taatagaaga attcggcaag aattatggta cctatttctc aatcctaagt 780 

gctatctcag gtggatataa tactcagaca gaaatagaag cgttgcttgg cgaaaagagc 840 

ttaggcggtt atctaaagcg attaattgaa gattataaca tagtagtgcg ccaacgtcct 900 

gtcttttcaa aagagggttc tcaaactgtc agatatggga tatgcgataa ctttattcat 960 

ttctggttta attatttcga tagaaatcgt tcactcattg aaataaaaaa tttcgttggc 1020 

ttacgaaaat taataaaagc tgactatccg acatattcag gaaaaatcct ggaacagtat 1080 

ttcaaacaaa aatatgctga aagttacgag ttccgtctta ttggttcgtg gtgggagcct 1140 

aaaggcaatc agaatgaaat tgacattgta gctatttatt tagataacaa aagtgcaatt 12 00 

gtagcagaag tcaaacgtca aaaaaagaat ttcaagccag aacttttcca aaagaaagtg 12 60 

gaacacttag agaataaagt cctggctaaa tatcaaataa acacagtctg cttatcatta 1320 

gaggatatgt ag 1332 

<210> 1096 
<211> 213 
<212> DNA 
<213> B.fragilis 

<400> 1096 

tgtccctgta tcgtgctaat caaaggagtg accggaaccg taatagaaaa aacaaaatct 60 

gaaccatttc acaaaatcaa agggaaagta aaattatcat tctactacca aatagaatgt 12 0 

ttttacctat ttacgactta ttcttatttg ggaaaatttc aaaaaaggaa tattatcaca 180 

tatttcttct ttcttaattt cattgtcaga tag 213 

<210> 1097 
<211> 3303 
<212> DNA 
<213> B.fragilis 

<400> 1097 

acatctttat tattaactca aaaagcaata cttatgaaga aaaccatctt cttgattttg 60 

tgcattttat gttctcttgg agccatggca caaaagaaat caatcacagg tgtggttacg 120 

gatgctagcg gtgaatcagt catcggagcg agtgttgtcg aggtcggtac caccaatggt 180 

gtgattactg acattgacgg taagtttacg ttgtcggtcg atcctaacgg aaagatcaga 240 

gtatcttata tcgggtatca gcctcaggta cttgatgtaa agggcaaaaa ttcttttaat 300 

attaaattga aagaagactc tgaaatgctg gaggaagttg ttgtaacggg gtatggtggc 360 

aaacagctgc gtacgaaagt gacgaactct attgcaaaag taaaagatga agcattgaaa 42 0 

gtcggcttat tctctaaccc cgctcaggca ctctccggag cagttgcagg tttaaaggtt 480 

acccaagcct ctggtagccc gggtgcggct cctaaagtaa cgcttcgtgg cggtactaac 540 

ttcgatggtt caggtgaccc tctggttatt gtagacggac aattgcgtga cggtatgcag 600 

gatatcaatc cggaggatat tgaatccatg gaagtcttga aggatgccgg agcaaccgct 660 

atttatggtg cgcgagcaag taatggcgta attttaatta ctacaaaaac aggtaaagaa 72 0 

ggacgtcgcg aaatcaactt caaagccaaa atgggtttga gctatgtaaa taacccttat 780 

gattttttgg gagccaaaga ttatatcaac gtactgcgta caggctatag taaatccgga 840 

tttacaacct cagacggaga gtatgtctct attgccccac ttggtaactt gacaagtgct 900 

tctccattcg gtactggtaa tacactgaat gataaaacga tctggaatat tatgaataaa 960 

acggcagaca atgcctatct gttacagaaa ggatggcaag aaatgccgga tcctctggat 102 0 

cccagcaaaa ccattttata taaagatact aatccggcag attataacct gaataatccg 1080 

gcaatatctc aggactataa tatcaatatg tccgggggta atgataaggg tacttactat 1140 
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gcaggattag gttacaaccg tcaagaggga cttcctatca agacattcta tgagcgctat 12 00 

agttttgttt tgaatgccag ttataaaatt acagattggc ttaccagttc atccaatttc 12 60 

aattataacc gtgcaaattg gaaaaacatg ccgggatcac aaaccagtga aggcaattac 132 0 

ttcggacgta tcatgtctac acctcccact gtccgcttcc aggatgagga tggaaatcca 13 80 

actttaggtc cggtagctgg tgatggaaac cagaattatc agcccgacaa atggtggaat 1440 

tttaatcaga gtgacaaatt taccatggta caggccttcc agattgatat tttgaaaaat 1500 

ctttctgtaa aaggtactgc caactggtat tactccgaat cattggctga aagtttcacc 1560 

agagactatg aaaacacgcc gggtcaattt gtgagaacac gtagttcttc agcaagtttc 1620 

tccagagatt tctctcagac ctataatgtg gtattaaact ataatcaaac tttcgctaaa 1680 

gatcataatg tggctgttat gttgggtatg gaatattttg atagatatag ccgcagcttt 1740 

agtgcatccg gttcaggagc tccaacggat gattttgccg atctatcatt gacagataat 1800 

ggagaaggga aacgttccat tgattcagga catagcgatt atcgtattct ttcttatttc 1860 

ggacgtctga attacgacta taaaggccgt tatttacttt ctgctgtctt ccgtcaggat 192 0 

ggatattcat ctttattagg tgacaaccgt tggggatttt tcccgggagt ttctgccgga 1980 

tggatttttg gacaagaaaa tttcgtaaaa aatgctctgc ctttcctgtc atttggtaaa 2 040 

ttacgtgcga gttatggtgt aaatggtaac gcaaccggaa ttggcgccta tgacttacag 2100 

ggatcttaca attctcagaa atataatgga aatgtcggct tcttaatcgg tgcactaccc 2160 

aacccgggtt tgaaatggga gaaaacccgt actgcagaag tcggtataga tatgagtttt 2220 

tttgagaatc gcctgaacgc aaactttacc tattataatc gtttaacttc agacaagtat 2280 

gccaacttaa gtttaccttc tacaacaggt ttctcgtcaa ttaagaacaa taacggaaaa 2340 

tttcgtaata gtggtgtgga aatagaactg tcgggaaaaa tcctaaaaac caaagattgg 2 40 0 

agttgggatt tgggtggaaa catatcgttc aataaaaaca aaatagtttc gttgccggat 2460 

aatggcttaa ttcgcaatca acaggatgcc gctcaaatat acagtggaag gcaattatct 252 0 

gatggcacat atgagaagat ttgggtcggc ggtaatcagg aaggttatga acccggtgtg 2580 

ttaattgcat ataaagccga tgggctttat cgcagttggg atgaaattcc cggagacttg 2 640 

gtagtcacat ccggtaacta tttcggtaaa aagatgtatg gaccggaagc ttggaagaag 27 00 

ttgagttccg cagagcaaaa gaatgcatta cccattcagc ccggagatgt gaaatggaga 27 60 

gatataaatg gtgatggtat gattgataat tatgatcagg ttgttgtggg aaatacaaat 2 820 

ccgcattgga ttggtggttt caacaccaca ttgcgctgga aaaacttcca actgtacgga 2 880 

cgctttgatt ttgcatttga ttactggatc tatgataata ctaccccttg gttcttgggt 2940 

tgtatgcaag gaacttacaa tacgaccaaa gatgtattca acacttggtc tgaagagaat 3 000 

ccgaatgcca aatatccgcg atttgtgtat gctgaccagc ttatgaatgc aaactattat 3 060 

cgtacttcca cattatttgc ttataaaggt aattatttgg ctatccgtga aatatctttg 3120 

agttattctt tacctaaagc atgggcaaac aaggcttact gtcaaaaggt ggatgtgtcg 3180 

atcaccggac agaatctggg atatatcaca tcggctaatg tagcttctcc tgaggtttca 3240 

agtgcaggtt caggatatgc tttaccaaga accctcctgt tcggattgaa tgtaactttc 3300 

taa 3303 

<210> 1098 
<211> 990 
<212> DNA 
<213> B.fragilis 

<400> 1098 

cagccgatgc agatagtgct cgaccgcaaa aagtggggtg gattgccaga gaaatataat 60 

ggaatcagtg atgcttgtat cctgaccgat gaaaagaacg gtactattta tgtggcggga 120 

ctctggatgt atggagtctt agatccccga tcgggtaaat gggtggaagg aatgacgcag 180 

gacagtaccc gttggataca ccaatggcat gcgaaaggtt ctcagcccgg gctcggggct 240 

aaagagacct gtcagttctt gattacgaaa agcgtggatg acggactgac ttggagtgac 3 00 

cctgtaaata taacagcaca aaccaagaaa ccggaatggt ggctgtatgc tccggcaccg 3 60 

gggcatggca ttactttgaa agacggtaca ttaatatttc ccacacaagg ccgtgataaa 42 0 

gatggaatac cattctctaa tattacgtat agcaaggatg ggggaaaaac atggatagcc 480 

tctaagccgg cttatcacaa cactacggag tgcatggcag tcgaattaca ggatggcagt 540 

gttatgttga atatgcgtga taaccgtaat cacggtaata aaaaggtcaa tggacgccgt 600 

atttgtgtca cctccgatct gggaagcaca tggacggaac attccacttc ccgaaaagca 660 

ttgatagagc ctacttgtat ggcaagtatt catcgacata cttatcagga aaacggcagg 72 0 

caaaagactc ttcttctatt ctgcaatccg gagtcttatg acagtagaga ccacatgacg 7 80 

ctaaagtgca gcctggatga tggaaatacc tgggattccg gccggaaaat catgttggat 840 

gagttgggaa gttttggcta ttcctgcata acttcggtca atgattctac gattggtgtt 900 
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ttttatgaaa gtagccaggc acagatggtt ttccaacaaa tacagttgaa agagctcata 960 
ggtaaaggta aatcatataa agagagatag 990 

<210> 1099 
<211> 747 
<212> DNA 
<213> B. fragilis 

<400> 1099 

ttggagtggt caccattgtc gtttcacaat tgcaataaat tgcggatgaa ttttaaagaa 60 

ggagaagtac tttattttaa taaaccgttg ggatggacgt cttttaaagt cgtggggcac 12 0 

gcccgttacc atatgtgccg gcggatgaaa gtgaagaaat taaaggttgg acatgcaggt 180 

acactcgatc ccttggcaac aggggtgatg attgtttgta caggcaaggc taccaagaga 240 

atagaggagt ttcagtatca tacgaaggag tatgtggcta ccatacagtt gggcgctact 300 

actccgtctt acgatctgga acatgaaata gatgctacat accctacgga gcatattacc 3 60 

cgtgagttgg tggaaaagac gttgaaaacg tttgttggcg agatacagca gatacctccc 42 0 

gctttctcgg cttgtaaggt agatggtgca cgcgcttacg atttggcccg taaaggccag 480 

gaagtggagt tgaaaccgaa attgctggtg attgatgaga tagagttgtt ggagtgtaat 540 

ttaccggaaa ttaaaatacg ggtggtttgc agcaagggga cttacattcg tgcattggca 600 

cgtgacatcg gagaggcttt gcaaagcggg gcgcacttga ccgggctgat acgtacccgt 660 

gtgggagacg tcaagttaga gcagtgtctg gatccggcaa agttcgcgga atggatagat 720 

cagcaagatg ttgagatatc tgattga 747 

<210> 1100 
<211> 429 
<212> DNA 
<213> B . fragilis 

<400> 1100 

gttatctgtc cgttgtttac tgtttattgt ttaccgtttg ttattattat ggcaaatacc 60 

ctgtgcaaag ccgaaaggct gaatagtaag attctgattg agaagatgtt tgcgggcggc 120 

tcaaagtcgt tttccatctt tccgttgcgt gtggtgtata tgcctgttga aaatcaagat 180 

gttcaggcat ctattttact gagcgtttcg aaaaaacgat ttaaacgtgc agtaaaaaga 2 40 

aatcgggtga aacgccagtt gcgtgaggct taccggatgc ataaacatca acttttgcag 3 00 

attcttactg ataagcagca acagttggct attgccttta tctatctttc ggacgaatta 3 60 

acgtcctcgg ccgaaataga ggaaaagatg aagattctac tggctcgtat tagtgagaaa 42 0 

ctggtatga 429 

<210> 1101 
<211> 222 
<212> DNA 
<213> B. fragilis 

<400> 1101 

attaacaata caggaagtta taccaagaaa ttcgtagtaa tcatcccttt tcctgtattt 60 

aaaaaaagaa atgaagaaac tctaaacttt tggtataact tcctcttaat agttagcagg 12 0 

tacaagaata tgtttgacta ccgtcaaaat gacagtttcc ccacttatca taaaccttgt 180 

agcaacgtcc tactccgtca gaacacgtac gggagcaagt ga 222 

<210> 1102 
<211> 1146 
<212> DNA 
<213> B. fragilis 

<400> 1102 

atcgaaatga caaaaaataa attgctttcg tgtgttattg gagtgataat cctatcacta 6 0 

ttggttggag cctatttcta tcaacgaaat aaggtggctg tacatcaaca ggcggagagg 120 

ctattcgtac agatgcttca agaagagata gaaagaaaag aaagaaattt aaatctattt 180 

catctgtttt ctgagagttc atctgatact ttacctttga aaatttgcat tatcacagaa 240 
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gaggggaaaa 
ctgagaaacc 
ttaaatgaac 
catgtgcgga 
tgggatactt 
atcggtcttt 
tggattgtaa 
aatcgccctc 
gttgagaaag 
gcacctttaa 
gttgtttttg 
ccccaacagt 
gaggatatta 
tctgccggaa 
cgttttggaa 
acataa 



aagagtatga 
ggtcaataca 
attggcagag 
tggaaaatct 
cttctgggat 
tagcttttag 
tttgtttgtt 
cggaattaaa 
aggtcattcg 
taaagcaaat 
atgcccagaa 
gtcagattct 
ttaagtttat 
acaaattagg 
gcgatcgtta 



ggttgattcc 
ctctatatta 
tatgcttaag 
tcaaggaaag 
tataacttca 
ttggaaaaca 
attgatgctt 
agaggttcct 
tgagataata 
ttgtaaagta 
tagagttctt 
aaagcttttt 
ttggaaaggt 
gaagagatta 
tcgtatgttg 



ctgaaaagta 
tgtgagaaat 
aaagatcaca 
attattagtt 
tatataggta 
atattatggt 
ttatttatct 
tatgaggttg 
gttgaaaaag 
gaaggacaac 
aattgtaatg 
ttagatgctc 
caatcgaacg 
gagcaagctg 
ttcatagacg 



aaaagaacat 
cccatttatt 
ttgatacaga 
catcaagtca 
atcgttgtga 
atcattggca 
gcttttatta 
tcgttgaaaa 
aaacatctcc 
tatatggttt 
gcaagaaaat 
ccgattatac 
ttcagataaa 
gttgtggtgt 
accttgttga 



ttctcagaac 
gccggattct 
gtccactata 
cgatggtgtg 
gatagaggtc 
accttttgga 
taaaaaggtg 
ggaagtcgtt 
ggaaaaaaag 
acgttatgga 
gtctttgtcc 
tgtgactgat 
tacgttttgt 
ttgtttcagg 
taatgatttg 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1146 



<210> 1103 
<211> 765 
<212> DNA 
<213> B.fragilis 



<400> 1103 
atttttagga 
aaatctcctt 
atcaaggttg 
catacggctg 
gagttgcgtg 
ctttatattc 
aagatcgagg 
atgtcggatg 
acggaatgcg 
tatgatatgt 
gattttgatc 
cgtgatgccg 
gctgccctcg 



cgttgaaaat 
attacgacat 
agagcgtctc 
ttatatttac 
tgacgatccc 
agaagtatgt 
atctgattcc 
tacacaacga 
taatgtatcg 
tggtattctt 
agaaggacat 
gactccgtct 
atttgtttat 



taagaaagta 
tgctgaaaag 
tgcgaaagaa 
atcgcgtcat 
tgagacgatg 
gcagtatcgt 
ttcgattgtg 
tgacgtgagg 
cactgttagc 
tagtcctgcc 
taagatcgga 
tgatcttgaa 
taaggaaaac 



ttagtgtcgc 
tatggtgtaa 
ttcagacaac 
gcgattgatc 
aagtatttct 
aagcgtaaga 
aaacataaaa 
gatctgctgg 
aatgacttta 
ggagtatctt 
acattcggat 
gcacctaacg 
aataaaggaa 



agccaaagcc 
aaattgattt 
agaaagtatc 
atttcttcca 
gtgtaacaga 
tcttctttgg 
cagaaaagta 
ataaaaataa 
tggagggaga 
cgctgaagaa 
cgactacagc 
tgaaagctcc 
aataa 



tgcttcagag 
ccgccccttt 
tattttagat 
tctttgtacg 
agctgttgct 
tgctacgggc 
tctcgtcccg 
catccagcat 
ggagtttgat 
gaacttccct 
acaagctgtt 
ttcaatgacg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

765 



<210> 1104 
<211> 843 
<212> DNA 
<213> B.fragilis 



<400> 1104 
gtttgccgct 
cagcatgtaa 
caggcggtaa 
acagacgatg 
gcacgcagca 
acgagcattt 
cagacctgcg 
ttgatggagc 
tatcttttgt 
aaaaccgata 
tttccgtttg 
ggttgcgttt 
tcctgtaata 
ccttgtctga 
tag 



atatgatttt 
ttagtgcagc 
gtaagaccgt 
ggttggctat 
agaagtttct 
tcgcttcttc 
ttcttggagg 
gtgtttcgcc 
ttaaatggat 
tatggctgga 
ttttgtttct 
tggtaatttt 
tttatggggt 
tagtgtatca 



tgtccaggac 
cggtgctccc 
gaccggggcg 
gatactgttg 
gttgcaacag 
tacggcggct 
tgtttgtatt 
gcatatactt 
tttatattcg 
gtcttattct 
ggtctatttt 
tactaaaata 
tttcctttta 
aggtatgatt 



tctcttggcg 
caggatacgt 
gagggtatgc 
ggatgttttt 
gtgaaagatt 
gacatgcgct 
tttaactatt 
ttgggagttt 
tttttgggat 
acgctgattt 
gatttgaatg 
ttgatgtttt 
attttatact 
cagctaaata 



cacaggaggc 
tgggtagccg 
cgattcctta 
ttgtttcagc 
ttatgttaca 
atttgctatt 
ttaatgatat 
atgtggctgt 
gggtgttttt 
attaccttgg 
tcaccttttt 
acaagtggtt 
tttgtgccct 
atgttttgat 



agatactgtg 
ggtggattta 
ttctccaaga 
gtatgtgttg 
tcgtgaaaga 
acttatcgtg 
caggcccgca 
ttgtttactc 
tgacaaaagc 
attcgcttta 
agtttcaatt 
aaagcttttt 
tgaaatcgta 
aataaatttt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

843 
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<210> 1105 
<211> 915 
<212> DNA 
<213> B.fragilis 



<400> 1105 

ttcgatgatt tatgtattca gaaagaaacg ctgatgaaga gtaaaagcag aaataacgct 60 

gtgtcatatt ttgatatgca gttcatcact tccagtatca gtaccacgtt ggtattgctg 120 

ttactggggc ttgtggtgtt ctttgtattg gcggccaata atttgtctgt ttatgtgcgg 180 

gaaaatatta atttttccgt gcttatcagt gatgatatga aggagacaga tattctgaag 240 

cttcagaaac ggctgaataa tgaacctttt gtgaaagaaa cagaatatat ctcgaaaaaa 300 

caggcattga aagagcagac ggaagccatg gggaccgatc cgcaagagtt tttggggtat 3 60 

aacccgttta cggcttcaat agaaattaaa ttgcattcgg actatgcgaa ctccgacagt 420 

attgcgaaaa tagagaaatt gattaaaaga aataccaata tacaggatgt gctttatcag 480 

aaagacttga tcgacgcggt aaatgaaaat atccgtaata tcagtcttgt tctgctggca 540 

ttggccgtga tgttgacatt tatctctttt gcgctgatta ataatacaat ccggctggct 600 

atctactcga aacgtttctt gatacatacg atgaaactgg tgggagcgag ctggggattt 660 

attcgtcgtc cgtttttgaa aaggaatata tggagtgggg ttctggctgc ttttattgca 720 

gatacgatcc tgatgggggc cgcttactgg ctggtatcct atgagcctga attgattcgg 780 

gtaattacgc ccgaagtcat gttactggta tcgggcgcag tattggtgtt cggtgtggtc 840 

atcactttct tgtgtgctta tctttctatt aataaatatc tgaggatgaa agcaagtacg 900 

ctatattatg tgtaa 915 



<210> 1106 
<211> 231 
<212> DNA 
<213> B.fragilis 



<400> 1106 

aacgtatatc ttatgaaaat gaaaaactat ttgaaagtaa ctgtattttg ggtcgctgtc 60 

ctttcggttt ggtgcttaaa gccgacaaaa aaatctcaag atactctctt gttgcagaat 12 0 

gtcgaagctt tggcaagtgg agaagagcct tcacagattc attgttattg gcgaggctct 180 

gtagattgtc ctgttagcca tgataaggta gaggttgtat atgagtacta a 231 



<210> 1107 
<211> 1314 
<212> DNA 
<213> B.fragilis 



<400> 1107 

ttcataattc ataattttat tatgggatat ttattcacat ccgaatcggt gtctgaagga 60 

caccccgata aagtggccga tcaaatatcg gacgctgtgc ttgacaaact gttggcttat 12 0 

gatcccagtt cgaaagtagc ttgcgaaacc ttagtaacta ccggacaggt ggtgcttgcg 180 

ggagaagtga aaacaggtgc ttatgttgat ttgcaactga ttgcacgtga agtgatccaa 240 

aagattggtt acacgaaagg cgaatacatg ttcgaaagta attcgtgcgg tgtactttct 300 

gccattcatg aacaaagtgc ggacattaac cgtggtgtag aacgcgaaga cccgatgaac 3 60 

cagggagcag gcgaccaggg tatgatgttt ggttatgcaa ccaacgaaac agaaaactat 42 0 

atgccgttgt ctcttgacct ggcacataga atacttcttg tgttggccga tatccgccgc 480 

gaaggtaaag aaatgactta tcttcgtccg gatgcaaaga gccaggtaac cattgaatat 540 

gatgataacg gtactccggt acgcattgat acgattgttg tttcaacaca gcatgatgaa 600 

tttatattac cggctgatga ttctgccgct gcccaactga aggctgatga agagatgttg -660 

gcagtgatcc ggaaagatgt gattgaggtg ctgatgcctc gtgtcattgc ttctattaat 720 

catccgaagg ttcttgcttt gttcaacgac catattatct atcatgtgaa tccgaccggt 780 

aagtttgtga tcggtggccc tcatggagat acaggactca ccggacgtaa gatcattgtg 840 

gacacttatg gtggaaaggg agctcatggt ggcggtgctt tctccggtaa agatccaagc 900 

aaagtagatc gtagtgctgc ttatgctgcc cgtcatattg ctaagaatct tgttgctgcc 960 

ggcgttgccg acgaaatgct ggtacaggtt tcttacgcta tcggtgtggc tcgtcctatt 1020 

aatatttatg taaatacata cggacgcagt aacgtgaaga tgagtgatgg agagatcgcc 1080 

aaaaagatcg atgaactgtt tgaccttcgt ccgaaggcta ttgaagaccg cctgaaactg 1140 



450 



cgttatccga tttatagtga aactgctgct tacgggcata tggggcgtga acctcagatg 12 00 

gtgactaagc attttcaatc tcgttatgaa ggcgaccgga ctatggaagt ggaactgttt 1260 

acatgggaaa aacttgacta tgtggacaaa gtaaaagccg ctttcggttt gtaa 1314 

<210> 1108 
<211> 1320 
<212> DNA 
<213> B.fragilis 

<400> 1108 

ttaaacttta aaacttatta tcattcgatg aattttgtag aagaactaag atggcgtgga 60 

atggtgcatg acatgatgcc cggcacagaa gagttattgg ctaaagaaca ggtgactgct 120 

tatgtgggta ttgacccgac agccgattca ttgcatatcg gacacttatg tggtgtgatg 180 

atattgcgtc acttccagcg ttgtggtcat aaaccattgg ctttgattgg tggtgcgacc 240 

ggtatgattg gcgatccttc gggtaaatcg gccgaacgca atctgctgga tgaggaaaca 3 00 

ctgcgtcaca atcaggcttg tatcaaaaag caactggcta agtttttgga cttcgaatct 360 

gatgctccta acagagctga actagtgaac aactatgatt ggatgaagga gttcactttc 42 0 

ctggattttg cccgcgaagt aggtaagcat attactgtga actacatgat ggctaaggaa 480 

tcggtaaaga aacgtctgaa cggtgaagcc cgtgacggat tgtcgtttac tgagtttacc 540 

tatcagttgt tgcaaggtta tgactttctt catctctacg aaaccaaagg atgtaaactg 600 

cagatgggag gctctgatca gtggggaaat atcactaccg gtactgaact gattcgtcgt 660 

actaacggtg gtgaggctta tgcattgact tgtccgttaa tcaccaaagc tgacggtgga 720 

aaatttggta agaccgaatc gggtaatatc tggttggacc ctcgttatac ttctccttac 780 

aagttctatc agttctggct caatgtgagt gatgccgatg ctgagcgcta tattaagata 840 

tttacttcac tcgataaggc agaaatcgac ggactggttg ccgaacataa tgaagctccg 900 

catttgcggg tgctccagaa acgtctggca aaggaagtaa ctgtgatggt tcactctgaa 960 

gaggattaca atgctgcagt agacgcatcc aatatcttat ttggtaatgc cacttccgat 102 0 

gcgttgaaaa agctggatga agatacattg ttggctgtgt tcgaaggtgt tcctcaattt 1080 

gagatctcac gtgatgcgtt ggtagaggga gtgaaagcgg ttgatttgtt tgtcgacaat 1140 

gccgctgtat ttgcttcaaa aggtgaaatg cgtaaattgg ttcaaggtgg cggtgtctct 12 00 

ttgaataaag agaaactggc tgcttttgat caggtgatta ctactgccga cttgcttgat 12 60 

gaaaagtatc tgttggttca gcgtggtaag aaaaactatt atttgattat tgcaaaataa 132 0 

<210> 1109 
<211> 897 
<212> DNA 
<213> B.fragilis 

<400> 1109 

cccctgttta atcaaatttt atttcaaatg aaaaatctta tactggtgtt aggttgtttt 60 

ttctttctaa tctcatgtca gcagaccgag aaggaaaaac ttgaagaact tgttaaaaat 12 0 

tggaatggga aggaggtact atttccgaca aatcctagtt ttacgttata tggaaaaact 180 

cctgtcgatt ttaaaatccc tgtttcggat tataagatcg tgacctatgt cgattcgttg 240 

ggttgttcca gctgtaaatt gcaattgcct aaatggaagg aatttatgaa atatgcggat 3 00 

tctatagtag gctatcaaat accggttctt ttttttcttc atcctgctaa tgttcgcgag 3 60 

atgaggtctg tgttaaaaca aaatcgtttt gattatcctg tttgtatgga tacggaagat 42 0 

acttttaata aagtgaataa gtttccttca cagctaaatt ttcaaacttt tttattagat 480 

aaaaacaatc atgtaattgc gataggaaat ccggtccata attacgatgt aagagaactc 540 

tatatccatc ttatttcagg aggaatagat ggagattctc tttcaaatat gcgaacagtg 600 

ataaaaatag aggaagatat ggttgatttg gggagttttg attggagacg agagcagcat 660 

ataacttttg agatacacaa tattggtaat aataatttgg tcgtttatga taataagaca 72 0 

tcttgcggat gtacttctgt tgaatattcc aaagaacccg ttcagcccgg aaagtcttta 780 

gcagttaaag taacttataa agcagaccat ccggaacact ttaataaaac tattatacta 840 

tattgcaatg cttccgcttc tcctttggaa ttaaagataa ctggaaatgc taaataa 897 

<210> 1110 
<211> 183 
<212> DNA 
<213> B.fragilis 
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<400> 1110 

tactttctta attttcaacg tcctaaaaat ttattatcaa aacattattt agctgaatca 60 

taccttgata cactatcaga caaggtacga tttcaagggc acaaaagtat aaaattaaaa 12 0 

ggaaaacccc ataaatatta caggaaaaaa gctttaacca cttgtaaaac atcaatattt 180 

tag 183 

<210> 1111 
<211> 270 
<212> DNA 
<213> B. fragilis 

<400> 1111 

attatgagta aaaagatttt tgcggccctg atagtcgctg tagtcgcaac ttttgcaggc 6 0 

tacaatatat atcagtcaca gaaaacagag aaaatagttt cagacttggt gatagctaat 12 0 

gtcgaagcat tggcaggcga tactgagggc ggtgctacta tcacttgctc ccgtacgtgt 180 

tctgacggag taggacgttg ctacaaggtt tatgataagt ggggaaactg tcattttgac 240 

ggtagtcaaa catattcttg tacctgctaa 270 

<210> 1112 
<211> 2031 
<212> DNA 
<213> B. fragilis 

<400> 1112 

ttattaataa aaggctctgt gatgctccgt ggtgaaaaga gttatatgaa tatgaaaatt 6 0 

atgaaatata ttgggttagg cttgcttctg cttgtctgct catgcggggg tagagacaga 12 0 

caagtggagg aggccttatc cctttcgggc aataaccgta atgaacttga agcggtgctg 180 

aagcattatg aaggagatgg ccggaagctg gaggcggcac gcttcttgat tggcaatatg 240 

cccggaagtt atggagctaa tccgatagta gagcaggatt gttctgcttt ttacgaggct 3 00 

tatgattcat tgggacaaaa gtatgactat cgggtaggaa cggaatgggg gaaacaggta 360 

gatagtcttt ggaaggattt tagtaatcga catcgggtaa ggcaggaact taactatgat 42 0 

attacccgca tgaaggcgga agatttaatc cgggaaattg atctggcatt tcgggcatgg 4 80 

gtggagaatg tgcattcaag aaactgttcg tttgaagatt tttgtgagta tatactgcct 540 

tataggcgac agaatggctt attgattgac aatgcacgcc gggagttcaa caaacggcat 600 

cagggaaagt attttgtgaa agagggaaag gattggcaac aagagatcga ttcgttgtta 660 

tatgaatata agtatctgac tcattccggt ttttggggga cgaagattcc gatatggaat 720 

gcggtgactc ttgagaagat gcgtcatggg ttatgtgcac agcgctgttg gtataactct 7 80 

ttgttattat catcattggg gattccggtt gccattgact ttgttccggc atggggaaac 840 

cgaaataatt cgcatacctg gaatgtggtg ctaataaacg gggaatcgca tgcttttgag 9 00 

gcgttttggg ataatgatcg ctggaaatat aagcggattt ataataaccg ggatgatgat 9 60 

gaactttggg gaagattccg ccttccgaag gtatacagat atacctactc aaatcatatt 102 0 

gaaggaccgt tggcagatgt agaggtggat aaagctgata ttccggagct atttcgtagt 1080 

gtgaaaaagg tggatgtttc ttcggagtat tttgaaacgg ccgatgtaac ggtggagttg 1140 

acaggtgagg cgcctcaagg ggtgaaatat gcgtatttgg ctgtgttcgg atatcaggac 12 00 

tggcaccctg tgcagtgggc aaagatagaa aatgggaggg ctgtctttcg ggaaatgggt 12 60 

aaggacatgg tttatttacc cgtttattac aagcggggag gattattgcc cgcagcagaa 132 0 

cctttcagat tgcggaatga cggaacgatg gagaagctga gcggaaatga aggaacagag 13 80 

gaggttgccg tgaggatggt gacgggagcg ccggcttatg atcagaatag ggaatatctg 1440 

gggtgtatga aaggaagccg gatagtagga ttacttgatg gaaaatcaga agaagaactg 1500 

tgcagatgga cggactcgct ggctctgcag tcggttgtac ggaaggtgtc cgcacgatta 1560 

ccttaccgtt ttgtaaggtt attattaccg tcggatagca ttgctttggg ggagctttct 162 0 

ttttatacgg aagaaggacg gatcgggaat atgaggataa ttactccgat gagggctacc 1680 

ggaaggaatg aagtgcccgg gatgataacc gatggtttgg gggcgacggg ctatcgaggc 1740 

agggtggcag aaaggctggt agatatagat ctgggaaaag agtatatggt cagtcatatt 1800 

ggaatgactt cttacctgaa aacacagttg ttctgtccgg atgaatttga gttaagatat 1860 

tgggataatg gttggaagac tgtggagcgt aagcaagctg atcataaagg gtatcttgta 192 0 

tttgagagag tgccccgggg agcattgctg atgttgaaaa actgtcgctg gaaaggaaag 1980 

acggcagagc gtatatttac ttatgaaaaa ggagatgtga agtgggaatg a 2 031 
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<210> 1113 
<211> 279 
<212> DNA 
<213> B.fragilis 

<400> 1113 

gtttttaaaa acattataag tatggtaaac ttttcattag agggcaaagt ggcatgggtc 60 

acaggtgcat cttacggcat tggctttgcc ttggctactg ctttctcgga ggcaggagcg 12 0 

aagatcgtat ttaatgacat cagccgggag ctggttgata aaggcttggc cgcatacaaa 180 

gaattgggaa tcgaggccag gggatatgtg tgtgacgtta ccagcgaaga gcaggtgaat 240 

gctttggtag cgcagatcct cttcaccgcg gggctgcaa 279 

<210> 1114 
<211> 807 
<212> DNA 
<213> B.fragilis 

<400> 1114 

aacaggatgg aatggtttga agcactgatc cttggattga ttcagggact gactgagtat 60 

ttaccggtaa gcagtagcgg gcatttggcc attggttcgg ctttatttgg tatagaagga 120 

gaagaaaatc tggcatttac cattgtggtg catgtagcca ctgtgttcag tacattggtg 180 

attctgtgga aagagataga ctggattttc cgtggtttat ttaagtttga gatgaacagt 240 

gaaacacgct atgtaatcaa tatcttgatt tcgatgattc ctattggtat cgtcggggtg 300 

ttttttaaag atgaagtgga ggccattttt ggctcgggat tactgattgt tggctgcatg 360 

ctgttgctga cggctgcgtt actgtcgttt tcgtattatg caaagccacg ccagaaagag 42 0 

aacatctcga tgaaggatgc atttatcatt ggactggcgc aggcgtgtgc agtattaccg 480 

ggattatctc gttcgggcag tacgattgca accggtttgt tattgggtga taataaagcg 540 

aaactggcac aattttcttt cctgatggtg atgcctccta tattgggaga ggcgctgctg 600 

gatggtatga agatgataaa aggtgaggct attgcgggcg atattcctac tttgtcattg 660 

atagtaggtt tcattgcggc ttttgtttca ggttgcctgg cttgtaagtg gatgattaat 720 

atcgtgaaga aaggtaagtt gatttacttt gctatttatt gtgcaatagt tggagtggtc 780 

accattgtcg tttcacaatt gcaataa 807 

<210> 1115 
<211> 246 
<212> DNA 
<213> B.fragilis 

<400> 1115 

ataatatttt ggatttattg ctacttttgc ttcgtattaa cagactacct tatgaaaaat 60 

aaactaaaaa ccgcttttac attattggtg tatattgctg tcacagtggg tatttacgcc 120 

ttaatctgtc atctgaatca ccagcccttc gacgatttgc gcatactgta tgccgtactg 180 

atcggctgcg tagcctacct tccccgacac ctgatggttc gcaaatcacg aaagagccag 240 

aaatag 246 

<210> 1116 
<211> 258 
<212> DNA 
<213> B.fragilis 

<400> 1116 

ggcgagtgct gccagttgtc ttatttgtct ttgatagtcg ctgtagtcgc aacttttgca 60 

ggctacaata tatatcagtc acagagagta gaaagtatca tgtcggattt gacgatggcc 12 0 

aatgtagagg cgttagctgg ttctgagatt aatgatgagg attgtgtcag tgcatctaat 180 

cgttattgct ctgttttgat agtgacccca aatgggaatt atctagaaac ttattttgac 240 

caaaaaacaa agtactga 258 



<210> 1117 
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<211> 1584 
<212> DNA 
<213> B.fragilis 

<400> 1117 

tcaatgaaac gtcgcgattt tctgaaatgt tcacttgccg 
tctccgtcaa cctatgcgtt taatggcgaa agcaaagaga 
ctgtcaaagg ctcctgccac aaagggaggg aagccgcata 
caacatcgtg gagatgcttt acattgcatg ggcaataagg 
gataaactgg ctcaggaagg cagtttgttt gtgtgtggtt 
acgcctgcac gtgccggttt gctcacgggg atgtctccat 
tatgggaagg tggcttccaa atataaatat gaaatgcctc 
tactacactt tcgggatagg aaagatgcac tggtttccac 
catgctacgt tggtcgacga gagcggacgc agtgaaaccc 
cgggaatggt ttcagttgca agctcccggg aagaatccgg 
aataatcata atgccgggac ttataaactt gaagagaggc 
ggtcagacag cttgtgaatt gatacgcaat tatgatagcg 
gtctcttttg ctcgtcctca tagtccgtac gatcctccga 
gagaaggtag atattcctgt accgttcgtt ggagattggt 
aaagatccgg aacgggtttc aaaggatgcg gcttttgcaa 
gtcaattcac gacgccatta ttatgcaaat gtgacattta 
atcattcaga ttctaaaaga gaaaggaatg tacgaaaatg 
gatcacggtg atatgttggg agatcattat cattggcgga 
tcggccaaga ttccttatat cataaaatgg ccttctgcca 
ggaaagcgga ttgaacagcc agtagagcta cgcgactttt 
gcgggaggga cggtaccgga tgatatggat ggaaaatctt 
aataaaaacg gctggcgaaa atatatcgat ctggaacatg 
aactattggt gtgcgttgac tgatggtaaa atgaagtata 
gaagagcaac tgttcgattt aagttcagat cccggtgaac 
agtcgttatg cagataggct ggttgaaatg cgtaaagcga 
agaggaacag agtttgtaaa agatggaaag ctggcggtaa 
agtcccaatt atccgaaaga ttga 

<210> 1118 
<211> 273 
<212> DNA 
<213> B.fragilis 

<400> 1118 

aatgatagaa gccacacctt gcaaaggctg cataaatatg gacgtgtcac acagcatcaa 60 

gcattggaac ctccagataa ccaatggacc ggaatgcagg caaagtcaca aaaagatgct 120 

ttcgttcatc tctacattta tgttaataca ctattgcgga agtcttcaaa ttataaaaaa 180 

gataatccgc ttacatggtg gaaagttgat tggcagctat cacggacggt caataactct 240 

ccgggttact gtacctatca acggatattg taa 273 

<210> 1119 
<211> 2079 
<212> DNA 
<213> B. fragilis 

<400> 1119 



tgggagcagg 


tttggcagcc 


60 


ccggtaatga 


ttcttccaaa 


120 


tcatttttat 


tatgagtgac 


180 


cggttatttc 


tcctaatata 


240 


acagctctgc 


acccagtagt 


300 


ggcaccatgg 


gatgctggga 


360 


agatgttgcg 


tgacttgggt 


420 


agaaagcttt 


gcatggtttt 


480 


gtgattttat 


cagtgactat 


540 


atttaacagg 


aatcggctgg 


600 


tgcatcctac 


ggcctggaca 


660 


atcaaccttt 


gttcctcaag 


720 


aacgatattt 


ggatatgtat 


780 


gtgggaaata 


tgctgaacgc 


840 


atctaggcga 


agagtatgct 


900 


ttgatgacca 


gattggacag 


960 


C1CL u CL i— l. i~y 




102 0 


aaacctatgc 


atacgaaggg 


1080 


tgactacaca 


ggctatcaga 


1140 


tacctacctt 


cattgaactt 


1200 


tggtagccct 


ggcttcagga 


1260 


ccacctgcta 


cagtgccgat 


1320 


tctggtttat 


tcatacgggt 


1380 


agaagaacct 


ttccggaaat 


1440 


tggttgacca 


cttgcaagag 


1500 


gagatcaaac 


cttactttat 


1560 
1584 



tcacggcttt 


caataccggc 


60 


gggccgcctc 


ccaatgggat 


120 


ccatggatat 


gcaggtacag 


180 


cacagttctt 


tcctcacgac 


240 


gagcctgtgc 


cgaagtgtat 


300 


ctgcctttgc 


atgcgaaata 


360 


ttaaagccga 


caataaagcc 


420 


tctacggcgg 


tatttatcgt 


480 
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cccgtatggc tgattgtaac cgaacagaat aacataacgg ttaccgattg tgcctcgccg 540 

ggtgtctaca tcacccaaaa ggatgtatcg aagaaatcgg ccgatatcac cgtaaaagtg 600 

aaattggata atgcaggact tcaacctgct gctgtaacac tcgaaaacac tatttatacg 660 

caggagggcc gaaaagtcgg tacacacagc cggtcgtttg acttgagtcc gcaagggaca 720 

caaacttatt tgtccacttt taaactgaag aacccacatc tctggcaggg acgtaaagat 780 

ccgtatcttt ataaagttgt ctgcaggctg atggcagacg gaaaagtaat cgatgaagtg 840 

gtgcagcctc tcggggtgcg gaagtatgag atagtagccg ggaaaggctt tttcctgaac 900 

ggagagaagt acccgatgta tggtgtgacc cgtcatcagg attggtgggg attgggtagc 960 

gcccttaaaa acgaacatca cgatttcgat ttggctgcca ttatggatgt gggagccact 102 0 

actgtccgtt ttgcccacta ccagcaatca gactaccttt attcccgctg tgatacattg 1080 

ggactgatta tttgggccga aataccttgc gtgaaccggg tgaccggata cgaaactgag 1140 

aatgcgcaaa gccagcttcg cgaattgatc cgccagagtt tcaatcatcc ttccatttat 1200 

gtatgggggc ttcacaatga agtatatcaa ccacatgagt atacagctgc attgacccgt 12 60 

tctctccatg atcttgccaa gacagaagat ccggaccgtt acaccgtttc ggtcaatggg 132 0 

tatggtcaca tggatcatcc ggtcaacctg aacgcagaca tacagggtat gaaccgttat 1380 

tttggctggt acgagaaaaa gatacaggac atcaagccat gggtggaaca acttgaaaaa 1440 

gactatccct atcaaaaatt gatgttgacc gaatatggtg ccgatgcgaa tctggctcat 1500 

cagaccgaat accttgggga tgccctgaat tggggaaagc ctttttatcc ggaaacattt 1560 

cagactaaga cacatgagta ccagtggagt attatcaaag accatccgta catcattgct 162 0 

tcttatctct ggaacatgtt cgattttgcc gtacctatgt ggactcgtgg cggtgtgcct 1680 

gcccgtaaca tgaaggggct gattaccttc gatcgtaaaa caaagaaaga ctcttatttc 1740 

tggtataaag ccaactggag cgaagagccg gtactctatc tcacacagcg tcgcaatgcc 1800 

gatcgtgaaa agcgaacgac agccgttacc gtttattcca atatcggaat tccgaaagta 1860 

tacttgaatg gacaggaact gagtggcatt cgcaatggct ataccgatgt acattatgtg 192 0 

tttgacaatg tatcacttgc cgacggaaaa aatatactga aagctgtagt ctcaactaag 1980 

gggaaggaat atactgacga gattgaatgg aattattccg gtgagaaaaa cagggaaatc 2 040 

gattcatatg aaaataagaa tgaacattcg ggcttttga 2079 

<210> 1120 
<211> 240 
<212> DNA 
<213> B.fragilis 

<400> 1120 

ataacaatgg gtgataaaca aaaatttgcc ttcgataaaa cgaacttcat tctgcttgct 60 

atcggcatgg cagtggttat tctgggcttt atcctgatga cagggccttc atcgtcggaa 120 

acggtgtttc aggcggacat tttcagtgtg agaagaatta aggtggctcc ggtggtctgt 180 

ttcctgggct ttatttttat gatatatggc gtgatgcgca aacccaaaac aaaagaataa 240 

<210> 1121 
<211> 204 
<212> DNA 
<213> B.fragilis 

<400> 1121 

ggtgttacgg ggttgcttga tatgtatatt ttttgttttt caaatttttc tttatgcagg 60 

gatggaacag acaagcggga ggaggctttg ttattttcgg gcaataaccg tgatgaattt 120 

gaaacggtac tgaatgcaat ctatcttgga ggaatattgg aattagttga ctcgtacaag 180 

agttcttgtt tgcgtgtgaa atga 2 04 

<210> 1122 
<211> 1065 
<212> DNA 
<213> B.fragilis 

<400> 1122 

tttattatta tgaataaaat tgtatcgttt tttattttaa ttcttgtttg ttcttgttca 60 

gataggaaag agcatatagc aaatattctg tcaatgaatt ctatagatat taaaactaat 120 

aggattaata aagacgaagt aattgctaga ggtgcatttc cttttgaaat gattgattca 180 
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ttattctttc ttttcaatgg agatccttcg tctggagcat tggttttatg tgaatcgaat 240 

gcttccgaac tgggacactt cttgcaaaaa ggaaatggtt ttggagaatg tatcactcct 3 00 

ggatatatag gacattgcaa tgatactatt tatgtttctg aacgttctag aacaaggcga 3 60 

atgacttatt tactatcaaa tcataatgat agtttgcaat ataagtgtct tgaagatgtt 42 0 

agtcctaaaa tgaattcaga attttattat cagatttgtc gtctacaaag tggtttattt 480 

gtaggtgccc gtttatttgg aaaagaacat ttgtttacat tgttagacga aagtttggat 540 

acacttacca cttttgcacg ggtgccgata gacattgagg aaaatgcaaa taataagctc 600 

gctcctttta ttggtcattt atgtatagat gataatacgg tttattatgc ttctaatgac 660 

ttttcttata tggctgctta tgatatttta tctgagaaag agataaaacc agtatttgag 72 0 

aggatgtata tatctccaat aatccaaaaa tcagcgaatg ggatttcatt agataaatac 7 80 

aaacatcttt tgggctttgg tgatatcagg gtttatcaga attatatttt tgcgacgtat 840 

atagggaaac ctgatataac aatggatcaa gagaatgata tttcagcttt agtccccact 900 

catttgctgg tttttaataa agatggagtt ccaattgtaa aatttaagtt tccgtttaaa 960 

ataagatcat tcgtgtttac caaatccaag atgtatctat tagatgtgga ttgtaatata 102 0 

gaatctgtcg atttggtaga gttgtggaag catttgcccg attga 1065 

<210> 1123 
<211> 1074 
<212> DNA 
<213> B.fragilis 

<400> 1123 

acgcacatta ttatcatgaa actgtcgcaa tttaaattta agttacccga agaaaagatt 60 

gctttgcacc ctacaaagta cagagacgag tcgcgcttga tggtactcca caagcgtacg 12 0 

ggagagattg agcacaagat gtttaaagac atcctgaatt attttgatga taaagacgtg 180 

tttgtattca acgataccaa ggtgtttcct gcacgcttat acggaaataa ggaaaaaaca 240 

ggtgcgcgta tcgaagtgtt cttgttgcgc gagttgaacg aggaattgcg tttgtgggat 3 00 

gtattggtag atccggcacg taaaatccgt atcggcaata agctttactt tggggatgat 3 60 

gactcaatgg ttgctgaagt aattgataat actacttcac gtgggcgtac gcttcgtttt 42 0 

ctgtatgacg gacctcatga tgaatttaaa aaagcattgt atgcattagg agaaactcca 480 

ttgccacata cgattctgaa ccgtccggtt gaagaggagg atgcagaacg tttccagtct 540 

atcttcgcta aaaacgaagg ggctgtgaca gcaccgactg caagtctgca cttcagccgt 600 

gagctgatga aacgtatgga aattaaggga atagactttg catatatcac attgcatgca 660 

ggactcggaa acttccgtga tatcgatgtg gaagacctga caaagcataa aatggactct 72 0 

gagcagatgt ttgtgacgga ggaggctgtc aagatagtga atcgtgctaa agatctgggt 7 80 

aagaatgtat gtgccgtggg aacaactgta atgcgtgcta ttgaaagtac ggtaagtaca 840 

gacggacatt tgaaggaata cgaaggatgg acgaacaagt ttatcttccc tccatacgac 90 0 

tttactgtgg caaatgccat ggtatcaaac ttccatatgc cgctttctac gttattgatg 960 

attgtggctg cttttggtgg ctacgatcag gtgatggatg catatcacat agcgttgaag 102 0 

gagggttacc gctttggtac ttatggagat gcgatgctga ttttggataa gtga 1074 

<210> 1124 
<211> 852 
<212> DNA 
<213> B.fragilis 

<400> 1124 

ataagaaata tgaaaacaaa ctatgagatt cgctatgctg cccatccgga agatgcaaga 60 

agctacgaca ccaagagaat tagaagagat tttctgatag aaaaggtttt ttcagccgat 120 

gaagtaaaca tggtatattc catgtacgac cgtatggtgg taggtggggc catgccggta 18 0 

aaggaagtgt tgaaattgga agctatcgat cctttgaaag ctccttattt tctgacccgt 240 

cgtgaaatgg gtattttcaa tgtcgggggg tccggtatcg tgagggcggg tgatgcgata 3 00 

tttcagttag attataaaga ggcactttat ctgggggcag gtgaccggga cgttaccttt 3 60 

gagagtacgg atgctgcaca tcccgctaaa ttttatttta attcactggc cgctcatcgc 42 0 

aattatcccg ataaaaaggt gactaaagcc gatgctgtag ttgctgaaat gggaacgttg 480 

gaaggttcga atcatcgtaa tatcaacaag atgctggtaa atcaggtgtt gcccacctgt 540 

cagttgcaga tggggatgac cgaactggct ccgggaagtg tgtggaatac gatgcctgca 600 

catgtccata gccgtcgtat ggaggcttat ttctattttg aagtaccgga agagcatgct 660 

gtgtgccatt ttatgggtga ggttgacgaa acccgtcatg tgtggatgaa gggcgatcag 72 0 



456 

gcagtcttgt caccggagtg gtctatccat tcggctgcgg caactcacaa ttatactttt 780 

atctggggta tgggaggtga gaatcttgat tatggcgacc aggacttttc attaattacg 840 

gacttgaaat aa 852 



<210> 1125 
<211> 639 
<212> DNA 
<213> B.fragilis 



<400> 1125 

tctatggata ttctcgatat acatacacat cggatgcctg ttgaacttgg acaggcgata 60 

caaaattgtc agcccgcaga gtttgatccg ttggccggtg cttattattc tgtcggaatt 120 

catccgtggt atctgactcg tgaaaacctt gaccggcagt gggagatgtt gcttgcagcg 180 

atacagtgtc cccaggttct ggcaataggt gaagccggtc ttgacaaatt ggttcggaca 240 

gactatatgt tgcaacagga agtatttgag aaacaggcta tgcttgcaca cgaaatgaaa 3 00 

tatccgttgg tgattcatgc agtgcgttcg gcgaacgaaa ttatctgcct gagaaaaaaa 360 

atgaaaccct ctaatccttg gattatacat gggttccgtg gaaagaaaga actggctttg 420 

cagtacatcc gggaagggat ctatgtttca ttgggtgaga aatatcagga ggaagtgctt 480 

tggggcattc ccttggaata tttatttttg gaaacggatg aaagtatgat agatattcat 540 

tgcctttatg aacgtgctgc tttgctattg gagatacctc tttgcaaact tatgcaacaa 600 

gtgcgtcaaa acattaataa cgtctttttt aggcaataa 639 



<210> 1126 
<211> 231 
<212> DNA 
<213> B.fragilis 



<400> 1126 

gaaactggta tgaaaggcct gttgtcatat atattgttgc ttcctatcta cttttacaga 60 

gcatgcattt cgcccatgac tcctccttca tgtcgtttca ctcctacttg ttcgcaatac 120 

gctattgaag cgattaagaa acatggccct tttaagggac tttatcttgc tgtcagacga 180 

attctccgtt gtcatccctg gggtggatca ggatacgatc cggttcctta a 231 



<210> 1127 
<211> 606 
<212> DNA 
<213> B.fragilis 



<400> 1127 

actaaaccca tgcaagaaat gaaccaaata acttccgtat gcgtatattg cgcttcaagt 60 

acaaaaatag accagactta ttttgatgca gccataaaat tagggcatct gttggcaaac 120 

cggcatatcc gtttgataaa tggagcgggg aatataggat tgatgcgttc ggtggctgat 180 

gcagtattgc agaatggggg agaagtgacc ggagtaattc ctcattttat ggtagaccag 240 

ggatggcatc acacaggatt gacagagctt atcgaggtag aaagtatgca cgagcgtaag 3 00 

aggttgatgg ctgaaaagag tgatgccgtg attgcactgc ccggaggatg cggaactttg 360 

gaagagctgc ttgagattat tacctggaaa cagttggggt tgtatcttaa cccgatcgtg 420 

atattaaaca ctaatggttt ctttgacccg ttgctggaga tgcttgaaaa tgctatagaa 480 

ggtaatttca tgaggaaaca gcatggagat atctggcatg tggcacatac tccggaggag 540 

gctgtagagt tggtttattc cataccggtg tgggatggtt ctattcgtaa gtttgccgct 600 

atatga 606 



<210> 1128 
<211> 1128 
<212> DNA 
<213> B.fragilis 



<400> 1128 

actgactttt tggttagggt gttacgggtt tgcttgatat actttaaaaa tgaaatatac 60 
gttagttgtt taaaaataac gaagatgaaa caaaaatata tcttatttct ttcattgtgt 120 
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tttgtgcttt tttcatgtag gaagaacgat gttggttcgg tctattcatt tgacacagta 180 

tgcaaaattg tagactataa tcttatttgt tcagatgaaa atgctccttg gggagctata 240 

atgaatatgg aaatagtaga cagcatttta attctccagc atgcaatgga tgaatatgca 3 00 

ttttcattta ttaatgtaaa taatggggag ctattatctc aatgggggcg tacaggcgaa 360 

ggaccagaag agtttataga ttttgggtct ggttttgaaa tcgttgactc gagaattgtt 42 0 

tttctggatc gaatgaagaa agaaaggatt tcggtattat tatctgatat cttaagcaaa 480 

aaagaacatc cggatataac aagggaggct tatccttata atgtggattt cagggttttg 540 

gagattaatg ctgttggcaa taaaaaaata gtaacaggcg ggtttaagaa aggttattgg 600 

ggagctcttg actcacagaa tcatattata cctaatgtgg cagagcttcc tttcgatgcg 660 

ggtgaggtgt ccggtttaga gaagggtatc gttttcggag gtatattgaa ggcaaatagt 72 0 

aaacaatcca aatttgtgct ttcaatacgt gcttctgata ttttcgaaat ttatcgtgtt 780 

tctgacgatg gaataaatcg tgtctatgtg agtcctttta agcatattcc gaaaacctgg 840 

aagaagggag gcggttatgc aattgattat aaccaaagta ttggaggaat aaaaaatata 900 

gcagtctcgg atgacttgat ttgtttttca ctctttttac aaaattacaa tgaggctgca 960 

aaaacagact ttgcgtctaa tgaactgttc tgttttgatt gggatgggaa taaagtgaaa 102 0 

aaatatgtgt taccttttcc tataggtaat ttctgtattg atggaactca tatctatgga 1080 

gttcggaact ttgaagataa aattatcatt tatcgtttta acatgtaa 112 8 

<210> 1129 
<211> 3297 
<212> DNA 
<213> B.fragilis 

<400> 1129 

agtaaacaaa taaattcagt aagtatgaga aaactttttg tatgtattgc actgggtctc 60 

accacgctta ccggcaatgc cacatcaccg ctatggatgc gcgatgtaca gatttcgccg 120 

gacggaacag aaatagcgtt ttgctacaaa ggagacattt ataaggtatc tgcgggagga 180 

ggaacagcca tccagctcac aacacagcct tcgtatgaat gtacccccat ttggtcaccc 240 

gacagtaaac aaatagcttt tgccagtgat cgtaatggca actttgatat ttttgtaatg 3 00 

cctgcgacag gaggtacagc acaaagactg actacccatt cttcatccga actgccttcg 360 

gcttttacac cggatggaaa atacattctc ttttcagcat ccatccagga tccgtcacaa 420 

agtgctttgt tcccgacaac agccatgaca gaactataca aggttcccgt gaacggagga 480 

cgtacggagc aggtactggg tactccggcc gaagccattt gttatgcgcc atcaggagag 540 

ttctttctct atcaggatcg taaaggtttt gaagatgaat ggcggaaaca ccacacttcg 600 

tccatcaccc gcgacatttg gctgtacgac actaaaacag gaaaacatac caacctgacc 660 

aatcatgccg gagaagaccg caatcccgta ctttcaccgg acggaaaaag cgtatatctt 720 

ttaagcgaac ggaaagggtc atttaatgtt tatagttttc cattggacaa cgcacaagac 780 

ctgaaagcag taacatcgtt caaaacacac ccggtacgtt tcctgtcaat gagtcacggc 840 

ggaacgctat gctatgcata cgacggagaa atatataccc aaaaggataa tgccactcca 900 

cagaaaataa acatagatat tgtccgtgat gatcaggaca aaatagcaga cctgactttt 960 

acaaacgggg caacatcagg gactgtatca ccggatggga agcaaattgc atttatcgta 1020 

cggggagaag tatttgtaac ctcaactgat tatgcaacta caaagcaaat cacccataca 1080 

cccgcacgcg aagccgggtt aacatttgct ccggacaatc gtacactggc ttacgcaagt 1140 

gagcgtaacg gcaactggca actttttctt gctaaaatag cccgtaagga agaagctaat 12 00 

ttccccaatg ccaccatcat cgaagaagag gtgctgttac catccgcaac cgtggaacgg 12 60 

gcctatccgc agttctcacc ggacggtaaa gagctggcat ttatagagga gcgtaaccgt 132 0 

ttgatggtaa tcaatttgga tacgaaaaaa gttcgtcaga tcaccgatgg ttccacctgg 1380 

ttcagcacag atggaaactt cgactatcaa tggtcacctg acggcaaatg gttcaccctc 1440 

gaatttatcg gcaaccggca cgatccttac tcggatatag gattggtaag tgcaaagggt 1500 

gacagtccga ttaccaacct gaccaacagc ggttacatga gcggatctcc ccgttgggta 1560 

ctggacggca atgccatttt gttcacaacc gaacgatatg gtatgcgtgc acatgcttcc 1620 

tggggttcac agaatgatgc catgctggta tttctcaatc aagatgcttt cgacaagttc 1680 

cgcctgagca aagaagatta tgaattgcaa aaagaactgg aaaaggaaca acagaaagac 1740 

aaagaaaaag cctcaattga cccgaaaaaa gataagaaga aggatcccca aacagatact 1800 

gagaagaaag atgagatcaa aaatatccta gtagaactga atggccttga ggatcgcatc 1860 

atacgcctca ctcccaactc ttcgaacctg ggcagtacta ttatctcaaa agacggcgaa 1920 

actctctact atctgtcagc attcgagggc ggatttgatc tatggaaaat ggatctccgt 1980 

aaaaaagaga ccaaactgct tcataaaatg aatgccggat gggcttctat ggatatggac 2 040 

aaagatggaa aatccctgtt tgtcctggga ggtaatgcca tgcaaaagat ggacctcagc 2100 
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ggagaaaccc tgaagccgat caactataag gcagagatga aaatggacct ggctgctgaa 2160 

cgagaatata tgttcgacca tgtatataaa caacaacaga aacgtttcta caacaccaac 222 0 

atgcacggag taaactggga taccatgtct gctgcttatc gtaaattttt gccacacatc 2280 

aataataact atgactttgc cgaattactc agcgaatggc tgggagaact gaatgtatca 2340 

cataccggcg ggcgtttctc tccatctata ccgggagatg ccacagccag tctgggggta 2400 

cttacagatt ggaattataa aggaaaaggc gcatcgatca tggaagtgat tgagaagggc 2460 

cctttcgatc acgcccgctc gaaagtaaaa gccggaacta tcattgaaaa aatcaacgga 252 0 

caggaaataa cccctgaaac agactatcat acgttattga acgacaaagc aaacaaaaag 2580 

acactcgttt cattatacaa tccgcaaagc ggtgaacggt gggaagaagt agttatcccg 2 640 

atcggcaacg gaatactcaa taatctgctc tacaaacgtt gggtgaaaca acgcgcggcc 2700 

gatgtagata aatggtctga cggacgtctg ggatatgtac atatacaatc gatgggtgat 2760 

gacagtttcc gttccgtcta ctcagatatt ttaggaaaat acaataatcg cgaaggaatc 2 82 0 

gttatcgaca cccgtttcaa tggcggcggc cgccttcacg aagatattga agtattgttc 2 880 

agtggtaaaa agtatttcac ccaagtcgtc cgcggacgcg aagcttgcga tatgcccagc 2940 

cgccgatgga acaagccgtc tatcatgcta acgtgtgaag ccaattactc gaatgcacat 3 000 

ggcacaccat gggtatacag ccatcagaaa ctaggtaaat tggtgggtat gcccgtaccg 3 0 60 

ggaaccatga ccagcgtttc ttgggaacgt ctacaagacc cgtctctggt attcggtatt 3120 

cccgtcatag gctatcgact tccggatggg agctatctgg aaaacacaca gttggaaccg 3180 

gatattaaag tagccaactc accggaaaca atcgtcaaag gggaagatac acaattgaaa 3240 

acagcggtag aagaattgct gaaagaactc ccggcaggca aagggaaaaa gcattaa 3297 

<210> 1130 
<211> 1773 
<212> DNA 
<213> B.fragilis 

<400> 1130 

gaacataaaa ttaaaattta tctagctata attatgaaat caattctcac tttcttacta 60 

attatcttaa tggacataca atttaattac gcatgccctt gtagcccaac caatacactt 120 

attgaaatga atgaaagttt cgcgagtcag tttcaaacgg ccaccattat tccaatgttc 180 

ttatggcaac cgtcatggtc ttatcctatt gagggcctgg caatagggtt acttatctcc 240 

cttatcgtat attaccgaat ggtatacagc acaaagctat ttcctcacga aaagctgaga 300 

ctgatcttaa acataaccca taaaactcag acaccgttaa ctttgatcca ccacctactg 3 60 

gaagaaatca tttcggacag tctctccgaa tctacatccc aaaaagtaaa gcggatactt 42 0 

agatacacca gtcatattat gagttgctac cagaacattg cggtattcga cgataaggag 480 

aatgaactgc acccgggctc ctctcccatt gaattcgaac tttacacttt tataacctca 540 

atcgtcaacc aatgccgggc gtacgccgat actcgtcaaa taaaattaaa tattaataaa 600 

gacttcagtt atatcagttg ccgggtggac gaaataacga tgactgccgc tctgcaatgc 660 

ctgctgaata aaatgataga agccacacct tgcaaaggct gcataaatat ggacgtgtca 72 0 

cacagcatca agcattggaa cctccagata accaatggac cggaatgcag gcaaagtcac 780 

aaaaagatgc tttcgttcat ctctacattt atgttaatac actattgcgg aagtcttcaa 840 

attataaaaa agataatccg cttacatggt ggaaagttga ttggcagcta tcacggacgg 900 

tcaataactc tccgggttac tgtacctatc aacggatatt gtaataccat ccaatgtccg 960 

gaagtagtgc ctcctgtaat gaaagatgat aaaatcatcc gtcccgataa aaaacagcat 102 0 

cacatactgt tggttatggc agatacagag ttaagcaact atttgcataa ggcgttctcc 1080 

atacttttca gaataacgat ccttgaaaat ccggaacaga tattacattt ctcgggagat 1140 

cggctaccgg atattatcgt tattgacgaa acggtaaacg gcatacgcgg caaggaaatc 1200 

tgttctaaaa taaaatcgaa tacaagcatg gttcatattc ctgtcattct cctgatcagt 1260 

aacaatgata acggaagtta tcttgcccat gcggactgtg gagtagataa attggaaccc 1320 

cgcgcaatca atatttgcag actcaaaatg gatatacaaa tacttatcaa taagcatgaa 1380 

cgtatcatga aactcctgga gaaaaacctg tcggacaatc tgccttcacc aactgcaaaa 1440 

agtgaagagg acgcactgtt cataaacaaa gtgaacaagc ttctggaaaa gaatctttca 15 00 

acagaaagct atacagttga catgttaagt gccgatatgg gaatgtgtcg taccaaattc 1560 

tacaccaaaa taaaagaaat tacagacaag acacctacag aatacatgca ttatttcaaa 1620 

atgaataaag ctaaaatttt attggttacc caacaatata cagttacgga aatagccact 1680 

tttctaggct tttgtaatgc caaatatttc ggaaaacgat ttaagaaatt ctataaagtt 1740 

ccacctacac aatatattaa agaggttttc taa 1773 



<210> 1131 
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<211> 1131 

<212> DNA 

<213> B.fragilis 

<400> 1131 

tgtaataaaa ttatgaagaa tatcttgtta acacttctct tgtttatact tttttcatgt 60 

agaagcactg gagataaaac cgactgtgaa gtattacatg tcgatttggt tgaacgccct 120 

gttccaacag aagaattatt ttctaaaata tctgtcattc cattggaaac caatgatagt 180 

tcctttcttg taaggcctgt gaaagttatt ataaaagata acagatatta tattgtcgat 240 

gaaggggttc cggctgtgtt ttcttttgat gaagaagggc atcttttgca taaaataggt 3 00 

aaaaagggac aaggtcccgg agagtatcgt gaaatatacg atgccgttat taaagaaaaa 360 

gaaaatacag tgtatatgct gtctccattt ggctctcttt atgtgtattc tctggatgga 42 0 

aaattcataa aagaaataaa actgccaact aggtcgaatt atcaattgat agaggagctg 430 

gatagtaagt atttcgttac atggacatta cctgcttctg agaatgaaaa ttgtatcagc 540 

gttatttcta aagagtcttt caataatgtg aaagaatttt ggcatgtccc tcccgttctc 600 

actactctga attctaaacc tttttataat tatgaacata aagtatattt ttcgaatcct 660 

tatcaaaatg aagtatatga agtaaggaca gatagcttac gggttgcata ccgttgggat 72 0 

tttggaaaag ataatcttga tttgaaggag tatggattca ctttattaga ggatcaaaag 780 

gttgaggaat ataaattaat gttgcagtgt ttacgtgatt ctactgtacc ttatttatta 840 

aggcatcaat ttcagaataa aaaatattat tataccatgt tgacgtttgg ctttcggcat 900 

cggataaatc tgttttatcg aaaggatgac ggcaagagtt tcttttttga gaaaacagcg 960 

gaaggtgttt tgctccat cc tttagccttt aatgaagatt ttctgacttg tattgttttc 102 0 

aatgaagact ttccaaacta tgaaaaagtg cttcctt egg aggaatataa gaagctggaa 1080 

gagegtttag aagatgataa cccctgttta atcaaatttt attttaaatg a 1131 

<210> 1132 
<211> 1068 
<212> DNA 
<213> B.fragilis 

<400> 1132 

ctttatagaa taaatatgaa taatatgaga tttaatttag tegttttatt tgtaatttta 60 

ctttctttct attcttgtgg cagggaagaa aaaactgtgt atgattttcc tttagaacaa 120 

tegcttaaga gtgataagga agttagttta aacaaggaac tattagctcc ctatctgatg 180 

tgttcttatg attccactct gtgtctaata gattggactg ccaatccgat ggtgcatgtt 240 

tataacatga atacagggaa agagatggtt gcttttggga ataaaggcat gggaceggat 300 

gattttctat ctatatccca aatgtatgta gatatgggca agcgttcttt ggtactgtat 360 

gatcagtctt tgcaaactat aagttctttt caaattgata gtttagctca aggcagtctt 42 0 

tcaaagatag attgtgtttc agctcctaag ttaggaatga atagggtata tgettatteg 480 

gattccatat tttacggaag tgggactttt gaaagtggct tgatagcgaa atgcaatcag 540 

aaagagattt taaatcaata tctccctttt ccacacacag ageaageggt aaategggat 600 

gtaaactatt tgttgtttca gggggatctt attatgaagc eggataaaaa aegttttget 660 

tacttggcgt atgagtgcga tttattatct attcagaaag tggtaaatga cacgtgcctg 72 0 

gaaagtgtag tacatttgaa tacgtatacg ccgttatttg agaatcaatc tactaacgaa 780 

gtgtcttctg ttaatgtctc taccgattct cctaaaggat ttcttcgtgg ggtagecact 840 

gaaaactatg tttatgeact ttacagtggg caaattggga aaaataaggc aatagcaaat 900 

gaaatttatg tatttgattg ggaaggacgt gctgtaaaga aagtgatact ggatagatgg 960 

ggtgtatgca tctcggtgga tagtaatgat gaacgactct gectgatgae aaaggaaacg 1020 

gatggtggag aagagcgtta tcattattat tgttatcagt taaactga 1068 

<210> 1133 
<211> 1080 
<212> DNA 
<213> B.fragilis 

<400> 1133 

attaaaaaga tcatgggaaa atatatcagt atattaatct taatagagac tattctttta 60 

ggatgtcact cgacaaaaga gaagattgag ttttcaaaca gagtgatttg tagegatage 12 0 

atatcgegag aactggttgt tttaaatgat aegtttttat tttegtatec tttgeaaata 180 
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gaatgtatag 
tttactctaa 
tttataaatg 
acgtcattac 
tctgaggtta 
tatgatatgc 
cggtttggat 
gtaaatacga 
ttaagacctg 
tttgatttgg 
aaatatggta 
attggttttg 
gggagtgaaa 
aaaattaaaa 
tatgtaattg 



actcaatgct 
agggtgtacc 
tggaatcatt 
gtaaaatagt 
tacaggtgaa 
tttctcttaa 
tattaaaaga 
acgatgacga 
acaggacaaa 
atgataattg 
ttgccgaagg 
aggacatata 
ctttaccttc 
cagggtgctc 
ccgagaacga 



actggtattg 
cataaaatca 
caatttatct 
gaaatatgat 
ttatgatagc 
agattcaaac 
tggaaaggtt 
agaagtctgg 
gatgttgaat 
ttccttatca 
ggcaatacct 
tgtaaccaat 
ggaaattact 
cctctctaat 
acagaatgct 



gataatgtta 
tttggagaaa 
gaagatagaa 
gtttcttctt 
ttgcctcaaa 
tttctggtaa 
actcaattat 
tcagttttct 
gcaacctatc 
ttggcaaaga 
aaatatgttg 
aatagtatat 
gtatttgatt 
attgccgtcg 
tatgagcttt 



ataataattt 
aagggcaagg 
aaatcatgta 
ttttgaaaga 
ccgaagtgcc 
aagcaaatca 
acaattcttt 
gtagcaatac 
ttggaggagt 
tactttatat 
ttttcaatga 
atacattatt 
gggcgggaat 
atgggaagga 
cttgtttgtc 



cttccatcta 
tcctattgac 
tgcctacgat 
ttcattgaaa 
tacaataatc 
taagggcctt 
ctctgattgt 
caaaaccaaa 
attggaattg 
ttacgagcct 
aacgactcag 
acatagtata 
tccaataact 
taacacaatc 
tttaaattga 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 



<210> 1134 
<211> 246 
<212> DNA 
<213> B.fragilis 

<400> 1134 

cacttatact ttgaagccat gaaaaactat atgaaattaa tctttgggtt aatagtcttg 60 

atcgggtatt ggcctaccac gaaaattcct aagagagtaa attccctatt tttgcaaaat 12 0 

gtagaggcgc ttgccggcag tgaacacgtt accaatttag gttgcttggg tgacggatct 180 

gtagattgtc ctattaacca tatcaaagta gaacatgtgg ttcaaggatt tagtcttggg 240 

gagtga 246 

<210> 1135 
<211> 252 
<212> DNA 
<213> B.fragilis 

<400> 1135 

cacttatact ttgaaaccat gaaaaactat atgaaattga tctttgggct tggtttaata 60 

gccttggtcg ggtattggcc tgccgcgaaa actcctaaga gagtaaattc cttattcttg 120 

caaaatgtag aggcgcttgc cggtagtgaa cacgttacca atttaggttg cttgggtgac 180 

ggatctgtag attgtcctat taaccatatt aaagtagaat atgtggttca aggatttagt 240 

cttggggagt ga 252 

<210> 1136 
<211> 1230 
<212> DNA 
<213> B.fragilis 



<400> 1136 
gataatattg 
ttatttttat 
cagaggaaca 
gaggaactac 
gatactattc 
gatgctaaaa 
gctgcttgta 
cttaaaagcc 
aatactattt 
ttcacttttt 
agatgggccg 
gttcttatca 
aatgataaaa 



taaagtatta 
ggatttttgg 
tggaagcaat 
acagaaagca 
ctttaaagat 
agagtaaaaa 
tgaagagtcg 
agcaaatttt 
cttattgtaa 
atgtgggaaa 
tatatcagta 
ttattctctg 
aacatttagc 



catttgtttg 
gatattgatg 
gtatgccgaa 
gcaagaattg 
tcgtgttaca 
gaatatttct 
tttgtctaca 
tgctaaaacc 
atgtaagaat 
tagatgcgaa 
tcatagtatc 
tagttggtat 
taatgaccga 



caaaaacaaa 
caagcatttt 
accgagtatc 
aatctatttt 
acaagtaagg 
caaagtatgg 
gatactttaa 
gatgtacata 
tgcaaagatt 
atcgaggtta 
ccatttgaag 
ttgataaaga 
gatcgtgaaa 



cagtgatgaa 
taattagtat 
ttctgaagga 
atatctccaa 
gagtcaagac 
cagagcggtc 
atttgctttg 
taaccacaac 
attgttttgg 
tagctttttg 
ttatatggag 
agtatatatc 
gaaaggttcg 



acgttctagg 
gcacttctat 
agttttgaat 
agtaacaatc 
ttttactgtt 
ttggcattca 
gaatagaagg 
tcatttagat 
aacgcataaa 
ctcttattta 
tgtgacggct 
taaaattcga 
tatacaattg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 



461 



gaaaaggatc 
ttttcagcaa 
gagtatgaga 
ctttacagat 
gaccaaacaa 
tccgagtata 
gatatgattc 
gaaaatttct 



agaaaaggtt 
aaggcgaaga 
atcaaattca 
tgagtccaaa 
tatctcttac 
ttttaactta 
gtttgcgagt 
atttttcaga 



ggaggtaaaa 
atacgaggaa 
aaaactgaaa 
agttactttc 
ttcacaagca 
tgaggagctc 
cgcaatttct 
aggatattaa 



caaaaagaat 


atgaaaaacg 


gataaaagac 


840 


gagagaaaaa 


gtatggagaa 


aatactgaaa 


900 


y dcLu u cLdy y y 


^ 3 t* c* f~ rrrrrTPt ^ 
cLcLL-L-uyyycLct 


rrrr^i ^ r^c^ f~ f~ 
y y CLCLO OCL L. L- Ct 


9 60 


gattcttatg 


caaaggtatt 


aatttgtagc 


1020 


tgtcaactgt 


tggatgcttt 


tctgaatgct 


1080 


ttgcgatatt 


tatgggaaga 


tggtactggg 


1140 


cgtttacgtg 


ttgcgctaag 


tatagatcct 


1200 








1230 



<210> 1137 
<211> 1131 
<212> DNA 
<213> B.fragilis 



<400> 1137 
tctaataaag 
aaaagcactg 
gttccaacgg 
tcctttcttg 
gaaggggttc 
aaaaagggac 
gaaaatgcag 
aaattcataa 
gagagtaagt 
gttatttcta 
actactctga 
tatcaaaatg 
tttggaaaag 
gttgaagaat 
tgcgatcaat 
tcgaaaaatc 
gaaggcattc 
aatgaagact 
gagcgtttag 



ttatgaagaa 
gagataaaac 
aagaattatt 
tagggcctgt 
cggctgtgtt 
aaggtcccgg 
tgtatatgct 
aagaaataaa 
atttcgttac 
aagagtcttc 
attctaaacc 
aagtatatga 
ataatcttga 
ataaattaat 
atcagaatga 
tattttatcg 
attttgaacc 
ttccaaacta 
aagatgataa 



tatctcgtta 
cgactgtgaa 
ctctaaaata 
aaaagttatt 
ttcttttgat 
agagtatcgt 
atctccattt 
actgccaacg 
atggacactt 
caagaatgtg 
tttttataat 
agtaaggaca 
tttgaaggag 
gttgcagtat 
taaattctat 
gaaggaagac 
tttagctttt 
tgaaaaagtg 
cccctgttta 



atccttctct 
gtattacatg 
tctgtcattc 
ataaaagata 
gaagaagggc 
gaaatatacg 
ggctctcttt 
agggcaaatt 
cctgcctctg 
aaagaatttt 
tatgaacata 
gatagcttac 
tatggattca 
ttacgtgatt 
tatatcatgt 
agtaagagtt 
aatgaagatt 
cttcctccgg 
atcaaatttt 



tgtttatact 
tcgatttggt 
cattggaaac 
acagatatta 
atcttttgca 
atgccgttat 
atgtgtattc 
atcaattgat 
agaatgataa 
ggcatgttcc 
aaatatattt 
gggttgcata 
ctttattaga 
ctactgtgcc 
tggtgttcgg 
tcttttttga 
tcctgacttg 
aggaatataa 
atttcaaatg 



tttttcatgt 
tgaacgccct 
caatgatagt 
tattgtcgat 
taaaataggt 
taaagaaaaa 
tctggatgga 
agaggagctg 
ttgtatcagc 
tcccgttctc 
ttcgaatcct 
ccgttgggat 
ggataaaaag 
ttatttttta 
gcttaagcat 
aaaaacaaca 
tattgttttc 
gaagctggaa 
a 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1131 



<210> 1138 
<211> 198 
<212> DNA 
<213> B. fragilis 

<400> 1138 

cctgcaaaag ttgcgactac agcgactatc agggccgcaa aaatcttttt actcataatc 60 

tacttttatg atgttaatta tttttctccc atgtctctct ttcggggagc cacctcctcc 120 

gtaaaaaaag taaatgctac atttactcat tccaattatt ttttcgctat cttcgtgccg 180 
ggcaacctcc ttttttaa 198 

<210> 1139 
<211> 465 
<212> DNA 
<213> B.fragilis 



<400> 1139 
gtgatgaaag 
gttgccctac 
gctactgctc 
gagacagtat 
gggcgtttgc 
ttgctctatg 
accgaccgta 



tatatttggg 
aaaaaataga 
cctggggatt 
tgtcccctgt 
ataaatcacg 
gcgataagat 
agtttgttct 



attaggtacc 
ggagcggata 
tcagtcggaa 
cgggatactg 
ggacggtgtg 
actacaggat 
ggaacccctt 



aacttgggtg 
gggaaaatca 
aacaactttc 
gaaagtactc 
tatagtgacc 
gaaaggctta 
gccgagattg 



ataaggagct 
tttctctttc 
tgaatgctgc 
aacggataga 
gcctgattga 
tagtgcctca 
cacaggacgt 



aaatcttcgt 
cgctttttat 
ggtgggagtg 
gcaggaaata 
cattgatctt 
tccgttgatg 
tgttcatccg 



60 

120 

180 

240 

300 

360 

420 
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gtgtttcata agacgattaa agagctattt ctggctcttt cgtga 465 

<210> 1140 
<211> 936 
<212> DNA 
<213> B.fragilis 

<400> 1140 

cgtttcgggg tacacttaaa tatgcagata aaaccaatga ataaactcac tataaatgcc 60 

tgtccgctat gtgggggcgc acatttgaaa cgtgctatga cctgtacgga tttttatgct 12 0 

tccggtgaac agtttgactt gtacacctgc gaagattgcg gatttacttt tacgcaagga 180 

gtcccggtag aggcggaaat aggcagatat tacgaaacac ctgattatat ttcccattcg 240 

gacacgaaga aaggtgccat gaatgccatt taccatcatg tacgtcagta tatgcttgga 3 00 

agaaaggcgc gtttggtgat gaaagagtct catcgaaaaa ccgggaggat actggatatc 360 

ggtacaggta ccggttactt tgcccatacg atgcagaata ggggatggga agtagaggcc 420 

gtggagaaga gcggacaagc ccgtaatttt gcacgcgaac atttcgggct gaatgtgagg 480 

ccggaggctg cattgaaaga attagttccg ggaacgttcg atgtaatcac gttgtggcac 540 

gtcatggagc acttggaaca tttggacgaa acgtgggaat tgttacgtga actgttgacc 600 

gagaaagggg tattgatagt ggctgtgcct aattgctcgt cgtatgatgc gatgaaatac 660 

gggaagtact gggctgctta tgatgtaccc cgtcatttat ggcattttac gcctgccacg 72 0 

attcagcagt tcgggtcgaa gcacggattt attctggcag cccgtcatcc gatgccgttc 7 80 

gatgctttct atgtatcgat gctgacggag aaacataaag gtagtgcata ctcctttgtg 840 

aaaggcatgt ggaccggaac ggtggcatgg ttgagtgccc aggctaaaaa ggaacggagt 900 

agttcgatga tttatgtatt cagaaagaaa cgctga 93 6 

<210> 1141 
<211> 234 
<212> DNA 
<213> B. fragilis 

<400> 1141 

tgggtgatgt cagtcttttg tctgcaagaa acttgtttct tgcagacaaa agactacatg 60 

gataactctt ccaagcaaaa catccatagg gaaaaacacc ttgtgttttc tttcactcac 120 

cgatacattt tgaacatcac tttcggcaga caccccctcg gctctacggt catacctgac 180 

actcttaaag cttatactga cataatgtct gctatccaaa caagacatat ttaa 234 

<210> 1142 
<211> 333 
<212> DNA 
<213> B. fragilis 

<400> 1142 

aataaaaaca ctatgtacac aatacaggca aatccaagtg gcacacgcag catggaaata 60 

tctgaagaga atttggtaac cattgaaaaa tactctttat tccagcatct gatagacagc 12 0 

aatggaattg tagatgaagc tgttctggaa aagctgaaac tcaatatacg ttctctgatc 180 

gcaagtcagg aagaagacag taaagacctg ctcgaccttt gtatagatgt gatttatcac 240 

aacaatatga aagcattcgg gttgcagcaa ctcatcaagc tctatctcac ttggttgtca 3 00 

aagcaggaag cagaagaaga ggaggaggca tga 333 

<210> 1143 
<2X1> 453 
<212> DNA 
<213> B.fragilis 

<400> 1143 

tcatttgtag cgtactacat gaaccaattt tatataatta tgaaagcaaa aatgtttttt 60 

ttagtagtag ccatgctact ttgcagaggt gtagcctatg catacagccc ttcggcgaac 12 0 

gatccgattt caaagtcggg aggaattgag caagatattc cacctcccca tattccaccg 180 

cctccgttgc cgagtcagat atttcttcat gtcggtgaaa tttccgaaac tccatatccc 240 
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atgatcgggg caatggtatg ggaatgtgat aatccaccgg taggtataga acaagaagaa 3 00 

tcatatatct ttggaactat gattatcctg cgtataaaag gagttgcagc gggaagatat 3 60 

caagtggata tgaaatggta caatccgctc aatccgggcc agcctccatt aggaagacaa 42 0 

accattgagg tcatcgttca gccgtggcct taa 453 

<210> 1144 
<211> 450 
<212> DNA 
<213> B.fragilis 

<400> 1144 

ggtgtagctc tgccttgcgg atggctacac cctatttcgg tcccttctca gacgaccgga 60 

caaccggaca gacagagcac ggatttattt aattatttca aatttattac tcagaaccaa 12 0 

agaaagtcat tatctttacc gaaaatattt tctatatact cacataaaac atttggaata 180 

atggcaaaga gaagagaact taaaaaaaac gtaaattata tcgcaggtga attattttca 2 40 

gagtgtctga tcaacagtaa gtttataccg ggtaccgaca aaaagaaagc tgacgagttg 3 00 

atggtggaaa ttatgaaaat gcaagatgaa ttcatcagtc gaatcagtca tacagaaccg 360 

ggtaacgtga aaggattcta caaaaaattc cgttcggact tcaatgcaaa ggtaaatgaa 42 0 

atcatcgacg ctatcgcgaa actgaattaa 450 

<210> 1145 
<211> 2241 
<212> DNA 
<213> B.fragilis 

<400> 1145 

atttgtagta acattcataa taacactata atgaaaaaac atttgtcatt gatattagtt 60 

ctgttaccgg tactttttct ggcact cccc gctctggcac aagaacgtaa aaaagtcgga 12 0 

gtggtactca gtggaggagg tgcaaaaggg gtagcacata tccaggcact gaaagtcata 180 

gaagaagccg gcattccgat cgattatata gtaggtacca gtatgggatc cattatcggc 2 40 

ggattatact ctatcggata tactcctcag caattggaca gcatggtgcg caaacaggac 3 00 

tggatgttct tactgagcga ccgggtgaaa cgtagcgcca tgtcactcaa tgaacgtgaa 360 

aagtcagaaa aatatgtttt ttcgtttccg tttaccaaaa gtcccaaaga tgcagtttca 42 0 

ggtggcatca taaaaggaca aaatctggcc aatcttttta cggaactgac agtgggatat 480 

cacgattccg tagatttcaa caaacttccc atcccctttg cctgcgtttc acaaaatatc 540 

gtaaatggcg aacagattgt gttccacaat ggaatacttg ccacagctat gcgggccagt 600 

atggctattc cgggagtatt caccccggta cggaaggata gtatgatcct gattgacggc 660 

ggcatgatca acaactaccc tgtagatgtg gccagatcga tgggtgcgga tatcatcatc 720 

ggggtagatg tacaaaacaa tctgaaagga atcgacaaac taaacagcgc tccggacata 780 

ctctctcaaa tcatcgacct gacaactaaa aacaaccatc aaagcaatgt cggcctgacc 840 

gatacttata taaaggtaaa tgtagaagga tattcctcgg ccagttttac tcccgcagcc 900 

atcgactccc tgatgcaccg gggagaagta gcagcccgca agcaatgggc ttctctgctc 960 

gctctcaaaa agaaaatcgg cattgcagac acgttcgtac cccaatcgca cggcccttat 102 0 

accatgttct caaaggaccg gaccctgcat gtgaaagaaa tcaccttctc cgatgtagaa 1080 

gaaaacgata agaaatggtt aatgaagaaa tgtaaactgc aagaaaacag cagaatcagc 1140 

atgcgtcaga ttgagcaggc actgttcatc ctgcgtggaa accagtctta ctcaaatgcc 1200 

agttatacac tgaccgatac tcccgaaggt tacaaactaa acttcctgct agagaaaaag 12 60 

tacgagaaaa cgattaatgt aggcatccgg ttcgactcgg aagagatagc ttcattatta 1320 

ataaacgcta cggcacagtt aaagactcat attccctcca aagtctccgt caccgggcga 13 80 

ttgggcaaac gatacatggc acgggtagac tatacattgg agccgatgca acaacgaaac 1440 

gtcaactttt cgtacatgtt ccaatacaat gacatcaaca tttacgatca tggtgaccgt 1500 

gcctataaca ctacttacaa atatcattcc ggtgagtttg gattttcgga tgtatggtat 1560 

aaaaactttc ggtttgggtt cggagcacga atcgagtatt tcaaatacaa agatttcctc 1620 

ttcaagaaac cggaatttac gatgaatgtc aattccgaat atttcatcag ctattttgca 1680 

caattacgtt acaacacttt cgacaaaggg tattttccat ctaagggaag taacttctcc 1740 

ggagcttatt cgctgtacac agacaacttt gcccgatata acggacatgc ccccttctct 1800 

gcactcagtg cttcctggga aagtgtattc tccataagta accgcctgac attgatcccg 1860 

gcactttatg gcagggtatt gatcggccaa gagatccctt atgcttatga aaatgcgtta 192 0 

ggaggcgatg tattcggccg ttaccttcct caacaactcc cgtttgcagg aatttacaat 1980 
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atagaactaa ctcacaattc cgttgccgtg gcttccttga aactacgcca acgcatgggt 2040 

agcaaacatt atatcacgtt ggccggaaat ttcgccttga gtgatgacaa ttttttcaaa 2100 

atcctgaaag gtaaccggat ttacggttgc agtatcggat acggcctgga cagtatgttc 2160 

ggtccgttgg aagcttcatt aggatattcc aatcaatcca aagacgtggg attctacgta 222 0 

aatttaggat tctcttttta a 2241 

<210> 1146 
<211> 3072 
<212> DNA 
<213> B.fragilis 

<400> 1146 

ctacctgaaa aatcaattgg gagtagatac ccacagattg agaatgctgt ataccgaata 60 

aatggacagt tagcaacttc ctcagccaat gacttatcgg caattgccgg tttttcattt 12 0 

atttcttcta cttttgcacc cgataatcat tcaaccgagc atatgaaaaa actatgtata 180 

tttctcctgc tgctctttgc tgcgacagga atcttgtttg cacaagaaat agaaaaaagc 240 

gtgaaagagc ggctcagtaa ttactttgag acttacaccc cggcatctgc caataccgga 3 00 

agctgtaagt taaaaagcgt agacatagac ttcgaaggca ggaaactatc tatctatgct 3 60 

tccgagagtt ttgcttatca gccgtttgta ccggaaacag tagacgaaat ctatcatcag 42 0 

atagaagaat tgctgcccgg cccggtacgt tttttccgaa ctacaattta tgccaacaac 480 

caacctatcg aggagttgat tcccaatttc tttcgcggga agaagaaaaa agataaatcg 540 

cggctttcaa acgcagaata taaaggagca ccttgggtga taaacacctc ccgcccttac 600 

gaaataacca aaggattgca gaaccggcac atctctttgt ggcagagcca tggcaaatat 660 

tacaagaacg ataaaggcga atggggatgg caacgtccac gtttgttctg cactaccgag 720 

gacctgttta cgcaatcttt cattctgcct tatgtcatcc ctatgctcga aaacgccgga 780 

gctaatgtct ataccccccg ggagcgggat actcaaaaga atgaggtgat tgtggacaac 840 

gatacacgaa acggttccat ctatctggag atgaaaagcc gcaaagcccg ttgggagaaa 900 

accgacggtt atgggttcgc acaaagaaaa cctgtatatg aagatggaga aaaccctttc 960 

ctgacaggta gcgcacgctt cacccggact gaaaagaaaa agaataaggc atttgccgaa 102 0 

tggattccta caatccccga aacagggagt tacgcagtat atgtatccta tcagacactt 1080 

ccaaacagcg tcagtgacgc caaatatctg gtatttcaca aaggtggtgt tacagaattt 1140 

aaagtcaacc agaggatcgg cggtggtaca tgggtatatc tcggaacctt tgagtttgac 12 0 0 

aaaggcagca atgattatgg catggtggta ctaagcaatg agagcagcga aaacggggtt 12 60 

atctgtgccg atgccgttcg tttcggcgga ggaatgggaa atatatcccg cggcacagta 132 0 

agcggactgc cccgttatct ggagggagcc cgttattctg cccaatgggc aggtatgccc 13 80 

tatgatgtct acggaggcaa acaaggaaca aatgactatg ctgacgacat caatgcacgc 1440 

tccaacacca tcaattacct gtccggtggt tctgtattca atcccggaca aaaaggactg 1500 

ggtgtcccct ttgaaatgaa cgtggcgctg catagtgatg ccggatacag taaaacgaac 1560 

gatatagtgg gatcactcag tatctatacc accgatttca ataacggact gcttaactcg 162 0 

ggaaacagcc ggtatgcttc acgtgacctg gcagatctcc tgctcaccca aatacaaaaa 1680 

gacattcgtg ccaaattcaa tatacagtgg acacgccgta gtatgtggga tcgcaattat 1740 

agcgaaacac gcctgcctgc cgctccatcc actatcgtcg aattgctttc acaccaaaat 1800 

ttcgcagata tgaaactcgg tcacgacccg aatttcaaat ttaccgtagg acgtgccatt 1860 

tacaaagccg tattacagtt catcagcagt cagcacaaca aggagtatgt agtgcaacca 1920 

ctccccgtca gcaacttcgc catcgagttt ggcaaaaaaa gaaacaccct ggaactctca 1980 

tggcagggtg aaaacgatcc gttggagcct accgcccgtc cccgcgaata catggtatac 2 040 

actcgtatcg gatacggtgg tttcgacaat ggagtacgtg tgaataaacc ttcgtacacc 2100 

ctgaaaatag aacccggatt ggtctattca tttaaggtta cagctgtcaa ccacggaggc 2160 

gaaagttttc catccgaaat cttatctgcc tataaagcca aacaagaaca tgcacgagtg 2220 

ctgatcatca atggttttaa ccgattgagc ggacccgcag taatcgacac accggacgaa 22 80 

gccggatttg acctggaaca agaccccggt gtcgcttatc aatacaatat ttcactttgc 23 40 

ggggcacaga ccggctttga tcgctctcag gcgggaaaag aaggaaaagg aagtctgggc 2400 

tatagcggaa acgaactgga aggaatgaaa attgccggaa acactttcga ctatcctttt 2460 

gtacacggca aggcaatcca agctgccgga aactacagtt tcgtatcatg cagcgatgaa 2520 

gctgtcgaaa acgggcgtat acaaccggaa cattatccca ttgtggattt tatcctggga 2580 

ctggagaaag atgatatttt aagcaacccg gcacgcaaaa cgtattataa gacattctct 2640 

tcacccatgc aacggatatt aaccgcttac tgtcagtcag gaggaaacct tctcgtcagc 2700 

gggtcttaca ttggcagcga catgagtaat tcacagggta accgggagtt tacggaaaaa 27 60 

attctgaaat acggcttcca aggttcactc aaagataccc gttcgggaca gatcaccgga 2820 
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ttgggacgca ccctgcaaat cccccgtttg cccaacgaga aggcttatgc agtaacagct 2 880 

cctgattgta tcgttcccgt agactccgcg tttccggtat ttgtctatca acccggacaa 2940 

tacagtgccg gaatcgctta taaaggaaat taccgggtat tcgcaatggg atttccattc 3 00 0 

gaaagcatcg aaagcgaaac agaccgtgcc atagtaatgg cggcaatact aaaattcttc 3 060 
ggagaaaaat aa 3 072 

<210> 1147 
<211> 3075 
<212> DNA 
<213> B.fragilis 



<400> 1147 
tttaaaagaa 
atgaaagtta 
gtcatcatcg 
cctccggtag 
gctgtagcaa 
tcgaacagtt 
ccggacctgg 
gctgaagtta 
cttacattga 
attaatgtac 
agccgttatt 
actgtgcagg 
cttgggcagc 
ctttctactg 
attcgtctgc 
ggcatcaatg 
atggaagtag 
gggctgagct 
gtttataaga 
cagagttggc 
tttggtttca 
cttgccatcg 
atggaggaag 
ggtgcgttga 
agcggaatta 
atttcgactg 
gacaatggca 
aatcataaat 
ggtttcggag 
cttccggtgg 
ctggaacgta 
atagcctatg 
agtgaattga 
attatgtcaa 
actcctcctg 
cgtggcgagg 
tctcagcgca 
tatttcgatg 
tcaacaatga 
atctataagg 
ttgttattcg 
tcatatacta 
cgcggtgagg 
gcccggtatc 
aaaaaggcgg 
ttcctggcag 
gtggctgcct 



actcttcctc 
gtttttttat 
gcatcatcgg 
tgaagatcag 
ctcccatcga 
ccaattccgg 
ccgctgtaga 
ttcaaaacgg 
tgtcgtccga 
ttgatgtaat 
atgccatgca 
atttgcagaa 
aaccggtgaa 
tagaacaatt 
gagatgtggc 
gtaagaatgc 
cgaaaagtgt 
atgaggttcc 
ccttgtttga 
gggctacgtt 
tgctgatttt 
gtattgtggt 
agaagttgcc 
ttgctacctc 
cgggccaact 
ttgtggcgct 
aaaagaagaa 
atgtaatcgc 
tggtgcttat 
aagaccaagg 
cccgggaggt 
tacagaatgt 
ctgttatctt 
acgtccgtca 
ttattcccgg 
ctactttcga 
aagagttgac 
tggaccgtga 
aagcttatac 
tatacattca 
tgaaggcttc 
cgggtcccgg 
cggcacaggg 
atttgcccga 
gaggacagac 
cacagtacga 
tgggggctta 



cctctcgttg 
agatagacct 
actgaccatg 
tgcctcttat 
acaggagatt 
aggcttctcg 
aattcagaac 
aatttcagtt 
tccgaaattt 
ccgccgtatt 
aatctgggca 
tgctttgaag 
ggggctcgat 
cgaaaatatt 
acgagtttca 
tgccgttctg 
aaaagaggcg 
gttcgatatg 
agcattgatc 
gattccgatt 
cggtttctcg 
ggatgatgct 
gccatatgaa 
gctggtactt 
ttaccgccag 
gactctgagt 
cattgttttc 
tatccgtagg 
tggtattttg 
gtacttcaag 
gacggatcgt 
taccggaagc 
gaagccgtgg 
cgacctgagt 
tttgggaacc 
aaatttggtg 
gggactttct 
taaagtgaag 
cggttcggtt 
ggctgaggca 
caacggggct 
cagcattaaa 
gtatagttcc 
taatatcgga 
cggattggta 
aagttggacg 
tctgggagta 



ggagaggggc 
gttttctcga 
ttgcctgtcg 
ccgggagcca 
aacggaacac 
gcaacggtta 
cgggtgaagc 
gaaaagcagg 
gacgaaatct 
ccgggagtag 
gaaccggata 
gatcagaatc 
gtgacaatcc 
gtgattcgtg 
ctggaagcct 
ggaatttata 
atggatgaaa 
acgacttata 
ctggtagtac 
gttgccgttc 
ctgaacatat 
attgtagtgg 
gctaccaaga 
tgtgccgtat 
tttactatta 
ccggtcatgt 
cgtaaaatca 
gtaatcggta 
ttgattcacc 
atagagttgg 
gccatcactt 
agtccccgtg 
gaagaccgca 
gagtatccgg 
tcgggtggtt 
caggcggcgg 
tcttcacttc 
atgctaggtg 
tatgtaaatg 
ccgtatcggg 
atgattccgc 
cgtttcaata 
ggacaggcta 
ctggagtgga 
ttggcgctgg 
gtaccgattg 
tgggtttgcg 
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ttccagatcg gactggtgat gcttgtcggg ttggcagcaa aaaatgccat cctgattgta 2 880 

gagtttgcca aagtgcaggt agaccgtgga ggtgatttaa tacagtctgc cattcatgcc 2 940 

gcccaattgc gttttcgccc catcttgatg acttccctcg cctttgtgct gggtatgctt 3 000 

ccgatggtgc tggcaaccgg tcccggttcg gcaagccgtg ctgctattgg tacaggagtc 3 060 

tttttcggaa tgatc 3 075 

<210> 1148 
<211> 777 
<212> DNA 
<213> B.fragilis 

<400> 1148 

ttattcatga aaatctcgat tgtacagaca gatattatct gggaaaataa acaggaaaat 60 

ctccgtttgc tccgcgaaaa gctatcacct cttcgcggaa caacggagat tgttgtttta 12 0 

ccggagatgt ttacaacagg attcagcatg aacagccggc tattagccga accggtttcc 180 

ggtaccacgc tccggagtct caaaaattat gccatagaat ttcatttgtc attggccggg 2 40 

agtttcattt gtgaagaaca aggttcttat tataaccggg ctttcctgat cactcccgat 3 00 

ggacaggaat tttactatga caaacgccac ctcttccgca tgggacacga agcggaacat 3 60 

ttttcggcag gcagccggaa agtgatcatt ccctacaatg gttggaacat ctgcctgcag 42 0 

gtatgttacg acctccgctt tcctgtctgg agcaggaatg tgaacaatga atatgacctc 480 

cttatatatg tagccagttg gccgactcca cgtattcagg catggaatac attattatgc 5 40 

gcacgtgcca ttgaaaatca atgttatgta tgcggtgtga accgtatagg acaggacggc 600 

aacgggctct gttatccggg gtattccgct ttatatggac ctaaaggaga aaacctggca 660 

ggaactcccg attcggaaga aaaaatacaa accattgaac ttagcctgga agccctcact 72 0 

acttttcgtc ataaattccc ttgctggaaa gatgcagacc cctttctcct ttactaa 777 

<210> 1149 
<211> 339 
<212> DNA 
<213> B.fragilis 

<400> 1149 

acaataaacg aggaacgggg aacaataaat gcgaggagaa tgggaaatat attgaaagat 60 

aaaagtatgg ctttcgcgat acaaatcgta aacctgcata aatatccgaa caaaagaaaa 120 

gcttactctc tatccgacca gattctaata tccggtacag ccatcggagt tctgcaaaaa 180 

gaaacggaat gcgccgaaag caacgccgac tttattcata atatagcatc gccccaaaag 2 40 

aacgtaatga aacccttttc cggcttgaat tgttatttaa aacagaatat ttatccgaaa 3 00 

cagaatattc aagcatgttt gcagacgcaa atgaattaa 33 9 

<210> 1150 
<211> 378 
<212> DNA 
<213> B.fragilis 

<400> 1150 

tggagaacgg atgactttac agtttactat tttatgagcg aaatacaaaa tcaaattaaa 60 

aaatggccgg taacggcaat caaaaaaatc aaaagtacat tcggtagcgc agaaaagttc 12 0 

tacgctaccg tttatcttat agcccgcaac gaacatcatt gccagatgat gggagtggcc 180 

ggagcggaac aacgcttgaa gacgattcat gcctatcagg gtatgattcg ctttatgctt 240 

gatgaagaag gactcaatgg taaggaaatc ctggacacaa tagccggaga gtatctggaa 3 00 

gactttgtga actatcgcga acaagacttc ggaatgacca atgaagaatt tattgccatt 3 60 

atcaaaagaa taggttga 3 78 

<210> 1151 
<211> 1230 
<212> DNA 
<213> B.fragilis 



<400> 1151 
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ctaataaagg ataagttaat tttaataata caaacaagca tgagactgtt ttttacgaga 60 

aaagagctaa aactgaggag aaaaagaaca attgcaggca tcgtctgttt ggctttggtg 120 

gcaggcattt actggattct gacacgacca cataaagtag agccggaagt gccgactgtg 180 

attgtagagc ccgcagaaag ggataatgta gagattttcg gtgagtatgt ggggcgcata 240 

cgtgcacaac aatttgtgga agtgcgggcc cgtgtggagg ggtatctgga aagtatgctc 3 00 

tttgccgagg ggacgtatgt gaataaaaat caggtgcttt ttgtgatcaa tcaggatcaa 3 60 

tatcgtgcaa aggcagataa agcacgggca caactgaaaa aagatgaagc acaggcattg 420 

aaagcaaagc gggatttgga acgtatcaag cctttgtatg cccagaatgc ggccagccag 480 

ttagatctgg acaatgcgga ggcggcctat gaaagtgcgg tggcaaccgt tgccatgagt 540 

gaggcggacc tggcgcaggc agagttggag ttgggttata cattggtccg ttcgccgttg 600 

tccggacaca tcagcgaacg taatgtggat ttggggacac tggtgggacc gggtggaaaa 660 

tcgcttttgg ctacagttgt gaagagtgat acggtgctgg tggacttcag catgactgct 72 0 

ctggattatc tgaagagcaa agaacgtaat atcaatatcg gtcagcagga ctcttcccgt 7 80 

tcctggcagc cgaacatcac cattacctta gcggataata cggtataccc ttataaagga 840 

tatgtggatt ttgccgaacc tcaggtagat ccgcagaccg gtactttttc ggtaagagcg 900 

gagatgccga acccgaaaca ggtattgctt ccgggacagt ttacgaaggt aaagctgctc 960 

ttggatgtac gtgaaggagc catcgttgta ccgcataaag cagtgactat cgagaaaggc 102 0 

ggagcatata tttatgtgat gcgcagagat tctacggcag agaaacggtt cattgagttg 1080 

ggacctgaat ttggtaataa actcgttgtg gagagaggtc tgggtgcagg tgaagaagtg 1140 

gtggtggaag ggtatcacaa gctgacaccg ggaatgaaag tgagagctac cttgccccag 12 00 

ccgtcagctg agaataaaga gactgagtga 123 0 

<210> 1152 
<211> 2946 
<212> DNA 
<213> B.fragilis 

<400> 1152 

aactgcacgt gggtgagcgg gtggatacgg gtttcattac ggatgtattg catagttacg 60 

ggtttgaata cgtggattat gtatacgaac cggggccagt atgccgttcg tggaagtatt 12 0 

atcgacgtgt tttcattttc gtccgagtat ccctatcgta ttgacttttt cggtaatgat 180 

gtggagagta tccgtacatt tgaggtcgat tcacagttgt cgaaagagaa gaaagagagt 2 40 

attgtgattg tgcccgatct tgccgtaacc ggaaaggtta caacttcttt tcttgatttt 3 00 

atcccgaaag ataccacttt ggcgatgcgt gatttccttt ggttgcggga gcgtattcag 3 60 

gttgttcacg atgaatcgct cacaccgcag gctcttgctt ctcaggaggc cgaagagaat 42 0 

ggaggcatta ctttggaagg aaaattgatc gatgggagtg agttcactgt tcgtgcgctc 480 

gatttccggc ggatggagtt tggtaacaag ccgaccggca caccggatgc taccttgaca 540 

tttcatacta cggcacaacc tatttttcat aagaatttcg atttggtggc ggagtctttc 600 

aaagagtatc tgaaccgggg atatgcactt tatatctgta gtgacagtac gaaacagacg 660 

gatcgtatca aagccatttt tgaggatcgg ggagaccgga ttcagtttac ggctgtggag 72 0 

cggactctgc atgaagggtt tgcagacgat accttgaaac tttgtttgtt taccgatcac 780 

cagttgttcg accgtttcca taaatataat ctgaagagtg ataaagcccg ttcgggaaag 840 

gtcgcccttt cgctgaaaga gttgaatcag ttcactcccg gtgattatgt ggtacatacc 900 

gatcatggtg tgggacgttt ctccggtctg gtacgtattc ctaacggaga tacgacacag 960 

gaggtcatga acctggtcta tcagaatgaa gatgtggtat ttgtttctat ccattcgttg 1020 

cataaagttt caaaatataa aggtaaagaa ggagaagccc cccgactgaa caaattgggt 1080 

acgggcgctt gggagaaact gaaggagcgt acaaagccaa agattaaaga tatagcccgt 1140 

gacttgataa aactttattc acaacgtcgt gaagagaaag ggtttgcata cagtcccgat 1200 

agttttttgc aacgggaatt ggaggcttcg ttcatctatg aagatacccc cgatcagagt 1260 

aaggctacgg cggatgtgaa acaggatatg gaacgggata tgcctatgga tcgcctggta 1320 

tgcggagacg ttggcttcgg taagactgaa gtggccatcc gtgccgcatt taaagccgta 1380 

gcagacaata agcaggtagc cgtactggtg cctacaacgg ttttggcata ccaacacttc 1440 

cagacttttc gtgatcgcct gaaaggactt ccctgccggg tagaatatct cagccgtgcc 1500 

cgtacggcgg cacaggcaaa ggcggtaatc aaaggattgg aagctggaga cgtgaatatt 1560 

ctgatcggta cgcaccgtat cttgggaaaa gatgtcaagt tcaaagacct cggactgctg 162 0 

attattgacg aagagcagaa gttcggtgtg tcggtcaagg agaagctgcg gcagatgaag 1680 

gtcaatgtgg atacattgac aatgacggca acccctattc cgcgtacttt gcaattctcg 1740 

ctgatgggag cgcgtgactt gagtgtgatt tcaactccac ctcccaaccg ttatccgata 1800 

cagacggagg tacatacgtt cagtgaagag gtgatagctg atgccattaa ctttgagatg 1860 
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agtcgtaatg ggcaggtttt tctggtaaac aaccgtatag ccaatcttcc ggaactgaaa 1920 

gcaatgattc ttcgtcacat tccggattgc cggatagcca tcggacacgg acagatggag 1980 

ccggcggaat tggaacagat cattttcggc tttgtcaatt atgactacga tgtactgatt 2 040 

gcaaccacta ttatcgagag tggaatcgat ataccgaatg cgaacacgat tattatcaac 2100 

caggcacaaa acttcggatt gagcgatctg caccagatgc gcggacgtgt gggacgtagc 2160 

aataaaaagg cgttctgtta cttgctggct cctccgttgt cttcgttaac ccccgaagcc 222 0 

aaacgtcgcc tgcaggcgat cgagaatttc agtgatctgg gtagtggtat tcatattgcc 22 80 

atgcaggatc tggacattcg cggtgcgggt aatatgctgg gagccgaaca gagtggattt 2 340 

atcgccgatc tgggttatga aacttatcag aagatattgt cggaagctgt gcatgaactg 2400 

aaaacggatg aatttgccga actttatgct gacgaattaa aaggagaagg tgtcattagt 2460 

ggtgaagagt ttgttgaaga atgtcaggtg gaaagcgatc tggaattgct gttaccggct 2520 

aattatgtga cgggtagcag cgaacgtatg ttgctgtatc gggaactgga cggactgact 2 58 0 

ctcgatagag atgtagatgc tttccgttca cgattggaag accgtttcgg ccctattccg 2 640 

cctgagactg aagaattgtt gcgtatagta ccgttaaggc gcttggctgc ccgattggga 27 00 

gtggagaaag tgttcttgaa aggaggacgt atgacactgt tctttgtcaa caatgcagaa 2 7 60 

agcccgtatt atcagagtgc tgctttcggt aagatgatcg actatatgat gaagtatacc 2 820 

cgaagatgtg atttgagaga gcagaacgga cgtcggtcta tgttggtaaa agatattccg 2880 

aatgtggaaa cggctgtcag tgtactactg gaaattgtgg cattaccggt gaaagagaaa 2 940 

gagtaa 2946 

<210> 1153 
<211> 342 
<212> DNA 
<213> B.fragilis 

<400> 1153 

agacctgctc gacctttgta tagatgtgat ttatcacaac aatatgaaag cattcgggtt 60 

gcagcaactc atcaagctct atctcacttg gttgtcaaag caggaagcag aagaagagga 12 0 

ggaggcatga ttaccgtcga tacctgtgga atgacgaact atagcccgtt gattccggcc 180 

ataaaagcga tgtgtaatgc caatcccggt gacaagatgg agattgtaac ggatcaggtg 2 40 

gctgcattcc aggatcttaa ggaatattta tcagaacaag gtatcggatt ccgtgaaata 300 

tatgatggag aacggatgac tttacagttt actattttat ga 342 

<210> 1154 
<211> 1179 
<212> DNA 
<213> B. fragilis 

<400> 1154 

ctcataacac gaatgttaac ccaattaatt aatgcacgta tactcacccc ccaaggatgg 60 

atgaaagacg gttccgtgct tatcagagac aataagattt tagaagtcac aaactgcgat 120 

ctggccgtta tcggagctga acttattgac gtcaaaggta tgtatgtagt ccccggtgga 180 

gtagaaatcc acgtgcatgg tggtggaggc cgcgacttta tggaatgtac ggaagatgct 240 

ttccgggcag cggtccatac tcacatgaaa catggcacaa caagtatctt ccccacactg 300 

tcatcatcta cagtccccat gattcaacaa gctgcagaaa cctgtaccaa gttgatggaa 3 60 

gagaaaaaca gcccaatcct gggactgcac ctcgaaggtc attacctgaa catgaaaatg 420 

gcaggaggac aaattccgga aaacattaaa aatcctgatc cgaatgaata cattccgatc 480 

gtagaacagt atcattgcat caagcgttgg gatgctgctc cggaacttcc gggagccatg 540 

caattcggta aatatattgc tgccaaaggc atactgcctt ccgtagcaca tacacaagcc 600 

gaatttgaag atatccgtac agcttatgaa gccggataca ctcatgcaac ccacttctat 660 

aatgcaatgc ccggcttcca caaacgcaga gagtacaagt acgaaggtac agtcgaaagc 720 

atttatctgt tggatgacat gacagtagaa gtagtagccg acggtattca cgtcccccct 780 

acgatcctga gactcgtata taaaataaaa ggtgtggaaa gaacctgtct gatcaccgat 840 

gcccttgcat gtgccgatag tgatagcaaa gaggctttcg acccgcgcgt aattatcgaa 900 

gacggagtct gcaaactggc cgaccattct gctttggcgg gaagtgtcgc caccatggac 960 

cgcctgatcc gcaccgttgt gcagaaagca gagatcccac tggaagatgc agttcgcatg 1020 

gcttccgaaa ctccggcacg catcatgggg gtatatgatc gcaaaggttc cttgcaaaaa 1080 

ggtaaggatg ccgatattct ggtactggac gaagacctca acgtaagagc cgtatgggcc 1140 

atgggtaagt tggtacctga aacaaatact ctgttttaa 1179 



469 



<210> 1155 
<211> 591 
<212> DNA 
<213> B.fragilis 



<400> 1155 

aatcatcgac gctatcgcga aactgaatta agaatgaaga aagcaatata tagctttatc 60 

tactatcacc tgttggggtg gaaaaccaat gtaacggtac cgaactatga taaatgtgta 12 0 

atctgtgcgg cacctcatac aacgaatatg gacctcttta tcggtaaact gttttatgga 180 

gcgataggcc gtaaaaccag tttcatgatg aaaaaagagt ggtttttctt tcctttagga 240 

atcttgttca aggccgtagg cggcattccc gtaaatcgag gacgcaaaag ctcactggta 3 00 

gaacaaatgg cagaggtctt tgccaaaaga cctaagtttc atcttgcaat cactcccgaa 360 

ggaacccgta aacgcaaccc caactggaaa aaaggattct actacatcgc attgaaagcg 42 0 

caagtcccta ttgtgctgat cggaatcgat tacaatacga aaacagttac ctccaccaaa 480 

gcaatcatgc ccagcggaga cattgaaaag gatatgcgtg aaataaaact ttatttcaaa 540 

gatttcaagg gaaaacatcc cgagaacttc tccattggag acgttgaatg a 591 



<210> 1156 
<211> 1383 
<212> DNA 
<213> B.fragilis 



<400> 1156 

gatatgaaaa gaacattaat acaaaacgct accatagtaa acgaaggacg ttctgtgcgc 60 

ggttcggtag ttatcgaagg ggaaaaaata gccgaagtac ttgaaaaagg acagaaacct 120 

gctatcccct gcgaagaaac aatcaatgcc aacggatgct atctgattcc gggtgtgatc 180 

gacgatcatg tacatttccg tgatccggga ctaacccaca aagccgacat ctctaccgaa 240 

agccgggctg ccgcagctgg aggtgtaacc tctatcatgg acatgcccaa tacaaatccg 3 00 

caaacgacca cactggatgc gctcaatgcc aagttcgatc tgcttgccga aaagtgtagc 3 60 

gttaactatt cgtgctattt cggggcaacc aataataact ataccgagtt cgacaaactg 42 0 

gacaagaacc gtgtatgcgg aattaagctt ttcatgggat cgagtaccgg aaatatgctg 480 

gtagacaaaa tgaacagtct actgaatatt ttcaatggaa ccgatctgct gattgccgct 540 

cactgcgaga atcacgaaac gattaaaaag aatacggaga agtatgtaaa agagtatatt 600 

gaaaaatatc ctcatcaata ttaccatgtt catcatgaga cccttccgat gggttatcat 660 

gctaaaatac gttcgattgc ggcttgttac gaatcgtccg aactggctgt acgcctggca 72 0 

cgcattgcag atgcacgcct gcatatcctg catatctcta cagccagaga actttcactg 780 

tttgacaatg atatcccgtt agaggaaaag agaatcacag cagaagcttg cgtttcacat 840 

ctgttattcg actcttccga ttatccggaa ctcggtgcac gcatcaagtg taatccttct 900 

atcaaaacaa aaaccaaccg ggatgcgctc cgccaggcag tcaactccaa cctgatcgat 960 

gtaatcgcga cagaccatgc cccacacctt ctcaaagaaa aagaaggagg gccgttgaaa 102 0 

gcaatgtccg gtatgcctat gatccagttc tctctggtca gcatgctcga actggtgaac 1080 

gaaggtatct ttacgataga aaaggttgtc gaaaagatgt gtcacgcccc tgcacaaata 1140 

tacaatattc acaaccgcgg ctttatccgc cccggttatc aggccgatct cgtattggtt 1200 

cgtccggatg cattatggac ggtaagcgcg gatcagattt taagcaaatg cggatggagc 1260 

ccgcttgaag gacgtacgtt cgagtggaaa gtagaaaaga catttgccaa cggacatcta 1320 

ttgtatactg acggacaggt agacgaaacc tatcgcggac aagagatcta ttttgaacga 13 80 

tga 1383 



<210> 1157 
<211> 789 
<212> DNA 
<213> B.fragilis 



<400> 1157 

agaagagtga ggagtttgga actcttcact cttcttttgt ttaaacagaa aagatacaaa 60 

ttatatatga aacaagaaaa attcttccgt cttcttccta tagagggagc ttataatatt 120 

cgtgatctgg gaggttatcc gacatcagac cataaacatg taaaatggaa aacattcatc 180 

cgttcgggcg atcttgacaa actgacagaa tccgatctgg actatcttac ctccttgcac 240 
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atccgaaccg acatcgactt caggagcatg 
ccctcaactg ttacacaata tattccctta 
gcacacttca acctgaacaa tataccggga 
caaaatgctc aggatactta tcgggaattc 
cctcttttat ttcactgctc agcgggaaaa 
ctgggagcat tgggtgtcga cagggaagtg 
tatataaaag ggaaatatga tgcaatcgta 



accgaataa 



caggaaaaaa 


aagcagcggc 


agacaaaatt 


300 


tctatcgaag 


caggcgacat 


gaccgacatg 


360 


atactcgaac 


aggcatacgt 


ttatatcatc 


420 


ttccggattg 


tttcggaaga 


acggaatact 


480 




crs at - t" ciccnc 


a erect* *r art a 


540 


ataatggaag 


attacatgct 


gtctgccgaa 


600 


caagctcatc 


ccggatttgc 


ccctctcacc 


660 


ttccaaacca 


ttgacactga 


ctatcaaggt 


720 


gtagataccc 


acagattgag 


aatgctgtat 


780 
789 



<210> 1158 
<211> 486 
<212> DNA 
<213> B.fragilis 



<400> 1158 

gttgaacgca tggatatatt cctgattatt ctgggtagta tctgcctgct tgtcggatta 60 

gccggatgta tcgtccctat gcttcccggg cctcctgtct cctatctggc actggtattt 12 0 

ctgcatttca ccgataaggt ttcttttacc attccacaac tattcttctg gttgttcatt 180 

gtggtactga tacaaatact cgactatttc attccgatgt tcggtgtaaa aagactcgga 240 

ggtaccccat ggggtaaatg gggttgcatc atcggtacct ttgccggcat ttttctgttc 300 

gccccctggg gcgtatttat cggcccgttt gtgggcgcag ttgtaggcga attattgggt 360 

ggaaaagaaa cgaaatacgc gctgaaagca ggattcggag catttgcagg gttcctgttg 42 0 

ggcaccgtac tgaaggtagc tgtatgcggt tggttcatct tctgctttat ccgtgccctc 480 

gtatag 486 



<210> 1159 
<211> 792 
<212> DNA 
<213> B. fragilis 



<400> 1159 

cggaaaagca ttacatttgt 



gccattgtaa agaccttgca gcaggagttt 
ggtaaactgg gattgggaac agcctacatc 
tacgaataca tctttgagat ggatgccgat 
ctctatgagg cttgtgccgt tcagggaggc 
ggagtgaatg tagtaaattg gccgatggga 
tatgtacgaa tcgttaccgg actgccgata 
cgtcgccaag tactcgaaac catcgatctc 
cagatagaaa tgaaatttac ggcctacaaa 
atctttatca accgcgaact gggtacttca 
gtattcggtg tcatcaaact gaaagtgaac 
aaaatgaatt aa 



agacacaata 


tgcagacatc 


ggacagcatc 


60 


aacatagaga 


acatcatccg 


cgccgttttc 


120 


atcgaggacg 


gttctcctga 


cgggaccgcc 


180 


cccgaccgcc 


ttttcatgat 


agaacgtaaa 


240 


accggattca 


agtgggcact 


ggaacattca 


300 


ttcagtcata 


atcccaacga 


tctgccacgg 


360 


gatgtagcta 


tcggctctcg 


atacgtaagc 


420 


cgtgtgctga 


tgtcttattt 


tgcatctaaa 


480 


cacgatacga 


ctgccggatt 


taaatgttac 


540 


gaccacatac 


gttttaaggg 


gtatgctttt 


600 


tgcggattca 


agattatcga 


ggtaccggtt 


660 


aaaatgaaca 


gtagtatctt 


tggtgaagcg 


720 


agctggtttc 


acacattccc 


ccagaaaaca 


780 
792 



<210> 1160 
<211> 2070 
<212> DNA 
<213> B.fragilis 



<400> 1160 

ttgattgttt ttatatatat ttgttcagag tttccaataa cgacaaagac tcagtcaaac 60 

ttaattaata acaaacttat gaagacaaat cttagttctc agattactct caacagggtc 12 0 

tcccccaggt attacagacc agagaatgca ttcgagagat cggtattgac ccgattagag 180 

aaaattccta cagacatcta tgaatctgta gaagaaggtg caaatcatat cgcttgcgaa 2 40 

atagcacagg ttattcgtga taaacagaaa gcaggacgtt tctgcgtact ggcattgccg 3 00 
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ggtggaaatt ctccgcgcag cgtatatgcc gaattaattc gcatgcacaa agaagaggga 3 60 

ctcagtttcc gtaacgtaat tgtattcaac atgtacgaat actatccgtt gtctcaagat 420 

gcaatcaaca gtaatttcaa tgcgttgaaa gagatgttcc tcgatcatgt agatatcgat 480 

aaacagaata tctttactcc ggacggtacg attgccaaag ataccatctt tgaatactgc 540 

cgcctgtacg aacagcgcat cgaaagcttc ggcggtatcg atatcgctct gctaggcatc 600 

ggccgtgtag gtaatatagc cttcaacgaa ccggggtcac gcctgaactc caccacccga 660 

ctgattttgc tggatagcgg ttcacgcaac gaagcatcca agatcttcgg caccatcgac 72 0 

aacaccccta tcagttctat tacgatgggt gtagccacaa ttcttgctgc taagaaaatc 7 80 

tatttgttgg catggggtga agaaaaagcc cacatggtga aagagtgtgt agaaggtaac 840 

gtaacagata ccattccggc atcttactta cagacccaca acaatgcaca tgtagctatc 9 00 

gacctgtcag cagcttccaa cctgacacgc attcaacgtc catggctggt cacttcctgc 960 

gaatggaatg acaaactgat ccgtagcgca atcgtgtggc tgtgccaatt gaccggcaaa 102 0 

ccaatcctga aactgaccaa taaagattac aacgaaaatg gcttgagcga gctgcttgcc 1080 

ctcttcggat ctgcttataa cgtaaatatc aagatattca acgacctgca gcacaccatt 1140 

accggatggc cggggggtaa accgaaagca gacgacacat atcgtccgga acgtgcaaaa 1200 

ccatatccga aacgagtagt cgtattttct ccgcatcccg atgacgatgt gatctctatg 12 60 

ggtggtacga tccgccgcct ggtagaacag aaacatgaag tccatgtagc ttaccaaact 132 0 

tcgggaaaca ttgccgtagg cgatgaagaa gtagttcgct tcatgcattt catcaacgga 13 80 

ttcaaccaaa tctttataaa cagtgaagat caggtgatca gtgaaaagta cgccgaaatc 1440 

cgtaaattcc tgaaagacaa aaaagacggc gatatggata cacgcgatat cctgaccatc 150 0 

aaaggtctga tccgccgtgg cgaagcgcgc acagcttgta cttacaacaa cattccgctg 1560 

gaacgttgtc acttcctgga cctccccttc tacgaaaccg gtaaaatcca gaagaatccg 162 0 

atcagtgaag ccgacgtaga aattgtccgc aacctgctcc gtgaagtgaa gcctcatcag 1680 

atctttgtag ccggtgacct tgccgacccg cacggaacac accgtgtatg tacagatgcc 1740 

gtatttgctg ccgtagacct tgaaaaggaa gaaggagccg aatggttgaa agactgccgt 1800 

atctggatgt atcgtggcgc atgggccgaa tgggaaattg aaaacatcga aatggctgta 1860 

cctatcagtc ctgaagaatt acgtgcaaaa cgtaactcta tcctgaagca tcagtcacag 1920 

atggaaagtg ctccattcct gggcaacgac gaacgcttgt tctggcagcg tagcgaggat 1980 

cgtaaccgag gtactgccgc tctttacgac agcttgggac tggcttctta cgaagcaatg 2 040 

gaagcattcg tagaatacat cccactataa 2 07 0 

<210> 1161 
<211> 615 
<212> DNA 
<213> B.fragilis 

<400> 1161 

tgggctaaaa ttgcatttat gacaattact gaactgcaac atcaatatgc ggggcatccg 60 

aatgtagagg ccctgaataa actgttgggg gagcccgcag tcagacatat ctattgcggc 120 

ggcttatatg cttccgctgc ttccttgttc gcttcggcgt tggttgaaaa gagtccttgc 180 

ccgtttgttt ttatattagg tgacctggaa gaagccggtt atttttatca tgatcttacc 240 

caagtgttgg gtacagagcg cattttgttt ttcccttctt cgtttcgtcg ttcggtgaag 300 

tacggacaga aagatgctgc taacgagatt ttgcgtaccg aagtgctcag tcgcttgcag 3 60 

aagggtgaag agggattgtg tatcgtcact tatcccgatg cattagccga aaaggtcgtt 420 

tcacggcagg agttgagcga gaataccctg aaactgcacg tgggtgagcg ggtggatacg 480 

ggtttcatta cggatgtatt gcatagttac gggtttgaat acgtggatta tgtatacgaa 540 

ccggggccag tatgccgttc gtggaagtat tatcgacgtg ttttcatttt cgtccgagta 600 

tccctatcgt attga 615 

<210> 1162 
<211> 198 
<212> DNA 
<213> B.fragilis 

<400> 1162 

ttatataaaa ttggttcatg tagtacgcta caaatgatta tccgggcaca aataatcaga 60 

ttagaagact ctattatgaa attagaaaaa tattatagaa cgaaaacagc cggattagag 120 

gggcaaccga ataagaagaa gaataagagg atggcagaca ttgtggcagt taatgttaaa 180 

tatgtcttgt ttggatag 198 
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<210> 1163 
<211> 1929 
<212> DNA 
<213> B.fragilis 

<400> 1163 

ctgtatgcgg ttggttcatc ttctgcttta tccgtgccct cgtatagcga aattatccgt 60 

atctttgtcc ccatgaaaga atatcgactt accgattggt tacccactac caagaaagaa 12 0 

gtagagcttc gcggctggga cgaactggat gttatcctct ttagcggcga tgcttatgtg 180 

gaccatcctt cattcggagc cgccgttatc ggtcgtatcc ttgaagccga aggcctgcgt 240 

gtagccattg tgccccaacc caactggcgt gacgacttgc gtgactttcg caagctggga 3 00 

cgtccccgac tctttttcgg catcagtgca ggttgcatgg actccatggt gaacaaatat 3 60 

acagctaaca aacgcttacg tagcgatgac gcttacaccc cggacggacg tcccgatatg 420 

cgaccggaat atccctcgat cgtatacacc caaattctga aaaaactcta tcccgatgtt 480 

cccgttgttt tgggaggtat tgaagcaagt atgcgccgcc tcagccatta tgactattgg 540 

caggatcggt taaagaaaag tatactttgt gaaagcggtg ccgacatgct gatttacggc 600 

atgggagaaa agcctatttg tgagttggtc cgccgattga cagccctgtg tgataatcag 660 

gatggagtga tttcatcatc cgacattcat tctccggcat tatcctctat cccgcagacg 72 0 

gcttatctga ccaggaaata tgaatccgac gagaatgata tcaccctcta ttcgcatgaa 7 80 

gaatgtttgg ctgataaaaa gaaacaggca accaacttcc gtcatataga agaggaaagc 840 

aataaatacg cggcagcccg aatcgtacag gctgtcgatg gtaaaacagt ggttgtaaat 9 00 

ccaccctacc ctcctatgac agagaaagag ctggaccgtt cgttcgatct cccttacact 960 

cgtttgcctc atcccaaata caaaggcaaa cgcattccgg cttatgatat gattaaattt 102 0 

tccgttaata tccaccgggg atgttttggc ggatgcgctt tttgtaccat ctccgcccat 1080 

cagggaaaat ttatagtcag ccgaagcaag gaaagcattc tgaaagaagt aaaagaggta 1140 

gttcaattgc ccgatttcaa aggaaatctg agcgatttag gaggtccttc tgccaatatg 1200 

tataaaatgg gcgggaaaga tctctccctt tgcaaacgtt gtaaacgccc ctcttgcatt 1260 

catcccaaag tgtgtccgaa cctgaatacg gatcaccgtc cgctattgga tatttactat 1320 

gcagtggact ctttacctga gatcaaacga agtttcatcg gaagtggagt gcgatacgac 13 80 

ttattgctcc atcaaagcaa ggatgctacc gtcaacaaaa tcacagcaga atatactcgc 1440 

gaactaatag cccgccacgt cagcgggcgc ctgaaagttg caccggaaca taccagtgac 1500 

cgggtactga gtatcatgcg taaacccgct ttcagccaat tcggagaatt taaaaagata 1560 

ttcgacagaa tcaaccggga acttggcttg cgccaacaat tgatccctta tttcatctcc 162 0 

agtcatccgg gctgtaaaga agaagatatg gcggaactgg cagtcatcac caaacaactg 1680 

gacttccatc tggaacaagt gcaggatttc acccctaccc ctatgaccgt agccaccgaa 1740 

gcttggtata caggctttca tccgtataca ctcgaaccgg tattcagtgc caagactcaa 1800 

cgggaaaagt tggcacaaag acaatttttc ttttggtata aaccggaaga acgacggaat 1860 

atcatcaatg aattgcgccg catcggacgg gcggacctga tagacaaact atacggaaag 1920 

aggaaatga 1929 

<210> 1164 
<211> 677 
<212> DNA 
<213> B.fragilis 

<400> 1164 

ttaccagtat taaaacagtc aaatatggaa aagcataacc cgtcatcatt caccgttgat 60 

agctcctccc ccgctcatcg ttcatcattc gccgttcatg atttaacccc ttatatcaac 12 0 

tggatttact tcttccacgc ctggggattc caaccgcgtt atgctgccat tgccaatatt 180 

cacggatgtg actcttgccg tgctatctgg ctgaccactt tcccggaaga agaacggagc 240 

aaggcttcag aagcgatgca actttataaa gaagctaacc ggatgttgaa cgaactggac 3 00 

agaaattttg aggtaaaaac tatttttaag ctctgtcctg ccaatgcgga tggagataat 3 60 

ctgattataa acggtatcac cttcccattg ctccggcagc aagtcaagaa gaaagaaaac 42 0 

gaaccgttct tatgcctcag tgatttcgta cgcccgctat cttcaggcat caccgatgtg 480 

gtaggagctt ttgcctcatc catcgatgcg gacatggaag gactttatga gaaagacccc 540 

tataagcatc ttttggtaca aacgctatcc gaccgcctgg ccgaggccgc taccgaaaag 600 

atgcacgagt acgtccgcaa agaagcctgg ggatatgcca aggatgaaaa tctttccata 660 

cccgatgtct tcaacac 677 
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<210> 1165 
<211> 408 
<212> DNA 
<213> B.fragilis 



<400> 1165 

gcggttggct cccgccgtca gtgtctcccc gtatcgaagg taacactcca tcggtatatc 60 

cactttgcct ttatcgagga tgcagctttg cggacgatta cagaggaaca acatgccgtg 12 0 

ggtggaaaca gcaatgcgta tcaccggatt aatgtaggca ttcttatagt tgatagcttc 180 

cacagccaga ctttttccga ttacatcacc acgggtattg acgatgggga catattccgt 240 

atgcgccatt acatgattaa agtaacggat acctatttga ttgagcagga tgctgagaat 3 00 

aaatatagta ggcggcaata cgtgataaag tacaagtatg gaagtccggg tgaggggatg 3 60 

tgctaccaaa accgtcaggc tgatgacagc aaaatggaga atacctaa 40 8 



<210> 1166 
<211> 792 
<212> DNA 
<213> B.fragilis 



<400> 1166 

tcactaccat ctttaagaaa taataaaaaa aacaaaaaga tagggaggat ggtttcagcc 60 

aacccctata atcgttacat ttgctttcaa attagaagat tacgatatat ggaaaccatg 12 0 

ttcgacacct tgttgcaact cccccttttt cagggacttt gtcatgagga tttcactaat 180 

atattagaaa aagtaaaact tcacttcact cgtcacaaac ccggagaacc attgataaaa 240 

agtggtgagg tctgtgatca gttacttttc ttgctaaaag gcaggctctc ttccgtcacc 3 00 

gtatcggaag acgacacgct gactgttatt gaatatttcg aagctcctgc cgtattagaa 3 60 

ccttactcca tgtttggaat gaatacccgg tatatatctt cctatattcc gcataatgaa 42 0 

gaagcgcaaa tggtaagtat cagtaaatcg tttgtcatgg gcgagttgtt caaatatgat 480 

atttttcgtc ttaactatat gaacatcgtc agcaatcgtg cgcaaaatct ctatacacgc 540 

ttatgggata aagctcccaa agacattgaa gacaaaataa tccgtttcat tttgggacac 600 

attgagagaa tgacaggtga gaagctgttc aaggtaaaaa tggatgattt ggcccgcatg 660 

ttggacgaca cccgcctgaa tgtatcaaaa gctctcaacg gactgcaaga attaaatttg 72 0 

ttggaacttc accggaaaga aatccgcata cccgatttat cactcctgac agaatggaac 7 80 

gagaaacgat aa 792 



<210> 1167 
<211> 1254 
<212> DNA 
<213> B.fragilis 



<400> 1167 
ctcatttggt 
gaaccctata 
aagtatgacg 
gatatggatt 
gaagtattgg 
aaagaacgtc 
cgtggattgg 
cctcctgttt 
agtccgttat 
gtaaagggct 
actcgggagg 
tcggatgaaa 
gtatcggaaa 
atgccgggac 
cagacttata 
gctgccgcgt 
aatattgatt 



ttcattatga 
tgaaatataa 
cagtatcgga 
tccgtactcc 
ggtatacttt 
acgggtggaa 
cttttgccct 
atcatccttt 
tgctgaaaga 
gtaaaatgtt 
agttggccga 
ttcatgccga 
aggctcgacg 
ttgccagttc 
tggaagccag 
atagcaacgg 
ttacagatga 



aacagatgaa 
ctttgatgaa 
acgttgggga 
tccttttgtg 
cgcttgcgaa 
tgttacccgg 
gcaatgtttt 
ctttctggtg 
cgggcaatat 
gattttgagt 
gatagcagag 
tctgacattg 
gaattcactt 
gtactgcatt 
tgaatttagt 
aacggagtgg 
atatttaaaa 



ttgtacattt 
gtgatagagc 
cggagcgacc 
atagaggcta 
gcttggtata 
gagatgctga 
acggctccgg 
acagaacaca 
cagattgatt 
aatcctcata 
atttgttttg 
ccgggataca 
gtgtttatgt 
atagaagatg 
gaaggccatt 
ctggatcagg 
acacatattc 



gcaggctgaa 
gccggggaac 
tgcttccgat 
tccgtcggag 
catccattat 
cgtttacgcc 
gggacaaagt 
accacaggga 
ttgaacgctt 
atccgggagg 
ataatcaagt 
cgcatcctac 
ctcccagtaa 
aggccattcg 
tatttgcata 
cattggaata 
ccgcaatccg 



cctaaaaata 
agactcggtg 
gtgggtggcc 
attggatcat 
caattggcag 
gggtattgtg 
gatggtgatg 
ggtagtctac 
ccgtgctgat 
gcgtgtctgg 
attggtcatc 
gtttgcattg 
agctttcaac 
tcatcgtttt 
tcttggagtg 
tattcaggag 
gatgattcgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 
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cctcaagctt cttatctgat ttttctggac tgtcgcggta tgggtgtttc gcagaaagaa 1080 

ttagttgact tttttgttga cggtgcacat ctggcattaa acgatggcgc tatgttcggt 1140 

aaggaaggcg agggatttat gcggcttaat gtggcttgtc cgcggagtgt gttaagacag 12 00 

gcactggatc agataaagga ggcatacgag ttgaagcatg acacaatagc ataa 1254 

<210> 1168 
<211> 2589 
<212> DNA 
<213> B.fragilis 

<400> 1168 

tccccgataa ccttggactt cttgtctttt attactaact ttgttatcac tataaaccta 60 

acaaccagtg caattatgaa gaaaggattc aaaattacag caatcgtcat cggcgtcatt 120 

ctgatactga tgtttctcct tccctttgcc ttccgtggca agattgaagg tatcgtaaaa 180 

tcggaaggca acaaaatgct taacggtcat tttgatttta gtagccttga tatcagctta 240 

ttccgcaatt tccccaaagc ttccgtcacc ctgaatgatt tttggctgaa gggaaccgga 3 00 

gaatttgaaa acgacacatt ggtgaaggcc ggtgaggtaa cagcggctat taacctgttt 3 60 

tcgctttttg gagatgacgg atacgatgta tccaaagtat ctgttgaaaa cacccgatta 42 0 

cacgccatcg tacttcctga cggaaaagct aactgggata tcatgaaacc cgattcatcc 480 

actgccagcg aaacccaaga aagtggtgaa tcttctactt tccgaattaa actccagcgt 540 

tttgtcatca aaaacatgaa tgtggtatat gacgaccggc agtccgcgat gtacgccgac 600 

atacacaatt tcaatgcgct ttgttcgggt gatctgggta gcgaccagac tttactcagc 660 

ctggaagccg aaactgaagc cctgacttat aaaatgaacg gtatcccttt cctctcacaa 72 0 

gctaacgtct acgccaagat ggatgtagat gccgatctgg cacacaacaa atttacactg 7 80 

aaaaagaacg aattccgtct gaacgccatt aaagccggca ttgacggatg gatagagttg 840 

aaagaccctg ctatcgacat ggatttgaaa ctcaacacca gcgaaatagg attcaaagaa 900 

atcttgtcac tgataccggc catttattcc aaagagttca agaatctgaa aacagatggt 960 

acagcaactc ttgaagctac agcgaaagga atactgcaag gtgacacggt tccacagttc 102 0 

gatgtccgac tggctgtaaa aaacgccatg ttccgatatc catccctgcc ggcgggagtc 1080 

gatcagatta acatcgacgc acaagttcgg aatccgggag gtaacattga cctgaccgag 1140 

ataagcatac atcctttcag tttccgactg gcggaaaatc cgtttagtct gacagccgac 1200 

ataaagacac ctgtcagcga cccggacttt acagccgaag ccaaaggggt acttaatctg 12 60 

ggcatgatca agcaagtcta tccgctggat gatatggagc tgaatggtac tgttcgtgcg 1320 

gatatgacaa tggccggaca cttgtcatac atcgaaaagg aacaatacga tcgtttctcc 1380 

gcttcgggaa ccattgccct cagtgatatg aacttgaaaa tgaaagagat gccggatata 1440 

gaaataaaaa aatccctgtt cacattcact ccgaaatatc tgcaactcag tgaaacgaca 1500 

gtagccatcg gaaagaatga tcttactgcc gactgccggt tcgagaacta tatgggctat 1560 

gctcttaaag ggggcacact gaaaggtacc ctgaatgtcc ggtcgaacca tctgaatcta 162 0 

. aacgatttta tgacggccac aactgacagt gccgcccaaa catcacaagc atcttcgacc 1680 

gaagaaaccg ccagtatgat cgaagttccg cagaatatcg acttccagat ggatgccggc 1740 

ctgaaagaag tgttatttga caagatgact tttacaaaca tgaatggcaa acttattgtg 1800 

aaagacggaa aagtcgatat gacaaatctg tcaatgaaca ccatgggagg aagtgtcgtt 1860 

atgaacggat actattcgac cgccgatccg aagaagccgg aaatgaacgc aggattccgt 192 0 

atggaaaata tcggttttgc acaggcttac aaagcactgg atatggtgca gcaaatggct 1980 

cctatctttg aaaacctgaa aggcaacttt tcgggtaata tgcatatccg gactttactc 2040 

gacaaccaga tgagccctgt catggatacg atgcaaggaa acggaagcct ttcgacccaa 2100 

gatcttagcc tcagcggcgt aaaagtaatc gaccagatag ccgaagccgt aaagaagcct 2160 

gaactcaaag agatgaaggt gaaagatatg gcactggatt tcaccatcaa agacggaaga 222 0 

gtatctacca agccgttcga tatcaaactg ggtgactatg tcatgaatct ttccggaagc 2280 

acaggactcg accaaaccat cgactattcg ggaaagatca aattaccggc atcggcaggt 2340 

gatatcgcca aactgactac cctcgacctg aaaataggag gtaccttctc ctcacccaaa 2400 

gtatcactgg atacgaaaag catgaccaat caggcagtag aagcggtgac agacaaggca 2460 

atcagcgaaa tagggaaaaa gctcgggctg gattcggcta caacggccaa caaagactcc 252 0 

gtgaaggaga aggtaaaaga gaaagccgta gaaaaggcac ttgattttct taaaaagaaa 2580 

ataaaataa 2589 



<210> 1169 
<211> 1107 
<212> DNA 
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<213> B.fragilis 
<400> 1169 

atggaaaaaa cgaagcggga tagcagacat attgatcctt tgaaagacga agaaaagttc 60 

ttccagtggt taaaattcaa gagcgttcgg atgaatcaag aacacctctc gaaagatgac 120 

tatgaacgcc tgcgtcagcg aatccatgtt tcgctgcgta tgttacgtcg gaaaagaacg 180 

gtcaggaggg tagcttatta tggtgtatcc gtatgcgtca ttattgcatt aggaattgtt 240 

gcttatctga atcattatga gagagtggaa cctgtgccgt ctgttgtcga aaaaaaagca 3 00 

gagattgtct ggcaaccttt gaagtcggaa gatattcgtc tggttagcgg tgatagcata 3 60 

acttcgttcc ggcaaaacgt gcaattgctc ttgagcaagg atggatcagc tatggtatat 42 0 

catcctaata gcgggcagaa gcgaatcaga atggaacagg acgaggttaa tgaactggtg 480 

gttccttatg gtaaacgatc gaaggttaaa ctggaagatg gaacggaaat ctggctgaat 540 

tccggttcgg tacttaagtt tcctactcat ttctcgggag agaaacggga agtaagcttg 600 

aagggtgaaa tgtatgccga agttacggca gatagtaaaa aaccgtttat cgtgcatacg 660 

gcccattttg atatacaggt atatggaacg cgtttcaata tatcggctta tgaagatgaa 72 0 

ccgactcctt cgtgtgtact tgtagatgga atagtcggat tccgtccgga atccggtcct 780 

gaaatacgca tgaagacaaa cgaaaaagtg ctttatgatg gcaagcgatt tgaaaaaaga 840 

aaagtgagtg cgtccagata tacttgttgg aaagaaggct atctggaatt ggatgatgcc 900 

aatatcatgg atgtactgaa ccggataggg agatattata atctatcgtt tagcttcggc 960 

gataagaagc ggctgaccgg taggaagtgt tcaggaaaaa tttatttatc cgataatata 102 0 

gataatgtat tgacaactat atcattgctg tattctactg attacaggaa agaagaacgg 1080 

actattttta tatctgagaa cccataa 1107 

<210> 1170 
<211> 942 
<212> DNA 
<213> B.fragilis 

<400> 1170 

tattctatga ctaaaccggc tccgacgcct ttatataacg aatttacttt ctttctgaaa 60 

aaatactttc cctataaggt acagaagata tctcttaatg cgggttttac atgtcctaac 12 0 

cgagacggaa cgaagggatt gggaggctgt acgtactgta ataaccaaac attcaatccc 180 

gagtattgta aaacggagaa atccgtcacc cggcaacttg aggaaggaaa gcaattcttt 240 

gcccataaat atccggatat gaaatatctg gcttattttc aggcttatac caatacctat 3 00 

gccgagcttg aaggactgaa agggaaatat gaagaggctt tgagtgtgga cggggtggtg 3 60 

ggactggtca tcggcacacg tccggactgc atgcctgatc ccctgttgcg atatctggaa 42 0 

gaactgaata agcacacttt ccttttggta gaatatggga ttgaaactac ccgggatgtt 480 

actttgaaac gtatcaaccg tgggcatacc tatgccgata cggtagaaac tgtcaaccgg 540 

acggctgctt gcgggattct gaccggagga cacgtcatcc tcggtcttcc gggagagacc 600 

catgacgaaa ttatcgctca ggcggccgaa ttgtcccgtt tgccattgac cactcttaaa 660 

atgcatcagt tgcagttgat acgtggtacg aaaatggcac gtgagtttga gtgccgcccc 720 

gaggattttc atctctttag tgtggacgag tatatcgatc tggtaatcga ctatgtggaa 780 

cacctgcgcc ccgatctgat acttgagcga tttgtttcac aatcaccgaa agaacttctg 840 

attgcacccg attgggggct gaagaattat gaatttactg cccgcgtgca aaaaagaatg 900 

aaagaaaggg gtgcttatca gggaaaggca tacctggttt ag 942 

<210> 1171 
<211> 879 
<212> DNA 
<213> B.fragilis 

<400> 1171 

aaacagtact atatgggaaa cgataaacgg gtgcgcaaac ccgaatggct taaaatcagc 60 

attggagcca atgaacgcta taccgagacc aaacgcattg tcgaatcgca ctgcctgcac 120 

accatctgta gcagtgggcg ttgccccaac atgggcgaat gctggggaaa agggacagct 180 

acgttcatga ttgccggtga catctgtacc cgcagttgta aattctgtaa tacccagacc 240 

ggacgacctc tgcctttgga tcccgacgaa ccggcccacg ttgcagaatc gattgccctg 3 00 

atgaaactct cacacgcagt tatcacatct gtagaccgcg atgatctgcc tgatttgggt 3 60 

gcagcccatt gggcacaaac aattcgtgaa atcaaacgac taaatccgga aacaactacc 42 0 
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gaagtactga ttcccgattt ccaaggacgc aaagagctta ttgaccaagt gataaaagcc 480 

tgtcccgaaa tcatctccca taacatggag actgtgaaac gcatcagtcc acaggtgcgt 540 

agtgccgcca actatcacac cagcctggaa gtgatccgcc aaatagccga aagcggtatt 600 

acggcaaaat cgggcattat ggttggtctg ggtgaaactc ctgctgaagt agaagagtta 660 

atggacgacc tgatttccgt cggttgcaaa atcctcacca tcgggcaata tctgcagcca 720 

acccacaaac acttcccggt tgcagcatac attactccgg aacagtttgc tgtttataaa 780 

gaaacaggac tgaagaaagg gttcgaacaa gtggaaagcg caccattagt acgctcctcg 840 

tatcatgcag aaaaacatat ccgatttaat aataagtag 87 9 

<210> 1172 
<211> 450 
<212> DNA 
<213> B.fragilis 

<400> 1172 

caactggaac aaagcatgaa aagagaacgt atagtaagaa ctgaaaacat tgatagacag 6 0 

agaataaaat atagagtatt gtttggtttt attttcatta tgtgctgtaa cggggcagct 12 0 

ttaccgtcgg aggacggcaa gatgccggac tggaaatttt cttttagtct gaaacgtata 180 

ccgatgatcc ggatttttga tgaaatcgag caaaaaagtg attttgtctt tgcatggtcc 240 

tgcgatatcg acaacgagat tcatgaggaa atcagtattt gtgttacgga agaacccatt 3 00 

caaaaggtaa tggaaaaggt tttgaaagga tcgggacttg tttatcagag actcgacagg 3 60 

caaattgttg tttatcgttt gctgggacac aatgcctgta gagtagattc tgtgagagtg 42 0 

atgactaata tggaacagaa tgatcgatag 450 

<210> 1173 
<211> 1095 
<212> DNA 
<213> B. fragilis 

<400> 1173 

gtctgcgtgc aatggctact tacgaaaagg ggctggacga atataaagac gaaaacggta 60 

atatcattta cggttaaggt aatgaaaata cggaaagacg atattcttct tattttgctg 120 

agtctgcttt tcgcagactg ccgggtgggg aaagcggaac ctgtggcgga ccctatggaa 180 

gaagagggaa tttccgtagt cagatatgat aagctgttgg acgaatacgt tcgtttcaac 2 40 

agcttttctg ctttacaaaa aatgaacctg gagtatgcct tgcccaccaa actgttgatt 3 00 

gaagatgtat tggctattgg ccaggtgagc gatgaccata ttttccagcg attgaaaact 3 60 

ttctattcgg atacaacctt ggtccgtctt atagaagatg tggaggccaa atacccggaa 42 0 

ttggaatcgg ttgaaaaaaa tctgaccaaa gggttcggga aattacaaaa ggagattccg 480 

gatatcatga ttcctatgat atatacgcag atctcggcat tcaatgaatc cattgttctg 540 

tctgacagcg tgttgggtat tagtcttgac aagtatatgg gtgaagacta tccgctttac 60 0 

aagcgtttct attacaacta tcagcggcgt actatgcgtc ctgaccggat cgtccccgat 660 

tgtttggtgt tctatctgat gagccagtat ccttttccga tggattactc ccgtacatta 720 

ctcgatgtaa tgatgcatta tggtaaaatc aattatgtgg tacaacatct gttggactat 7 80 

tcctcatcgg aagaagcgtt gggatattcg gatttagaaa gggaatggtg taaagagaac 840 

caacagcaga tgtggagata tattcttgag caagatcatt tgcatgctac ggatccgatg 90 0 

gtggtacgtc aatatacccg tccggctcct ttcactaaca ctttaggcga gaatgcgcct 9 60 

tcgatggtag gtacctggat cggtacgaaa atcatcactt cgtatatgaa acatcataag 102 0 

aaaacaactt tacggcaatt gcttgaaatg agcgactatg aacgtatgtt cacggaatcg 1080 

cgttttaatc cgtaa 1095 

<210> 1174 
<211> 258 
<212> DNA 
<213> B.fragilis 

<400> 1174 

tacaggcata aaaataaaga agaggatagg ccggaagaac ttttggctta tcctcttctc 60 

tttatcagtc tgacacgaaa gtctgttgag gaactaatct ttcagaccgt ctttgatccc 12 0 

ttccattgct ttctcacctt ttttagcagc cttttgcgca ccttctttta cgtctttggc 180 
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tgcatctttg gctgtatctt tgactttctc aaatgcatct ttggtgcttt ctttcacgtc 240 

atctcctatt ttctttaa 258 

<210> 1175 
<211> 2712 
<212> DNA 
<213> B. f ragilis 

<400> 1175 

tgtatgacaa aaaaaatcaa cctgtttcca agccttatac ggtttcggga aaccaatcgc 60 

ctaaaaatgg caattgctgc atctatcatg ctatggtgta tggcacccca acaagcagtt 12 0 

gcagatacgt atgaaaaaca cgaagttgcc agtattcagc agcaaaaggt aaaagcgaac 180 

ggtactgtag tagatcagac cggcgaacct ctaatcggcg tttctgtaaa agtaaaagac 240 

gcgcctaatg gaacaatcac caatttagat ggtaaattct ccatcgatgt agccaaaggt 3 00 

gctacacttg aaatatccta tgtgggatat aaaacggtca ttgtaaaagc cgaatcaacc 3 60 

ccaatgcaca ttgtcttaaa agaagatagt gaaatgatag atgaggtagt ggtagttggt 42 0 

tatggctcac agaaaaaggt taatgtcacc ggtgccgtag gcatggtcaa ttccgaagta 480 

cttgaagctc gtccggtaca aaatgtatcg caagctttac aaggtgtggt accgggtttg 540 

aacctctccg tcaacaacgg tggtggttca ctggatagtg agatgagtat taatattcgt 6 00 

ggtacaggta ccatcggcga cggctccgga tcgtctccat tggtattgat cgatggcatc 660 

gagggcagcc tgaatacagt aaaccccaat gatattgaat cggtatcagt actgaaagat 72 0 

gctgcatcag cttctattta tggtgcacgt gccgcattcg gtgtggtttt ggtaaaaacc 780 

aaaagcggac aatcgggaaa acccagagtg acctattcgg gtaatgtgcg cttctctgat 840 

gcgactaata ttcctgaaat gctggattct tatactttcg cacaatactt taaccgtgca 900 

gcggcaaatg ataatggtgg tactgttttc agtaaagagc aactggaacg catcaaagca 9 60 

taccaggatg gtaccttaaa atcatcagct acctttaacg aacaatcacg ccgatggaac 102 0 

tactatacgg gatcgaatgc aaataccgac tggtttaagg aagtgtatga agattgggtt 10 80 

ccttccatgg atcacaatct tagcatcagt ggtggcacag ataaaactca atatattgtc 1140 

agtggaagct ttctcgatca gaaaggtttg atccgccatg gcaaagatac cttccagcgc 12 0 0 

tacactttaa atggtcgaat cacaagtaac atcacagact ggtttacatt gggatattca 12 60 

accaaatgga cacgtgaaga ttatgatcgt ccaagttacc tgacgggatt gttcttccac 13 2 0 

aatgtggcac gccgctggcc gactgtacct gtctatgacg ataacggata cctgaccgaa 13 8 0 

ccatccgaac tgatccagct ggaagacgga ggcagacaaa tcaaccagaa ggacctcttc 1440 

acacaacagc tgcaactgac cttcgaacca atcaagaact ggaaaattta tgtagaagga 1500 

agtttgcgtg taaccgccaa taaccaacat tgggaagtac tgcctgttta tcagcacgat 1560 

gtagacggta acccggtagg tatgacatgg gatgcaggtg taggcagcta tccggtaggc 162 0 

ggttcaaaag tgtctgaata tgcttataaa gagaattact attctaccaa tatctattcg 1680 

gattacttca agcaactgga taacgggcac tatttcaagg caatggtagg tttcaacgcc 17 40 

gagctgtaca aggaccgcag tgtaagtgcg gacaaatcaa ccttgattac tccatccgta 1800 

ccgacaatta ataccgcagt aggtgaaccc agtgtagcag gtggatacag acatacctca 1860 

gtggccggtt tctttgcccg tttaaactgg aactacaaag accgctacat gctggaagcc 1920 

aacggacgct acgatggttc ttcacgcttt atcggcgaca aacgttgggg attcttcccg 1980 

tcattctcag gtggttggaa catcgcccgt gaagcatttt tcgaagaaac cgccaacaag 2 040 

ttgaaaattg gtacactgaa actgagagca tcgtggggac agttgggtaa caccaacacc 2100 

aatgaagcct ggtatccttt ctatcagact ttaccgcaag gacagaacta cggatggtta 2160 

gtaaacggtg tacgccagaa ttatgccagc aatccgggta tcgtcagtag cgaaaagacc 222 0 

tgggaaacca tcgaaacatg ggatgccggt ctggactggg gattattcaa caaccgtctg 22 80 

accggttcat tcgactattt cgtacgttac acatacgaca tgattgccac cgctccggaa 2 3 40 

ctcccctcta ttctgggtac aggtgttcct aaaatcaata atgccgacat gaaatcgtat 2400 

ggtttcgagt tggaaatcgg ctggagagac agaatcaaaa acttctctta tggagtgaaa 2460 

tttgtcctct cggatgcaca acaaaagatt ctgaaataca acaatcccga caagagtctt 2 520 

agtaatcctt attatgaagg acagaagcta ggagagatat ggggatacaa aacaattgga 2580 

atcgcacaaa gcgatgaaga aatgaaccta catcttgcca atgccaagca gccgatgggg 2640 

cagaaatggg cagcaggtga catcatgtat gccgatctcg acaatagcgg ctcagtggac 2 700 

caaggtgtct tc 2712 

<210> 1176 
<211> 732 
<212> DNA 
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<213> B.fragilis 
<400> 1176 

tatatggaaa agaaagagtt ttcatctcct gctcggagat acggtaagtt ttttatcgcc 60 

tttattttta taacggcagg agtgcttttg ctggctcgca atctgggatg gatttcttat 12 0 

accttgtttg gtattttggt ttcctggcaa atgttactga ttcttttagg aatttacttg 180 

attttgcggc gtcagatttt gcggggcggg atactgcttg ctatcggtgc ctatctgatc 240 

agtccgtatt tggaatggat gcctgcagga gttcatgtca ctcttttccc gattgtcctg 3 00 

attgttatcg gacttgcttt tctgttcagg ccgaaacgtg cccggcacga gcgttcgcac 3 60 

cgagggaact ttgccagtag ccaatataac tcaacagatg gagtgctgca ctccgaaaac 420 

acatttagcg gcatcaggca ggtggtgctc gatgaagtgt ttaaaggcgg aactatacaa 480 

aactcttttg gcgggacggt tatcgacttg cggcgtacga ctcttcccga aggagaaacg 540 

tttctcgata ttgattgtac atttggtgga atagaaattt atgtgccttc cgattggaaa 600 

gtagtgtttc ggtgtactac ctgtctgggc ggttgtcagg acaaacgttt tggcgggggt 660 

atgatcgatc agaaccggat attggtgatc cggggtgatt tgacattcgg aggtattgat 72 0 

ataaaaagtt ga 732 

<210> 1177 
<211> 825 
<212> DNA 
<213> B.fragilis 

<400> 1177 

aaacaatcga acatgagact aatcattcag ccggactatc agtccgtttc tcaatgggcg 60 

gcacattatg ttgctgctaa gatcaaagct gccaatccca ctccggaaaa acctttcgtt 12 0 

ctgggatgcc ccacaggatc atctccactg ggtatgtata aggcactgat cgacctgaat 180 

aaaaaaggaa tcgtatcgtt ccagaatgtt gttactttca acatggacga atacgtagga 2 40 

ctgccgaaag aacatccgga aagctactat tcttttatgt ggaacaactt cttcagccat 3 00 

atcgacatca aaccggagaa cacgaacatt ttaaatggaa atgctgccga tctggatgct 3 60 

gaatgtgcac gttatgaaga aaagatcaaa tcgtatggcg gtatcgacct gtttatggga 42 0 

ggtattggtc ctgacggtca tattgctttc aacgagccgg gctcttcgct gagttctcgt 480 

acccgtcaga aaacactgac aacagatacg atcattgcga actctcgctt cttcgacaat 540 

gatattaaca aggttcccaa gacttcgttg actgtaggag tgggtactgt gctttctgcc 600 

cgtgaggtga tgattatcgt aaacggacac aacaaagcac gtgcattgta tcatgccgta 660 

gacjcjgtgcca ttacacagat gtggacgatc agtgcattgc agatgcacga aaaaggtatc 720 

atcgtttgcg atgatgctgc tactgccgaa ctgaaagttg gtacttatcg ttatttcaag 7 80 

gatatcgaag cagatcacct cgatccgcag tcattgctga agtaa 82 5 

<210> 1178 
<211> 963 
<212> DNA 
<213> B.fragilis 

<400> 1178 

tcaaattcac tgtatttttt gtggttttcc tcttttcttt gcacactttt cgtgctttta 60 

tgcagaaaat ctatatcttt gcaacctaat ttaacattaa aaggtatgag ttacaatttg 12 0 

ttgaaaggaa aaagaggtat tattttcggt gcattaaacg agcagtctat tgcctggaaa 180 

gtagccgaaa gagccgttga agaaggtgct gttattacat tatcaaatac tcctgttgct 240 

gttcgcatgg gacaggtttc tgctttatca gaaaagctca attgcgaagt gattgctgct 3 00 

gatgccacca acgtagaaga tttggagaac gtattcaaac gctcgatgga agttttgggc 3 60 

ggacaaattg attttgtatt gcactctatc ggtatgtcac cgaatgttcg taagaaacgt 420 

acttatgatg atctcgatta taatatgttg aatactacgc tggatgtttc agctgtttcg 480 

ttccataaaa tgattcaggc tgccaagaag caaaatgcaa ttgcagaata cggttctatc 540 

gtggcattga gttatgtagc tgcacagcgt actttctacg gatataacga tatggcggat 600 

gcaaaagcat tacttgaatc tattgcccgc agttttggtt atatctatgg tcgtgagcac 660 

aacgtgcgtg tgaatactat ttcccagtcg cctaccttta caactgccgg ttctggtgtg 720 

aagggtatgg ataaactgta tgactttgct aatcgtatgt ctccgctcgg taatgcttca 780 

gccgacgaat gtgctgatta ctgtatcgta atgttctccg atcttacccg taaggtaact 840 

atgcagaacc tgttccacga tggaggtttt tcaagtgttg gtatgagtct gcgtgcaatg 900 
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gctacttacg aaaaggggct ggacgaatat aaagacgaaa acggtaatat catttacggt 960 
taa 963 

<210> 1179 
<211> 474 
<212> DNA 
<213> B.fragilis 



<400> 1179 
ttggcggcac 
atttcgggac 
atcagtactt 
caatgggctg 
gagagtttca 
agaggtcgtc 
atcatgaacg 
ctacagatgg 



tacgcacctg 
aggcttttat 
cggtagttgt 
cacccaaatc 
tcagggcaat 
cggtctgggt 
tagctgtccc 
tgtgcaggca 



tggactgatg 
cacttggtca 
ttccggattt 
aggcagatca 
cgattctgca 
attacagaat 
ttttccccag 
gtgcgattcg 



cgtttcacag 
ataagctctt 
agtcgtttga 
tcgcggtcta 
acgtgggccg 
ttacaactgc 
cattcgccca 
acaatgcgtt 



tctccatgtt 
tgcgtccttg 
tttcacgaat 
cagatgtgat 
gttcgtcggg 
gggtacagat 
tgttggggca 
tggtctcggt 



atgggagatg 
gaaatcggga 
tgtttgtgcc 
aactgcgtgt 
atccaaaggc 
gtcaccggca 
acgcccactg 
atag 



60 

120 

180 

240 

300 

360 

420 

474 



<210> 1180 
<211> 1110 
<212> DNA 
<213> B.fragilis 

<220> 

<221> unsure 
<222> (1097) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1180 
cataatttgt 
caaatattgc 
taccatactg 
cagaaagccg 
cgtgtgcaaa 
ataagcaatg 
ttatttggca 
cgcttctgtg 
gcgctttttt 
gccgacatag 
catgtattcc 
at ttcgggag 
gagcattccg 
aaactcagaa 
atgcagacac 
ctgaaactgg 
ttggaatcta 
atgttgacgt 
tt catacgta 



ggaatttata 
atactccttt 
ctgcctacga 
ggcatgctta 
gagtgaccgg 
ctttgatcac 
atacctattc 
acctgacatg 
tggaagttat 
cacataaggc 
atgcgactga 
gagctacctg 
ctaagttggc 
aagaggttca 
tcggacttga 
cacaatgttt 
atccgtttta 
ttgatcttcc 
agggcancga 



taaaaagact 
tgaaaaagag 
atttgagact 
ttcacgcata 
agcgctgagt 
actggcaagc 
atttctgaaa 
tcccgaagaa 
taccaatccg 
cggagtaccg 
cttcggagtt 
tataggagga 
tgctctgagt 
ccgtaatctt 
aacgatggaa 
gcaagaactg 
tgaattgagt 
gtcgcgtgag 
ctttatttga 



ttcaggatga 
gatgcatatc 
gcagaagaga 
acaaatccta 
gtgacagctt 
gcgggagcta 
agtacattgg 
gtgaagcaac 
cagttggaag 
ttgctggcag 
gatatcgaga 
ctgattatag 
gcagataccg 
ggagcatata 
gtacggtttg 
cctgagattg 
acccgccagt 
atatgttttc 



aaaaaaatag 
attctctttc 
tggaagctgc 
cggtacagta 
tgaactccgg 
atgtggtgac 
aagcttttgg 
agattgatgg 
tggccgatct 
atacgacagc 
ttgtgtccag 
attacggcac 
ggaaggaggc 
tgaccccgca 
ctcgtcaggc 
agtcggtgaa 
ttggttctct 
gtttcataaa 



ttttgagaca 
aatgccggtg 
tttctgtggg 
ctttgagcag 
gatggctgcg 
ttcgaaacat 
ggtagaagta 
agatacttgt 
gaaggctctt 
gattcctttc 
tacgaaatat 
ttttgactgg 
ttttactgtc 
ggtggcttat 
agaaacctgc 
ctataccgga 
tccgggagcg 
tcggttgaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1110 



<210> 1181 
<211> 201 
<212> DNA 
<213> B.fragilis 



<400> 1181 

agtgtagcgc tggaaggtat ctttgccatg gcggatcaaa cctttctgat cgagaaagct 60 

tccactgaca atatattgag ttttatctgt gccaccactg atgctaagat tgtgatccat 12 0 

ggaaggaacc caatcttcat acacttcctt aaaccagtcg gtatttgcat tcgatcccgt 180 

atagtagttc catcggcgtg a 201 



480 



<210> 1182 
<211> 1125 
<212> DNA 
<213> B.fragilis 

<400> 1182 

aaacatatgg gtaacaaagg agtactttct tctgcattta atatgtcatt gggctttatc 60 

cctgttattg tttccatcct cttatgcgaa tttataacac aggacatatc aatctatatc 12 0 

ggtacaggta tcgggcttat ctattcgtat aggtctctgt ctcgtaaagg ggcccgcata 180 

cccaatttta tcctctatat ctctacggga atcttgacat tactgactct ggcaagcttt 240 

attcccggag atttcgttcc ggaaggagca ttgccgctta cacttgaagt cagcatactg 3 00 

attccgatgg ttattctatt cctgcacagg aggaagttca tcagccacta cctgcgtcaa 3 60 

aatgcccaat gcaaccggcg gttgtttgct caaggagcag aatctgccat tgtatccgca 42 0 

cgggtcgtac tgattttagg tattctccat tttgctgtca tcagcctgac ggttttggta 480 

gcacatcccc tcacccggac ttccatactt gtactttatc acgtattgcc gcctactata 540 

tttattctca gcatcctgct caatcaaata ggtatccgtt actttaatca tgtaatggcg 600 

catacggaat atgtccccat cgtcaatacc cgtggtgatg taatcggaaa aagtctggct 660 

gtggaagcta tcaactataa gaatgcctac attaatccgg tgatacgcat tgctgtttcc 72 0 

acccacggca tgttgttcct ctgtaatcgt ccgcaaagct gcatcctcga taaaggcaaa 7 80 

gtggatatac cgatggagtg ttaccttcga tacggggaga cactgacggc gggagccaac 840 

cgcttattga gcaatgcttt tccgaaggcc tctgatctga aaccgacatt taccatctcc 900 

tatcattttg aaaacgaaca gaccaatcgt ttggtctatc tgttcattgt cgatatggaa 960 

gacgactcca ttctctgcga tccacgattc aaaggaggaa agttatggac tttccaacaa 102 0 

atagagcaca acttaggtac tcacttcttc agcgagtgtt tcgagttgga atacgagcac 10 80 

ctgaaacagg ttattggtat aagagaaaaa tacaaggtat cttag 112 5 

<210> 1183 
<211> 2022 
<212> DNA 
<213> B.fragilis 

<400> 1183 

aacatttcgg gatcatctga cgtggaattt acaatagaca ccggtaataa ggagtttcag 60 

gatgcactga atctgattca gtatactcgt caatcagttt tcctcacggg gaaagcggga 12 0 

acgggtaaat ctactttcct gaaatacatc tgtaagaata ccaagaagaa acacatcgtg 180 

ctggcaccca ccggaatcgc tgccattaat gccggtggca gcactctgca cagcttcttc 240 

aaacttcctt ttcatccgtt acttcctgat gatccgaatc tgagtttgca acggggacgc 300 

atccatgaat tctttaaata caccaagcca caccgtaaat tgctggaaca ggtagaactg 3 60 

gtcattatcg acgaaatatc aatggtacgc gccgatatga ttgatgccgt agaccgcatt 42 0 

ttgcgtgtat acagtcgtaa tctgcgcgac cccttcggag gaaaacaagt tttactggta 480 

ggcgatgtat tccagcttga accggtaatc aaaggggatg aacgggaaat tatcaaccgc 540 

ttctatccca ctccttattt tttctcggca cgggtcttca acgagatcga attggtatct 600 

atcgagttac agaaagtata tcgtcagtca gatgcagtct ttgtcagtgt actcgaccac 660 

atccggagcg gagcagcagg ggcagccgac cttcagctgc tcaacacccg ctatggcgct 72 0 

caaattgatg cttcggaaga agatttatac atcactctgg ctacgcgccg cgatacggta 780 

gaaaccatca atgaacggaa acttaccgaa ctccctggcg atcctgtagt gtttgaggga 840 

gagatcaacg gcgactttcc cgaaagtagc ttgcccacct caaaagaact gaccctgaaa 900 

ccgggagcac aaatcatctt tatcaaaaat gactttgaac gccgttgggt aaacggtacc 9 60 

atcggtgtag taagcggcat cgacaacgac ggtatcatct acgtcatcac cgatgatggc 102 0 

aaagagtgtg atgtccaccg ggaatcatgg cgcaacatcc gttacaagta caacgaagag 108 0 

aaaaaggaga ttgaggaaga agaactgggt actttcacac aatatcccat ccgcctggca 1140 

tgggccatca ccgtgcacaa aagccagggt ttgaccttta gccgtgtagt catagacttt 12 00 

accggaggag tgtttgccgg cggacaagct tatgtagctc tcagccgttg cacatcgctt 12 60 

gaaggaattc aactgaaaaa gcctatcagc cgcgcggata tttttgtccg tcccgagatt 132 0 

gtcagttttt ccggaagatt caacaaccgg caagccatcg acaaagcatt aaaacaagca 13 80 

caggccgatg tgcagtatgc cgcagctgcc cgtgctttcg acaaaggaga tttcgagaca 1440 

tgcctcgagc agtttttcct ggccattcat tcccgttatg atatagaaaa gcctgccgcc 1500 

cgaagattaa ttcgcaggaa attgggagtc gtcaacctat tgcgggaaca aaaaaggaaa 1560 



481 



ctacaagcac 
ctaatgggaa 
gacaaagcga 
ctgttcaacg 
cgtccttcag 
atagaagggg 
gcacacgaac 
caatggcgta 



aaatggaggc 
acgaatgtat 
ttgagctcta 
aaaaagagtt 
agttcaaagc 
ctttat caga 
ttttcggaga 
tagcagagga 



acagaaaaaa 
cactcaggcg 
tcccgaatac 
cttcgatgcc 
actctataac 
cctcgacaaa 
tacattatta 
actacgaaaa 



agcctgcaaa 
catgacgttc 
atcgacgcct 
gagaattgcc 
cgaggcaaac 
gcaactagcc 
aaagtcggta 
aagaagaaat 



agtatgcccg 
gggccgccct 
ggatacgaaa 
tgaaccgggc 
ttcgcctgca 
tgaaacctga 
aagagacaga 
aa 



agagtatttg 
tgccaattat 
aggaatcacg 
agtcagcctg 
gacagaaaac 
acatcccgga 
agcagccatc 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2022 



<210> 1184 
<211> 624 
<212> DNA 
<213> B.fragilis 



<400> 1184 
cagacaatta 
attgtagacc 
atcggttgcc 
gacccgaaga 
gccaaagtgg 
tttatcctga 
atccgtatgc 
atgaccacca 
ctgatcgacg 
attgaacctt 
aaaggggaac 



tgatactcaa 
tactgaacga 
atggcctgaa 
aaaacaatct 
acaacaacat 
acggaactaa 
cggataacag 
cgttgcctca 
aaaaactcgg 
cgacagttgt 
tggaagaggc 



actttacgaa 
cgggggattg 
ggaacgtgcc 
ctctattatt 
tttcaagttg 
ccggttgccc 
tatcatccgt 
tgacgagcac 
tgacgtagtg 
gaactgcact 
ctga 



aaaaataata 
attatctatc 
atcgaacgca 
tgctatgacc 
atgaaacgta 
aaaatctttc 
gaaattgccc 
gaagacatag 
gacctcgtga 
gaaggagaag 



atcctcagga 
ccacagacac 
tctgccggat 
tgagcagcat 
acctgcccgg 
gaaaccggaa 
gtctgctgga 
agtacgtcac 
tcgacggagg 
ctgccatcgt 



cctgcaacgg 
catgtatgcc 
caaagagatt 
cagtgaatat 
acctttcact 
agaagtaggt 
tgctccgatc 
tgacccggaa 
tatcgggggg 
ccggcaggga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

624 



<210> 1185 
<211> 435 
<212> DNA 
<213> B.fragilis 



<400> 1185 
catttgatta 
agccaaacaa 
aacgctcaag 
aaagacagtt 
aaactggaat 
agcaccaaag 
aaagaaggtg 
ggtctgaaag 



tgaaaaaagt 
aagattcgta 
actatacaaa 
ataataactt 
ccacttatgc 
atgcatttga 
cgcaaaaggc 
at tag 



attattcttt 
tctggagggc 
agccgactgg 
cagtaaacaa 
tgccctaaag 
gaaagtcaaa 
tgctaaaaaa 



gcattggtgc 
tttaagcttt 
gaaaaggctg 
atgacttcgg 
ttaaagaaaa 
gatacagcca 
ggtgagaaag 



tgaccattgc 
ttatcgaaag 
acgaacaatt 
acgaaaaagg 
taggagatga 
aagatgcagc 
caatggaagg 



cacagcatgc 
cgtacagaaa 
tacgaaactg 
agaagtgata 
cgtgaaagaa 
caaagacgta 
gatcaaagac 



60 

120 

180 

240 

300 

360 

420 

435 



<210> 1186 
<211> 2238 
<212> DNA 
<213> B.fragilis 



<400> 1186 
aaacaaccta 
ctgtgctgcc 
tcgggcaagt 
tacacacaaa 
ccggtggaag 
agttaccaat 
tatcgtcatt 
gtgactacca 
ttctctccgg 
cttttgtatg 
aacggtattc 



taaaatcatc 
ttgccggctt 
tcagccccga 
gaaatgccga 
tggtattcga 
tttcaccgga 
cttatacggc 
acaatattgt 
acggaaacct 
gcaacagcga 
ccgactgggt 



taagccaatg 
tgcacaagga 
aaacatttat 
aggtacacag 
tgtgacaaaa 
cggatcaaaa 
agtccactac 
tgaaaagcta 
ggtggcgttt 
atcgcaagtg 
atacgaagaa 



aaaagaaaat 
ggcaaagcgc 
ggtgtggttc 
atcgtgaaat 
gcccgcgaat 
atactgattg 
ctgtatccgg 
tcggacggtg 
gtccgtgata 
actgaagacg 
gagttcggtt 



tcatttttct 
tcgacctgaa 
ccatgcctga 
attccttccg 
gcccctttaa 
caaccgaaac 
tgaaacggaa 
gtccgcagca 
ataacatctt 
gcaagttgaa 
tcaaccgtgc 



gtttttctgt 
agaaattaac 
cggcgaacac 
cacgggtgaa 
gaaattcgac 
caagcccatc 
cgacaaagga 
agccccggta 
cctggtaaag 
cagtgtactg 
cctggaattc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 



482 



aatgccgata ataccatgct ggcctatgtt cgtttcgacg aatcggaggt tccatcatac 72 0 

actttccccc tgtttgcagg tgaagcaccg cgttatgatg cactgcagga ttatccggga 7 80 

gaatacactt acaaatatcc caaagcaggt taccccaact ccaaggtgtc agtacatacg 840 

ttcgacatca aatcgaaagt gacccgtcag gtgaagctgc cgatagacgc cgacggatat 900 

atcccgcgca tccgtttcac tcaggatccc aacaaactgg ccatcatgac actgaaccgt 960 

caccagaacc gcttcgacat gtattttgcc gatcctcgca gcacagtgtg caaactggcc 102 0 

ctacgcgacg aatctcctta ttacatcaac gaaaatgtat tcgataacat tcagttctat 1080 

cccgaatatt tcagctttgt tagcgataag agcggatatc ctcacttgta ctggtatagc 1140 

atgaacggta acttgatcaa acaagtgacc agcggtaact atgaagtaaa aaactttatc 12 00 

ggatggaatc cggataccaa cgagttttat tacaccagca atgaagaaag cccgatgcgt 12 60 

caggcggtat acaagataga ccgtaagggc aagaagatga aactgagcaa tcagccggga 132 0 

accaacagtc ccatcttcag cagctcgatg aaatatttca tgaacaagtt taccagcctc 13 80 

gatactccga tgctgattac cttgaatgac aacacaggta aggtcttgaa gactctcgta 1440 

acaaatgata aactgaaaca gaaactggcc gaatatgcca taccgcaaaa agaattcttc 1500 

acgttcaaaa caacagaagg agtcgatctg aacggctgga tgatgaaacc ggtcaatttc 1560 

gatcctgcca aacgttatcc ggtactgatg ttccagtata gcggtccggg ttcgcaacag 162 0 

gttctggaca aatggggaat cagttgggaa acctacatgg cgagcctcgg ttacgtggta 1680 

gcttgtgtag atggtcgcgg cacaggtggc cgtggcagtg aattccagaa atgcacctac 17 40 

ctgaacctgg gtgtaaaaga agctaaagac caggtggaag ctgccaaata tctgggtgga 1800 

ctgccttatg tggacaaagg acgtattggt atctggggat ggagtttcgg cggatatatg 1860 

accatcatga gtatgagcga aggtacaccc gtgtttaaag ccggagttgc tgtggccgca 192 0 

cctacagact ggaaatatta cgatacagta tataccgaac gctttatgcg cacgccgaaa 1980 

gaaaatgccg aaggctataa agcagcttca gcattcagcc gtgcagacaa cctgcatggt 2040 

aacctgctcc ttgtacacgg tatggcagat gataatgttc acttccagaa ctgtacagaa 2100 

tatgcagagc acctggtaca actcggaaaa cagttcgata tgcaggtata caccaaccgg 2160 

aatcatagca tctatggtgg aaatacccgt aaccacttgt atacgaagct gacgaacttc 2220 

ttccggaata atttataa 223 8 

<210> 1187 
<211> 846 
<212> DNA 
<213> B.fragilis 

<400> 1187 

aacatggata tacatcctat acaagaatct tcccggcggt ggatgacggc attgatattg 60 

gccgttgtag cagcagggat acaaacaact ctactttggg gctatgccgg agccgacacg 12 0 

cttccggctg caatagacgg gattttatct gtcggattgc tttgcctcct ggcctatctg 180 

gcatggtatg tcattggcct tgtctctata ttgcagaccg acttactgat agccgctttg 240 

gctctgcttt tctggctggc gggaggcttt gctgtgcaat atgtgctgga acagaatatg 3 00 

ggacaagtat atgctccttt tggtgagacg cttcctttcc gtatattgtt tggagcattg 3 60 

gcctggggag tgatgatgtt gtggtatcgc ttgcagtcgc tgaataccgt tcaggaagag 420 

atattagaag aggctgtttc gagagaggaa gcccttcgtg aagaattgag gcagattgaa 480 

tgccgcgaag ataaagcgtt gccggaagag gcggaatgta ttgaccgcat cacggtgaaa 540 

gatggtacac atattcatct gatccgtacc gacgagttgc tttacataca ggcatgtggc 600 

gattatgtca cattggtgac cccttcggga caatatgtca aggagcagac catgaagtat 660 

tttgatgccc atctgccatc agcaggattt gtgcgtgtgc atcgttctac aatagtgaac 720 

gtgacgcaaa tatcacgggt tgaactcttt ggaaaagaaa attatcagct ctcgttaaaa 7 80 

aacggcgtaa ggctgaaagt gagtaattcc ggatataaat tactgaagga gcggttggaa 840 

ctttaa 846 

<210> 1188 
<211> 1209 
<212> DNA 
<213> B. fragilis 

<400> 1188 

aaaaaaagaa tggaacagaa aacaagaatt aaaggaaacg ttcattatgt gggagttaac 60 

gaccgtaaca agcacctctt tgagggaatg tggcctttgc cctatggagt ttcgtataac 12 0 

tcttatctga ttgatgatga aatggtggca ctgatcgata cagtggatat ttgctatttc 180 



483 



gaagtatatc ttcgaaaaat cagaaatatc ataggcgacc gtcctatcaa ctatttgatt 240 

ataaatcaca tggaaccgga ccattcaggt tctatccgat tgattaagca acactatccc 3 00 

gacattgtga tcgttggcaa taaacagact ttcggtatga tcgaaggttt ttatggtgtg 3 60 

accggcgagc aatatctgat taaggatggt gattttctcg ctcttggacg tcataaactg 42 0 

cgcttttacc tgactccgat ggtacactgg ccggaaacga tgatgacatt tgacgaaaca 480 

gatggcatac tcttctccgg tgatggtttc gggtgttttg gtacgctgga tggcggcttc 540 

gtggatacac gcatgaatat cgaccattat tggggcgaaa tggttcgtta ttattcgaac 600 

atcgtcggca aatacggttc accggtacag aaagctttgc aaaagttggg tggacttcct 660 

atctcggcta tttgttctac gcacggtccg gtatggactg agaatatcac gaaagtggta 720 

ggcatttatg ataaactgag ccgttatgac gcagatgaag gtgtggtaat tgcatacggc 780 

agtatgtacg gtaataccga acaaatggca gaagccattg cagctgagct ttcggcacag 840 

ggcatcaaaa acattgtgat gcacaatgtc agcaaaagta atccctctta tatccttgcc 900 

gatatattcc gttataaagg attgattatc ggtagcccta cttacagtaa ccagattttc 960 

ccggaagtgg agtcgcttct gtccaagata ttggttcgtg aattgaaagg acgttatctg 1020 

gggtatttcg gttcgttcac gtgggcaggt gccgccgtga aacgtatggc cgagtttgca 1080 

gagaaaagta aatttgaatt ggtcggtgat cctgtagaaa tgaaacaggc catgaaggag 1140 

atcacatatc aacagtgtga gaacctggca cgtgctatgg ccggccgttt aaaaaaagac 1200 

agagtataa 1209 

<210> 1189 
<211> 879 
<212> DNA 
<213> B.fragilis 

<400> 1189 

cagacagcag ttaaaattta ccatcattgt ggtattcagt tactctatat caggcttttt 60 

cagtacattt gtgctccatt aaagaaaaga cgtatgataa aagccctgtt tttcgatata 120 

gacgggacgt tggttagttt caacactcac gaaattcctt cgtctaccct cgcagccata 180 

gccgaagcaa aagctaaagg tatcaagata ttcatcgcta ccggacgccc gaaagcgatt 240 

atcaacaacc tcaccgccct tcaggaacgg gaactgatag acggctacat caccatgaac 3 00 

ggagggtatt gtttcgttgg agatgaggtg atttacaaac attccatccc ggtacaagat 360 

gttaaggcac tggctgcact ttcggacgaa agaaactttc cctgtatctt tgtagccgaa 420 

cataccgtag ctgtttgcaa taccaacaag ctggttaacg aaatctttca tgatttcctg 480 

catgtagata tcctgcccat ccaaactaca gccgaggcta cgcagcccga aatctttcag 540 

atgactccat tcatcactac cgaagaggag aaaacggtat tgcctttact cccgaactgc 600 

gaatccggac gctggtttcc tgcatttaca gatatcgtag ctaaaggtat ccgcaaacaa 660 

aaaggaatag atgaaatcat tcgtcatttc ggtatcggac aggaagaaac aatggcattt 720 

ggtgatggag gcaacgatat cagtatgttg cgccatgcag ccatcggagt agccatgggc 780 

aatgcgaatg acgatgtcaa agaaaccgcc gactatataa ccacttctgt agacgaagac 840 

ggaatacaaa aagcattaaa acatttcggg atcatctga 879 

<210> 1190 
<211> 615 
<212> DNA 
<213> B.fragilis 

<400> 1190 

aaagctaaat ttatcattta caaactttat acaatgctgc caaagaaaaa aaacctggaa 60 

gaagagagag cgaaacatac ggttgatagt ctttataagg actatgttga cgacttgttt 120 

tcgtatgctt tgggattcgg gtttgacaaa cagacagcga tggatgccat tcatgatgtg 180 

ttttgcaggg tatgtatccg agaaagagaa gtgcaggaga tacagaatcc taaattttac 240 

ttgttgcgtg ccctgcggaa ccagttgatt gacacctata aactcaaacg aaactactcg 300 

gaggttctta ccggtgagat taccgatgaa cttccatata aaatcaaaat taccgtagag 3 60 

gatgaaataa tcgcagcgga agagcaggcg gaagtatcac agaaagttga cgagattctg 42 0 

agtatactta ccgaacgcca gcgcgagatt atttatttac gttatatgca ggaatgctct 480 

tatgaggaaa ttgcagagat tatgcaaatc agtgttcctg cctgtcgtaa attgctctat 540 

cggaccttac ttaaactgaa gcacaataac acattagtgc tcttctatct cttactttct 600 

attaatgttg gttaa 615 



484 



<210> 1191 
<211> 738 
<212> DNA 
<213> B. f ragilis 



<400> 1191 

tccgtaaaat ttcaactatt aaatacccta ttcatggata ccgctcttta tcttttgccc 60 

gtcactttgg gcgacactcc gatcgagtct gtattacctt cttataataa agaaattatt 12 0 

cagggcatca agcacttcat tgtcgaagat gttcgctcgg cccgccgctt cctgaaaaag 180 

gtagaccgtg agattgatat tgactcactc actttttacc cgctgaataa acatacttct 240 

cctgaagata tttccggtta tctgaaaccg ctggcaggcg gtttgtccat gggagtgatt 3 00 

tccgaagccg gttgtcctgc tgtagccgat ccgggagctg acgtggtggc tattgcacaa 360 

cgtaaaaacc tgaaagtagt tccgttggta ggaccttcgt ctatcattct ttctgtgatg 42 0 

ggttcgggat ttaacgggca gagttttgcc tttcacggct atctgcctat agagccgggt 480 

gagcgtgcaa aaaaaataaa agctcttgag caacgggtat atgccgagca tcagacacag 540 

ctatttatag aaacacctta tcgtaacaat aagatggtgg aggatattct gcataattgt 600 

cgtccgcaga ctcgcttgtg tattgcagcg aatatcactt gtgagggaga atatatccgt 660 

actaaaacca taaaagagtg gcaaggaaaa gtacccgacc tgactaagat accttgtatt 72 0 

tttctcttat accaataa , 73 8 



<210> 1192 
<211> 1374 
<212> DNA 
<213> B.fragilis 



<400> 1192 



agttccccga 


cgccgagaga 


60 


acatcattct 


gggggtacgc 


120 


ttgtggacga 


ggaacacgaa 


180 


cccggaatgc 


tgccatcgtg 


240 


ccactccttc 


ggtggagact 


300 


tgaaagaacg 


ctataaggaa 


360 


tgcaccgcaa 


gaagcggatg 


420 


aggcactcga 


taataaacag 


480 


tgatagagtg 


ccgcacgtgc 


540 


cctatcataa 


aggtatcaat 


600 


cccgttcgtg 


tccggcttgc 


660 


agatagaaga 


tgatgtgaag 


720 


atactacacg 


tacgcggtcg 


780 


ccgatatact 


gatcggtacc 


840 


tggtggggat 


actcaatgcg 


900 


gtgccttcca 


gttgatggcg 


960 


gggtagtgtt 


gcaaaccaaa 


1020 


attatgaaga 


tatggtggcg 


1080 


attaccggat 


ggtgtatgtc 


1140 


cacacaccat 


ggccgagaaa 


1200 


aaccgcccgt 


tgcccgtatt 


1260 


aaaatgcgcc 


gatgagtcgt 


1320 


cgcggtcagt 


teat 


1374 



<210> 1193 
<211> 1533 
<212> DNA 
<213> B.fragilis 



<400> 1193 

ataacaaact caataaaaac aaacgcaatg ttacgaaaaa tcagattaac atgtggcatc 60 

atetgectga cactgatcac ettgetatte cttgacttta ccggaaccct tcacggttgg 12 0 

ttcggctggc tggcaaagat ccagttcctc cctgcagtac tggcattgaa cgtaggagta 180 
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gtagtccttt taatcatcct gacaggagta ttcggccgaa tctattgctc ggtgatttgt 240 

ccgttggggg tatttcaaga tgtagcagcc tggattggca aaaagcggaa aaagttaccc 3 00 

tactcctatt ctcccgctct ctccctccta cgctacgggg cattggcaat attcatcatc 3 60 

accctggtgg caggagtaag tttcatcgca actctatttg ctccctacag tgcttacggg 420 

cgtatcgcaa acaacctgtt ccaacccatt tggctgtggg gaaacaacct gttcgcccat 480 

ttagccgaac gggccggcaa ctatgcattt tatgaagtag acatctggat aaaaagtctg 540 

cctactttca ttgtagccgc agctactttt gtcatcctga tattattggc atggcggaac 600 

ggacgcactt actgcaatac aatctgcccg gtagggacgg tactgggatt tctttcacgt 660 

tactccttgt tccgcatcac aatagatacg gaaaaatgca ataagtgcgg actttgtgca 72 0 

cgtcattgta aagcggcctg catcaatgct aaagaacata cgatcgatta cagccgatgc 780 

gtggtttgca tggattgcct cggtaaatgc aagcagaaag cactcagtta ccaattgcgg 840 

acaaccaagg ctcggccagc aaaagcagaa gaaaatgctc ttgcagcctc atccaaggaa 900 

gtcaatgaag cacgtcgcaa tttccttacc gtaacggcaa tggccgctac ggcatcagcc 9 60 

ctaaaagcac aagagaaaaa agtagacgga ggactggcgg ccatagaaga taaaaagatc 1020 

ccgaaccgcc agactcccat tacacctccc ggttcgttga gtgcacgaaa catggcagca 1080 

cattgcacag cttgccaatt gtgcgtatca gcttgttcca accaagtatt gcgtccttcc 1140 

actaacctga tgaacctgat gcaacctgaa atatcatatg aacgcggata ctgccgtccg 12 00 

gaatgtaatg actgttcaca agtatgtccg acaggagcca tacaccccat tacagcagcc 1260 

gacaaatctt ccactcaaat cggacatgct gtctggatca aagcaaattg cgtgtcgctg 1320 

accgatggag tgaaatgtga caactgtgcc cgtcattgtc cgacaggagc tatccagatg 13 80 

attgtcgccg aaccggaaaa agaggcttct ccccaaattc cggcgatcaa caccgagcgt 1440 

tgcatcggct gtggagcatg cgagaatctt tgtccggcac gtccgttcag tgccatctac 1500 

gtcgaggggc acgaaaggca tcgtatcata taa 1533 

<210> 1194 
<211> 798 
<212> DNA 
<213> B.fragilis 

<400> 1194 

aaaagtgttc cgggtatatt tacctctaat atagtggctg 

gaagacgaaa ctgctgccta tgaaaattta acggatattc 

atccggatca tggcaaatac ggaaagtgtc acacaaaccg 

ccggctccgg atctgatatt catggatatt catttatcgg 

ttcgacagaa tagaactgga gactcccatc atattcacaa 

atcgaagctt ttaaggtgaa cagcatagat tatttattaa 

gtagaacacg cactggagaa atacagcaaa ctgacccgac 

tcacaactga ctctattgaa acctgcaccc agatacaaag 

aaagacaaac tgttaccggt aaatataaaa aatatttctt 

aacacgtatg tatgcttaaa agacggcaat cgttatccat 

attgcttcct cactgaaccc ggaagacttc atccgggcca 

agggatagcg taacggatat aaccatctgg ttcgacagcc 

acagaagtac cggaacgtat ttatgtcagc aaaaacaaag 

cttgtaaacg ataaataa 

<210> 1195 
<211> 843 
<212> DNA 
<213> B.fragilis 

<400> 1195 



ttatgaaagt 


attgattgta 


60 


tcacagagat 


aactcccgac 


120 


tcgggtggtt 


acaatctaat 


180 


acggatcggc 


ttttgctatt 


240 


ccgcttatga 


ccgatatgcc 


300 


agccggtcaa 


agtggaagat 


360 


aggacttatt 


acaatatttg 


420 


acaagttatt 


gattgcacac 


480 


atttttacgc 


aaccggcaaa 


540 


actccaagac 


tttggaacaa 


600 


ataagcagtt 


tatcgtagcg 


660 


gtctacttat 


cacgctcgat 


720 


catcggaatt 


taaaacatgg 


780 
798 



tgtatctaag 


tacacaatgc 


60 


acaaactgaa 


caaagagttc 


120 


aactgaatga 


atataaacga 


180 


ttctaagtga 


tatgcatacc 


240 


tgggcatagc 


ccgaaaagga 


300 


ttaaatatat 


ccatccggat 


360 


attttctgaa 


acagattcca 


420 


gtatgcgtga 


tccttccggg 


480 
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aaatatatcc ctatcctaca caggatgttc tatgtagcta cccattcaaa cgacagtatg 540 

tggctggctc tgtgtcttta caatttgtcg gtcgacccta ctatgagttg tagagtgatc 600 

aactcgacaa acggacaggt catagaactg gaaaagcaag attgcagcaa attgctatcc 660 

gatagggaga aaacaatatt acagttgatc gatatgggga aaacaagtca tgagatcgcc 720 

cgggaactgt ttataagcaa aaataccgtc agccgacacc gacaaaatat attggaaaag 780 

cttcaggtaa agaactccat tgaagcttgc agaattgcca aagaacttaa gttactcttc 840 

taa 843 



<210> 1196 
<211> 588 
<212> DNA 
<213> B.fragilis 



<400> 1196 
acgcaggtta 
tattacatca 
acgaaaacgt 
ttccatctca 
ggagcttata 
aactcattat 
acacttctag 
atcctcctcc 
gtcaacgatt 
caaatattcg 



ccccatgccg 
taattatgga 
atctgacaac 
ttccgcaagg 
aatacggatg 
tgttcggcat 
cgatagctgc 
tggtcgtgtt 
ttttcagtgc 
gaggctatct 



aaggctacat 
aacaacatct 
actgctattc 
tggtatcact 
gaaagtaggg 
gcctcaaccg 
cggttatgca 
atcctatcag 
cgtacaggat 
gtttataagt 



gtatgggagg 
atcaagcttt 
gtagtgggca 
tggttaccca 
ttactgacag 
gtgatcttac 
gcccaccgct 
gtggtcggca 
ttccgtatcg 
cgtttgattt 



aattaatatt 
attcattgaa 
atatggcact 
tctatttttt 
cacttctatc 
ccgccatact 
acaaacgcat 
ctttaggcga 
gtctgccggg 
ataaataa 



aacttttaat 
ttacaatgat 
cccccaactt 
tactctgatc 
gcctgtttta 
cttaaaatcg 
ttccatccct 
atggatcctt 
aatggctctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

588 



<210> 1197 
<211> 264 
<212> DNA 
<213> B.fragilis 



<400> 1197 

aatcaaccta ctattcttta cattcttact tcattttact atttatctag tataaaatca 60 

attaatatgg atagtatagg aaaaagaatg tatcgtaatc ttggagacga taccaaagcg 12 0 

aaaattagtc aatcattaag aggtagaagc aagtcagctt cgcatatcca agcaatatca 180 

caaggcatga ctaattactg gaagactata ccagtcaaaa cagatgataa cccaagtgat 240 

aaaacaaaaa aagaggggca ataa 2 64 



<210> 1198 
<211> 639 
<212> DNA 
<213> B.fragilis 



<400> 1198 

acaagtaaca ccatgaaaaa agtagtaata tttgcaagag tatcgagcac caacggaaca 60 

caagactatg aacgtcaaat aaatgatttg cagacattag cctcagcaaa caactggact 12 0 

gttgaggctg tatttgcaga aaaggtatct ggagcgaaaa agaatactga acgcatagaa 180 

ttaatgaata tgataaacta tatcaactca cacaacatac ataaggtact agtaaccgaa 2 40 

ttgtccagac ttggacgtga tactttacaa gttttgcaag ctatagagat actcaatcaa 3 00 

aacaaagtat cagtattcat tcaaaattat aatattgaaa cgcttactcc agagggagaa 3 60 

atcaatccta tgagccagtt tcttattact atacttgccg aagtagcacg aatggaacgc 420 

aagactatta gagaacgtgt tgcaagtggt taccagaatt tccgtagcaa tggtggtaag 480 

gtagggcgaa aagttggata tacgaaaagc gatgaggtta tgagggaaga gtatgcagaa 540 

gaattaagat tactgaaaag agggtactca ctgcgaaata cctcaaaact gacgggaaca 600 

agtatcaaca ccctgcgaaa attaacccaa ttaacataa 63 9 



<210> 1199 
<211> 1344 
<212> DNA 
<213> B.fragilis 
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<400> 1199 

ttcacgggca cgactcatcg gcgcattttg ttctatttta acaactattt ttcgtatgaa 60 

caaagtttga atacgggcaa cgggcggttt atccggtccc agtatccggt ttccgaacaa 120 

tgcgcgcagt ttctcggcca tggtgtgtgc catcacatca agcaacgtct cgttccggtt 180 

tttcagatag acatacacca tccggtaata cggaggataa tgaaacatct gccgctcagc 240 

cagttgtccc gccaccatat cttcataatc gttcgtcatc acctgacgga tgatgggatg 300 

gtctatactt ttggtttgca acactaccct gccccgttta ttttttcgtc cggcacgtcc 3 60 

ggcaacctgc gccatcaact ggaaggcacg ttcgtacgaa cggaaatcgg gataattcag 42 0 

catcgtgtcc gcattgagta tccccaccac gctgacgtgg tcaaagtcga gtcctttgga 480 

taccatctgg gtaccgatca gtatatcggt ctttccctgt tcaaagtctg caataatttt 540 

ttcgtatgcc gaccgcgtac gtgtagtatc gagatccata cgtgctacgg aagcttccgg 600 

aaaaatcagc ttcacatcat cttctatctt ttccgtacca aatcctctgt gcatcagctc 660 

gaccccttcg caagccggac acgaacgggg caactggtaa gtatacccgc aatagtggca 72 0 

agtcagctga ttgatacctt tatgataggt caggctgaca tcacagttct tgcactttgg 7 80 

cacccatccg cacgtgcggc actctatcat cggagcaaat ccccggcgat tctgaaacag 840 

aatcacctgc tgtttattat cgagtgcctc gcggacatat tgcagcagca acggagagaa 900 

ctgccccgtc atccgcttct tgcggtgcaa ctcctttata tctaccggaa taatctccgg 960 

caattgaatt tccttatagc gttctttcag ttccacccac ccgaatttgc cggtagtggc 102 0 

attctgccaa gtctccaccg aaggagtggc agtacccagc aacgtctttg ccccatacat 1080 

cgaagccagc acgatggcag cattccgggc atgatagcgc ggtgcgggat cttgttgctt 1140 

atacgtattt tcgtgttcct cgtccacaat gaccagtccg agattccgaa acgggaggaa 12 00 

caccgaagag cgtaccccca gaatgatgtc gtaaccctcc tccgtcaact gcttttgcca 12 60 

gatctctact ctctcggcgt cggggaactt ggagtgatag attcccaacc cgggaaccga 132 0 

acacccgttt cagccgttcg gtga 1344 

<210> 1200 
<211> 198 
<212> DNA 
<213> B.fragilis 

<400> 1200 

ctaaaaacaa cgaccggacg agtaaatcag tataccatat ttcccactat gcgggagctg 60 

aatgtgggag gatttgcggc gtgcctttcg gtatatcccg aaaaagcacg ccacaagaaa 12 0 

aataaattta aaacggcatg taaccaaata cgcagcgcaa tgaacaggac tgaaacctac 180 
aactccacca cacaatag 19 8 

<210> 1201 
<211> 192 
<212> DNA 
<213> B.fragilis 

<400> 1201 

aagatatcga caacaaaatc gccttgttcg ataaatacgg atacactaat attgaaggtc 60 

aggaagagta tcttgatctg tcttgaccag gatatcgaga aagcccaaag aactgtagat 12 0 

gaaaagcaag cagcagtgga cggtttgaac gctaccttga agaaactctt ggatgcttat 180 
gcagctgaat aa 192 

<210> 1202 
<211> 1260 
<212> DNA 
<213> B.fragilis 

<400> 1202 

cccaatagag ccatagggat tattacgaat atgaataaat tgcttatgcc atttatgacg 60 

cttgtttgct tgctttttac ggcttgcaac aaggatgata ttttacccgg tggaccgatg 12 0 

ctctggacgt atgagattct gacaccggaa agtgtagagt atgaaggtgg caccgtaggt 180 

tggataccta aagaatgttt caaggcaaac ggtaatgagg gatatatcgt gatgacttgc 2 40 

aagaatttcg atatgctcaa tcctatttcg ggcggttctt acacatacga ttgtggatgg 3 00 
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gcaacgctca aggtagaagc caatcagttg aagattcatt ttccccgtca ggtttcggaa 360 

gccccggatg catacgagga gattacaatc tcgacaaatg atggaaagag aacagcaagt 42 0 

acaattatct gtttgtctag aacctttaag gacgaagggc aacccgatcc agagccgaaa 480 

cctctgcccg aagaagccaa gttcaagatg aaaaaggcat acttcactcc gtttatgcac 540 

cttgacaccc aattcccggc accgctcgat ctggtgacgt tcagaatcac ggatataaac 600 

gacaattaca ccccgctggg ctttcctgag tttacacaat attacgactc tattgtttgg 660 

agtgccgagg gtttccctca tacgttcaga gtctatgaaa gcaatacaac ggagggaggg 72 0 

atggaaacac atcttgctac ggaatggagt tcgcacttct tcaaaagcgg taccatcaaa 7 80 

aattacctga aaggctatcg caaaggaaag gttgaatatg agacctcgct cgctgtgaga 840 

ctgtacgaac gtgatttctt ggggattgaa tgggggacaa tcgtgttgca gagcccacag 9 00 

aaccttacaa cctattgcct gctggacaca gattatgagt atcaggtgta tgacatcgtg 960 

gcaaaggatt ataacccctt ttctaaaata atcccggtga accataagca actctcggat 102 0 

tcagacttcc cggcagcagc gcaaaaagcc atcaaaacac tgatggagaa taacattggt 1080 

gaagggcaaa atgctggtgg aaaagagaac ctgttcaaat gcctgcccga agagggtgtg 1140 

aaagctgaat tgtattggga aaacaagact acccgtatac tgatgttgca tcaactctcc 1200 

actgaccccg atgacctgac acaagagaag tattatctac acgttgaacc taaacaataa 12 60 

<210> 1203 
<211> 1296 
<212> DNA 
<213> B. fragilis 

<400> 1203 

tatgacatgg caaaaataca aattaaatct gagaaactca caccttttgg aggaattttt 6 0 

tcaatcatgg agaaatttga ctccatgctt tcacccgtta tcgactcaac actgggtcag 12 0 

agatgcagca gtatcttcgg atatcagttc agcgagatag tccgttcgct gatgagcgtt 18 0 

tatttctgtg gcggctcatg cgtggaagat gtaacgtcac aactgatgcg ccatctctcg 2 40 

tatcatccta cccttcgtac atgcagctct gataccatcc tcagagccat caaggaactg 3 00 

acacaggaaa acatctccta tacttccgac caaggcaaga cctatgattt caatactgca 3 60 

gacaaactca acacattgct tataaacgct ttggtttcta caggcgagtt gaaggaaatt 42 0 

gaggaatacg atgttgactt tgaccatcag ttccttgaaa cggagaagta tgatgcaaaa 480 

ccgacctaca aaaagttcct cggctacagg cctggcgtat atgttatcgg tgacaagata 540 

gtctatatcg agaacagcga tggtaacacg aatgtgcgtt ttcatcaggc agacacccat 60 0 

aagagattct tcgctcttct ggaatcccag aacatccgtg taaatcgctt cagggcagac 660 

tgcggttcct gctcgaagga aatcgtcagt gagatagaga agcattgcaa acatttctac 72 0 

atccgtgcca accgatgcag ttcgctctac aatgacatct ttgctctgag aggatggaag 7 80 

acggaggaga ttaacggcat ccagttcgaa ctcaattcca ttctcgttga gaaatgggaa 840 

ggcaagtgct atcgtcttgt catccagaga caaagacgca acagtggcga ccttgacctg 9 00 

tgggaaggcg aatacactta ccgttgtatt ctgaccaacg attacaagtc atcgacaagg 9 60 

gacattgttg aattctacaa tctgcgtggc ggcaaggaac gtatctttga cgacatgaac 1020 

aacggattcg gttggagcag gctccccaag tcattcatgg cggagaatac tgtctttctt 1080 

ctgcttactg cattgataca caatttctac aagaccatca tgagcaggct tgacaccaag 1140 

gcttttgggc tcaagaaaac gagtcgcata aaggcttttg tcttcagatt catctccgta 12 00 

cctgccaagt ggatcatgac tgcaaggcaa tacgtgctga atatctacac agagaaccga 1260 

gcttatgcaa aacccttcaa aacagaattc ggataa 1296 

<210> 1204 
<211> 498 
<212> DNA 
<213> B. fragilis 

<400> 1204 

ttaaactctt ataagatgaa aaagaatgta tttattttgt ttgtagttct tttaactact 60 

agtgtgttta tgtcttgttc cagtgacgat gacaatgaca atggaaaagt tgaaaatacc 120 

attattatca atggtaaaga gtatcttaat gatgaatctg catctgtgtc gtacaactct 180 

tatagccagt ctattagttt tgaggcaggc tttagtaatc cagaaagtct tatggatata 240 

agttacttta cgattgcaag taatgatgct gctagtgtag ataagctaac caacggaatg 3 00 

gaacttaatg ctaaagttaa agaatttgta aaaaatacag atttaggctc tagctatact 3 60 

tatactacgg taggtggaaa agtcgttgtg gataatgtta cctctgaaag tataacgctt 42 0 
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cgttatgatg attttaagtt tacaaaaaat ggtggtgaat atacgataaa aggtagagtt 480 
ctttatcata agaattaa 498 

<210> 1205 
<211> 198 
<212> DNA 
<213> B.fragilis 

<400> 1205 

ccaactgatt • atcagacgat ttatattttc catttttgcc attttacacc cttattttgt 60 

tacttagtac actttgaatt ctcattttct ttcgcacctt tgcaatatcg gaaaacaaaa 12 0 

gaaaatcaac tatttagcgt aatcacccgg ttatggtacc taaatcggga taaaaccagt 180 
agcgtatggc aaagatag 198 

<210> 1206 
<211> 195 
<212> DNA 
<213> B.fragilis 

<400> 1206 

aaaaaatgtt atgtaagtaa atcaatgttt tctcttgccg atgacggcct tgcctctatc 60 

ttatgtaata cctctatcta cagttacttt gtaggtaacc ccaaacagac tattgaagga 12 0 

tgcaaatggc gttttgccaa aaggaatagc agtaaaaggg aatccttatt tataataagg 180 
tataaaaaag aatag 195 

<210> 1207 
<211> 201 
<212> DNA 
<213> B.fragilis 

<400> 1207 

agtcttctat atcccagcat cttccttccc attttcctag aatatagtga ctttggctgt 60 

gattttggat actctacata taatagtgga tggagttgtt ttgtaattag taaatacgga 120 

aaacaataca tagctgaaaa ggacaaatgt atgaatggtc ctttcaattt gtttaaagta 180 
ttagaattac cacaacatta a 2 01 

<210> 1208 
<211> 579 
<212> DNA 
<213> B.fragilis 

<400> 1208 

tttatttcat acagcgaatt tataatggta aatgaattga ataaagaaat ggatgtggat 60 

aaatacaaaa taagaagttg gtctaaagac gatttttcca ctttagctaa atatcttaat 120 

aataaaaaga tatgggataa ttgccgtgat agcctaccat atccttattc tgaaaacgat 180 

gcgcaacaat tcatcctgtc cgtttcaagt caaaacgaac aaaataatta ttgtatcgaa 240 

gtaaatcagg aagcggctgg taatataagt tttgctcgtg gtatagatgt agagcgctac 3 00 

aatgcagaat taggttattg gcttgctgaa ccatattggg gtaaagggat tatgacccaa 360 

atgttagcac tggctattag cagctatttt catcatacag atgtgatgcg catttgtgca 420 

aatgtttatg ctggcaacat agcatcgatg agagtattag agaaaatagg ttttcgtaaa 480 

tgtggcatac atcgtaatgc ctgtttcaag aatggagtat ttacagattg ccattatttt 540 

gaattgctaa aagaggaatt taggaatttg gttaaatag 579 

<210> 1209 
<211> 708 
<212> DNA 
<213> B.fragilis 



<400> 1209 
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ttacttgcat ataacacttt acttctcaaa tggttttata tatctttgtg cgttgaacgc 60 

gatatatttc aatttattat gaacggattt tttcttttga aacgtccttt tatctggttg 12 0 

gcccgctttc gccaccgttg tggatatggg gttcattcac catttgcttt cgacttgata 180 

accaacgtca tttatgagcg taccccttac tatgcctaca gttcgttgga agccgaacag 240 

aaaaaaatgt cggcaaactc cggtaggaaa tggaagcatg aatcgaagaa ggtgaaccgg 3 00 

ttgctgtttc ggctggttaa ctatattcag cccgatacga ttgtggatgc cggaacatta 3 60 

tcggcatcgt ctttgtattt gcaggccgga catgctaaag ccgattatgt gggtgcttcc 42 0 

gatttgtcgg agctctttct ggagaaagat acgcctgtcg attttttgta tttgcaccat 480 

tatcggaatg aggagtttgt ggagcaggtg tttgatcttt gtgcatcgag aaccaccgga 540 

cgaggactct ttgttattga gggcatccgc tatacgaaga agatgaaagc actctggaaa 600 

aagatacagc aggacgaccg gacaggcatt acattcgatt tgtatgattt gggaattgtc 660 

tttttcgacc gtaccaagat aaaacagcac tatctcgtca acttctga 708 

<210> 1210 
<211> 204 
<212> DNA 
<213> B.fragilis 

<400> 1210 

ctaattttgt attcggataa aggcaactct ttccaagaca ttagagaaaa gaagaaacgt 60 

tatgtttatc ttgtaggtga aaacgtagga aattttcaga ctgatgtaag agatagcata 12 0 

gtggttctca cgcatatagc gtggaccgtt cttgttacat ctgcatcaaa ggtttcctat 180 

gaacttcact tagacgtggc atag 204 

<210> 1211 
<211> 723 
<212> DNA 
<213> B.fragilis 

<400> 1211 

tactacctgt attctcttat tacccattcc acagccattg aaggctcaac gcttacggaa 60 

cttgatacgc agcttctttt cgacgaggga gtaacagcga aagggaaacc gcttgtgcat 12 0 

catctgatga atgaggattt gaagcaagcg tatgaacttg ccaaaaccga atccagtagc 180 

cttgtacaga taactcctgc cttgctacaa agattgaatg caacactgat gcgcactaca 2 40 

agcagtgtac acagtgtaat gggcggttct ttcgattctt cgaaagggga ttttcgtcta 3 00 

tgtggtgtta cggctggtgt cggtggacat tcttatatga actatctgaa agttcctgct 3 60 

aaggtagatg aactttgcgc tatactgcaa gtgaagcaga agacagtggg gacacttcgg 42 0 

gaacaatatg aactgagctt caatgctcac ctgaatttag taaccataca tccgtgggtg 480 

gatggtaatg gcagaatggc tcggttgctg atgaactaca tccaattttg ctatcacctt 540 

ttcccgacga agatatttaa agaggataga gaagaatata tcctttccct acgccaatgt 600 

caggatgaag aaaccaatca ggttttcttg gactttatgg taaggcaatt aaagaaatcc 660 

ctctctttgg agattgaatg tttcaatgct tcacaaaaga gagggttcag ttttatgttt 72 0 

tag 723 

<210> 1212 
<211> 276 
<212> DNA 
<213> B.fragilis 

<400> 1212 

ctacaaaggt atcatccatg cgggagtagg aaacggcaac ttccacaaaa acattttacc 60 

ggtactgctg gaagcacgca agaaaggaat cctcgtggtt cgctcctccc gcgtacctac 12 0 

cggtcctacc acaatggatg ccgaagtaga cgatactcaa tatcagtttt tgcttctcag 180 

gaactgaatc cgcagaagtc acgtgtattg ctgattctcg gactgaccaa aaccaatgac 240 

tggaaacaga ttcagcaata ttttaatgag tattaa 276 

<210> 1213 
<211> 1380 
<212> DNA 
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<213> B . fragilis 
<400> 1213 

ttagcatatc atttaccgat tcagagacaa tttattcaat tatctttgcc aaaaataaaa 60 

aagaaagata tgattctaca attggctttt gtactgacag ctatcattat cggtgcccgt 12 0 

ctgggaggta tcggactcgg agtaatgggc ggcgtaggtt taggaatact tacttttgcc 180 

ttcggattgc aacccacagc tcctccaatc gacgtgatgt tgatgattgc cgcagtcatc 240 

tcggccgcct cctgcatgca agcagccggc gggctggatt atatggtgaa gctggcagaa 300 

aagttattgc gtaagaaccc gtcacatgtc accatattaa gtcccattgt gacctacctg 3 60 

tttacttttg ttgcgggaac agggcatgtc gcttactccg tattgcctgt gattgcagag 42 0 

gtagccaccg aaacaaagat tcgtccggaa cgtcccctcg gcatagccgt catcgcttcg 480 

caacaagcca tcacggcaag tcccatctcg gcagccacgg tcgccttact cggactgttg 540 

gccggtttcg acattaccct gttcgatatt ctcaaaataa cgattcccgc aaccattatc 600 

ggcgtactgg taggtgcact tttttctatg aaagtaggta aagagctggt agacgacccg 660 

gaataccaga aacgattggc tgaaggatac ttcaactcaa agaaaataga gattaaagac 720 

gtacacaata ggcgcaatgc aatgatatcg gtgttgattt tcatcttagc taccgccttt 780 

attgtatttt tcggctcttt cgacggcatg cgccccacat ttctgatcga tggcgaaaca 840 

gtcaccctgg gcatgtctgc cattatcgaa atcgtcatgc tttcggcagc tgcgcttatc 900 

ctgctgatca cgaagacaga tggtatcaaa gcgacgcaag gttctgtttt tccggcaggg 960 

atgcaggcgg taatcgctat ttttggtata gcctggatgg gcgatacgtt tctgcaaggc 1020 

aacatggggc aactgaccga atcgatcgaa ggacttgtcc gccagatgcc gtggttgttc 1080 

ggcattgccc tgttcataat gtccatcctg ctctacagcc aggctgctac ggtacgtgca 1140 

ctgatgccgt tgggtattgc tctcggcatt tcaccgtata tgctgatcgc catgttcccg 1200 

gctgtaaacg gatatttctt cattccgaac tatccgacag tagtggccgc catcaatttc 12 60 

gaccggaccg gtacaacgaa aatcggtaaa tacgtattga atcattcgtt tatgatgccc 1320 

ggactgatat cgaccgttgt agccatcgcg ctcggattgc tctttatcca gatattctaa 1380 

<210> 1214 
<211> 984 
<212> DMA 
<213> B. fragilis 

<400> 1214 

ttaaaaaccc aaacctggag atttatgaag aagttaatgt tactgaccct tttgagtacc 60 

tttatatttt acagttgctc ggatgatgat tcatgcacaa cctgtaagga ggataatgga 120 

agtttggtca cccccgattt gagcgttacc ctatccgata cacagagtcc gatgacgggt 180 

gtattggaag cctacccttg ccaggcagga ggtgccattt attacggcaa ttatatcgaa 240 

ggcaaactga cctcctttcc gggaatgtat tacctccaga acggagagat ctatggagat 300 

aagaacaggg aaatatctct cccggtgggc acttacaaca tgatatactg gggtaccccg 3 60 

aaatatgaag agctgattta cagcaacccg gtcgtcgtcg ccccccaaat cactatcgga 420 

ggagaccttt cacaacagta tttcgggctc cggaaagttt cggcggatac gacctattat 480 

ccagtattcg acttagtgta taccgtgaaa ccggcacata tcggcacgga agaactgagt 540 

gcagccatgc agcgtgttgt tgccggtctg aaagtaatcg tcaaaaacaa aaacaacggt 600 

atcctaagtt ccagtattgc cggcatggaa gtacatgtag gaggcattgc cgagaagctg 660 

aacatgtata cagccgctcc ggtcaaccaa accaaaacag tatctttccc gcttgtactg 72 0 

tcggcagacg gtacacagat gagcaatgcc acggtcatgc tttttccatc atccgccaaa 780 

ccaatgttca agctgatcat caagcttaaa aacggaaata ccaaagtcta ccagcaacca 840 

ctcaatgctc cgttaaaagc taataacaag ttgactctga cattaacctt gggtgatatc 900 

ttctcggaag aaacttccgg gggattcacc atcgataact ggcaagaaga gaacgaaaca 960 

atagatatac cgacactgga ataa 984 

<210> 1215 
<211> 252 
<212> DNA 
<213> B . fragilis 

<400> 1215 

gttcctgagc aacaaaaagt tgcccaggat tttgccatgt cagaattttc acttatctta 60 

gtgttgcaaa aagaaaacaa gcaaaactct aatatgacat ggcaaaaata caaattaaat 120 
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aaagtatgtt 


taagcctgct 


tatgggactg 


60 


gaaaaaatgc 


aatggttcaa 


cgaaccggaa 


120 


atgtccgtta 


ctccgcaaag 


tgattactgg 


180 


gacgcacctt 


tctattatgc 


cacttatggc 


240 


ggagagtata 


aagaacgttt 


cgatcaggcc 


300 


tacattaaag 


cgggtattga 


gtttgtcgat 


360 


cataaaacga 


gtgactggag 


tgtgataacg 


420 


aaggctgtca 


ggcggctgga 


tgcagtagag 


480 


acgctgatgc 


gtaatgcctg 


gttgcaggat 


540 


gcttgtcccg 


atggtagtgg 


attcaatgct 


600 


ccggaccagc 


gcagagtgga 


atggctgaag 


660 








675 



ctgagaaact cacacctttt ggaggaattt tttcaatcat ggagaaattt gactccatgc 180 
tttcacccgt tatcgactca acactgggtc agagatgcag cagtatcttc ggatatcagt 240 
tcagcgagat ag 2 52 

<210> 1216 
<211> 675 
<212> DNA 
<213> B.fragilis 

<400> 1216 

aataaacata atcaaaaggt agaaatgaaa 
atggtacaaa tgacctttgg gcagacactg 
caatgggaga taaaaaataa tgtattgtcc 
cgtatttctc actacggttt tacagtagat 
ggtgaatttg aagcgaaagt caaggttgtc 
ggtctgatgc tccgtatcga tcatgaaaat 
gggaaattta atttaagtac cgttgttact 
ttagataaaa cggtacctta tatctggata 
attttctatt catttgatga taaaacttac 
catattcctg tgaaagttgg actgatggcg 
aagtttgaat acttccaggt gaagcatctg 
aagaatgcag aataa 

<210> 1217 
<211> 690 
<212> DNA 
<213> B.fragilis 

<400> 1217 

attttaaata agtccagaat actacgaaaa tcaccccgtt ctgccctcgg gtacgcatgg 60 

ggtgattttt ttataaccca tagtaggatg ataaagaaga tgaaagggat ttggccagag 120 

gtatttcctg ccgtttttga agaagggggc ttgtatccat gccaccccaa gagggaactt 180 

ccacttaaga gggatggtgc caacccgaaa ctgaaaggca gaaccattaa tttgcagaat 2 40 

gcggttaaaa aatgcaaccg gttgtgccca ttgagatatg acagcattac agtcagtgaa 3 00 

ggacgtttta tgtttaatgg gaaagtgact gccccgcaag tgagggagct ctttattcag 3 60 

gagactgaca gtgaccgctt tcccgtcacc ctgcctgtgg tcttggatcc gggagagatc 42 0 

aacgcaaaaa taggagatat cgtactggtg gaaggatccg gactgaacga ggagatgatg 480 

caaaccttga tggctctcga cgaattcaga gggcgggact ttaccgggaa agagataaat 540 

gaaattaaag aggctttcgg cggattcgtg ctggaacaga ttgtgaaaca tgccggcagc 600 

cccgtaggaa actatctgta cgaggcctat cagaacaaac taaacgagaa gcaacaggcc 660 

gaggcacgga aaacgctcgg catcggttag 690 

<210> 1218 
<211> 372 
<212> DNA 
<213> B. fragilis 

<400> 1218 

ttaataaata aaggagttat gaagaattat caaaaaaaga gcgtagcaca agatgcacgt 60 

gtagagttgc atgacagtct tgccctgacg ggtgccgaag tatctatcaa tcatcttccg 12 0 

gccggtgccg gagtcccttt tgttcattca cataaacaga atgaagaaat ttacggcatc 180 

ctttcgggga agggctttat cactattgat ggcgaaaaga tagaattgca ggctggggat 2 40 

tggctccgta ttgctccgga tggaaaacgt cagatttctg ctgcatctga cagtcctatc 3 00 

ggttttattt gtattcaggt gaaagcaggc tccttggaag gttataccat gactgatgga 3 60 

gtcgtacaat aa 372 

<210> 1219 
<211> 945 
<212> DNA 
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<213> B.fragilis 



<400> 1219 

aacaaaaata agatgaaaga aaaagattta atccgtttta tggatcgcat gattgaagag 60 

cgaaaagcgg aatatgcatt aggtacggct cacatttatc aagcgagcag aaatgccctt 12 0 

tctgcttttc tgaaagcaca cgacattccg ttcaaaagag taaggcctga gttattgaag 180 

cagttcgaac ggtttctcag acggcgggga aacagctgga acacggtgtc tacttatatg 240 

cgggtgctca gggctgtcta taaccgggcg gtcgacaggc gtctggcacc tcacgtgcca 3 00 

catctgttca aagctgtata taccggtact caggccgata tcaaacgagc tttgaaagcc 3 60 

gaagaaatgg ggcagttgct cgacacgaag tgcacccgga agcaatcgga actattgcag 42 0 

aaaactcatc acctgttcgt gctgatgttt cttttaaggg ggcttccttt tgttgacttg 480 

gcttatatac aaaagaagga cctgaatggg aatatcctga cctatcatcg caggaaaacc 540 

ggacgtcaga tcaccattac agttactaaa gatgccatga atatcattcg ggaatatatg 600 

gatactacta cggagtctcc ctatttattc cctattctga gtgcagaggg aggagaggat 660 

accatctatc gggagtatca gcaggcattg cgcatcttca attatcaact gacaaaattg 72 0 

ggagaactgt tgggactgac taccgaattg acttcatata cagcccgcca tacctgggcc 780 

actttagcct attatttgga agtgcacccg ggtattatcc gggaggcgat ggggcactcg 840 

tctatcaaag taacagagac ttatctgaaa ccattcaata taaagaaact ggatgaaaca 9 00 

aatttaagta ttatcagtta tgccaaacga tcttttgagg gataa 945 



<210> 1220 
<211> 231 
<212> DNA 
<213> B.fragilis 



<400> 1220 

aatatgcaca atatggataa taaacaagaa agaacagttg ttcacgttga atataacgga 60 

cagcattact attttggctc actctctgca atttatacga aattcagtcc taaagacttg 120 

ggtatcgcat tggggacatt aagaaattat ggattgaaag aagaaaagcc gtaccagaac 180 

tctctgtgta ctataagaaa aggttttttg ataacgatgc ctaaaaagta a 231 



<210> 1221 
<211> 276 
<212> DNA 
<213> B.fragilis 



<400> 1221 

ccagtctgca aagataagcc ctttcctttg ttatcacaaa tgataactat agaaaaaatg 60 

aatgaacagc aactttttat taaaatagga gataaaataa aggaaataag gcttgaaaaa 120 

ggaataagcc aacaagactt ggcagctaaa tgcaactttg agaaagctaa tatgtcacgg 180 

attgaagcag ggcgcaccaa tctaacaata aaaaacgcat ataaaataag tcttgcttta 2 40 

ggagttagac taaaagacct attggatgta gaatag 27 6 



<210> 1222 
<211> 183 
<212> DNA 
<213> B.fragilis 



<400> 1222 

acagatctaa ataaaaagac cttggggcgt agggagtttc tcatcggatg ttgtctttat 60 

aggaaagaac ctataaaacc aacaaacatt atgaaagaat ttatgctgat cgcttctctc 12 0 

gtcttgtcat tctgcattct tattttatgt agagactata tcgtatttat gctgaaaaaa 180 
tga 183 



<210> 1223 
<211> 462 
<212> DNA 
<213> B.fragilis 
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<400> 1223 

cgtatggcaa agatagagaa taaaacgaaa gaaaacccca agttagagca aaataagctc 60 

tcggatggta gaatcagcct gtacttagag tattatttag gtagagaaga gaagcccgtt 120 

ttagatgcga atggcaatca ggtatattat gaagatggca aaatgcaagg caaacccaag 180 

ttttcggtga agcacaacag gcgaaaagag aacctgaatc tatatcttat ggataagccc 240 

cgtactcctg ccaaacgtca acaaaataag gaaacactgg agcttgccac aaagatacgt 3 00 

gccgaacgtg aacaagagtt taaagaaagt atgttgggat accgcctaaa gaaagattgt 3 60 

accatcaact ttcttgatta cttccaagcc tacatagaca gctatacaaa gaaagattgc 420 

gcatggtgca aattgcactt agccgtttca aagacttcct ga 462 

<210> 1224 
<211> 192 
<212> DNA 
<213> B.fragilis 

<400> 1224 

agtatccaaa atcacagcca aagtcactat attctaggaa aatgggaagg aagatgctgg 60 

gatatagaag acttcattat ttcattgaat tatcaaagca agttacagtt aacaatctat 120 

ccccaacaat atcttgccca gtcagaacgt gaagttcaat tacaaaccaa ctccaataac 180 

aatacaaatt ag 192 

<210> 1225 
<211> 2547 
<212> DNA 
<213> B.fragilis 

<400> 1225 

acaaatagaa aaatgaaaaa cgcaattgtt tccttactcc tgcttttgat ggtcacccag 60 

tatgtgacgg cacagaaaaa agtgattaag atagcctgta tcggcaatag tataacgtat 120 

ggtgtaggta cgcgcaatcc tgcgaaagac agttatcccg ctgtgctggg gcagatgctg 180 

ggcgacggtt atgaagtccg gaactttgga gtcagtgccc gtaccatgtt gatgaagggt 240 

gaccatcctt atatgaagga ggaacgctat cggcaggcat tggcttataa tcccgatatt 300 

gtgaccatca agcttggaac caatgatacg aaaccgcaga actggcggta caaatcggat 3 60 

tttaaaaagg atatggaaac gatgatacgg acgattcgcg ctttaccctc aaaacctgaa 42 0 

atctacctgt gttaccctat tcccgcctat gctgtacagt gggggattaa tgacagtacg 480 

attgtacacg gcgtgatgcc tgttatcgat cagctggctg ctaaatatcg attgaaagta 540 

atcgatctgc atactccgct gataggtatg aaagagtgtt ttgccgatca tgtgcatccc 600 

aatgaaaagg ccgctgcctg cattgcccgg gtcatttatc ggcaactgac gggtaaagaa 660 

gcacctgaac acgtctccca gcctttcccc ggtcataaaa gcaagtggca gggattcgat 72 0 

caatatactt ttacctatca ggatcgtcag gcgattgttg tttgccccga acgggcggcg 780 

gcaggtaatc cctggatttg gcgtcctgct tttttcggtg cttttgcttc ggtagatgag 840 

gctttgctga agcggggttt tcatgtggct tattatgact tgacccacct ttacggaagt 900 

ccgcgtgccc ggaagtcagg taccgatttc tattggaata tggtacagat gtacggtctc 960 

tctccccgtg tgacactcga aggctttagt cggggaggat tatttgctta taattgggca 1020 

gccgatcatc cggataaagt ggcttgtatc tatgtcgatg ccccggtttg cgatgtgttc 1080 

agctggccgg gacgttcgtc cggaaatgcc ggattatgga aaggaatgtt ggacgaatgg 1140 

ggattgacag aagcccggat gaatacattt cccggtaatc cgatcgaccg gttgaaacct 1200 

ctggcggatg cccgtattcc ggtgatttgt gtatgtggcg atagtgacag ggtagtgccg 12 60 

ttttccgaaa attcggcagt ggttcgtcaa cgttatacag caatgggagc tccgttcgaa 1320 

cttattctga aacccggggt ggatcatcat ccccacagtc tggagaatcc cactccggta 13 80 

gtcgatttta ttgttcgcca tcaggcaggc tatgaagccg gacaatgtta tacgctgaga 1440 

ggcaattatc agaattcata tcggaagttt gagaaagaac gggtgggtac ggttgctttc 1500 

ctgggaggct ccatcaccga aatgaaggga tggcgggata tgatttgcga agacttgaaa 156 0 

cagcgttttc cttatacaaa gttcactttt gttgcagccg gaattccttc gaccggcagt 1620 

actcccgggg cattccgcct gacggatgat gtgttgtcca aaggcaaagt cgatctgctt 1680 

tttgtagagg ctgcggtgaa cgatgacacc aatggattta gtgccattga gcaggtaaga 1740 

ggcatggaag gcattgtccg gcatgccttg gtctccaatc cgtcaatgga tatcatgatg 1800 

ttacatttca tttacgatcc ttttattccg aagttggaca aagggcagat gcctgatgta 1860 

attctgaacc atgagcgggt ggccaatcat tacctgcttc cttctgttaa tcttgcttct 1920 
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gagattgctg cccggatgcg gagtggtgaa ttcacatggg aacagtttgg cggcacacat 19 80 

cccaatcctt tgggacatgc ctattatgca gctaccataa acaaggtact cgatgaaatg 2040 

tatgcccctt gcgctactgc caaagatgct gccaagcctc atgctcttcc tgccgtgcca 2100 

ctggatgcat atagttatac aaatggcaga ttggtcgata tccggcaagc ccatataggt 2160 

aaaggttggc agttggttgc tccatggact ccccggcttg ctgccgaaac gcgtccgggt 222 0 

tttgtcgacg tacctatgct tgagaccaat cgtcccggag cgaagttaac acttgacttt 22 80 

gaagggactg ctgtcggtat cttttgtgtg agtggtccgg ctgccgggat actggaatat 2340 

agtgtcgatg gtgccccatt caaaaagttg gatacgttta cagcctggag tggcggactg 2400 

tatatccctt gggtgtatat gttcgatacg gagttaccga tgggaaaaca tcgtctgact 2460 

cttcggatgt cgaaagacca tcatccgcag agtaagggta cgtcctgcca gatcaggcag 252 0 

tttgtggtaa atgattcttg tgaatag 2547 

<210> 1226 
<211> 222 
<212> DNA 
<213> B.fragilis 

<400> 1226 

agaagaaccc ttgaactttg ggaccgggat gtggaacgtt ggattaactc cgaacgtgtg 60 

ccggtgtact ctcctattac ctacttcttg tatgacttgc ctcgttggga cgggaaggac 120 

tacatccggg cgctggcccg ctacgtacgt acctttcgac gaatcggcta taagcgaact 180 

gaagcgttat gcttcgttca tagcgaccag caatcacagt ga 222 

<210> 1227 
<211> 1194 
<212> DNA 
<213> B.fragilis 

<400> 1227 

acacttatga aaaaattaat ggccatgttg ctccttgcgg gcagcataca aggagtctat 60 

gcccaaaaga cggaaaagaa agagatgttt cttgaaaata aatcgttgta tgaagagctg 12 0 

accaacgtgc agaagaagac ggataagttc aatctgtatc tcaatatgca aggtagtttc 18 0 

gacgccaact tccgcgacgg tttcgacgaa ggagtattca agatgcgcca acttcgtatc 2 40 

gaagccaagg gcaacctcaa cagctggctc tcctatcgtt atcgccagcg tctgaaccgt 300 

tcgaacgagg gaggaggaat gatcgacaac ataccgactt cgattgacta tgccggtatc 3 60 

ggtgtaaagc tgaacgacca gttctctttc tttgccggta aacaatgcac cgcttacggc 42 0 

ggtttcgagt tcgacctgaa tccgattgac atctaccaat acagcgacat gatcgagaat 480 

atgagcaatt ttatgaccgg attgaacatc ggttataaca ttacacctac ccagcagctc 540 

aacttgcaga tcctgaacag tcgcaacagt tcgttcgaca agacgtatgg aatcaccgaa 600 

gactcggaag gcaaacttcc ggacctcaag tcgggcaaga tgcctttggt ctataccctg 660 

aactggaatg gtaactttaa tgaggtgttc aaaacccgct ggtcggcttc cgtcatgagt 72 0 

gaagccaaag gcaagaacct ctattattat gcagtgggca acgaactgaa tctggataag 7 80 

ttcaatatgt tcgtcgattt catgtattcg caggaaggca tcgaccgtaa cggtaccatc 840 

accgggattg tgggcaatgc cggcggacac aatgctttca acgccggcta cttgtcggta 900 

gtgaccaagc tcaattaccg tttcctcccc aagtggaatg ctttcgtgaa aggcatgtac 960 

gaaacggcct ccgtcaccaa agcagccgac ggcattgaaa aaggtaacta ccgtacttcc 102 0 

tggggctacc tggcgggggt agagttttat ccaatgaaga ctaatttgca cttcttcctg 1080 

acctacgtag ggcgttcata cgacttcaca catcgtgcca aagtactggg acaggagaat 1140 

tacagtacta accgattgtc tttaggcttc atctaccaac tgccgatgtt ctga 1194 

<210> 1228 
<211> 189 
<212> DNA 
<213> B.fragilis 

<400> 1228 

gtgacaagga agaacaatat tggccggaac atcctatctt cgcaggaaat aagtgact cc 60 

tatgaaagaa agaatattaa atatagagac cgtccatcaa tgcaactgct gcctgggctg 12 0 

caaaacactc catccgctgg taagtgtaat cgacctgtca aagagcgatc tggaacagca 18 0 
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aattattaa 189 

<210> 1229 
<211> 537 
<212> DNA 
<213> B.fragilis 

<400> 1229 

acagaaagga aaaaaaatat ggaaaatgtt attcagatac aatattatca atcgccatgt 60 

ggagaactga tattgggggc atatcgggaa aaactttgct tatgtgattg gaagatagaa 12 0 

gaacgcagga teat cat cga cagaagaata caaaaagagt tgeaagctte ttataaggag 180 

ggcatatctg aagtaatcac acgaacgatc ggtcaactgg atgaatattt tgeeggaega 240 

agaactacat tcgatattcc tttgettett gtaggtactg attttcagaa aactgtttgg 3 00 

aacgaactgt tgaacattcc ttatggaaaa acaatctctt atgcagggtt gtctcaaaag 3 60 

ttggggaatc ctaaagctat ccgtgccata gcttccgcca aeggagegaa ccctatctcg 420 

atacttgttc cttgtcatcg tgtgatcggc agtgaccgta aattagtagg gtatggcggt 480 

gggctgcctg ctaagaagat ettgettgae ttggagtctt ccgataggtt attctaa 537 

<210> 1230 
<211> 603 
<212> DNA 
<213> B.fragilis 

<400> 1230 

tgtgttaacc ttgacagaaa taataaacat actatcatta tgaaaaagag tcttgtatat 60 

acaaaaaegg gtgataaggg aacgaccggc ctgataggcg ggacgcgtgt tccgaaaacc 12 0 

catatccgtc tggaagcata tggaacggtc gatgaactga attcgaatct gggcttgctg 180 

gcaacttatt tgatggacga gcatgatttg aattttgtgc agtccgtgca ggataaattg 240 

tttgecateg ggtegcatet ggecactgat caggagaagg tgcaattgaa tgatgtcagt 300 

attattactc ccgctgaggt ggaggctatt gagegegaaa tcgatgccgc cgacgaaatt 3 60 

cttccacctt tacattcttt tattattccg ggagggagtc gtggctctgc ggtttgccac 42 0 

gtttgccgta ccgtttgccg gagggecgaa cgccggattc ttgeattate cgaaagctgt 480 

acaatctcag ccgatttact ggectatate aacegtttat eggattattt atttgtcttg 540 

tecegtaaaa tgaattttaa tgaaggaaaa gacgaaatat tttggaataa tagttgcaag 600 

tga 603 

<210> 1231 
<211> 237 
<212> DNA 
<213> B.fragilis 

<400> 1231 

gaaattaatg atcaccttac ataccacata cccgcaatct atgccctaca cagcaaatgt 60 

acagaatttc eggatgaaac cagceggaga gttactaaaa ttactttcta ttttttgttt 120 

ttacctaaag atcttaccac agctaaaaat attaatatcc gaattttctc tactgecata 180 

gacttccatt atatccaatc cctacagtcg ctcacgtttc ggagecatae teegtaa 23 7 

<210> 1232 
<211> 279 
<212> DNA 
<213> B.fragilis 

<400> 1232 

cgcagagccg acccacgcac gctacccgaa cttctgatag aaaagcatga cattctaatt 60 

gaaaaaattg geacaeggtt gaaaacccat gecacagaaa cagatatagc acggttgttt 12 0 

ategcattag tggaataccg tttcatgegg aagtgtccca tcaagacttt cagaaatgee 180 

tgtacaacca gtttaatgaa caagaaatcg ttcatgaaag aggtcttcag aaagcataca 240 

aaaatctcat ttcgccactt ggaaaeggea aaaagttga 279 



497 



<210> 1233 
<211> 519 
<212> DNA 
<213> B.fragilis 



<400> 1233 
atattattag 
tacttctcat 
gcagatctgt 
gataagaaaa 
ggtacgttgt 
tatattgccc 
gttgttccgt 
aaagcatatc 
gatctggtta 



aattagatca 
gtagcggcgt 
atgaaattaa 
gccgtagttc 
ttcatccgga 
ctactataat 
tcgctacatc 
cggatatcgt 
cggaatggtt 



aggagaaaga 
aactaaagct 
gccggaggtt 
ggtggagatg 
agagtacgaa 
taatacattt 
gggaggcagc 
gtggaaagat 
tgaaaagatt 



gttatgaatg 
gtggcagaga 
ccttatacgg 
agagatgctc 
gttctgtttg 
ttggaaagtt 
ggcataggaa 
ggaaagcttt 
aggttgtaa 



acagaaagat 
aattggctgc 
aggctgacct 
tctcacgtcc 
taggctttcc 
atgactttgc 
attgtgaaaa 
taaatggacg 



tttagtagcg 
aattactgga 
ggactggaat 
tgccatttcc 
ggtctggtgg 
cggtaaaata 
gaatcttcat 
gataacgcgg 



60 

120 

180 

240 

300 

360 

420 

480 

519 



<210> 1234 
<211> 1347 
<212> DNA 
<213> B.fragilis 



<400> 1234 
aatgtggcag 
gaatccccgg 
gtagatgtca 
ggggctgaag 
tatacagcca 
atctctaccg 
tggctggccg 
ggcctgaagt 
cctctttctg 
gtcaccaaac 
ctcgacaagg 
gaagcccggg 
gacgagctgg 
ggcgtgttgg 
tctgcttctc 
caggaaatcg 
gaacaccagc 
ctgctccacg 
acgttgaggc 
atcaccgaac 
ccccgacgcc 
cattctgggg 
ggacgaggaa 



atttccgcag 
cgcttgcgtc 
tattaccttt 
aggtgcaaat 
ttgttcgcaa 
tacttgacac 
attattatct 
tggagagcga 
aacgcgagca 
ttgagaaaga 
aagctctgtt 
tccggctggc 
agcgcgctcc 
gggacggtgc 
ccgctatttt 
ggcggttgaa 
aacgggctta 
gagtgacttc 
aaggcaggca 
ggctgaaacg 
gagagagtag 
gtacgctctt 
cacgaaaata 



cttacatttt 
ggggattttt 
gcctctgccc 
aggttgccgg 
tgtgcatcac 
ctctccgata 
ttgtacgcag 
gacgattgtg 
actggtgctc 
gagcggattg 
tgtaaaagaa 
ggcagacgcc 
gaaacagttg 
atccaaagaa 
caacggattg 
ccgtttggtc 
tcatgaattc 
cagcggaaag 
ggtgctttat 
ggtgttcggt 
agatctggca 
cggtgttcct 
cgtataa 



ctaagaagac 
tgtatctttg 
cgttgcttta 
gtagttgtac 
tatgcaccga 
ctgctgcccg 
ggtgatgttt 
gagtataatc 
gacctgcttg 
aaaaacattc 
gagctccgcc 
tccggtgaag 
gcgttgttga 
gtgtccaaga 
gtagaaaaac 
ggaaagacgg 
atgcagagct 
accgaagtat 
ctgttgcccg 
tcccgggttg 
aaagcagttg 
cccgtttcgg 



agaatcattt 
tcctcacaat 
cctattccct 
ctttcgggcg 
ccgaatatga 
gtcagttccg 
acaaagccgc 
ccgactttga 
ccaaagaacc 
tcaccgttat 
gcacttataa 
agaatctccg 
tgaagtatgt 
aagaacttct 
agatattcga 
tagaactgaa 
ttcaggagaa 
acatccacct 
aaatagcatt 
ggaatctatc 
acggaggagg 
aatctcggac 



tatgataaac 
gaaaaagtat 
tccggacgaa 
gaagaagtac 
agtcaaagag 
gttctgggaa 
attgccttcc 
ggcggatgct 
cgagcaatgt 
caagtcgctg 
acccaaaacg 
gcgtatcttt 
ggagctttcc 
gcaacgtgcc 
ggtctattat 
cgtgctgaac 
gaatgtctgt 
gatagaagag 
gactacccag 
actccaagtt 
gttacgacat 
tggtcattgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1347 



<210> 1235 
<211> 987 
<212> DNA 
<213> B.fragilis 



<400> 1235 
aaaactaaag 
actgttgcac 
aaaaaaggtc 
acggagaact 
ggattgggca 
ggagatttga 
acagaccgtt 



aagaatatag 
ctgtcatggc 
agtggcaagt 
atctattgcc 
ataatacgtc 
attctaataa 
gggatgtcaa 



tatgataaag 
gcaacaatac 
atccatggta 
tacctactgg 
cggaaaccaa 
tttggtcaat 
cctgatgttt 



aaactatgca 
age age age g 
atgggtagtt 
aacggacaaa 
tcctccgatc 
ateateggta 
agcatgaata 



taatccttct 
atgatgcgtc 
cgcaaatgtt 
actttagctt 
eggctactta 
tacagggaaa 
teggegtaac 



gtctgtatgt 
gtttgetect 
caacaacaat 
tccgaatgtg 
ccttcaactg 
atatttcctg 
geccaaaaag 



60 

120 

180 

240 

300 

360 

420 
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gattatatcg agggagacaa gacggttacc gatatgcaga ttccggccct tcaatatctg 480 

gaaggcagaa tcaaaaacaa ctggtcggtt aatattggct caaactatta cttcaatacg 540 

aagaacgaac gcatcaatct gtatgtaggc ggcctgttgg gctggcaaat gggaagaata 60 0 

gaaaccacgc ttccgtatac tggaatcatg gtatctgata aggatatgaa cacagatggg 66 0 

acggataccg atttacagcc gactcccgat gagaacggcc aggacaatgg tccggtcgtg 720 

gacgataacg acgtcactgg tacacctctg gaggtttata tccccaacag cagagccgga 7 80 

cagatattcg gtttgcgtgc ggcaaccgtc gcaggcatcg agtacagtat aggcaaggga 840 

ctgattctgg gatttgaagt tcagccggta gcttaccgct atgacatgat tcagattatc 900 

ccgaaaggaa ccccggtcta caaggtgggg catcacaaca tcaacttgtt tgcattgccc 960 

aacctgaagc tcggatttag attttaa 9 87 

<210> 1236 
<211> 972 
<212> DNA 
<213> B.fragilis 

<400> 1236 

gtcatgaaag aattaaaaag actaagcttt gtagtggtca cactactact ttccacgatg 60 

atggctttcg cgcaaaagcc taatattcac atccttgcta cgggtggcac aattgccggt 12 0 

acaggcggtt ctgccacttc caccaactat acggccggcc aggtagcaat cagtacgctg 180 

ctcgatgcag tacccgaact caaggatatt gccaacgtga ccggtgagca aattgtacgt 240 

atcgcatcgc aggacatgag cgatgaagtg tggctgatac tcgccaagaa gatcaaccaa 3 00 

ctcctgaaac gcccggacat cgacggtatc gttatcactc acggaacaga tacgatggaa 3 60 

gagactgcct atttcctgaa cctgaccgta aaaagtaaca aacccgtggt acttgtagga 42 0 

gccatgcgcc cttctactgc gctgagtgcc gatggcccgc tgaacctcta caatgccgta 480 

gtcactgccg gagccaaaga atctatcggc aaaggtgtgc tgatagccat gaacggactg 540 

attctcggag ctgaaagcgc aataaagatg aatacgatcg acgtacaaac tttccaggca 600 

cccaactccg gtgcattggg ctatatcttt aacggaaaag tattctataa ccaggctccg 660 

ctcaagaaac atacgaccca atctgttttc gacgtaacca acctgaactc tcttcccaaa 72 0 

gtaggcattg tctacagcta ctcgaacatc gaccccgata tggtgacccc actgttacat 780 

catgactaca aaggtatcat ccatgcggga gtaggaaacg gcaacttcca caaaaacatt 840 

ttaccggtac tgctggaagc acgcaagaaa ggaatcctcg tggttcgctc ctcccgcgta 900 

cctaccggtc ctaccacaat ggatgccgaa gtagacgata ctcaatatca gtttttgctt 960 

ctcaggaact ga 972 

<210> 1237 
<211> 1179 
<212> DNA 
<213> B.fragilis 

<400> 1237 

aaaaaaggag aaaaaaagaa taaaagatcg cataaacact tctattttca gagaataaac 60 

cgtattttag cagaaacatt taaaacagtc aatatcatga caagcaaaga taattattgt 12 0 

gtcattatgg gcggaggtat cggcagtcgt ttctggccgt ttagccgcaa gacaatgcct 180 

aaacagtttc tggatttctt tggaacaggt cgttcactgt tgcaacagac tttcgaccga 240 

ttcaacaaaa ttattcctac ggagaacata cttatcgtaa ccaatgcgat atacgcagac 3 00 

ttggtaaaag aacaacttcc ggaattagat ccaaaacaaa tcttgctgga accggcaaga 3 60 

agaaatacgg ctccgtgcat tgcatgggca tcatatcata tacgtgcttt aaatccaaat 420 

gccaacatcg tagttgcccc ttccgatcat ctgatcttaa aagagggaga atttttagcc 480 

gctatagaga aaggactgga ctttgtatca aaatctgata aacttctcac tttaggtata 540 

aagcccaatc gtccggaaac cggatacgga tatatccaaa tagcagagca ggaaggagac 600 

aacttctaca aagtaaagac atttactgaa aaaccggaac tggaacttgc taaggttttt 660 

gttgaaagtg gagagttcta ttggaattca ggccttttca tgtggaatgt caatacaatc 72 0 

attaaagcag gagaaactct tctaccggaa ttagcatcta agctggctcc cggaagagag 780 

atttatggta cacctgaaga aaaagacttt atcgaagaaa acttcccggc atgccctaac 840 

gtttcgatag acttcgggat tatggaaaag gctgataatg tatatgtctc tttaggagac 900 

ttcggatggt cagaccttgg aacctgggga tcattatatg atttatcacc taaagacgaa 960 

caaagaaatg taactctaaa atgcgactca ttgatttaca acagcaatga caatatcgtt 1020 

gtattaccca aaggtaaact tgcagtgata gaaggtctgg aaggtttttt ggttgccgaa 1080 
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tcagataatg tattactgat ctgcaaaaag gacgaagaac atgccatacg caagtatgtg 1140 

aatgacgcac aaatgaaatt aggagaagat tatatttag 117 9 

<210> 1238 
<211> 594 
<212> DNA 
<213> B.fragilis 

<400> 1238 

ttatttatgg aaagtgaaaa agaaaaaatg ggtacaggca ggctttacga tgctaattat 60 

gatacagaat tgatagccga acgtcaggct tgcaaagagt tgtgttatac cttaaatcat 12 0 

ttgcctccct cgcagatagc tgaacgggag gccattatcc gtcggttgtt ttgcaagacg 180 

aaagaacgtt ttctgttgga acagcctttt tattgtgact atggctataa cattgagatt 240 

ggtgaaaatt tctatgccaa tatgaactgt gtcattctgg atgaggctaa agtaacgttc 3 00 

ggtgataatg tctttatcgc tccatcctgt ggcttctata ccgcgggtca tcctttggat 3 60 

gtggaacaga gaaatcgagg gttggaatat gcccgtccca ttcgtgtcgg aaataatgtg 420 

tggattgggg cacaagtgtg cgtattaccg ggcgtgacga ttggtgacaa cacagtgata 480 

ggtgcgggaa gtgtagtaaa tagagatatt cctgccaatg tgattgctgc gggtaatcct 540 

tgtcgcgtga ttcgggaaat tacggaagaa gataaaacaa aatatttatt atag 594 

<210> 1239 
<211> 1389 
<212> DNA 
<213> B.fragilis 

<400> 1239 

agacatatgg aaaaacagaa taatcatata gaccgaagag gattcctgaa aattgtgggc 60 

atcagtgccg ctacaacgac agcggctctt tatggctgcg gctccggaac taaaagcagc 120 

caaggacgga atgcctcctc tcctgttccg acagaccaga tgacttaccg ctcagtaggc 180 

ggaatcaaag ataaagtatc cctcctggga tatggctgta tgcgttggcc taccgttcct 240 

tccccggaag gaaaaggaga ccttatcaat caggaagctg tcaacgaatt ggtagactat 3 00 

gccattgctc atggagtgaa ttatttcgat acatcacctg tatacgtaca gggctggtcg 3 60 

gaaaaagcaa ccggtatcgc tctcaagagg catccgcgcg agaaacttta tatagccacc 420 

aagctatcga atttctctaa cttctcacgc gaaaactcac ttgcgatgta tcatcaatca 480 

ttcaaggata tgcaagtgga gtattttgat tactacctgc tgcacgccat tggcggaggc 540 

gggatgaagg tattcaacga gcgttatatc gataacggta tgctggattt tcttctcaag 600 

gaacgagagg ccggacgcat acgtcatctc ggctggtcat tccatggtga cgttgaggtt 660 

ttcgaccagg tacttgccat gcacgatacg gcgaaatggg attttgtaca gattcagctc 720 

aactatgtgg actggcgcca cgcaaccgga aacaatgtaa atgcggaata cctgtacggc 780 

gaactggcca aacgaaatat tgccgctgtg atcatggaac cgctattggg cggacggtta 840 

tcgaatgtac cggagcacat cgtgggacgg ctaaaacaac gacgtcccga agacagtgtg 900 

gcatcgtggg cattccgctt cgccggttca ccggaattgg tactgaccgt attgagcggt 960 

atgacttata tggagcactt acaggataac atccgcactt attcaccact ggttccgctg 1020 

accgatgacg acaaagagta tctggaagaa accgcgcaac tgatgatgca ataccctacc 108 0 

atcccctgta atgactgtaa atattgcatg ccctgtccat acggcatcga tattcctgcc 1140 

attctcgtac attacaacaa gtgtgtcaac gaagggaata ttccgcaaag ccagtcaagt 12 00 

gaaaactaca aagaagcacg acgcgctttc cttgtaggct acgaccgcag tgtacctaaa 1260 

ttgcgacaag ccagtcattg catcggatgc aaccaatgta ccccgcattg tccgcaatcc 1320 

atccacatac cggaagaact gcatcgcatc gatcgttttg tagaacagct caagcagggg 1380 

acactatga 1389 

<210> 1240 
<211> 186 
<212> DNA 
<213> B.fragilis 

<400> 1240 

aaagcaatgg aacaaggcgt ttggcaagag atagaacagt tataccaaaa gtttcagaaa 60 

cttggtatca atgaagcggt ggactatgat agtactacct gtattctctt attacccatt 12 0 



\ 
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ccacagccat tgaaggctca acgcttacgg aacttgatac gcagcttctt ttcgacgagg 180 
gagtaa 186 



<210> 1241 
<211> 201 
<212> DNA 
<213> B.fragilis 



<400> 1241 

cttgttattt gcgcacaaga acctgacaac ctgcaaaaga cgttaacaat gttaattgaa 60 

gagaggtata aggatgaaga caccggttca ggcggcgtaa actcacttcc gaaacttgag 12 0 

ctatcttatt cagccggtgt ctgttttttc ttattaaagc aagcaaaaag gacaattatc 180 

aacttgaaaa taaagaaata a 2 01 



<210> 1242 
<211> 1158 
<212> DNA 
<213> B.fragilis 



<400> 1242 

aaagactccc gttttatttt tgggtatccc ttgcgttggc ttcgtgtcag ggggggcaag 60 

aaagcagtga atcaggctct gcctgtgatt gatatgaatg aagattatcc cgaaaaggag 12 0 

atcgtgttgc aggatattgc tgacataagc tatattcctt tggagactaa cgacgaattt 180 

ctgttcgacg gttcggttga agtggtcacc gatcaatatg tgataaccaa aggacatcgg 2 40 

ggaaacgacg tctgcttctt cagccggcag ggaaaagcac tcaaccgcat ccacagggtc 3 00 

ggtaacggcc cgggtgaata caaggatatc ggttcgatgg atgtaaaccc ggcgaacggc 3 60 

gaactttacc tgaaggagat gaaccgtcag cagattcacg tctattctct ggacggaaag 42 0 

ttcaagcact cttttacttt ccctgaaggg aaacggatga gtcgcatgtg cctgttctct 480 

cccgactatt tgatagcgga acaggagtca aaggtgccgg acgatcagga tgccaacttc 54 0 

tatccttatc ttttggtctc tacccgggac gggcatctgg attcactgga ctatgtgcag 600 

aaaagaaata ttctcgtcaa gcttattgtc aatgcggaaa accattcata cgcttatctt 660 

ctggaaccct ctttgattcg taatggctcc cgcttttata tcggcaatcc cgactcggat 720 

accttgtttg caatgaatcc ggaccgtacc ttagagccat tgctcgtccg tactccttca 780 

cattcggagg agggaaacaa gtatggtttg tttttacggg gagcggcggg ggcttatttc 840 

tttctgacta agcaacctat ggaagtgccg atgaacagta tcgagtcatt ggatctgaaa 900 

agtgaagagt ggctgtacga ctgtcgcacg caggaagtct gccgttactt gttgaagaat 9 60 

aaagacgatg cttcgaaacg tgtggacggt atcatgttct tttgctatcc cgaagattgc 102 0 

ggcttggctg ttctgaagtc cgaagacctg atggatgctt acgaagccgg tcagctgagc 1080 

ggtgaattga aagagatagc agccggcctg aaagccgatg acaatccggt attgatgttg 1140 

attcacttca aaaagtaa 1158 



<210> 1243 
<211> 234 
<212> DNA 
<213> B.fragilis 



<400> 1243 

atacgaaagc ttatgtattg gacattggaa ttagcatcta aactggaaga tgctccttgg 60 

ccggcaacta aggatgaact gattgattat gccatgcggt cgggtgctcc tcttgaagtg 120 

attgaaaacc ttcaggaaat ggaagatgaa ggcgaaatct atgaaagcat agaagatatt 180 

tggccggatt accccagtaa agaggacttc ttctttaacg aggaggagta ttga 234 



<210> 1244 
<211> 786 
<212> DNA 
<213> B.fragilis 



<400> 1244 

tctaagatta gcttcttcac ttcacggatg gactcttcgg gcgaaagccg ggatacttgc 



60 
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ttttttgtca 
aaaagaaaac 
aacgttggat 
acttgcctcg 
ttcgacgaat 
cacagtgatt 
ggcaggatcc 
gaactccgtg 
aacaaccagg 
tctgccgatg 
caaaagaaaa 
cgtaaattgg 
ttgtag 



actataaaag 
ttaccaccaa 
taactccgaa 
ttgggacggg 
cggctataag 
tactctccga 
gtaatggtgc 
agggtgagtg 
aatttgagct 
agggggaaaa 
atggttttaa 
gagtgccttg 



agggcttaca 
atttgaagaa 
cgtgtgccgg 
aaggactaca 
cgaactgaag 
tccttcggga 
tgccatcaac 
cttttatttc 
gcaaaccccg 
gtgcgaaacg 
actgtcggct 
caagaaaatg 



cttccgaaaa 
gaacccttga 
tgtactctcc 
tccgggcgct 
cgttatgctt 
agccgccgct 
tatctgcaac 
tctcccgaag 
gttggacacc 
ctgctcgctg 
acgaaaattg 
aaaaatggga 



ctgaattaga 
actttgggac 
tattacctac 
ggcccgctac 
cgttcatagc 
tcatttgtat 
cctatgcgca 
aagaggccgt 
tctttcaaca 
tcgatatttt 
tgcatttggg 
atttctattg 



ttccatgagt 
cgggatgtgg 
ttcttgtatg 
gtacgtacct 
gaccagcaat 
caatatcacc 
ggcagtgcag 
tttgatggag 
gtattttcgc 
ggggcgcata 
caggatactt 
tgtggtggag 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
786 



<210> 1245 
<211> 855 
<212> DNA 
<213> B.fragilis 



<400> 1245 
ttacttccaa 
cttagccgtt 
cctaaattta 
gttggcgaag 
gaacacgatg 
atgttacgga 
gagaatgaga 
ttctgtgatg 
tttgaacaga 
gatggtttac 
tttgacctgc 
ggaattgata 
aacaatggtg 
acgaagaaat 
gagttaaagc 



gcctacatag 
tcaaagactt 
tcacaaagga 
gagcgaaaag 
tgatactaaa 
aagatgtgct 
acccgatggt 
tgaaagacct 
ataagaccaa 
tctctatcat 
cgacttatga 
agcacattag 
ctaatattaa 
acacccgtgc 
tttga 



acagctatac 
cctgaaagag 
tatgatagta 
tatctatcag 
agacccgtgc 
ttcactggaa 
aagacgtgcc 
tacttataag 
agggcattct 
cggtgaagct 
aagttgctcc 
ctggcattgt 
aacaatagcc 
agtggataag 



aaagaaagat 
caatatcctc 
cagtttgtgg 
cgtttcaaga 
aaagaagtaa 
gaaattcagt 
tttatcttct 
aatgtagact 
tctgccagtg 
cctgcgaatg 
aaatcattaa 
gcccgtcact 
ggtttgctcg 
ttgaaggaag 



tgcgcatggt 
tttatgaatg 
actacctgca 
aagtgatacg 
tttgcaaggt 
ctcttatcaa 
gcctgtattt 
actcaaacaa 
gcgtagtcat 
ggaacaaaga 
gacgttgggt 
catttgcagt 
gacatagcgg 
aggcaattaa 



gcaaattgca 
taacatcaaa 
atcccgtagt 
ctatgccatc 
agacgaccca 
ctgccactat 
cggtttacgc 
gctattgaag 
tcctttgaat 
tgttcttatt 
aaagagggca 
gaatatcctt 
gttgaagcat 
tagtttgccc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

855 



<210> 1246 
<211> 2427 
<212> DNA 
<213> B.fragilis 



<400> 1246 
attgcgacaa 
catccacata 
gacactatga 
ggaaacggaa 
tttcggcaag 
gcagcagcaa 
caacctgcac 
cctttcatcg 
atagaatcca 
aagaaaaacc 
caagtccgga 
accggcacga 
cgtgaaacca 
cccggattgt 
ggcggtatca 
gacgggcatc 
atgctggccg 



gccagtcatt 
ccggaagaac 
tggaaaagat 
cagagatacg 
acccgtcctt 
gtctaatggt 
ttgcactcct 
aaaaccggga 
tacaggaaat 
tgctcggcat 
aagatacgac 
ggaatgaaac 
tcgggcaacg 
tcaccaccag 
gcctgcgtgg 
cccaatacat 
agaaagtgga 



gcatcggatg 
tgcatcgcat 
aattgatctt 
aaccttcacc 
tatgaaagga 
attgggagga 
gtgcaatgcc 
taaaagcgga 
attccggatc 
tctattggta 
acaagcaggg 
cgatatccgt 
ctccgaacct 
tcgtggcgtg 
tataggtggc 
gggattgatg 
agttgtgcgc 



caaccaatgt 
cgatcgtttt 
ctacactccg 
caacggggcg 
gccggcatcg 
atccggcagg 
aatatagaag 
tggtgtccgt 
atcgaaaact 
tgcgcttttc 
cataattatg 
cacctgccaa 
tccctgctcc 
atgggatatg 
agccccacta 
ggacatccca 
ggccccgcat 



accccgcatt 


gtccgcaatc 


60 


gtagaacagc 


tcaagcaggg 


120 


gaggatattc 


ctgtgtcatc 


180 


tcgccgattt 


atatgactta 


240 


ccgacaaggt 


aattggaaaa 


300 


tatacgcgga 


tgtcatcagc 


360 


tgagttatgt 


acggctggtg 


420 


tggaaacagc 


ttgctatgga 


480 


tcctttccaa 


gatcaggata 


540 


tcagttcatc 


tttacaggca 


600 


aaatagacgg 


cgtggtagtg 


660 


tgacaatctc 


cgtggtgaac 


720 


ccacattgac 


agaacaggtt 


780 


gagtgtcaaa 


cggtgcggca 


840 


caggattatt 


ggtattgatt 


900 


ttgctgatgc 


ttaccaatcg 


960 


ccgtacttta 


tggctcgaat 


1020 



502 



gccatgggtg 
ggagtccgac 
aaaaaaggac 
cccgatatgg 
agctggaacg 
gtcagtacac 
ctggaaaaca 
catcacatta 
aaggaccgga 
ttgactgcag 
gacaaacata 
ttccgccagg 
accgtcaccg 
gcaagcctga 
atgttccgac 
tattcacaac 
ggggacaata 
aaagtcgaga 
ctgacagcta 
aaactatata 
tatgtgacag 
tggaatcttc 
aatctgctgg 
atgggaggaa 



gagtaatcaa 
tgggatatgg 
gtttcaacag 
aatttgagca 
tatcggccga 
cggtttttga 
gatacgaacg 
atgacggata 
tgatgggaat 
gaatagatta 
aagaaggcat 
caatcggcag 
gaacggaatg 
aagcaatggc 
cggccaatcc 
gcttactcaa 
tgattcagac 
attggggagc 
actacagttg 
caggcatcga 
gactctatac 
gcggcagtta 
cacaacgtta 
ttaatattaa 



catcgtaacc 
ttcgtatgac 
cattatcacc 
atatggggga 
tctgaacctg 
taatgactcg 
tacatcgggg 
tggtacggga 
ctcctggtat 
tatgcaattc 
ctcggataaa 
ttacctgaca 
gataccacaa 
cagtaaagga 
cgacttgttg 
aggtactttt 
cattcgcaca 
agaagccgac 
gcttcatatg 
tttcacacag 
ggcagtcgat 
tcgcatttgc 
tgaaataaac 
cttttaa 



cgccggcaac 
acatggatga 
ggatcttata 
tatacgaggt 
acacatttca 
cgcattacac 
gcactcaagt 
gaagaaccgc 
cagagtgcct 
ggcggtgaag 
tcggaaaatg 
atggatgcag 
gtcggtttgt 
ttccgtaacc 
ccggaacggc 
tattacggag 
gacggacgcc 
atagcatatc 
gagcatcccc 
aagaaatgga 
cctcaggaaa 
tctattgccg 
gcaggttacc 



gggaagacgg 
ccgaagctac 
accggacaaa 
tgggttatga 
atgcttccaa 
gcggcatgac 
ttttctataa 
tagactaccg 
ctttattttc 
cctggaaccg 
agattgccgg 
ggctccgtat 
cggtacaatt 
caacgattcg 
tgtggaatta 
tcaacctttt 
cactgaacgt 
acattcaccc 
tgatagccgc 
gtttctccac 
aaaaagaaaa 
acctgtttgt 
ccatgccgaa 



agtgagaacc 
caaccaagta 
cggtcatcgt 
actggatcac 
tccgggaact 
ttcttttgct 
ttggggcaaa 
cttcaattcc 
cggcaaccga 
tttcattgcg 
atatctcgat 
agatcaccac 
gccacaagac 
tgaactgtat 
cgaactatct 
ttacatcaac 
aaacacaggc 
catgtggaga 
acccgaacat 
cggaatacaa 
cttcctccta 
aaaaggggag 
ggctacatgt 



1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2427 



<210> 1247 
<211> 501 
<212> DNA 
<213> B.fragilis 



<400> 1247 
cacaagagaa 
gtcacattcc 
atatcacaaa 
tatcatattc 
tcccgtctta 
aagttgctta 
aatttcatag 
ttaggattca 
gcagaaacat 



gtattatcta 
tccttttaat 
aggcggactt 
cggctcaact 
aacttgtcaa 
tggtaaaata 
attatctcac 
gcgtatcagc 
ttcctaagta 



cacgttgaac 
tggtatagta 
gtcgaggtat 
gatggaatat 
tgatatacgt 
tggagtagat 
tggtcgcccg 
agacattaga 



ctaaacaata 
gcatcatctt 
gaatatgcct 
gaaatacaac 
attgacgaac 
attttagaag 
atagcttcat 
ggtgctataa 



aaaaaatgaa 
gtacaacctc 
caattatcaa 
tatttgatgc 
tttctccgaa 
aagaatctgt 
gccgtggcgc 
aacgtgttgc 



taaagtaata 
taaaagtgta 
caacgacact 
catagagagt 
tcaacaatca 
cgtaactgta 
atatacaacc 
taaacaaata 



60 

12 0 

180 

240 

300 

360 

420 

480 

501 



<210> 1248 
<211> 2151 
<212> DNA 
<213> B . fragilis 



<400> 1248 
tcgaattatg 
tttacaacag 
tcatttggac 
tacgatgaaa 
gatcgcagat 
tt tatccgcc 
cccggcgcca 
tt ctattttg 
ttcggtgtac 
agcctgattt 
actcccgaac 
ccggtcgaaa 



gcatgaacac 
gaatgataag 
agaagaagtc 
tgacccgcta 
ccgtttctcc 
ttgaagaaca 
ttacccgtat 
acggaaagga 
gaggactcaa 
atctacctgt 
ggacaaacag 
ccttctcccg 



aaaatcaaaa 
gttgagcatc 
accgatcacc 
tcccaccctg 
cgatcggccc 
caacggacgg 
atggctcacg 
cgaaccaggc 
gaaaggatta 
cacttatgcc 
acattttcta 
gaaagttgca 



caccccttac 
accttgctct 
gtagaaaccc 
ccctaccgca 
ggctggtttg 
aaagaaaaag 
acctttggca 
tgggtggtac 
atcgaaccgg 
gacggttgca 
ttcaactacc 
gaacgtatcc 



tatcgttccc 
tttccactct 
tcttggaaga 
gcatgcagca 
ccaatgacga 
ttcttttcga 
gcatacatac 
cctcttatga 
acgataaatg 
aagtaaccat 
gcaaatatcc 
ctcaatcggt 



gaaaccccag 
atgcattggt 
gatgacttca 
gagcagttac 
tggcgaagga 
agacgaaggt 
aatcttgcgc 
tctccagaag 
gatcaggggc 
ggaggaactc 
cacaggcact 
cgaaaagaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 



503 



tcggctacat 
ggtgcactga 
ctccacaaag 
ctgaaacccg 
ttcgacggac 
gcatttgcat 
tggctgatgc 
accgatgtac 
ttccatacag 
aaatgcatgg 
ctgtctcttt 
gatcacgaag 
ggatacttta 
tatggttata 
ctaaaattcg 
gtattctggt 
gaagcaggtt 
gacttctgcc 
ctcagcggac 
aaaggtgact 
ctattctgta 
aacggtaaac 
ggaatgcaca 
gcattatcga 



tgtatagaaa 
tccaccggca 
ggaaaagagc 
gaacagatga 
agcaaaccgt 
cccggagttt 
cctatcgaca 
atatcgatat 
cctggaaaca 
actggaattt 
tcaatcatac 
cgtttccctc 
aaagccagac 
acagtttctt 
attttgaatt 
atggagactt 
tgctacccac 
aaatccaacc 
atccgggcaa 
acatcgaatt 
ccaaagcagc 
agctggactg 
agcccataga 
cgggcacttt 



tatagataag 
aaatctgtca 
gatcagcctt 
tttcgcccgg 
atgggcacca 
tttcttctat 
gaattgtgag 
tgtatcacaa 
agaacgcaga 
caccactatc 
gacagagtgg 
ccatttcggg 
tccatttgcc 
cagagtcaga 
actaggttgg 
aaactctcag 
ccccacccag 
aacctccaaa 
gtggaattta 
cgaattctcc 
cgactacggt 
ttacagccag 
tggtaagttt 
attcggactg 



ggggttgatc 
ttaaacaaag 
ctccaattca 
ttgatgcgca 
ttatccgatt 
tctgatggta 
atctctgttt 
ccttacaagt 
ctgcccgttg 
tccggcagag 
tatggagaag 
acaggtaccg 
ggacagcccc 
tgtttggacg 
gaaaatggca 
gcagcgggtt 
tcccccgtct 
tctgaaaggc 
aaagaccatc 
ggttttgagg 
aatataaagt 
gaagtcgagg 
atcctaagga 
gattgtatac 



cacaggcccg 
gagaaaagca 
acgtgaaaac 
gtctgatcat 
ttgcaggctc 
aaggaatcgt 
taaacctctc 
gggacaaccg 
tcacttggat 
gagtctatcg 
gcgatgaaaa 
aagactatta 
gacaagatat 
gcataccttt 
cagtagacta 
tttccggtat 
gtagcatagc 
tgcgttatga 
tggtgtgtca 
accgtgaata 
tttacgtaaa 
ccacaggtgc 
tagaattgac 
ggattgaata 



ctatggaaaa 
gcaactaaac 
ggatccggat 
atccataagc 
cggtatggga 
ttgcagtaaa 
accctacaaa 
ttccctctat 
ggaacatgaa 
aggagacctg 
aataacagta 
cagcttcgac 
gaaggacttt 
caaccagcaa 
ttcctcgact 
agaagaaata 
caatgctatc 
cagacaaagg 
tggcggtaaa 
cagtttgaat 
tcgtcaggaa 
cataaacctc 
cggacagaat 
a 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2151 



<210> 1249 
<211> 279 
<212> DNA 
<213> B. fragilis 

<400> 1249 

cacgattacg gattttgcaa gatatccgga cggagcttga acaacagacc tttcgacaaa 60 

ttcccgatag aagaatgtac atacggtaca cctcattaca taacacttga agaaaaggac 12 0 

aaaattttca atgccgacct atcagccact ccgcaactgg caatacagag ggatatattt 180 

atattccaaa cactgatagg ctgtcgagtg agcgacctat accgcatgac gcagagccga 2 40 

cccacgcacg ctacccgaac ttctgataga aaagcatga 279 

<210> 1250 
<211> 1443 
<212> DNA 
<213> B. fragilis 



<400> 1250 
agatatatta 
gaacgtgaag 
ttccgcatta 
accaagatgg 
aatgccattc 
gtagacatga 
gccaaccgcg 
aatgatcatg 
ggtatgtatt 
agaagaaagg 
gcggtgccga 
gtcaagaacc 
atcggaaccg 
aagattaccg 
tcttgtctgg 
tgtaatgact 



tgaaagaaga 
tacccgaaac 
gcaaatatca 
gagccgctat 
ttcgtgcatg 
tccagggtgg 
cactcgagtt 
ttaaccgttc 
atacgcacct 
gcgaagaatt 
tgacgctcgg 
tggactttgc 
gaatcaccgc 
gactggatat 
taggttactc 
tgcgcctttt 



attatcgaaa 
cgccttatat 
tctttgtgag 
ggctaacttt 
taaagaaatt 
agccggaact 
gatgggacat 
gcagtctacc 
caaactggtg 
tgcacacgtg 
acagacgttc 
tgctcaggac 
cgagccggaa 
ccgtttggcc 
ttctgctatg 
ggcatccggc 



gcaactcgta 
ggcgtacaaa 
tatcccttat 
gaacttggac 
ctggaaggga 
accaccaaca 
aaacgtgggg 
aatgacgctt 
aaacatttta 
atcaagatgg 
aatggttttg 
ttcctgaccg 
tatgcaagca 
gatgatctga 
cgccgtatag 
ccgcgttgcg 



cagagagtga 
cccttcgggg 
ttatcaatgc 
tgctgaccga 
aacatcatga 
tgaatgccaa 
aatatcaata 
atccgaccgc 
aggaggtgat 
ggcgcaccca 
ccagtatcct 
tcaatatggg 
aatgcatagc 
taggagctac 
ctgtaaaaat 
ggttgggcga 



tttaataggt 
gatagagaac 
actggccatt 
agagcaggca 
ccagttcccg 
cgaggtgata 
ttgttcgccc 
tatccatatc 
cgatgctttc 
gctggaagat 
tcaggatgaa 
agctactgcc 
ggctctccgg 
ttccgacact 
gaataagata 
aatcaatctg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 



504 



ccggccatgc 
gtgatgaatc 
gaagcggccc 
tctgccgatt 
acggccaatg 
ctgaatcctg 
ggtaagggag 
acgatcctta 
tga 



agcccggttc 
agattgacta 
agatggaact 
tgctgatgaa 
aggaaaaatg 
ttatcggata 
tctatgaact 
aaccggaaaa 



gtccattatg 
taaagtaata 
gaatgcgatg 
tggtttcgat 
ccggaaagat 
taaaaactct 
ggtgctggaa 
tatgattcat 



ccgggtaagg 
ggtaacgacc 
gaaccggtca 
acgctgcgta 
gttcacaaca 
accaagattg 
catgacattc 
ccggtgaaac 



tcaatccggt 
tttgtgtagc 
tggcccagtg 
ccttgtgtat 
gtatcggtgt 
ccaaagaagc 
tttcaaaaga 
tcgatattaa 



tattcccgaa 
tatgagtggt 
ctgtttcgaa 
agacggcatc 
ggtgactgca 
acaggagacc 
agacctcgat 
gcccaatcat 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1443 



<210> 1251 
<211> 1068 
<212> DNA 
<213> B.fragilis 



<400> 1251 
ggaatcatta 
aaatatatat 
cccaatatag 
ttcttttctt 
aactacaatc 
ctgtttgcac 
ggtttccata 
ttatgcactt 
gaaatagaac 
gtaaacccgc 
aacgatgaaa 
cagagcgacc 
cgatacgtaa 
gcacagaaag 
acggtacaca 
caatacgaac 
ggaacgggat 
atagaaactg 



ttttgctttt 
cccgaaacct 
cctgcacccc 
acttcatata 
tgcggcagat 
tgaccggtta 
cggattttct 
tggtaggata 
gtctccgttt 
acttcttctt 
acacgctgac 
gcaaaggagt 
tggaggtacg 
atgtactgac 
acaggataga 
tggtggtatc 
taagcaacct 
atgaaaaagt 



ctttgcagtg 
tccgctggtc 
gtgggagcta 
ccgtttcctc 
cccgacagca 
tctgttcttc 
gggaagcact 
catctccatg 
tgaaaacttg 
caactcactg 
gtatgtcaac 
ggttacactg 
ctttgccaat 
actgcctgta 
cagtgaacac 
caatcccatt 
tgaaaaccgc 
gttccgggta 



agaaagaaga 
agcctgatca 
aactctttgg 
ttcttttggg 
ctgttcagga 
gcatccgttt 
ttaatctccc 
ctttataccc 
caaagccgtt 
aacggaatat 
caactatccg 
agagaagaac 
aaactgagtt 
ctctcactcc 
aaaatggata 
taccccaaac 
ttcaacttat 
tatttacctc 



gcatcatgga 
gtgcggtttt 
aaccatcgga 
ggatgatcgg 
aacgcctgac 
cgtataccat 
agttcttcac 
ggcaaagaga 
gtgatgcact 
cttcactgat 
atattttccg 
tggagttcat 
ttactatcga 
tgccattggt 
tctccatccg 
tgtcacctcc 
taatgaataa 
taatatag 



ttcctttgat 
catcatatat 
ataccttggt 
tttcctgata 
tcacaacttc 
ctcttcgcat 
gctctgcttt 
aaaagaacag 
ggccaaccag 
ccgaaagaag 
gtatatcttg 
ccagtccttc 
cgtggatgaa 
agacaacgtc 
gctgaacgaa 
cgacaccaac 
acaaatccgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1068 



<210> 1252 
<211> 906 
<212> DNA 
<213> B.fragilis 



<400> 1252 
gtgactccta 
ctgggctgca 
gaacagcaaa 
gatattttat 
ccgggagaat 
gcatttcatc 
ttcttttttt 
gtagaatgca 
attctgattt 
cagtttatca 
aaagattata 
gcaaaaatac 
aaaaatatcg 
gactctaata 
tt tagccgtt 
aattag 



tgaaagaaag 
aaacactcca 
ttattaaatt 
atgggcgcaa 
ctatcaaaat 
cggacttgat 
acaatccgga 
tatgcaacat 
cacggtacat 
cacgttgcga 
ttctatccgg 
ttcaactgtc 
atgaatattt 
atacggtaag 
tatttaaaag 



aatattaaat 
tccgctggta 
tgatttttat 
gtactatgat 
aaacaaaagc 
ttctcaaacc 
ggaggcactc 
cgaagaggaa 
agaattgtta 
agctaacaaa 
aaaattaaaa 
atctcattat 
cgagtcaaaa 
tgtagttacg 
aattaccgga 



atagagaccg 
agtgtaatcg 
accatcctga 
tattccaatg 
aaagcacttc 
tctttgggag 
cacctatctc 
cttcgtcatg 
ttagatcatt 
aagattatga 
tacaatacat 
ttcaatgacc 
cgtttggaaa 
gaaaaactgg 
gtggctccga 



tccatcaatg 
acctgtcaaa 
tgatggaagg 
catcattggt 
cctcaaaagg 
aacatataaa 
aacgggagaa 
cgatagactg 
gcaaccgttt 
agaaaacaga 
cgccttcatt 
tgctgaaatt 
tggctaaaag 
gataccccaa 
ataattacag 



caactgctgc 
gagcgatctg 
ggagattgat 
attcctcacg 
gtggctgctg 
agattactcc 
ggccaaggct 
ccacagccaa 
ctacgaacgt 
tgtattgttg 
gggatactgt 
tgaaagtgga 
catgctgctc 
tatacgatac 
actctcacag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

906 



<210> 1253 



505 



<211> 1764 

<212> DNA 

<213> B.fragilis 



<400> 1253 

aaaatgtata tagaaaaaat taattcgccg aaggacataa aagaattgtc agttgaacaa 60 

ctaagtgtgc tggctgagga agtaagaact gcgttaatca ggaaattaag tgaacacggt 12 0 

ggtcatatcg gcccgaatct tggtatggtg gaaactacaa ttgcactgca ttatgttttt 180 

aactctccca ttgataaaat agtttttgat gtatcccatc agagctatgt tcataaaatg 240 

cttaccggaa gaatggctgc atttcttgat ccggctaaat atgatgatgt caccggatat 3 00 

actaatccgg atgaaagcga gcatgatttc tttacgatag gacacacctc aacatctgtt 360 

tcactggcaa tgggacttgc aaaaggacgt gatctcacag gtggcaaaga aaacatcatc 420 

gctgtaatcg gtgacggttc gttgagtggg ggagaggctt tggaagggct caacaatgcg 480 

gcaatgcttg gctctaacat gattattata gtcaatgaca atgatcaatc cattgctgaa 540 

aatcatggag gactttataa aggattgaaa gagcttagag acacgaatgg ggagagtcct 600 

gataatattt ttaaggctat ggggttggag tattattacc ttggagatgg gcatgacgtg 660 

tcagcactca taaaattatt tacgcctgtc aaagatatag accgtgcagt agtattgcat 72 0 

atccatacga tcaaaggtaa gggattgaaa tatgcagaag aaaataaaga atactggcat 780 

gcaggaggtc cttttcacat cgaagacggt tctcccaaag gacccggatg gccggtgaat 840 

gaaactgtca gggagtctgt tttagacttg attgaaaaga ggtcggatgt agttgcaatt 900 

actgcgggaa caccgtctgt aataggattt acggaagact atcggaagcg tgccggcaaa 960 

cagtttgttg atgtaggtat tgcggaagaa catgctgttg caatggcaag tggtattgcc 1020 

aggaatggag gaacaccgat ttttggagtt ttcagtccgt tcttgcagag aacgtatgac 1080 

caactttcgt cagacttgtg tttaaataat aatccggctg taatcatggt gtttatggct 1140 

tcagtatatg ggatgaatag taatactcat ttggggatct atgatattcc gatgatttca 1200 

catataccca atctggtata tcttgctccg acgagcaagg aggagtatct tgccatgttt 1260 

aagtatgcca ctacacagaa ggcgcatccc attgctatca gaattccaat gatgatgcct 1320 

gagacgggaa ttgaggatac cacggattac tctttactaa ataaatatca ggtcgtacga 13 80 

aaaggttcag gtgttgcgat tattgcactt ggagatttct ttgaactggg cgtacaaatt 1440 

gccgataaat ataaaatcct gacgggtaat gatgtaacac tgataaatcc taaattcatc 1500 

acaggtattg atgaagagct gctggagtgc ttaaaaacgg accatgaact tgtgcttact 1560 

cttgaagacg gcatagtgga aggaggattc gggcaaacaa ttgcaagttt ttatggttta 1620 

tcggatatga aggttaaaaa ttatggaata aagaaatcat tccccacaga ttttcggcct 1680 

gaagaactga tgagagagaa tggattatcc gtagagcaaa tagtagagga tataaaatcc 1740 

gtatgcagag agcacgttat gtga 17 64 



<210> 1254 
<211> 666 
<212> DNA 
<213> B.fragilis 



<400> 1254 

aactatatct ttgtcccgaa ttttaaacac aacttagtta tgaaaaagat tattagtgct 
ttaatgatgg ctgtatgtat cggtatggct atgcctgctc aggcacaact tatcaaattt 
ggtgtgaaag gtggtgtaaa cctggcaaaa gctgatttaa atacgtctga ttttaaaaca 
gacaatttca ccggattctt tatcggtccg atggctgaag ttactattcc actgataggt 
ttgggggttg atgcttctct ccttttctct caaagaggtg tgaaagttag cagtcgggat 
tttattgatc ctcttgcaga tagtgatcca ataataggaa atcgtactat caggcagaat 
ggtcttgata ttccaatcaa cttaaagtat actatcggtt tgggtagttc attgggtatt 
tatgtagcag ctggtccgga cttttatttc aacttctcag gagataaagt ttatgaaaac 
tatggacggt tgaataaaaa aaatgctcag ataggaatca atgtaggtgc tggtgtgaag 540 
ctgttgagac acttacaagt aggtgccaat tataacattc cgttgaataa aacggcagaa 600 
tggaaagagg ctgatttctc ttataagact aaaatgtggc agatttccgc agcttacatt 660 
ttctaa 666 



60 

120 

180 

240 

300 

360 

420 

480 



<210> 1255 
<211> 1206 
<212> DNA 
<213> B.fragilis 



506 



<400> 1255 

aagaatgtca tgttccagca ccagttcata gactccctta ccggtctcct gtgcttcttt 60 

ggcaatcttg gtagagtttt tatatccgat aacaggattc agtgcagtca ccacaccgat 12 0 

actgttgtga acatctttcc ggcatttttc ctcattggcc gtgatgccgt ctatacacaa 180 

ggtacgcagc gtatcgaaac cattcatcag caaatcggca gattcgaaac agcactgggc 240 

catgaccggt tccatcgcat tcagttccat ctgggccgct tcaccactca tagctacaca 300 

aaggtcgtta cctattactt tatagtcaat ctgattcatc acttcgggaa taaccggatt 3 60 

gaccttaccc ggcataatgg acgaaccggg ctgcatggcc ggcagattga tttcgcccaa 42 0 

cccgcaacgc gggccggatg ccaaaaggcg caagtcatta catatcttat tcatttttac 480 

agctatacgg cgcatagcag aagagtaacc taccagacaa gaagtgtcgg aagtagctcc 540 

tatcagatca tcggccaaac ggatatccag tccggtaatc ttccggagag ccgctatgca 600 

tttgcttgca tattccggct cggcggtgat tccggttccg atggcagtag ctcccatatt 660 

gacggtcagg aagtcctgag cagcaaagtc caggttcttg acttcatcct gaaggatact 72 0 

ggcaaaacca ttgaacgtct gtccgagcgt catcggcacc gcatcttcca gctgggtgcg 7 80 

ccccatcttg atcacgtgtg caaattcttc gccctttctt ctgaaagcat cgatcacctc 840 

cttaaaatgt ttcaccagtt tgaggtgcgt ataatacata ccgatatgga tagcggtcgg 900 

ataagcgtca ttggtagact gcgaacggtt aacatgatca ttgggcgaac aatattgata 960 

ttccccacgt ttatgtccca tcaactcgag tgcgcggttg gctatcacct cgttggcatt 102 0 

catgttggtg gtagttccgg ctccaccctg gatcatgtct accgggaact ggtcatgatg 1080 

tttcccttcc agaatttctt tacatgcacg aagaatggca tttgcctgct cttcggtcag 1140 

cagtccaagt tcaaagttag ccatagcggc tcccatcttg gtaatggcca gtgcattgat 12 00 

aaataa 1206 

<210> 1256 
<211> 2421 
<212> DNA 
<213> B.fragilis 



60 



<400> 1256 

tttacttaca taacattttt tttattaaca cttaaaaaca gaggtatgag aaaatggact 
tatctcgtag ctgcgctctt agtgggagga gcaactacaa cattcaccgg atgtatcgat 12 0 

180 
240 
300 



aacgatgaac cggcgggaat cgaacaactt cgcggagcga aagcggaatt cattaaggca 
aaggctgcct atgaagacgc actgacccaa atccaattag tcaaggtaga aagagagaaa 
gttaaacttg aaaaagatca ggttaacttg gaactaaaaa aatgcagttt ggaagtagaa 

caagctaaaa cagcagaaca aaaagctttt tgggaagctg aagctcaaaa gagaactgaa 3 60 

gagttcaaag caaaaatact cgatctacag acacttactg cacaggctga gtataacaat 42 0 

aaaaaagctt taatggacat tgaagtggct ctactcacca tgaaggatga cgcatatact 480 

gctgaaatca ataaatatcg ggctgcgttg gttggttata catttaagtc agagtctacc 540 

actaatggag agacaactat tacgacgaat tctagttatg gcgcattggc tgatttagcc 600 

gaagcgaaat caagtttaat gaaggccgag gttgccagaa ttaatttctt atctaagaac 660 

aaatactatc cggaagaatt gcagctggag agaacgcatg ctgccaaaac gttggaaatc 72 0 

cagatggctc tgatggaaga atacaaagca ttggatgcga caggcactga tagcaaggct 7 80 

ttggcagata agctgaaagg ctataaaacc gatttgcaag cccttgacgc taaagaagac 840 

gaggcctata ctaaaatcga agaaatgaga aagcccatct tcccgattaa tcagcaaatt 9 00 

attgaagaga aaataaagtt ggacgcaaca tcctctgctt atacattggc caaagcagat 960 

gtagatcctg ctttagtaaa cggattatac agtgcattgt caaccggaga agatgaagag 102 0 

aaactcgttg aggatcttga taaaatcttt gtccaggata gctggaacgc agaattgctg 1080 

aattaccaat acaccatgaa gaaagatgtg gtgattaatg atcttagcct caataacaaa 1140 

gcaactaaaa taggagctat tgccgatgct atcaaagcat attatcaagg ggagaatagc 12 00 

gaagcattcg acgcaagcgg aaaattatta gatgcataca aaacaaaatt tgagaatgaa 1260 

cttaagcgtt tggaaattga taagaaacca gcttacgacc agttcaaatc ggactctacc 132 0 

gcttggatta atgcatacgt tgcttatgta gcagccctga aggcttataa taactataaa 13 80 

ggaaccaaca cctaccaagc gattacgaaa gaggtcacaa cttataacag cttaaaagct 144 0 

gaagataaga agttggaaac agccaacgca ttgcgtacct ccatattggg ataccttgga 1500 

aagagaaaag cggtggacgg ctttaacgca gaattcgcta ctacttataa agacgcttta 1560 

actgacgcta atctggctac tttcaatgaa gcaattgcaa ctgcagtaag tgacggtata 1620 

agttcattaa tcggcaatga aactcttgca acatcattca atgataaagt tgaaggtagc 1680 

acactctttg ccttcttgga ggctaatcaa gcattgttcg gtggttcaga actcaatctt 1740 



507 



gaaaagagta tcgaacctaa agaaatatca gcaaataaat atgatatgcc caaaaatgtt 
tctcagttga ttgataatcc cgaaaataat tcaggttcat ttgttaaata tacctatttg 



ggagccagct tagacgtaca agaagaacgc ctacggaaat attgtaacca aatgggatat 
aacatcatta acaatatccc atacagagag gatgagagtg ctaaaacatt tgaaaaacgc 



gtccaaaaac gcatcaacag cattgaagat aaatatctgg acggagattt aaccaaagaa 
gaatataaca gaatgttgga gagatacacc aaagaagcct caaccataca acaacaagtt 



1800 
1860 

gctcaggaat ctaattatct ggttaacctc gacaaatggc aggctttgta tgccaagatc 1920 
aaggcggaag cagatcctgt acttgcagaa gtggatgcta tcaacgagaa aatcgctact 1980 
ttggaatcgc aaattaaaga tgaaaacgca gcattgtggc aggcagaatt ggcttgttat 2 040 
ctgattaaag gcgataagtc gaaacgccaa aacgagatct tctcagaaag caacccttac 2100 
aaaaactact atacttctaa cgagcaagaa ccacttttgg gtaatgtcgt aactgaatat 2160 
tctaaacttg aagcagaaat cgccctcgtt caaagagcta tggaagatgg atacttctac 2220 
tatacgtatt atgatgccaa cagccatacc tataaggtaa gcgagaagag ccttaatttg 22 80 
acaagtttac ttgagactca ggaagaagct attaccgacg ctaagaaaac cgtagaagat 2340 
atcgacaaca aaatcgcctt gttcgataaa tacggataca ctaatattga aggtcaggaa 
gagtatcttg atctgtcttg a 

<210> 1257 
<211> 1572 
<212> DNA 
<213> B.fragilis 

<400> 1257 

aaaacaagaa ttatgaatgt aataatatat tgcagagtaa gttcagacga acaaacatta 



2400 
2421 



60 
120 
180 

cctgtcatac aaggaataat gagctacata cgtaagaaca aaggcaaagt gaataagttg 240 
ctattcttgc ggtgggatag atattcacga gacattatca gtgccagtga gaacctaaaa 
gaattactta aattgggagt tgaaccaaat gcaatcgaag cacctttaga cttcaattca 
gacacttggc ccctattatt gggagtacac ataggctcgg cacaatgcga caacatcaag 
aggtcaaaag ccacaatgga cggtattcat ggaacattgg caaaaggaaa atgtgctaat 
aaagcaccaa gaggatataa gaatgtacgt attagcaaac atgaaaccca tgtagaaata 
gataccaata cagcgccatt tatacaagcc atgtttaaag aagtggctaa aggtattgaa 
acaccttgct atatccgtag gaagtttgct agaaaaggat ataatatacc cgaaagttca 
tttcttgaaa tgctacgtaa caaattttat atcggtaaaa ttcgtgtgcc agcctacaaa 72 0 

780 
840 
900 
960 



300 
360 
420 
480 
540 
600 
660 



ggagaacccg aatattatgt aaacggtgaa catgaagcaa ttatagacga agaaacattc 

tataaagtgc aagagatttt ggacggtaaa agaaagaaaa cacccaaact atccaaagcc 

ataaatcccg atttgtattt gcggaagttt ttaatctgtc ccgtatgtgg ttgtgcccta 

actggtgcga caagttcggg caatggcggc aaatacacct actacttttg ttgtaacaat 

caaaaacata taagaatgag agcagagaac gtaaacgaag agttcgctcg atacacagcc 1020 

caattaaagc ccaataaaac ggtattgaac ttgtataacg aaattctaaa ggatttgcaa 1080 

aacgaacgca aaggagaaag taagaaagaa gtagtagcac tacaaaatga actttctact 1140 



1200 
1260 

gaaatgcgtg agaaccctaa ccgtagtaat atcgaaccaa aactaaacta ctctatcaat 13 2 0 
ctgataaaca atatagatag ctatataaga aatgcgtctg taggggtgaa gattaagcta 1380 
ataagttcga tgtttcccga aaaaatcgaa tttgacggaa aaacatatcg aaccaattct 1440 
tataacaaag tgcttgattt aatttatcag caaaccaacg agttacgagg agtagaaaag 1500 
aaaagcggag agagtttttc aactttctcc gcctcagtac ccagacccgg ggtcgaaccg 1560 
ggatggaagt ga 1572 

<210> 1258 
<211> 1020 
<212> DNA 
<213> B.fragilis 

<400> 1258 

aagaataact ttaatttttt aatgaaatcg caatatgaaa agaaacgatt aatcactttt 60 

gatagaatta aaatcaaatc caactataaa tatcttctca acactaaagt gaagttcaat 12 0 

gaaatgtttc attcacgtag tggggagaaa ataggtatat tttatagctc aaaagatgat 

ataaatatac cttataacct ctatatagct gtcagctata taaaacaaac cttaactctt 

gaattttcca gtaagatttt gaaagaaaaa tatccagact taatttcaag agatactatc 

aaagaatgtc taaccaatat aaaccaacta catatttgtg atattgatgt agatagtatc 



180 
240 
300 
360 



508 



ttatccaatg gagcaattac atcagtagat gtaacttatg atgcaaacct tattctaagt 42 0 

gataatcttc tgaacattct taattcacaa gtaaccaact atagacgctt caagtgggca 480 

cattacgata aagagggcat cacttttact aaagatgtga aatccaaaga ctgtacggaa 540 
acaatcacct tatataataa ggaaaaagaa atatgtacca gccacaataa agacttttta 
aatagcctat cacaaccaca atcagtaata gactatttca aagggaaaac cagatttgaa 
atcacattaa acacagttaa aaagataatg aactatttga atttgaccga taccaaaata 

ttcagtgtat taaattccga tacaaaccct atccttactc aatttgacaa ggttttcggt 780 

aattctacag ccaatatgcc aaacacaaca tttgatgatt acgaaaattg ggcaatgaaa 840 

atcattctcg aaaggtacaa tggtgattta aaactactgg agcaagatat tagaagcaag 900 

ttcaactcac gtagtggtgc tagtaagcga atgaaaaagt ttgaaacggt ttatcatgca 9 60 

atgacatcag cctcaaccag tgaaaaccct attgaaaaga tacgtaatct gttgctttga 102 0 



600 
660 
720 



<210> 1259 
<211> 264 
<212> DNA 
<213> B.fragilis 



<400> 1259 

atgatgtcag tattattact cccgctgagg tggaggctat tgagcgcgaa atcgatgccg 

ccgacgaaat tcttccacct ttacattctt ttattattcc gggagggagt cgtggctctg 

cggtttgcca cgtttgccgt accgtttgcc ggagggccga acgccggatt cttgcattat 

ccgaaagctg tacaatctca gccgatttac tggcctatat caaccgttta tcggattatt 
tatttgtctt gtcccgtaaa atga 



<210> 1260 
<211> 621 
<212> DNA 
<213> B.fragilis 



<400> 1260 

gaacgatcat tggtctttgc aagaattctc attacctttg tccgaaacca ctatcaacta 60 

agtaatggaa aaaaagacga agtctggcag gtagtcagca gtaaatatct atttcggcgc 12 0 

ccatggttaa ccgtacgttg tgacgacatg cttttgccca acggcaatca tattccggag 180 

tattacatcc ttgagtatcc cgactgggtg aacaccatcg ccatcaccaa agaaggaaag 240 

tttgtattcg tccgccaatt ccgtccggga ataggtaaac agctatacga actctgtgcc 3 00 
ggcgtatgtg agaaagaaga cgctt caeca cttgtttcgg cgcaacggga gctacttgaa 
gagaceggat aeggcaaagg caactggaaa gagtatatgg taatttegge caatccgagt 
actcatacca atctgacaca ctgctttctg gctactgatg tggagcaaat cgacacacaa 

cacctggaag acaeggagge gcttacagta catctgctca gecttgaaga ggtaaaagaa 540 

ctactggaaa aeggacagat catgeagtea ctgcatgcag ccccactttg gaaatacatg 600 

gcagaacata aacagatcta a 621 



360 
420 
480 



<210> 1261 
<211> 192 
<212> DNA 
<213> B.fragilis 



<400> 1261 

egggtcattt categtatga agtcatctct tccaagaggg tttctacggt gateggtgae 

ttcttctgtc caaatgaacc aatgeataga gtggaaaaga gcaaggtgat gctcaacctt 

atcattcctg ttgtaaactg gggtttcggg aacgatagta aggggtgttt tgattttgtg 

ttcatgecat aa 



<210> 1262 
<211> 594 
<212> DNA 
<213> B.fragilis 



<400> 1262 



509 



attatggaat tacaggttat tcagaataaa atatacgaag tcagaggtca aaaggtaatg 60 

cttgactttg atttagctga actttatggg agtgaaacca aacgactgaa agaagcagta 120 

agaagaaacc tcaaacgctt ccccagcgat ttcatgttcg aattaacaaa agaagagttt 180 

gaaagtttga ggtcgcaaat tgcgtcctca aacaaaagag gtggtacacg atatatgcct 2 40 

ttcgctttta ctgagcaagg cgttgctatg ctttcatctg tactaaacag cgaatctgcc ™ n 

atcgaaataa acatatctat catgcgtgct ttcgttacag tacgtcaata cttgtcttcc 

ttaaatagca caactaagga aatcgaagaa ctaaaacaac gcatgaaaat gttggaagaa 

ggcaacgaag acataatagc agcagtcaat gaccttagcg aagatacccg aaaagagctt 

gacgatattt acttagcatt gtcacagtta gcagagaaac aaaagcatgt taataaacaa 

acagaacgta gacctattgg atttgctcac tataaagaaa acaacaaaaa atag 594 

<210> 1263 
<211> 2439 
<212> DNA 
<213> B. f ragilis 



300 
360 
420 
480 
540 



60 



<400> 1263 

ttatcaccat cgccaaataa agaaaccccg acaatgaaag aagacttata cgacgatcta 
tacgaagaga aagaagaaaa aatagacttc catgccctcc tgttccgcta cgtcatccgg 12 0 

■ 180 
240 
300 



tggccctggt ttgtagcctc ggtcatcatc tgcctggccg gagcatggct acacctccgg 
cagaccactc cggtctacaa catctcggcc tcggtcatca tcaaggacga taagaaaggg 
ggcaacagtg gcggaaatct cgccgcgctc gagggcctgg ggctggtcaa ctcggtatcg 

aacatcgaca acgagattga aatactacgt tccaaaacgc tggtcaagca tgtggtcagc 3 60 

gagctgaacc tctacaccac ctactccgtc aaaggcagtt tcaacgaagt agagctttat 42 0 

aaaagttcgc cggtactggt ggggctcacc ccgcaagaag cagacaggct gccgggtccg 480 

gccgtattcg aactcaccct gtcgcccggc aaccggctcg acgtgaaagc caccgtcggg 540 

gaaacctcct acaacaaaaa attctccaaa ctgccgggcc tgctcgtcac tccggccgga 600 

acgttcacct tcacccttgc cggcgactcc gccggagtaa gtgaaccgca gacactcacc 660 

qccgttgtaa gcaaccccat gcagacggca aaaaggtatg cggcggcgct cagcgtagag 72 0 

' 780 
840 
900 



1020 
1080 



cctacctcca aaaccacctc catcgtcatc gtctcgctca aaaacaccaa caagcgtcgg 
ggcgaggact tcatcaaccg gctgatcgaa gtctacaacc ggaacaccaa caacgacaaa 
aacgaagtgg cggagaaaac ggaagagttc atcgccggac gtatccgcat catcaacgac 

gaattgttca gtaccgagaa ggagctggaa accttcaaac gggatgccgg actgacggac 960 
cttgccagcg acgcccaact ggctgtgagc gaaaattccg cctacgagaa acaacgggtc 
gagaacggca cccagctcaa cctcgtacgc tacctcgccg aatatatctc cgccccggac 

aagatcaacg ccgtgctgcc cgtcaacgtc ggcctgaccg accagtcact ctcctccctg 1140 

atcgggcaat acaacgaaat ggtgcttcag cgcaaccgcc tgctacgcaa ctcctcggaa 12 0 0 

agcaacccgg tgatcgtgaa cctggatagc ggcatccgtg ccatgcgcga aaacatcctg 12 60 

accaccatcc acagcgttca gaaaggattg ctgat caeca aggccgacct cgaccgtcag 13 2 0 

gecaacaagt tcaaccgccg catcagcaat gcgcccgcgc aggaacgeca gttegtcage 13 80 

atctcccggc agcaggagat caaagcegga ctctacctga tgctgttgca gaaacgggaa 1440 

gaaaactcca tcgcactggc cgccactgcc aacaacgcca agategtaga egaggecatg 1500 

gcggacaacg gtcccgtttc tcccaagacc aaaacgatct atatgatagc cctcgtcatg 1560 

gggatgggca tccccgtagc catcatctac gtcatggggc tgctccagtt ccggatagaa 162 0 

ggacgggccg atgtggaaaa gctgacctcc gcccccatca teggagacat ccccctggcc 1680 

gaagagggta aegggaaage gggaggcatc gctgtccgcg aaaacgaaaa cagectgatg 1740 

geggaaaect teeggggcat ccgcaccaac ctgcagttta tgctgggcga agagaataaa 1800 

gtcatcctgg tcacctctac catcagcggc gaaggaaaaa catttgtagc caccaacctc 1860 

qccatcagcc tctcgctgct gggcaaaaga gtagtcatcg tagggctcga catccgcaag 192 0 

1980 
2040 
2100 



ccgggactca acaaggtctt caacctctcg cagaaagaaa aaggaatcac ccagttcctg 

gccggcccgc agaccaccga cctgatgtct atggtgcagc cctccgggat atcccgcacc 

ctgagcatcc tccccggcgg aaccgtaccg cccaacccga ccgaactgct ggcacgccag 

gcattggtgg aagecatega tatcctcaaa aagcacttcg actatatcgt gctcgacacc 2160 

gctcccatcg gaatggtgac cgacacacag atcatcgcac gggtggccga cctctcggtt 222 0 

tatgtctgee gcgccgacta tacccacaaa gecgactata ccctgctcga agatctccgc 2280 

ctgggcaaca agctccccaa cctgtgtacc gtaatcaacg gcttggacat gaaaaagegg 2340 

aaataegget attactatgg ataeggaaaa tacggccgct attaeggata eggaaagaag 2400 

tacggttatg getaeggcta eggacaaaag cataattag 2439 



510 



<210> 1264 
<211> 306 
<212> DNA 
<213> B. fragilis 



<400> 1264 

gactatgtgt ccctctgtgg tgaattacac gaggaagtac acacgtgtta caatgggggg 60 

tacagaaggc agctagcggg tgaccgtatg ctaatcccaa aagcctctct cagttcggat 120 

cgaagtctgc aacccgactt cgtgaagctg gattcgctag taatcgcgca tcagccacgg 180 

cgcggtggaa tacgttcccg ggccttgtac acaccgcccg tcaagccatg ggagccgggg 240 

gtacctgaag tacgtaaccg caaggatcgt cctagggtaa aactggtgac tggggctaag 3 00 
tcgtaa 



306 



<210> 1265 
<211> 1767 
<212> DNA 
<213> B. fragilis 



60 



300 
360 
420 
480 



<400> 1265 

ctacgagctt ttataataat gacagatatg acagatataa aaaacgaaga agcaggcgaa 
aagaaaagcc tcaatttcat tgaacaggca gtagaaaatg atttgaaagc tggaaagaac 12 0 
gggggaaaag tacaaacacg cttcccaccg gaaccaaacg gttacctgca catcgggcac 180 
gctaaagcca tttgtctcga cttcggcatc gctgccgcac acggcggtgt gtgcaacctt 240 
cgtttcgacg acactaaccc gacgaaagag gatatggaat atgtagaagc catccaggaa 
gatatccggt ggctgggatt ccaatggggc aacgtatatt atgcctcaga ttatttccaa 
caattatggg actttgccgt cactctgatt aaagaaggca aggcctacgt agacgagcag 
acttcagaac agatagcgca acagaaaggc actcccaccc aacccggtgt cgagagtccg 
taccgcaacc gtccgatcga agagagcctt gccctgttcg aaaagatgaa tagcgacgaa 540 
gccaaggaag gttccatggt gcttcgtgcc aaaatagaca tggcaagccc caacatgcac 600 
ttccgcgacc cgatcatgta ccgcatcctg catgtggcac accaccgcac cggaacccaa 660 
tggaaagcct acccgatgta tgactttgca cacggtcaga gcgactattt cgaaggagtc 72 0 
acccactcac tctgtacact cgagttcgtg cctcaccgcc ctctttacga tctgttcatc 
gactggctga aagaaggcaa ggacctggac gacaaccgtc cccgtcagac ggagttcaac 
aaactgaacc tgaactacac gctgatgagt aaacgcaacc tgctgatcct ggtgaaggaa 
ggactggtga acgactggga cgatccccgt atgccgactc tctgcggatt ccgccgtcgc 
ggctattctc ccgaatccat ccgtaagttc atcgataaaa taggttacac cacttacgat 
gcactcaacg acttcgccct gctcgaaagc gccgtacgcg aagacctgaa tgcccgtgcc 
acccgtgtat ctgccgtact gaacccggtg aaactgatca tcaccaacta tcccgaagga 1140 
caagttgagg aactggaagc catcaacaac cccgaagatc cgacagccgg aagccatacc 12 00 
atcgaattca gccgcgaact gtggatggaa cgcgatgact tcatggaaga tgccccgaag 1260 
aaatatttcc gcatgactcc gggacaggaa gtgcgtctga agaatgccta catcgtaaaa 132 0 
tgtacaggct gcaagaaaga cgagaacggc accgtgaccg aggtatactg cgaatacgat 13 80 
cccaacacca gaagcggcat gcccgacgcc aaccgcaaag tgaaaggcac cctccattgg 1440 
ctcagctgca accattgcct gccggcagag gtgcgtctgt acgaccgtct ctggaaagtg 15 0 0 
gaaaacccgc gcgacgaaat ggcagccatc cgtgaagcca aaggttgcga cgccctcgaa 1560 
gccatgaagg aaatgatcaa tccggattca ctgaccgtac tgccccattg ctatatagag 162 0 
aagtacgtgg ccgacatgcc cgcgctctct tatctgcaat tccagcgtat cggttatttc 1680 
aatatcgaca aagattccac ccccggacat ctggtattca accgtaccgt aggactgaaa 1740 
gatacctggg gaaagatcaa taaataa 17 67 

<210> 1266 
<211> 675 
<212> DNA 
<213> B. fragilis 



780 

840 

900 

960 

1020 

1080 



<400> 1266 

gccttacagt tgtttgatga aaaagcaatt 
aaaaaagatt tcaacttaac caagcttttt 
accttgtctt cttgcaacaa cgatgacaat 



tatgtaacaa atttaatgtt taaaaaaatg 60 
tattcttttg cgattgcttt ctcagtggta 120 
tctccgcttc ctcctccatc caccaacgat 180 



511 



gtggcaggca cctataacgg aaaagtactg ataacccagg tgactcccgc cactgtaaaa 240 

gaaaatgccg gagaagctcc ccagggacaa gacgtaaacg ctacggtgaa aaatgacacg 3 00 

gtgttcttcg acaaattgcc ggtaaccgaa cttattacct ccattgtagg cgataaagac 360 

aaagcggaag ccattgtcaa agccatcggt gacgtaaaat acaaagtagg ctacaagccg 42 0 

gctctcaata cagagaagga cagcatctac cttgctttcg atccgaaacc gttgaccctt 480 

caactgcctg cagctgtaga aggccaggaa ggacagactg ttaccgtaac catttcgtct 540 

ccggacaaag gcagctttgc ttacaagaaa aatcagttga agttgaagct cagcgccgat 600 

aaagtggaac tggcaggcgt agcggtacct gttcctcaga ccctgttcaa cttcgatatg 660 



accaaaaaga agtga 



675 



<210> 1267 
<211> 519 
<212> DNA 
<213> B. fragilis 



<400> 1267 

aacttagagt tatcaattat ggcaacaaca aatttcaaag gacaaccggt aaagctgatt 60 

ggcgaattta tacaggttgg aaaggtggct cccgatttcg agctggtgaa aagtgattta 12 0 

tcttctttcg cactaaaaga tctgaaaggt aagaatattg ttctgaatat tttcccgagt 180 

ctggataccg gtgtgtgcgc cacttcggtg cgtaaattca ataaaatggc agccggaatg 240 

aaggataccg tggtattggc catttcgaaa gacttgccgt ttgcgcaggg acgcttctgc 3 00 

acgacagagg gtatcgaaaa cgtgattccg ttgtcggatt tccgcttttc ggacttcgac 360 

gagagctatg gcgtgaggat ggctgacgga ccgctggccg gactgctggc gcgtgcggta 42 0 

gtggtgattg ggaaagacgg gaaagtagct tatacagagc ttgtaccgga gattactcag 480 

gagccggatt atgaaaaggc attggctgct gtgaaataa 519 



<210> 1268 
<211> 1140 
<212> DNA 
<213> B. fragilis 



60 

120 

180 

240 

300 

360 



<400> 1268 

atgattaata caccacgaaa aaacggctat tcattcaaat acgggacact tttttacgtt 
ccttttttct catgccactt gggtatttct gaaactttca ttcgtctata catttatgct 
attgattttt tactaatttc cagcatattt tccaatctgt cactcaaaat cttttttatt 
ataaaccgtg ttcttgaaca cactaaaaag aacaagaaaa tggaaataga aaaattcatt 
aaatctttag caagaaaagc gaagttaggc gggcgttaca gcacagccaa tacctacctc 
tacactttgc acagttttca gaagtttgcg ggaaaagcct cactgacttt tgaagagatc 
actcccgaga gtatcaagga gtacgagcaa tacttaatcc tcaacgggaa acggtacaac 420 
acgatctcgc tctacatgcg catgttgcgt tccatctgca atcaggcatc ggagcagaac 480 
atagcttcgc tcaacacccg cgagctgttt gagaatgttt ttatcggcaa cgagcccact 540 
gccaagcggg ccatctcacc cgtcctcatt tcccgcctgc tcgaagcaga tttcagcaag 600 
aacagccggc tcgattttgc ccgcgacctc ttcttgctaa gcttctacct gaggggaatc 66 0 
ccgtttgtcg acctggtaca tctccgcaag accgatgtgc agggaaacat gctcgtttat 720 

840 
900 
960 



ttccgccaga aaacgggaca gcaacttacg gtaatcatag aaaactgcgc caaagtgatc 

ttgcgtaagt atgcctcgct ttgcaaagaa tccgtctatc tgctgcccgt catcagcgca 

gccggagagg aggggcacaa gcagtaccga agtgcattga gggtatacaa caaacgcctc 

aaccagatat ccggaatact gaaattgaag actccgctga cctcttatgt ggcacgccac 

agttgggcga ccacggccct gcagaaaggg gttccggttt cagtgatcag tgcaggaatg 102 0 

gggcatgctt cagagaaggt gacatacatt tatctggcat cttttgataa caaaacgctc 1080 

agtaacgcaa ataaaaaagt gattgccgcc gtgagattta agaaagagga ggaggagtga 1140 



<210> 1269 
<211> 468 
<212> DNA 
<213> B. fragilis 



<400> 1269 

aaagaaagta cgattatgag taaagaagtt acgtattcgg ragtggcgcg caagaacatg 



60 



512 



ctgaagaagg atgatcccgc caagtattat gcccaggcac aggccagcgg cgatgtgggg 120 

ctggacgaga tctcgacccg ggtggaaaag gcctgtacgg tacattcggc ggatgtcgtg 180 

gcggtgctga aggcgctgga ggatgaaatg gtggatggcc tgtcgagggg agagattgtc 

aggctgggga atatcggcac cttccaggtg ggcctgcgaa gcaggggagc ggagaaggcg 

gaggatttca aagcggcgaa tatcagcaag gcgagagtga acttccgtcc ggggcctgtg 

ctggcggatg cgatgaagac gctcaatttc agtaaggtga gtacgcgggc ggcgcagaaa 

ggcgatggcg gcggggacgg ggatattgtg gatgacccga cagcataa 

<210> 1270 
<211> 315 
<212> DNA 
<213> B. fragilis 



240 
300 
360 
420 
468 



60 

120 

180 

240 

300 

315 



<400> 1270 

aaacacactg cattccggat atttttccgt acatttacag tatgtaacaa tcagataatc 

aaagccatga atcaaagaaa agaagaagac acaaccgaag ccgatttcat catccgctcg 

tacaccaaag ccgaacttgc acagctttac tgcccgggac tcgaccccgt gctcgccctg 

cagaaactct accgctggat gcgtaaaaac accgccctga cacaggcact gtccgaagtc 

aattacaaca aataccgcca cagcttcctt aaacgggaag tccggctgat cgtgtattac 
ctgggagaac cttga 

<210> 1271 
<211> 639 
<212> DNA 
<213> B. fragilis 

<400> 1271 

acaatcggtg caagaacttt gttcacgatg gacaataatc taaaaaataa aattgatatg 
aaaactttat tcgacgagat ggaacacgca gtcaaaaact ggtggttatc tcttattctg 
ggtattctgt acatcatcgt ggctctctgt ctgctattcg caccgggaag cagttacatt 
gccctcagcg tcatcttcag catttcgatg ctgataagtg gtatcatcga aatcatcttc 
tccatcagca accggcgagg catctcgtcc tggggatggt acctcgcagg cggtatcatc 
gatctgatct taggcatcta cctggtagcc tatccgctgc tcagcatgga agtcataccg 
ttcatagtcg ccttctggat gatgttccgc ggtttctctg ccacaggcta ttctatggac 
ctgaagcgtt atggcacccg tgagtgggga tggtacatgg gattcggcat cctcgccatc 
atttgttcgc tgatcatcct gtggcagccg gccgtaggtg ccctctacgt tatatatatg 
ctggcattca ctttcctgat catcggattc ttccgtgtca tgttgtcctt cgaactgaaa 
agccttcata aacgatcaac ggtgatgaac ggtaaatga 

<210> 1272 
<211> 1449 
<212> DNA 
<213> B. fragilis 

<400> 1272 

ttgctaatta aattgggcag aaaaaactcc ccgtccgaaa gagagcggga attacagaac 
cttatcgcac aatacgaagc ggtaaaagcc aaaaacgaat cgctatacct ggacggtgac 
caattggctg atattgccga tctatatgcc tccgaacgta agtttaagga agcccaggag 
gtcataacat acggactcgg actccatccg ggacacaccg accttatggt cgaacaggct 
tacctgttcc tggatctcaa tcaaccccaa aaggccaaag aggtggccga actcatcacc 
gacacttact cctccaacgt aaaactactc ctggccgaac tgctgctgaa cgaaggtaaa 360 
ctggatgccg ccgaccaaat gctcgacagc atagaagagg aagagaaaaa cgacctgggc 42 0 
atcctggtcg atattgtcta cctgtacacc gacttaggtt atccggagaa aggcgtacaa 
tggctcaaac ggggtgcgga gatgtacaaa gatgacgaag acttcctggc agccaccgcc 
gattgctacg gtgcggccgg agcagaatac atagagcaag ccatcgttgt cttcaacaag 
ctgatcgaca agaatcccta taacccggct tattgggtag gcctggccaa atgccagttt 
gccaccaaag atttcgataa agccatcgag tcctgtgact ttgccatcgc agccgacgaa 
gaattcggcg aagcccacat catcaaggca cacagcctct ttcatcttga gaacatcgag 
ggagccatcg ttgaataccg gaaagccctg aagtataaaa cgctctcccc cgaatttaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

639 



60 

120 

180 

240 

300 



480 
540 
600 
660 
720 
780 
840 



513 



tatatgttca tcggactggc ctacacccag caggaaaatt gggctgaagc caacgaaagt 9 00 

tacagcatgg ctctccgggc gatcgaagaa aacggtaacg gttcttcccc gctattgtcc 960 

gatatctatt caaacaaagc actctgcgct tcccgccagg gcgactcgga agaagcccac 102 0 

cggctttgcc gactggcaaa agagctcgcc ccccaggatg cagagcccta cctgctcgag 1080 

ggacgcatct atatggaaga ggacaatttc gacctggccc gtgcagagtg ggcattggct 1140 

ctccgctatg cccccgaagc agatacctgg atggaaatag gcaactacag cctcgagttc 1200 

cgcatgctcg agaacgcacg cttctgcttc gagcaggtgc tggaagaaga tcccgagtac 12 60 

cccaagatct gcgaacaact ggccgccgta tgcctcgtcc ttcaggatca tgaaggattc 1320 

aagaaataca acgcgatgtc cggcgattcc atcaatctcg actccctccg ggatacgata 13 80 

ttggaaatgg gtgtcgacgg agaacagatg cttcgcgaac tggacgattt tttaaaagac 1440 

gaaaaataa 1449 

<210> 1273 
<211> 762 
<212> DNA 
<213> B.fragilis 

<400> 1273 

caccgatgca aattaaagat gtctttgatg aagagatcac caactggaaa gagttggggg 60 

gtgaagatct cccgatccgg ggtgttccgt ctggaagaca tcacgcacta ctactcggaa 120 

gaagaactgg gagaagccta cgagaatgca ggggaaaaga ttatggagct gattcagaaa 180 

acacccggta tcgttgcttt tattcctcag aagtttgtgg tacggccgga tgcggtacat 2 40 

ttcataaagg ataacaccat ttctgtgaag gaggtctttg ccggagcgga atggtttccg 3 00 

acggctactc ccgcaccttt gttcggcttt ctgccgctga taaccggtac gctttgggtg 3 60 

agcctgtttg ccatattgat tgctttgccg ttcggacttt cggtatcgat ctatatgtca 420 

gaggtggccg attcgaaagt acgtagttgg ctgaagccgg tgatagagtt gctgagcggt 480 

attccttcgg tggtatatgg ctttttcggc ttgatcgtga ttgtaccttt gattcagaaa 540 

gtatttgatt tgccggtggg ggagagcgga cttgcgggaa gcattgtgct tgccatcatg 600 

gcattgccca ccatcataac ggtgactgaa gacgccatgc gcaactgtcc ccgtgccatg 660 

cgcgaagcca gcctggcact cggagcttcg cagtggcaga ccatttataa agtagtgatt 72 0 

cgtcttcacc acggggctgg aaggatccga cgactggatc ta 762 

<210> 1274 
<211> 1275 
<212> DNA 
<213> B.fragilis 

<400> 1274 

aatcatagag atgcttttct aattatgctt ttgtccgtag ccgtagccat aaccgtactt 6 0 

ctttccgtat ccgtaatagc ggccgtattt tccgtatcca tagtaatagc cgtatttccg 12 0 

ctttttcatg tccaagccgt tgattacggt acacaggttg gggagcttgt tgcccaggcg 180 

gagatcttcg agcagggtat agtcggcttt gtgggtatag tcggcgcggc agacataaac 240 

cgagaggtcg gccacccgtg cgatgatctg tgtgtcggtc accattccga tgggagcggt 3 00 

gtcgagcacg atatagtcga agtgcttttt gaggatatcg atggcttcca ccaatgcctg 3 60 

gcgtgccagc agttcggtcg ggttgggcgg tacggttccg ccggggagga tgctcagggt 42 0 

gcgggatatc ccggagggct gcaccataga catcaggtcg gtggtctgcg ggccggccag 4 80 

gaactgggtg attccttttt ctttctgcga gaggttgaag accttgttga gtcccggctt 540 

gcggatgtcg agccctacga tgactactct tttgcccagc agcgagaggc tgatggcgag 600 

gttggtggct acaaatgttt ttccttcgcc gctgatggta gaggtgacca ggatgacttt 660 

attctcttcg cccagcataa actgcaggtt ggtgcggatg ccccggaagg tttccgccat 72 0 

caggctgttt tcgttttcgc ggacagcgat gcctcccgct ttcccgttac cctcttcggc 780 

cagggggatg tctccgatga tgggggcgga ggtcagcttt tccacatcgg cccgtccttc 840 

tatccggaac tggagcagcc ccatgacgta gatgatggct acggggatgc ccatccccat 9 00 

gacgagggct atcatataga tcgttttggt cttgggagaa acgggaccgt tgtccgccat 960 

ggcctcgtct acgatcttgg cgttgttggc agtggcggcc agtgcgatgg agttttcttc 102 0 

ccgtttctgc aacagcatca ggtagagtcc ggctttgatc tcctgctgcc gggagatgct 1080 

gacgaactgg cgttcctgcg cgggcgcatt gctgatgcgg cggttgaact tgttggcctg 1140 

acggtcgagg tcggccttgg tgatcagcaa tcctttctga acgctgtgga tggtggtcag 1200 

gatgttttcg cgcatggcac ggatgccgct atccaggttc acgatcaccg ggttgctttc 12 60 



514 



cgaggagttg cgtag 1275 

<210> 1275 
<211> 189 
<212> DNA 
<213> B.fragilis 

<400> 1275 

gaaatgcgct ggaaggagaa aaagaagctt attaaaataa agaaaatgaa taaaagaaat 60 

tacaccacag aggaacacag agtttcacag agaatttttt tcccccatca cacaaaagat 12 0 

gaaaaccctg tgaaactctg tgtactctgt ggtgagccac cccatagtaa tattctaaaa 180 

ttaagataa 189 

<210> 1276 
<211> 462 
<212> DNA 
<213> B . fragilis 

<400> 1276 

cggcatccgt gccatgcgcg aaaacatcct gaccaccatc cacagcgttc agaaaggatt 60 

gctgatcacc aaggccgacc tcgaccgtca ggccaacaag ttcaaccgcc gcatcagcaa 12 0 

tgcgcccgcg caggaacgcc agttcgtcag catctcccgg cagcaggaga tcaaagccgg 180 

actctacctg atgctgttgc agaaacggga agaaaactcc atcgcactgg ccgccactgc 240 

caacaacgcc aagatcgtag acgaggccat ggcggacaac ggtcccgttt ctcccaagac 3 00 

caaaacgatc tatatgatag ccctcgtcat ggggatgggc atccccgtag ccatcatcta 3 60 

cgtcatgggg ctgctccagt tccggataga aggacgggcc gatgtggaaa agctgacctc 420 

cgcccccatc atcggagaca tccccctggc cgaagagggt aa 462 

<210> 1277 
<211> 789 
<212> DNA 
<213> B. fragilis 

<400> 1277 

atgacaaaaa gactactttt tttcacgtta acctgtattt tgctggcttc ctgccagtcc 60 

tacaagaaag taccctactt gcaagacccg ggagaggcgc aacgtgccgt tgcagaggcc 12 0 

aagctctatg atgcccgcat cctgccgaag gatctgctca ccatcgttgt atcgtgcagc 180 

gatccggaat tggcagaacc gttcaacctg accgtatcca cccctgtcag caatacacaa 240 

aaaagcctga ccagccaacc ggcccttcag caatatctgg tcgacaaccg gggcaacatc 3 00 

gacttccccg tgttgggcac cctccatatc ggcggactga ccaagggcga agcggaaagt 360 

ctgatcaggg aaaagctgaa aggatacatc aaagaaaaca ccattgtgac ggttcgcatg 42 0 

gccaattata aaatttccgt catcggcgaa gtgaacaggc cgggcacgtt taccatcagc 480 

aacgaaaagg tcaacctctt cgaagccctt gccatggcag gcgacatgac cgtgtacgga 54 0 

ctgcgcgaca atgtccgcct gatccgcgag gacgccgacg gacaccagca catcatcacg 600 

ctgaacatga accgggcaga catcatccaa tcgccctact actatctgca acagaacgac 660 

atcctctatg tcacccccaa taagaccaag gcaaagacag ccgacatcag cgccagcacc 72 0 

accatctggt tctccgtggt aggcacgctc gtgtcgcttg ccagtttaat tatcaccatc 780 

gccaaataa 789 

<210> 1278 
<211> 450 
<212> DNA 
<213> B.fragilis 

<400> 1278 

aaatcaaaag gtatgaagaa ggtttttgaa agaattatag aaggaatatt gacttgtagc 60 

ggttttgtaa cgagtattac gattctgctg attgtgcttt tcctttttac cgaggcattc 120 

ggcctgttca gcagcaaggt tattgaagag ggatatgtgc tggcgctgaa caaagataat 18 0 

aaggtaagcg aactgacacc gatgcaaatt aaagatgtct ttgatgaaga gatcaccaac 240 



515 



tggaaagagt tggggggtga agatctcccg atccggggtg ttccgtctgg aagacatcac 3 00 

gcactactac tcggaagaag aactgggaga agcctacgag aatgcagggg aaaagattat 3 60 

ggagctgatt cagaaaacac ccggtatcgt tgcttttatt cctcagaagt ttgtggtacg 42 0 

gccggatgcg gtacatttca taaaggataa 450 



<210> 1279 
<211> 1413 
<212> DNA 
<213> B.fragilis 



<400> 1279 

aagtacagta tgaaacaggt attgcggttc aataaagtca ttaaaaggat tgtattcacc 6 0 

ggagatctca ttctcttgaa tggcaccttt ctgtccttgt acaccctatt ggggagcaaa 12 0 

ttttttgcag atccattcat tcactcactt ccccaagtac tggtattgct caacttatgc 180 

tacctggtta gcaacatgtc ttcaggtatc atattgcacc gccgtgtagt acgtcccgag 2 40 

caaatcgtat ggcgtgcctt acgcaacagt gcgggacacg ccttgttttt ttcctgcgcg 3 00 

ctcacctttg gaaacttcgg tatcctttcc gcccgctttt tcttactgtt ctacattgcg 360 

ttcactctgc tgttggtttg ttaccggtta ttgttccgca agatcctgaa gtcctatcgt 420 

aagcatggag gcaactcccg cagcatcatt ctggtgggaa gcaatagcaa tataatcgaa 480 

ctctaccatc aaatgacgga cgacgtcact tccggattcc gtgtcatcgg ctactttgac 540 

gacc&gcccg gcagccgctt ccccgaaaag gtgaactatc tgggaaaacc cggtaagatt 6 00 

gtagaccgcc tgaagcaggg aggagtcgag caggtttatt gttgcctgcc ttcggcccgc 66 0 

agcgaagaga ttctccccat catcgactat tgcgaaaatc acctgatacg ctttttcagt 720 

gtccccaacg tgcgcagcta tctgaagcgg cgcatgtact tcgagctcct gggcaacgtg 7 80 

cccgtactct gcatccgcca ggagccgctc agttttgccg aaaaccgatt caggaagcgt 840 

gtgttcgaca tcgctttctc gctcttgttt ctttgcaccc tcttccccat tatctatgtc 900 

attgtcgggc tgaccatcaa aatcacctcg ccgggtccca tcttcttcaa gcaaaagcgc 960 

agtggagaag acggacggga attctggtgc tacaagttcc gctccatgaa ggtgaacacg 102 0 

cagagcgaca ccctgcaggc caccttgcac gatccccgca aaacgcgctt cggcaacttc 1080 

ctgcgtaaaa gcagcatcga cgaactcccg cagttcatca acgtactgat gggcgacatg 1140 

tcggtagtag gcccccgccc ccacatgctg aaacacacgg aacaatactc acaactgatc 12 00 

aacaaataca tggtccgcca tttcgtgaaa ccgggtgtca ccggctgggc gcaagtcacc 1260 

ggctt ccgcg gagagactca cgaactatgg caaatggaag gacgtgtgca acgcgacatc 132 0 

tggtacatcg agcactggac cttcatgctc gatctatata ttatatataa aaccgtaaga 13 80 

aatgcgctgg aaggagaaaa agaagcttat taa 1413 



<210> 1280 
<211> 597 
<212> DNA 
<213> B.fragilis 



<400> 1280 

attatggtaa tagcttattt gagagtaagt acggaaaaac agtttttggc taatcagaag 60 

gaagagatta tgcgatttgc agagaagaat gggttgtcga ttgacaagtg gtacacagag 12 0 

accgtaagcg gaagcgtgag cacaaaagac agaaagttat cagagttatt gaagagaatg 180 

catcccgggg atacactgat tgtaacggag atttcgagat tgagccgtac actgctcgag 240 

attatgacta tcctgaattt ttgtattaag aagcaggtag tgctctatag caccaaagag 3 00 

ggctatgtgt ttcaggacga catcaacagc aaggtgctgg gattcgcgtt cggactgatg 3 60 

gcggaaatag aaaggaacct gatttcgatg cgtaccaaag aagctctcgc acgcagaaag 42 0 

caggaaggaa tgactttagg ccgaaagaaa ggagatacgc ctaaaataaa attgctgcgt 480 

gccaataagc gcgtacttac caaagaactt gacaaaggaa ctacttactc ggaattggcg 540 

gagaagatgg gggtatccag aacaaccctg ttccggttta tgaaaacgat gtattag 597 



<210> 1281 
<211> 651 
<212> DNA 
<213> B.fragilis 



<400> 1281 
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attaaacaaa tggaatctgt agccttcatt cagtggtgct 

accattactt tattaatgac cattgaaagt tctttcattc 

gttcctccgg cagcctacaa agcggcagtc aacgaagagc 

ctcttcgcca ccctgggagc caacctcggt gctattatca 

ctgggacgtc ccatcgtcta caagttcgcc aacagccgtt 

gatgaagcca aagtgcagca tgccgaagaa tatttcgaca 

ttcatcggtc gcctgattcc cgctgtccgc cagttgatct 

cgcatgaagc tgcacacgtt cctgatctac accactctgg 

atcctcgctg ccatcggtta ttacctttcc accgtacccg 

ctgcttgcca aagtaacaga atacagccat gaactgggct 

gtcttcatcg taggtttcct tgtctacaaa ggaatgaaga 



tagaccacct 


caattactgg 


60 


cttttccatc 


cgaagtagtc 


120 


tgaacatcta 


tctggtagtg 


180 


attactatct 


ggcccgctgg 


240 


tcggacacat 


gtgcctgatc 


300 


agcacggagc 


actctccact 


360 


ccatccctgc 


aggacttgcc 


420 


gtgccggatt 


atggaacact 


480 


gcatcgaaag 


cgaagagcaa 


540 


actgcttcat 


cgtcatcgga 


600 


aaaagaaata 


g 


651 



<210> 1282 
<211> 492 
<212> DNA 
<213> B.fragilis 



<400> 1282 
tgtttctcag 
attgtgattc 
gtctgccatc 
gggcggattg 
gcgacgagta 
acccggacgg 
tatcccggca 
gagatcgagc 
aaattacatt 



tgggaggaat 
attgctcggc 
gcaggcgggg 
tgtctacccg 
tcggtatctg 
agtggcaggt 
gccgtgtgtg 
ccgaggaatg 
aa 



gtcgaaaaat 
aacgagagaa 
attcaacggg 
tccggtggag 
ttatgaaggc 
acattcgatg 
tggccaccgg 
gataaagcag 



gaagggggaa 
gaccggtgtt 
ccgggatacc 
aagatcgggg 
gggctggacg 
cgggtgttgg 
gatctgagtc 
tgtccgtgtt 



gtatgaggaa 
ttacggagtt 
acttctatat 
cgcatgccaa 
cgaggggccg 
tgaaaacgtt 
cggacctgaa 
ttaatgtgat 



aatcgatttg 
cgatctggat 
ccggaaggac 
ggggcacaat 
tcccaaggat 
attgaaacag 
cgccaatggg 
tgaagataaa 



60 

120 

180 

240 

300 

360 

420 

480 

492 



<210> 1283 
<211> 858 
<212> DNA 
<213> B.fragilis 



<400> 1283 

aactctgggg tactctgtgg tgaatcaaac tcaaaactta atcatatgaa aattagaact 60 

atcttattcg gccgcctctc actcgcagca ttgcatgcca acgctcagcg catcaaaggc 120 

agtgacaccg tactgcccgt agcccagcaa accgccgaac gcttcatgaa ccgtgaaccc 180 

gacgcccgtg tcacagtcac cggaggcggt acaggggtag gcatctccgc cctgatggac 240 

aacaccactg acatcgccat ggcttcacgc cccatcaaat tcagcgaaaa gatgaaagcc 3 00 

aaagccgcca aacgggatat agacgaagtg atcgtcgctt acgatgcact ggctgtcgtt 360 

gtacacccgt ccgatccggt gaaaaaactc acccgccggc aactggaaga catcttccgc 42 0 

ggaaaaataa ccaactggaa acaagtggga ggcgacgacc gcaagatcgt ggtttactct 480 

cgcgagacct cttccggcac ctacgagttc ttcaaagaga gcgtcctcaa gaacaagaat 540 

tatatgagca gcagtctctc catgcccgcc accggagcta ttatccaatc cgtcagccag 600 

accaaagggg caatcggcta tgtagggctc gcttatgtgt cgccgcgcat caagactctg 660 

tccatctcgt atgatggcga gcactatgcc accccgaccg tagagaacgc caccaacaag 72 0 

acttatccca tcgtccgccc cctctactac tattatgatg caaagaacaa aacacaaatc 780 

gctcccctgc tcgagtttat tctctctccc gaaggacagg atattataaa aaagagtgga 840 

tatatacccg tgaaataa 858 



<210> 1284 
<211> 444 
<212> DNA 
<213> B.fragilis 



<400> 1284 

cttatgatgt ttctaatttt taaatattat accattatga aaacaatgat tatttcaatt 60 

atcgcagtat tagcaagtgt ggccgtatcc ggtcaaagtc tgacattaac agttaaagac 12 0 

gtggaacatg tggagggaac tctttatgtg gctatctatt cgtccaaaga gaactttatg 180 
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aagaaacctc ttttcggttt tcgggtggct gtgaaagacc gtacaatgac aataccttgt 2 40 

aaaggaattc ctgccgggac ctatgccatc tccctctttc aggatgaaaa cggaaatgga 3 00 

aagttagaca ccggttcatt cgggcgccct ttagagaaat ttggattcag caatgatgcc 3 60 

gagggtatca tgggagctcc ttcgtatgaa aagtgttgtt tcgaatttaa acgagatact 42 0 

acggtggtca ttcatttaaa atga 444 

<210> 1285 
<211> 2046 
<212> DNA 
<213> B.fragilis 

<400> 1285 

ctattgatca caaaaggatg cagttcgtca cacgtgaggt gctgcttatc acatatccgg 60 

gctgtccgtc ccggatattt ttttgctttg tgtcaatcct ttatgtttgc cgatgaaaac 120 

gttagcaaac agatcaccat gaaaggattt tatgtaccag ccgcattgag cctgatgtta 180 

ctctcgctcc cgatcgctgc acaaagtgta gctacggata ccattctcct ggccactttc 240 

aatgatagta tatcgttgga cgaggtagta atcaaagcac ataagacgcc gagggccaac 3 00 

agtcgttgga gcgatctgca acctgtcgat ctggtaacag taggaggttc caacggagat 3 60 

ttgtaccggg ctttgcaaac acttccgggg gctcaactcc aaggtgaaag cggtcgttta 420 

ctggtacgtg gagggaacag caacgaaaca caaacctata tagacggcat gcatgtgttg 480 

aatccctata ctactaccgg gactgatact cctgcacgcg ggcgttattc tacatttatg 540 

ttcagtggtg tcaatctggc ttccggagga cagtcgcagg agtatggaga ggctttgtct 6 00 

gctgttcttc ctttggagac caaagactat agtacggtta ataagttcgg gatgaatgtt 660 

tcgactgtcg gaatgggtgg aggcggtaca cgggctttca atcggtcctc attgtcattg 72 0 

aatctcgatt atcagaatct ggctccttat gatcgggttt atccttcgcg tacagatttt 7 80 

aaacggccgt atcgcatgtt atctggtgcg acccaatttc gctacacacc gaatgaaaag 840 

atccttttta aattttatgc gggatatgac cgtacggatt tttcgaatta cacagatatc 900 

gatcgtcatc tttttggcct gggtgaaaac aatatatacc tcaatacaac attccgcaaa 9 60 

cgtacagctt ccgattggaa ttggtttatc ggaactgcct actcttttta tgatcggaaa 1020 

gtgaaaggag cagtgaagga ccgggatgta tggaacgaac gccaacaaga gttccatctg 10 80 

aaagcaaagt tctccaagct gtttacttcg cggttgcgtc tggatatggg tgtggagact 1140 

tttgtgcgtt cttatcggaa cgactatcag ttggaaacat tgcgtgatac gcatcaaatg 12 00 

tatcccacta tctatgccgg atttctttcg tctgcttttt atctgtcgga gaatcttaaa 1260 

actgaaatat ctctccgccc cgaatatact tcgttaaacc ggacaatgaa ctggtctccc 1320 

cgggctgctg tcagttatac gtggaatcat ctgctggtgt cggtcgtagc agggcaatat 1380 

acccaacttc cggaaaacga ctatctgata aggaatatct ctttgccttc taatgtttgc 1440 

agacaagttc tttttagtct ccaatatgaa cagggaggcc ggttttacaa agcagagttc 1500 

tattataaaa attataagaa actggaatta tcggttccgg acggtatcac tcctgatgga 1560 

tatggataca gtaaaggtat tgatctgtat ttttgtgaca atgtcctatg gaagaatttt 1620 

gagtaccgtt tgtcttactc ttataacctc tcgaaacgta aatatcggga atatacagaa 1680 

cttacggtgc cacagtatgc tacccgtcat cacgcatcgc tggtgttgaa atacagtgtt 1740 

ccccgattgc gaaccatctt tagcgtgacc gatggggtgg caagcgggcg tccctaccac 1800 

aatccggaac tgtccggact gatgaatgat gaagtaaaat cctatcattc tctcgatttg 1860 

ggtattacgg ttctggcggg caaaaaagtg attgtacatg cttctgccac caatctgctg 192 0 

ggacgtaaga atgaatacgg gcgtatcgac ggagaggctg tccgtacttc aagcgaccac 1980 

tttttctatc tgggagtgta tatcacattg ggtaagaagg tagcttatga tgtttctaat 2040 

ttttaa 2046 

<210> 1286 
<211> 1200 
<212> DNA 
<213> B.fragilis 

<400> 1286 

aatgaacatt ttaaaataaa gacgatagtt atgatagatt taggaagctg gatgaacaaa 60 

atccttatcg gttggggggt agaccctaaa attgcaaata cttttgatga aacgattatt 12 0 

gccatcctga tgatttttat agcggtcgga ctggattatc tttgtcaggc catttttgta 180 

ggcgggatga agcgtctggc acggaagacc tcttataaat gggatacatt gatggtcaaa 2 40 

cataaagtta ttcatcacct gatccatatc cttcccggta tcctgatgta tatgcttctt 300 
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cctatggcgt 
tacatgattt 
ctcagtgcta 
gt acttgttt 
gctacattat 
agtattctgg 
gactgggtga 
acggtgaaga 
aacggttcgt 
agtatctttc 
cggaaggaga 
tcgcaggtgt 
gatcttgact 
tacttctttt 
gatcactttt 



tcgtacatgg 
ttgctttgtt 
aggagaagtt 
tctttatcgg 
ttgcgggatt 
gcttcgtggc 
ctatcccttc 
tacagaattt 
ttcagaactg 
ttgacctgac 
tacctttgct 
tcagggtgta 
tgattatcag 
cccgcaataa 
ttgcgatgat 



aaaaacgctg 
gctggctctc 
gaaagatcgg 
aggcattgtg 
aggtgcatcc 
aggtatacag 
gactaatgcc 
cgataatacc 
gcggggaatg 
tacattgaaa 
ggccgattat 
tgtagaacgt 
tcaaaaagaa 
aatctggaaa 
accgaagttt 



ctgctggttt 
aacagtagtc 
ccgatgaagg 
attgtggcta 
gctgccatcc 
ttatcggcta 
aatgggattg 
atttctactg 
acggaatccg 
ttctgtactc 
cagccggagg 
tatctttgca 
gctacggaat 
gaatacgaac 
gaactaaagg 



cgcagaagat 


ctgtgttatt 


360 


tgttgatgtt 


gttggatata 


420 


gctttatcca 


ggtattgcag 


480 


tcattgtcga 


taagtctcct 


540 


tgatgttggt 


ttttaaagac 


600 


atgatatgat 


acgtcccggt 


660 


tagaagaaat 


aaccctaaac 


720 


tccctcccta 


ttcactggtc 


780 


gagggcgtcg 


ggtgatgaag 


840 


ccgaaatgct 


tgatactttt 


900 


aaggagtgat 


tcccactaac 


960 


gcctgcccgt 


ggtcaatcag 


1020 


atggagtacc 


tattcaaatt 


1080 


gtatccagtc 


cgatatcttt 


1140 


tatatcaata 


ttcggattga 


1200 



<210> 1287 
<211> 1863 
<212> DNA 
<213> B.fragilis 



<400> 1287 
aatactaacc 
ttatcgacgg 
tattatgtga 
gcggcacgtc 
tt cctgcatt 
cccgcactct 
gctggtttca 
accaagacaa 
gt acgcgagc 
ccttgggatc 
gaacaactga 
gccaatggtg 
accatccgtc 
gtaggcaacg 
ggtacgtatg 
gacttgggcg 
gtggatgtat 
agcctcaaac 
ctcaatattc 
gagtttgccg 
gcgtggactg 
aatgtggtga 
gaagcgttga 
cgcctgatac 
cgtctggcgg 
gctgcaaaag 
ccgttggtga 
aatgcggaag 
agagactgga 
ccgcagactg 
actccggata 
taa 



aatccatttc 
cccttcttgc 
agcacgtaga 
ttgtacctac 
ttggtatcaa 
tcaatcccac 
agatggccat 
ccgggcactc 
tgcgcgatgc 
ggaacgcctc 
ccgaactgtt 
aaggtcctaa 
gtctccaacc 
aacgtggatt 
cccgctgtga 
gacgcgacat 
cgatccgtcc 
acctgaccga 
ctcctgacca 
attatcgtaa 
cccggccggg 
tgctgcgcga 
ctgccgatgg 
gtattccggc 
ctaatatcag 
aaaactggaa 
ttgatctcgg 
cgaagccgac 
aagaggtgcc 
tctcgttcgg 
ctacgccggc 



accccaaaaa 
cgctatctgc 
gtttccacaa 
tccgcagcaa 
tacttttaca 
cgatttcgat 
cctgacagcc 
cgtggcggcc 
ttgtgataaa 
ttgctatgga 
gaccaactat 
tggaaagaag 
ccgtgccgtg 
aggacgtgaa 
agagcagaac 
gttggtcaat 
gggatggttc 
tatttatttc 
gcgcggacgc 
agagattttt 
tgatacgcgt 
agacatttcg 
atggaaagag 
tgtcgaagcc 
cgaagtggct 
tgatctgccc 
taaagctgtg 
gatggccttc 
gactaccgga 
taacaaagtc 
ccgggtagat 



cataacaata 
acagccggac 
ggagccactc 
ttggaatggc 
ggacgtgaat 
gcggagcagt 
aagcatcatg 
tctccctgga 
tacggcatca 
gactctccaa 
ggtgaggtgc 
caggaatatg 
actgccatta 
acggaatgga 
aaggcgttgg 
gccaaggaac 
tatcatcagc 
aaatctgtag 
atcagtgatg 
gccgataacc 
gtctaccagt 
aaaggacaac 
atagcgaaag 
cgtcaattga 
gcttactatg 
cgtactgcat 
gatatgaccg 
cgttataaat 
gaattcagta 
agtgcacgct 
ttgaaagaaa 



catttatgaa 
aagcccagga 
tggaacaaaa 
agcagatgga 
ggggagacgg 
gggtacgttc 
atggattctg 
aagacgggaa 
agtttggtgt 
aatacaatga 
acgaagtctg 
actggactgc 
tgggcgatga 
gtgccaccgt 
gtgtgaaggc 
tcttctggta 
aggaagacaa 
gctacaactc 
ccgatgtcaa 
gtgtcaaagg 
tgaagccgaa 
gcatggaggc 
gaactaccgt 
gggtgaaggt 
cccgtccgct 
ggaaacaggt 
gatttgtata 
tctatatcag 
acatcatgca 
acattaaact 
tagggatccg 



gaaattaatc 
gacgaacgac 
ggtagacatg 
gttgactgct 
caaagaaaat 
gctcaaagaa 
cctgtggccg 
aggggatgtg 
ttatctttct 
attctttatc 
gttcgacggt 
catcctttct 
tgtacgttgg 
gctgactccc 
gacttccaag 
tccttccgaa 
tcaggtgaag 
agtgttgttg 
tcgtctgaaa 
cggcttgaaa 
atcggaaatc 
tttcacagtc 
cggttataaa 
cgatgcttgt 
cgaagagtcg 
aactgccgct 
tgccccggct 
cactaatggc 
caatcctgta 
tgatgctacc 
cttgcagaag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1863 



<210> 1288 
<211> 969 
<212> DNA 
<213> B.fragilis 
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<400> 1288 

tctccctgtt tatcgtatct gtccgatatt ttatattact ttgcagggaa agcgaataaa 60 

atacaatata tgggaagtca ttggggtgaa aaaatacgaa gatacctgca aagcctttat 12 0 

ccggttgcat atcaaagatg gaaggttgtg ttgatttctt ccctggttgt atttctgata 180 

ttactggttt tacagccatt tgggatatcc ggcattaggc agcataagtt ctggatactt 240 

gtgggcttta tgggcgtgac ggcagtatct ctgagtattc cgatgtatgt ctttggcaaa 3 00 

ctgtttccga agttctataa agaagaaaca tggactgtgt ggaaacagat tgtcaatctg 3 60 

ttacagatac ttttttttat agctatcggt aactggattt attctactct cgtttttggt 42 0 

tggggattac gttgggatgt cttttgtgcc tttgcattat tcactttggt catcggactt 480 

tttcctactg tacttttcat tttgttgaat caaaatagac tgttggccat tcatctgaaa 540 

gaggctaccg agatgaatct ccatctgcaa cgttcggtat tgctggccga atctgtggaa 600 

acaacacaag acagcccttt tcttttgttt cagggaggca tccgggaatc cttggaattg 660 

gactctaaag acttgttgta tgtagaatcc aatggaaact acatccgggt aaattatcag 72 0 

aaagcaggta aaaacgttca gtgtctgttg cgtgcaacca tgaagcaggc agaagaagta 7 80 

acagccgttt gtccgttggt gctgaagtgt caccgggcct tcttggtcaa cgtccgtaag 840 

gtggtgaaag tgaatggaaa ctcccaggga tatcgtttgc ttctggaggg ttgcccggaa 900 

gagatacctg tatcccgagg ctattcgaaa caggttaaag aactgataga aggtatctct 960 

ggcgactaa 9 69 



<210> 1289 
<211> 276 
<212> DNA 
<213> B. fragilis 



<400> 1289 

tcagtacaaa agtctatatt tgcacccaca aaattaatag aaatgctcta tacaatactg 60 

ataactctgt taatagttgc tatttgcctt ggtttattag gcataaaagt cttttttaca 12 0 

aagggtggaa aatttcccaa cgggcacgtc agcggcaata aagcgttaag ggagagaggc 180 

ataagttgtg cacagtccca agaccgggaa gcacagaaga aacgacgttt ttctattgat 240 

gaaattgaaa aagccttaaa cgatagtatg aactaa 276 



<210> 1290 
<211> 630 
<212> DNA 
<213> B. fragilis 



<400> 1290 

ttaattaaaa aactatattt tacaagtatg aagagaatga attacctcat taacggattt 60 

gctgctcttg cattcctctt tcttttttca caatgcgctg gtaaagctga taatgctgct 120 

cctgccgctt ccggaaatgc gaacggcact tctggtttga aaatcgctta tgtagaagta 180 

gatactttgt tgtctcaata taatttctgc aaagacctga atgctgacat gatcagcaaa 240 

gaggagaaca gccgcatggt gttgaatcag aaagcaaatg aactgcgtaa atctcaacag 3 00 

gaattccaga agaaatatga aagtaatgct tttatttctc aggaaagagc acaacaggaa 3 60 

tatacacgat tagctaaatt ggaacaggat ttgcaggctc tgcagaacaa actggctaca 42 0 

gagatggcat cggagaatgc taaaaacagt cagattctgc gtgactctat taacgctttt 48 0 

ctgaaagaat ataataaaac aaaaggttat aacctgatta tcagcaatac cagctttgat 540 

aatctgctgt atgccgatag cacactgaac attactaaag agatcgtaga cggacttaac 600 

gcaagatata ctcctgtggc taagaaataa 63 0 



<210> 1291 
<211> 864 
<212> DNA 
<213> B . fragilis 



<400> 1291 

caaataattg aatccatctt attgatatat aaatctttat tattcatgat gcgaatccga 60 

attatcatta cgttgctgat tgctttattt gtggagagtc ggcaggaagc cttcgggcag 12 0 

actgtcgaca cactctccct gtccgataag gtaatccgta ccgcttcatt tgcgaccgga 18 0 
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tttcgtggtg 
acaaggttgg 
ggtgataagg 
gatcgtgtat 
gaaaatatag 
ttgaaaggag 
tggggaatca 
cgtaatacag 
cgtttgggag 
gcggataaag 
tttgccggaa 
ggatcagcgc 



aggtatggag 
acgtaaatgg 
atacccatat 
tcggttcagc 
actggaaact 
aaacttacta 
ctggtggtta 
cttccgatct 
tttctacaga 
gctctacttc 
atcagaccgg 
tatgtactct 



gaatccggca 
agcttatcat 
cggggtggat 
aggatacagg 
gattgctcct 
tttcaatgga 
tagggcttcc 
ttcttttgct 
tttccggctt 
agtataccat 
aacgaagcat 
tgcc 



ctttattatt 
gataaaggaa 
gttcactctt 
agtgagaaac 
tacgtgaccg 
ggttacgcca 
cataattatc 
ttgggagcag 
tatcagcaga 
atgttgggat 
caaggtacgt 



actatactcc 
aagcatcttt 
tcgtcattct 
aggagaatgt 
gtgattctat 
gtgagtcggg 
gggataaaga 
gatatcgttt 
aaagtgagat 
tgggtatgga 
cttcaccacg 



gtatacatgg 
gaaacaggaa 
ttcgggacgt 
gctatggaac 
tggaggtttt 
ttcttggaca 
tccgcgtccg 
gggagcgtat 
ttcgttcctt 
ttatgtgcgt 

ggggtggatg 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
864 



<210> 1292 
<211> 1071 
<212> DNA 
<213> B.fragilis 



<400> 1292 
acttcaaccg 
cttatttccc 
ttttggaata 
tttctgcccc 
gtagcccgaa 
gaagtggaaa 
ggttatcggt 
tatcaacgtg 
ggtcatcgcc 
ctggacataa 
taccggcttt 
ccctccgcca 
gtacaagtgc 
ctgctggcac 
ggactactgg 
accgaagaaa 
tacggaggca 
gatcacttac 



aacagcattt 
tctcttccct 
ccgaaaactt 
attcaatgcg 
cactcacagc 
acgatacagt 
acgtgatgac 
accggttcaa 
cgacaagaga 
tggtggcaca 
acgctgcaca 
aactgatcat 
tacaggcact 
gaaaggcaaa 
accatctcat 
aaaaggcaaa 
tgcaaccctt 
cggtgtatgt 



aaaaggaatc 
ttttctttct 
tttcgatacc 
tcactggaac 
catcggagaa 
tatgcgtgac 
ccactgctcc 
acttctatcc 
tattctgcac 
cctcccttcc 
aaagttaaag 
tatgggggac 
gtctcccgaa 
agaccgtaac 
cgtatcagga 
cgtagcacgc 
ccgaacctat 
cgattttgaa 



caatctatgc 
gcacagaatc 
cgacatgatt 
caccgacgat 
tggaattttc 
ctgacccttt 
gatctgcgcg 
tactctgccc 
gtcagcggat 
cgctcaggcg 
gatgcggcag 
tttaacgatt 
gtgagtaccc 
ttcggttcct 
acgctacttg 
ttgcccttcc 
gtcggaatga 
accaatcaat 



tccgtctctt 
gacctccttt 
cactgaaaaa 
ataaaaagaa 
ccgcattgat 
attccccact 
gcatcgatgt 
tctccgtcgg 
tactgctcac 
gagtacgaca 
atagtcttat 
atcctacaga 
atcatgaccg 
ataagtatca 
atatttcggg 
ttctgacaaa 
aataccagga 
ccgaatattg 



cttcatatta 
ccgtgtcgtc 
cgatatggaa 
attagataat 
aggtctctgt 
gaaagaagcc 
ggcattactg 
aaatttcaaa 
aggcgatacg 
gtcggaacct 
caatgtacgt 
taaatcagta 
actttaccac 
gggtgaatgg 
cacactcttt 
agatgaaaag 
aggatacagc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1071 



<210> 1293 
<211> 1227 
<212> DNA 
<213> B.fragilis 



<400> 1293 
aacaaaatag 
atgctgggat 
accactactg 
gaaaatataa 
gcagatggat 
tccgtggagg 
aagttggaac 
at atttattc 
cgtatctata 
caattccaaa 
gcagtaggtt 
gaatctttta 
gatttaagta 
gctgttccca 
cagggaaatc 



aaatgaaaat 
ttaccgcatg 
tagataccac 
atacatcggt 
tgtataatgt 
tcgatgtaca 
tgacggttca 
ccggaaccta 
acaactcgga 
ctacccaaaa 
ctgttgtagc 
ttctgtgtga 
aggccaattt 
atctggatat 
gtgcttatgc 



gaaaaagaat 
tagtgatgat 
tatcgagggc 
taagacagac 
gacttttata 
gggagtacag 
tgtactcaat 
taatgaagct 
taaagttttg 
ataccagtcg 
ggtgccgggc 
caatgcgatc 
tgagtggtac 
ttattattgt 
tatcggcaga 



caattgacaa 
gacaaagtaa 
ctgcagctga 
gttgcctatc 
ggtaaaggta 
cagaatgtga 
acaggcgagc 
ggaaagcaat 
tatgccgatg 
gtagatccgg 
agtggaacgg 
aatcataaag 
atagagtcaa 
tattccaaaa 
ttgccccaag 



tggcaatcct 
atatatccac 
ccggtggaac 
cgctccaaag 
cttattcaca 
ctgtttccgg 
ccggttttgt 
ataatggcga 
gattgatctt 
atattatgaa 
atcatccggt 
aggctaatcc 
aacaagatgt 
ctatttgggt 
gcatgacaaa 



ttttgcgacc 
tgtaagcatt 
ctatactttt 
cattgaattg 
aaatggaact 
aggatcatgc 
gattgccgag 
tcagtatgta 
tatggaatca 
tgaagccatt 
acaaccggga 
gaattcaata 
ggataatccg 
gttgaataaa 
agaaaagtat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 



521 



atttcagact atgcatacaa ttatacgtat atcatgcaaa acgggacagc cagcaaaccc 960 

caaagcaaat ataagttccc caatgaatgg ataattgatg ccgtaaacgt gggggcttcc 102 0 

aatgaatggc agtggaatgt tacttctacc ggattggata tgggacatac ttatgtcggt 1080 

gtgaacaata ctattgcaga aaacatcgga aaatgcgtaa tgcgtaaagt agcctataag 1140 

gatggtgagc gtgaagtatt gcaggataca aataattcta ctgttgattt tacacctgca 12 00 

gccactccga gcctatttaa caaataa 1227 

<210> 1294 
<211> 345 
<212> DNA 
<213> B.fragilis 

<400> 1294 

gttattagtt ttatctttgc aactggaatt aatgagaaaa tcatgggcaa attaatcact 60 

atctgggagt tcataggcag acataaatac tggattaccg tcgtggcatt cggtgtcatc 12 0 

atcggatttc tggacgaaaa cagtatgatc cgccgcatcg gctacgcacg cgaaatcagc 180 

cgcttgcagg gagagattga taaatatcgc gcagaatacg aagagaatac ggagcgcctc 240 

aacgaactga gtaccaatcc cgaagctatc gaacagatag cgcgcgaaaa atacctgatg 3 00 

aaaaagccca acgaagacat ttacgtattt gacgaggaag aatga 345 

<210> 1295 
<211> 2820 
<212> DNA 
<213> B.fragilis 

<400> 1295 

atagatacac tgttatacct aatttttggt attgcctctg ctgcggatat cttatacttt 60 

tgcgagctaa tagaaagtaa gatgattcgt aaaattgcat atacattttt ttcttttcta 120 

atttgctgca atgtatctct tgcgtgggga cagacatttg cattccgagg tactgtattg 180 

gatgaacaga ctcataaggc attggattat gccaccattc agttgtttgt ggaaaagcaa 240 

tttgcttatg gaggaataac agatgcaaac ggacattttg aacttcttca tatccatcct 3 00 

gggacctatc ggattatcat atcttattta ggatatgatt ctaccgagaa agaaattaag 3 60 

gttgtgggaa atacttctga tatattttat ttaaagccct cgaacatggc gctgaacgaa 42 0 

gtggtagtga ctgcttccga atcaaagcgt gcgaccagtg cttccatagt tgatcgtacc 480 

gccatgaaac atcttcaacc cagcagtttc agtgatttga tggaattggt gcccggagga 540 

aaatcggctg atcctcagat gggacaggct aatctgatcc gtatacgtga aacgggtaag 600 

acggaagata tctcttcatt gggagtcgga ttctatattg atggtatctt tcagaatacg 660 

gatgcgaatc tgcaatatat gccgagtagc acttctgccg taaatgcaac gagtacgatg 72 0 

tcaaaagggg tggacatgcg tactatcccg actgataata ttgaaaaggt ggagatcatt 780 

cgtggtattc cttcggtagc ttacggtaac gtggcaaacg gtgcggtgat tattcagcgt 840 

aaaaccagtg aaagtccgtt atccgcacgt ttcaaagccg ataagaccag taagctgttt 9 00 

tctgtaggta aaggattccg gttggatggg aacggacgtt atgttttgaa taccgatttg 9 60 

agttatctgg attcaaagat agatccccgc aatagtgtga aaaactatac ccgcctcaca 102 0 

gcttccgccc gtctggatgg aaagtggtta tggaatgaac gcaatattca ctggaatctc 1080 

agtaccgatt ataccggttc gtttgatgat gcaaagaggg ataaagatgc gacagtaaag 1140 

gaagactcct ataaatcaga ctttagcagc ttcaaaatgg cgggaaaatg gaatctgaaa 12 00 

ttttcgaatc attcgtggat tcgtgagatt catgcggcaa cctctgttag ttggcagtgg 12 60 

gagaagatgc gtgaaacaaa atccgtctct ctgaatcgtc cggctgccat tgccactcag 1320 

acagagaccg gagaatctga cggtatctat ctgccttata attatgtggc acagatggag 13 80 

attgacggta aacctttata tgtcacggtt tccgcacgta cacatttggc ttttccactg 1440 

ggtggactgc aaaataggat gaatttggga gtggaatgga actaccagaa gaatttgggg 1500 

aaaggccagg tatttgatgt gacacgacct atcagtgaag gtttaagtac gcgtccgcgc 1560 

cgttttaaag atataccggg actgcaacct tttgctttct atgccgaaga ggtgttgaat 1620 

cttcccgtaa aaaggcataa attggctttt acagccggta tccgtttgca gtctttattg 1680 

gggctggacc gcaaatatga gatgcagggt aagatctatc cagaccttcg tctggatctg 1740 

caatggagtt tgcctacatc caacggatgg aatattgcct tttcgggagg tctgggctgg 1800 

atcagccgta tgccgactac cgcacagctg tatccggact tcaagtatgt ggatttgatt 1860 

caattgaatt attatcataa ccatccggat tatcgccgca tcaatatgat gacgtataaa 192 0 

tgggataata ctaattatca gttggaaccg gcacgtaaca tgaaatggga agtgcgggcg 1980 



522 



gatattggtt acaagggaaa ccgtttgtcg gcgacttatt tccgcgaacg gatgaataat 2 040 

gcttttgatg acctcacata ctataaatca ttggcatata agttatatga tccggcaagt 2100 

atcgatggtt cggctttgac agcccctccc gaactttctc agttgactta tgccaatgaa 2160 

tacaacctgg atgtgtattc cacgcaagga aatggaatga aggtgcataa agaaggagtg 222 0 

gagtttcagt tcgcttccag acgaattgaa tcgcttaaga ctcgcgtcac agtgtatgga 22 80 

gcttggataa agacagttta tagttcggat tctcccaaat acaaggcctc ttctatattg 2340 

ctggacaaca aacagttgaa atatgtggga ttgtaccagg gggagaacgg gacagaaagt 2400 

caggcattca atacgaattt tatgtttgat acttatatac agcgtttggg acttactttc 2460 

tccacatcgg cacaatgtac ttggtatacc aacagacgga acttatggaa taatggcgtt 2 52 0 

ccggtcagct atatcgacca atccggtgaa acacatctat ttcgtgaaga agataaaaac 2580 

aatattcagt tacagcatct ggtagagaaa tattcggcta cctattttga gcgtaccacc 2640 

gtaccttttt atatggatat caatctgaag gcgagtaagc ggatcggtaa atatctgaac 2 700 

ctggcttttt atgtaaaccg gcttttaggt atttatcctg attataccct acggggtgtg 2760 

ttgcagcgga gaacctccga gtcgccttat ttcggtatgg agatgaacct gactttttag 2 82 0 

<210> 1296 
<211> 1701 
<212> DNA 
<213> B.fragilis 

<400> 1296 

cagcacacat ggatcgtaca catcgcatgt atgaacgttc gaaaaatcat cctgccatcg 60 

ttatctggtc attgggcaac gaagcccgga aacggaatca atttcgagcg tacctacgat 12 0 

tggctgaaat cggtagagaa aagccgtccc gtccagtacg aacgtgccga gcagaattac 180 

aataccgata tctattgtcg aatgtatcgc agtgtcgacg aaatcaaggc ctatctggcc 240 

cagaaagata tctaccgtcc gttcattctt tgtgaatatg tgcatgccat gggtaacagt 3 00 

gtaggcggcc tgaaagagta ctgggatgta ttcgagaata atccgatggc acagggtggc 3 60 

tgtgtgtggg actgggtaga ccagtcgttc cgtgagatcg actcaaacgg tcgctggtac 420 

tggtcgtatg gtggagacta cggaccgaag ggaattccga gcttcggtaa tttctgctgc 480 

aacggtctgg tgagtgccga tcgtgtgccc catccgcatt tactcgaagt gaaaaaaatc 540 

tatcagaaca tcaaatgcac cttgatcaat aagaacaatc tgaccgtaag ggtgaaaaac 6 00 

tggttcgact tctctaacct caacgaatat atcctccact ggcaggtggt gggtgacaat 660 

ggcaaattgc tggccgaagg taacaaagag gtgaactgtg cgccacacgc tacagccgat 72 0 

gtgactttgg gaaaagttgc cttgcctgcc aatgtccgtg agggttatct taatctgagc 7 80 

tggacccgca aagaagcttc accgatggtt ggcaccgatt gggaggtggc ttacgaccaa 840 

tttgtgttgc ccggaaccaa aggtagtaca gcctatctgc ctgctaaggc cgggcagaca 9 00 

gctttcacgg tggataaaga aaccggtgct ctcaactcat tgacactgga tggacaagaa 960 

ttgctggcaa ctcctgttac gctgagtttg ttccgtcccg ctacggataa tgataaccgt 102 0 

gatcgtaatg gtgcatacct ttggcgtaaa gccggactca atcagttgac ccagaaagtg 1080 

gtgtcgctga aagacggcaa gaaagccgct actgcgaaag tggagattct gaatgcgaaa 1140 

gggatgaaag tgggtgatgc cgatttcgcc tattcactaa actctgccgg agccttgaag 12 00 

acgaaagtga ctttccggcc cgatactgcc gtggtgaaat cgatggcacg cctggggctt 1260 

actttcgaga tgaatgatac gtatggcaat gtagcttatc tgggtcgggg cgacaacgag 1320 

acttattccg accgtatgca gtcgggcaag atcgctctgt atcagactac ggccgaacgt 13 80 

atgtttcatt actacgtcac tccgcagtct accgggaacc gtacggatgt ccgctggatg 1440 

aagctgacgg acgaaaccgg acagggcatc tttgtcgatt ccaaccgtcc tttccagttc 1500 

agtgtcatcc cctttgccga tgatgtattg gaaaaagccc gccacatcaa cgacctcgaa 1560 

cgtaacggtc atgtcaccgt acatctggat gccgaacagg ccggtgtggg aacggcaacc 1620 

tgcgggccgg gcgtacagcc gcaatatcgg gttcctgtga cggaacaaag ctttgagttc 1680 

acgctgcgta cagtgaagta a 1701 

<210> 1297 
<211> 1926 
<212> DNA 
<213> B.fragilis 

<400> 1297 

aacttatttc gcttcctttt ttatccttgc acttttttgc gtacatttgc acaaactcaa 60 

aaaatagaga ttatccggaa tatggaaaac tatatcgtat ccgcccgtaa ataccgccct 120 



523 



tccacttttg agtcggttgt gggacaacgg gcactgacca ctacactgaa aaatgctatc 180 

gccactcaga aactggcgca tgcttacctg ttctgcgggc cgcgcggagt gggtaaaacg 240 

acttgtgcac gtatctttgc caagaccatt aactgtatga accttacggc agatggcgaa 300 

gcctgtaatg aatgtgagtc gtgtgtcgct ttcaacgagc aacggtcgta caatattcac 3 60 

gagctggatg cggcgtccaa taactcggta gacgatatcc gtcagttggt ggagcaagta 420 

cgcattccgc cccagattgg caaatataaa gtatatatca ttgacgaggt acacatgttg 480 

tcagcttcgg cattcaatgc ttttctgaaa acacttgaag agcctccccg ccacgccata 540 

tttattcttg caacaacgga gaaacataag attcttccta ccatcttgtc gcgttgtcag 600 

atttacgact ttaaccgtat cagcgtagat gatactgtga accatttgac ttacgtagcg 660 

tctaaagaag gaatcacggc ggagcctgaa gccttgaacg tgattgcgct caaagcggat 720 

gggggtatgc gtgacgctct ctctatcttt gatcaggtgg ttagtttcac cggaggaaat 780 

atcacctaca agagtgtgat tgagaatctc aatgtattgg attacgagta ttacttccgt 840 

ttgacagact gttttcttga gaataaagtg agcgatgcct tgttgctttt taacgatgtg 900 

ctcaacaagg gattcgacgg aagtcacttt attaccggac tttcatctca tttccgcgac 960 

ttgctggtta gtaaagatgc cgccacgctc cagttgcttg aggtaggtgc cggtattcgt 102 0 

caacgctatc aggaacaagc gcagaagtgt gcgttacctt tcttgtaccg ggccatgaag 1080 

ctttgcaatg attgtgacat gaattatcgt gccagcaaaa acaagcgttt gctggtggaa 1140 

ctgactttga tacaagttgc gcagcttacc gtcgaggggg atgatggtag tggtgggcgt 1200 

ggccctaaac aagctataaa acccgttttc acgcaacccg ccgctgctca gcagcctcag 1260 

gtagcaccta ttgcttcacc atctcagagt atgaatgctg ctacacctgt tgctccgcag 132 0 

gctgtgtctc aacaggttgg tagttccccg gctgtcaatg tacgtccggg tggagcaatt 1380 

tctccatcgg gtgctatgcc cgatgcggta cggatggctc aatttaaaga ggaaaagaag 1440 

attcctgtca tgaaaaagtc cagccttgga ctttccatca agcatccgca aaaggaggaa 1500 

gaacagcggg gagcgggtgt tgtccatact gctcagatgt cgacgcagca gatcgaagaa 1560 

gattttattt ttaacgaacg ggatctgaat tattattggc aggagtatgc cggacgtatg 1620 

ccgatcgaac aaaaggcgat agccatgcgt atgcagaata tgcgtttgtc attgctcaac 1680 

gacacgactt ttgaggtagt ggtagataat gaaatcgttg cgaaagattt cacggctctg 1740 

attcccggta tacaagctta cttgcgtggt tcgctgaaga atcgtaaggt gacaatgact 1800 

gtccgcgtca gtgaagcaac cgaaaatgta cgtgctgtca gtcgtgtgga aaagtttcag 1860 

atgatggctc aaaagaataa tgcattgttg cagttgaagg aagaatttgg gttggaactc 192 0 

tactaa 1926 

<210> 1298 
<211> 1479 
<212> DNA 
<213> B.fragilis 

<400> 1298 

aaagaaaaga gaatggaaaa atcagaactg aaaccggccg gtgtatttca cttcttcaat 60 

gaaatctgcc aggtgccccg tccttcaaag aaggaagaga agatgatcgc ctatttaaag 120 

gcgttcggag aaaaacataa tttagaaacc aaagtagacg aagccggcaa cgtgcttatc 180 

aaaaaaccgg caacaccggg taaagaaaat ctgaagacag tgattctgca atcgcacgta 240 

gacatggtgt gcgaaaagaa taatgatacg gaccatgact tcctgaccga tcccatcgaa 3 00 

acggagattg acggagagtg gatgaaagcc aaaggaacaa ccttgggagc cgacaacggc 360 

atcggagtag cgaccgaact ggccattctg gctgacgaca gtattgaaca cggtcctatc 42 0 

gaatgtctgt tcactgtaga tgaagagaca ggactgaccg gtgctttcgc cttgaaagaa 480 

ggctttatga gcggagaaat tctgcttaat ctcgactcgg aagacgaagg tgaactttac 540 

atcggttgtg cgggcggtat tgatacagtg gccgaatttc aatatgaaaa tgaaatgaca 600 

cccatcagcc acctctgctt ccgcataacc gttaaaggtc tgaaaggcgg acactccgga 660 

ggggatatac atctgggacg cggtaatgcc aacaagatac tgaaccggtt tctctatcag 720 

atgatgacta cttaccagga ggacttccac ctctatgaat tcaacggagg taatctgcgt 780 

aacgccattc cgcgtgaagc ttcggctgta ttctccgtgc ccgaacatta caaacatgac 840 

atacgtacag ccttgaacgt attcaccgcc gaaatcgaaa acgaacttca tcgggtggaa 900 

ccggatctga acattcttct tgaaacagag ccgcaccgcg actggtccat cgactcgagt 960 

acttcctatc ggctgattac ttcgctatac ggttgcccgc acggagtata tgccatgagt 1020 

caagatattc cgggactggt agaaacttca acgaacctgg catctgtaaa aatgaagccg 1080 

gaaaacacca tccgtatcga aaccagccag cgcagttcta tcctttcttc tcgcgacgat 1140 

atagcaacaa cggtccgtgc cgtattcaga ctcgccggtg ctcaggtcaa ctggggtgaa 1200 

ggttatccag gatggaaacc caatccggat tcggaaatcc taaaagtggc ggaagagtca 12 60 



524 



tataaacgcc tgttcggtgt tgatgccaaa gtaaaggcaa tccatgcagg actggaatgt 13 2 0 

ggcttgttcc tcgacaaata tcctgccctg gatatgattt cattcggccc caccttgaca 13 80 

ggagttcact ctccggacga acggatgcat attccttcgg tagataaatt ctggaaacat 1440 

ctgctggatg tgttggcaca tattccggct aagaactaa 1479 



<210> 1299 
<211> 669 
<212> DNA 
<213> B.fragilis 



<400> 1299 

acactagtag gtatgaaatt ctttattgat acagctaacc tggatcaaat ccgggaagca 60 

catgatttgg gagttctgga cggagtgacc accaaccctt ctctgatggc gaaagaaggc 12 0 

attaaaggtg tcgaaaatca gcgaagacat tacgtggaga tatgcaatat tgtacaaggt 18 0 

gatgtcagtg ccgaggtgat tgcaactgat tacgaaggaa tggtcaggga aggtaaggaa 240 

ctggcagccc tcaatccgca tattgtggtg aaggtaccgt gcattgccga tggcataaaa 3 00 

gccatcaagc acttttcggg gaaaggcatc cgtaccaatt gcacattggt tttctccact 360 

ggtcaggctt tactggctgc taaagcggga gctacgtatg tttctccttt cgtgggacgt 42 0 

ttggatgaca tctgtgagga tggagtcgga ttggttgcca atatcgttcg gatgtaccgc 48 0 

ttctacaatt atcctactca ggtgctggcc gcctctatcc gtagttccaa gcacatcatg 540 

gaatgtgtgg aggccggtgc cgatgtagca acttgtccgt tgagtgccat taaaggactg 600 

atgaaccacc cgttgacaga tgccggattg aagaaattcc tggaagatta taagaaggta 660 

aatgaatga 669 



<210> 1300 
<211> 999 
<212> DNA 
<213> B.fragilis 



<400> 1300 

tcaaaaaata taatcgatat gaaagcaatt tgtaataaag ggatttgtgt atttcttttt 60 

ttgtcgttac tgatgtcggc aactatggta aatgcacaga gagtgatcac agcgagtggt 12 0 

aagtatataa caaagaatat caaagtgacc cggtttgatc agatttattt gaaaggaagt 180 

cccacgattg aatatacgca gtccccggga gcatccgaag tacaaattgc aggatcggat 240 

aatttggtcg atttggtgga gtgccgtgta gaaggaagta cgttgatagt gaatatgaag 3 00 

tcacgtacca atatttctta tggtaaagag ggacgactga aaatcttggt ttccagtccg 3 60 

atgctgaaga gcgcttcttt gcaaggttct ggcgatatcc atttaggaag tctgaaagtg 42 0 

gaagggctgg atgtatcatt gatcggttca ggtgatattg ttgcggaaaa tataacttgc 480 

aacggtgatt tttctgccct gttgcaaggt tcgggtgaca ttgacgtgaa ggggcagctt 540 

cgtgctaaaa gtgtgaatct gaatttgcaa ggctccggtg atttgaaagt agcaggtgtt 60 0 

accggaagcg aaatcagtgc gatgcttcag ggatcgggtg acttgaaagt cggaagtact 660 

aatatcacat cgactgtaac ggcaaagttg agtggctcgg gtgatatgga tgtattggat 72 0 

attcgtgcca atagcgtatc cggacagttg gatggctcag gagacatgac tttgtcgggt 78 0 

tctgcttgta atgccacgtt ggttttgaac aggtcgggag aactcagtgc gcgaaaactg 840 

gatgctgaaa atgtaacggc tcatgtcaat ggatcagggg aaatctcctg tacagccacg 9 00 

aagacacttg aaaccaatat ccaaggtagt ggagaaattt cttataaagg aaatccgagt 9 60 

atacggtcga caggtaagaa tcatctgaac agactctaa 999 



<210> 1301 
<211> 1509 
<212> DNA 
<213> B. fragilis 



<220> 

<221> unsure 

<222> (12) , (13) , (14) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1301 
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ttgacggtct tnnnatggag cggctccttc cagccccgtg gtgaagacta tgaacagtca 60 

ccctattatc tcaacctcaa cggtaaatgg aaattccatt gggtgaaaaa tcctgatctc 120 

cgtccgaaag acttttataa accctcattc tataccggag gctgggcaga tatcaacgtt 180 

ccgggaaact gggagcgcca gggatacgga actgccatct acgtaaatga gacttatgaa 240 

tttgatgaca aaatgttcaa ctttaagaag aatccccctc ttgtgcctta taaggagaac 3 00 

gaagtaggat cttatcgccg tactttcact gtgcctgccg gatggaaggg ccgccgggta 3 60 

gtactctgct gcgaaggtgt aatttctttt tattatgtgt gggtgaacgg acattttctc 420 

ggttacaacc aaggttccaa gacagctgcc gaatgggata tcaccgatca gttggaagaa 480 

ggtgagaata cgattgccct cgaagtatat cgctggagtt caggttccta tctggagtgt 540 

caggatatgt ggcgtctgag tggtattgag cgtgatgtgt atctgtatag tactcccaaa 600 

cagtatatag ccgattataa ggtaaacgca actcttgaaa aggaacgtta taaagatggt 660 

attttcggac tcgacgttac ggtcggaggg cctgcagacg gtgtggcatc cgtatcttat 72 0 

acactgaacg atccactcgg acgtcctgta ctgtcgggtg agatgcctgt caagtcgcgc 780 

ggactgagta acttcatcac attcggagaa cagcgcctga aggatgtgaa acgttggaat 840 

gccgagcatc ccaatctcta caccctcgtg ttggagttga aaaatgcagg aggacaggtg 900 

accgaagtca ccggttgtga agtcggtttc cgtacttcgg agatcaaaga cgggcgtttc 960 

tgcatcaacg gtgtgcctgt attggtcaaa ggaaccaatc gtcatgaaca ttcgcagttg 102 0 

gggcgtaccg tcagcaaaga gctcatggag caagatatac gtctgatgaa actgtataat 1080 

atcaatactg tgcgcaactc acattatccc actgatccgt attggtatcg gctgtgcgat 1140 

cgttacggac tttatatgat cgatgaagcg aatatcgagt cacacggtat gggatatgga 1200 

cccgcttcgc ttgccaaaga cagcacttgg ctgacagcac acatggatcg tacacatcgc 12 60 

atgtatgaac gttcgaaaaa tcatcctgcc atcgttatct ggtcattggg caacgaagcc 132 0 

cggaaacgga atcaatttcg agcgtaccta cgattggctg aaatcggtag agaaaagccg 13 80 

tcccgtccag tacgaacgtg ccgagcagaa ttacaatacc gatatctatt gtcgaatgta 1440 

tcgcagtgtc gacgaaatca aggcctatct ggcccagaaa gatatctacc gtccgttcat 1500 

tctttgtga 1509 



<210> 1302 
<211> 354 
<212> DNA 
<213> B.fragilis 



<400> 1302 

cgaggaagaa tgaaacaact gatacccgca cttttcgccg taggcgcagt aatggccctc 60 

ataggggccg ctgtctttat caccggatgg gtctatgcac cttatatata taccatcggg 12 0 

gcaggttttg tcgcattggc tcaggtgaat actccgcttc gggctaaaag caagacgctc 180 

cgccgactgc gtatccagca gatcttcggt gcattagcac tgatattgac aggagctttt 240 

atgttcacca cacgtggcaa tgaatggatt gcctgcctta ctatcgcagc catactggaa 3 00 

ttatacacgg cattccgtat tccgcaggaa gaagaaaaag aactttccaa atag 354 



<210> 1303 
<211> 1068 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (231) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1303 

gccttcagga cctggtggcc atttatacga atgaagcggg agagcgtatt gtttcaggca 60 

gcccgcaaac accacttaaa cttacttttt ccattgtttc gggcaaatac cagtgcaagt 12 0 

tatccggtaa gcaggtttat atcgaggcct atcacagtcc tttctaaagg atttgcaaaa 180 

ggtttctttt gcattaaaaa agaaaccgtg gataagattc agcaattttt nttcgataag 240 

tgggccattg aggaaaggag ataccaccag ctcctttcca ttctgttgcc cggtctgaaa 3 00 

aacggcaacc tagcgtcggt ggaacaatat ctgggggcca agcatataga ggcctatgcc 360 

gccgtcccct atgtagccga ccgatgggaa ctggatgacg cctccctccc tcaaggagcg 420 

gtagtggtgc ttacctgtga aggcgtgctg tatagctggg agacctaccg gctggagaga 480 
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tatatttccg ccgcgatagc caacgaccgc atatcgggtg tcgttctgtt tgtgaacggg 540 

cccggaggta tgattacgcg tgtggatgtc ctggaaaagc ttatacggca gtcccccaaa 600 

cccatagtgg cctatatcac gggcgtatgc gcttcggcgc atttctggtt cgtttccgca 660 

tgcgcacgca gattcgtctc ctcgcccatg gatgaaatcg gctcctgcgg ggtggtctac 72 0 

actttccaga gcttcaagga gtattacgcg caaatgggga ttgagatcga ggacatttac 7 80 

cccgacagtg cggacctgaa gaaccgcgcc tatcgcgaca aggaagaaaa gcaggatgac 840 

accttaatta aagagaacct gtcgttttac caccatcttt ttgcacagac catcgcccga 9 00 

aatctgggag tgaagtatga cgcgcaggat cccctgttca gagggcagac tttctttgcc 9 60 

gatacggcac tggccaaggg gtatgtggat gcctacggaa gcctggagga tgccatcctg 102 0 

tgggtatccg cccagaaaac cgtaaagcgg gctaacaaga tgatttaa 1068 



<210> 1304 
<211> 474 
<212> DNA 
<213> B. fragilis 



<400> 1304 
cggaaaaatg 
ccctataccc 
gtcatcctgc 
ccgggatcat 
aatgtagggc 
gccatgagcc 
tcaggcagcc 
tgcaagttat 



cggagtattt 
ccgtttgtga 
cgcgcgcttt 
tcactcccgg 
atacgttcga 
ttcaggacct 
cgcaaacacc 
ccggtaagca 



gaaaatcaat 
tctggagttg 
tattgccgtg 
agtcgaatcc 
ggttgccttg 
ggtggccatt 
acttaaactt 
ggtttatatc 



aagttataca 
gttccggtgg 
cgggatggtt 
gagcaggcgg 
acaggaccgg 
tatacgaatg 
actttttcca 
gaggcctatc 



atatgaaaac 
agtgtatcag 
cttatcgcat 
attcaggaac 
acagccagga 
aagcgggaga 
ttgtttcggg 
acagtccttt 



aataaaaaga 
tgattttgca 
tcctgttatt 
tatatattat 
gttgttatct 
gcgtattgtt 
caaataccag 
ctaa 



60 

120 

180 

240 

300 

360 

420 

474 



<210> 1305 
<211> 825 
<212> DNA 
<213> B. fragilis 



<220> 

<221> unsure 
<222> (752) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1305 

caagatgatt taatactcac tataaaagta aatttgttta tgaaaaatta tttcgcatca 60 

ttcattccgg ccgtaaaggc cattctgggt atcgaggcct ggagtaagga cgccgacaag 12 0 

aaagacgcgt tactggaaga gcaaaagcag aaacttaagg cattgaattt caatgacacc 180 

tttatcaatg gtttttgtga ggccctgaag gatggattcc cggaggattc ttcccgcaag 240 

gacggggagt cgggcacgaa aggcagtggt cctgacccca atacctccaa cgcagtaata 3 00 

caaggattac tggctgatat gactgccaag ctggttacgg cccaggagga aatcgctgtg 3 60 

cttaccaaag agaaagggga actttcacag gaggtatccg ccaaacaaac agaaatcacc 42 0 

ggtttgcaga ccaagattca gaccctttcc ggccttgcgg agcaagacgg ggggaaaggc 480 

ttccagcatg cacgtctgga accggacgct aaagacattg tcatgaattg ggatgacgaa 540 

aaacaactgg gcggcctctc gggggagatg ttcgcaatgg gaccgcctta taaccagcgc 600 

ctgcgcgcaa agatgcttta ccgcaagggg ttgacccgtc aggtgcccac tgccagttcg 660 

atcgattact cccgcctgaa agaagacctg ggagccttct accgcatccc ctggcaggag 72 0 

cgtttacagt ctttcctgac cctgcttcct tncatcgaga gtattttccc cgctgaattc 7 80 

gggatatcag gacctgggcg tgcttacaaa catttgggtt ggtga 82 5 



<210> 1306 ' 
<211> 507 
<212> DNA 
<213> B. fragilis 



<400> 1306 

aaaaaacatc aacttatgat aaccacgaaa ataacagtag agccgcacct ggctcaatat 6 0 



527 



tgctacgcca aatattcttc cgatccggaa ggcagcatgc ccgtccgctt tgcggaccat 12 0 

ctggatgtat accatctggt ttataacctg ctggaaaaac gcccggttaa ctgtccccgg 180 

gataatggca atcttgagat cgtcttgccg gaccgcaggc agggtgacgt ccccggtggc 240 

aaatccccgg agcgtttcaa ctatctgggc cagcgcagcc agggtatcat caataagaag 300 

ctaaagctga tgatgcgcgc cgagctccat gactttattg acgagaacaa gcaccggttc 3 60 

ggtatcgacc agcttcagtc agtccactgc tttatgaaga agtactgcat tgacagctta 42 0 

agcgaggatg ctcttctgaa agactaccaa cgttggcgtg accgggtaag acgttccagc 480 

cttaagcggc cctacaagaa aaagtag 507 

<210> 1307 
<211> 618 
<212> DNA 
<213> B.fragilis 

<400> 1307 

aaaaagatgg aagtagaact agtaaaaacc accctgcatg cggttctgag cccgtctcag 60 

ttacagaaac cctgtgtccg aaagaaggag ctgacgcctc tccagatctc gttaaaaact 12 0 

ggatcgacgg cctctcaatt ggtggatgaa tggggcggga caattgccca actgaacatg 180 

ggcgccccac tttacgatgt cgccgcaaac ggagaaatcc ctacattggc tgatgtgggt 240 

gtggtcttcg gtaattcgac atccgttcgg attatcacaa gccatctgga atccgttctg 300 

aagtacgccg gcgttgaatt gagccgcgag cagatggcgg aaaccgcgct ggcgatactt 3 60 

tcaggatact ggttcctgaa cctggccgag ctctgcattt tctttacccg ccttaagaac 420 

ggaagttgtg ggcagcttgt ctggggaaag agcctaaaca atcaggcggt catggtcgcc 480 

ctatcggatt tctgcaagga acgccgtgaa gtgatcattc gcaaagagac agagcggatg 540 

ggcccggggc tgtggaaaaa ggcttttcca gaacggagga ttttgccgcc ggtattgtgt 600 

tgggcgtaca gggtatag 618 

<210> 1308 
<211> 882 
<212> DNA 
<213> B.fragilis 

<400> 1308 

aagtcccgta gaaatcttcc aaggctttgc gtccggcttc gaactccata cctaatttgc 60 

ctgctgtgta atatctatcg agataaaggg tattacatcg agtgggatga agatttgcct 120 

tttgtggtgg ctgacaccat tggtaccacc gagggcgcag tagaggaagt agtaaagaaa 180 

gccgtgcaag tgggattctt cgacaagtca ttgttcgacc aatacaggat ccttacctca 240 

aacggtattc aaaaccgctt caaaagcgcc gtttccagac gtgaaggatt tgagtatatt 3 00 

cccgaatatc tggtttctgt atgcaataac cccattcaat cgaatttctg tatacagaaa 360 

ccctcctcaa ccgagtttct gtatgcagaa acccagccca accgagtttc tgcatgcaaa 420 

agtacacaaa gtaaagtaaa ggaaagaata tctccccctc ctcacgcgcg tgaaggaggc 480 

atttccggaa tcagactttt ttcagacaag tctttaaccg agtgttacgg ggagctgaaa 540 

gcgaatatcc cctggatgga gcaattctgc atgaacatcc gtctggatta tccggatttt 600 

accccggagc tgttttatgg ctttctggac aggttcttcc gtaaactcca gaatgaaggg 660 

gaaatagtca agtcacccaa ggacgccatg tcgcattttg caaactggtt gaatattgaa 72 0 

cttgaaaaat taaaaaaaga tggaagtaga actagtaaaa accaccctgc atgcggttct 7 80 

gagcccgtct cagttacaga aaccctgtgt ccgaaagaag gagctgacgc ctctccagat 840 

ctcgttaaaa actggatcga cggcctctca attggtggat ga 882 

<210> 1309 
<211> 807 
<212> DNA 
<213> B.fragilis 

<400> 1309 

aaaaacaata taccaatgat agtagcatgg ttttcttgcg gtgtaacatc cgcagtcgct 60 

tgtaagattg cacttagtct atacgatgac gtgcagctct attatattga aactggctcc 120 

gggcatccgg acaacgctcg ttttctatct gattgtgaaa gatggtacga tcagcctatt 180 

cacattatcc gaagcgacaa atacacttgc gtagctgatg tcctacggaa aggttttatc 240 



528 



aatggtgcgc atggtgctgc ttgcactctt gaacttaaaa agaaagtccg gtacaagttg 3 00 

gaaaaggaac ttggttcttg ggacggtcaa gtttggggat tcgattatga accaaaagag 3 60 

attaaccgag ctatccgatt aaagcagcag tacccagaca caaagccact gttcccgctt 420 

attgaaaagc agattacgaa gccggatgcc atggggatac tttggaaagc agggattgaa 480 

atccctgcta tgtacaagat gggctacaat aacaacaact gcatcggttg cgtgaaaggt 540 

ggtatgggat actggaataa aatccggaag gatttcccgg aagtgtttgc tcaaatggcg 60 0 

cagattgagc gtgatgttgg agctacctgt ctgaaagata aagatgggcg tatcttcttg 660 

gatgaactac cgacatggcg gggcgatcca gtggaagaga ttataccgga ttgctcgctt 72 0 

atctgccaaa ttgaatttca agagatcatc gacaggcagg taaaacgagt tttgaaagga 7 80 

gaaattagta ttaacgatgt agcttga 807 

<210> 1310 
<211> 189 
<212> DNA 
<213> B.fragilis 

<400> 1310 

accatgaaag tcgtcatcta ttggcagaag aaatccaccg tccaccatcg ccgccggatc 60 

cgtgacagat tcaggcttcc cgatggtatg accattaacg gtgaaactcc cgccgatgtg 12 0 

aggccggagg atatgaagga actacagacc ctggaagaaa tgggttatat taaattaaga 18 0 

aacaagtaa 189 

<210> 1311 
<211> 348 
<212> DNA 
<213> B.fragilis 

<400> 1311 

agtgatcatt cgcaaagaga cagagcggat gggcccgggg ctgtggaaaa aggcttttcc 60 

agaacggagg attttgccgc cggtattgtg ttgggcgtac agggtatagc cgtgaaacgt 12 0 

gaacgggcca aggccgactt taatgctttt ttggagtttt tcccctgtct gccatcagga 180 

tatgacccga tagccttatg gaaggcctgg ggcggtgatc cggatgccat caacttactc 240 

ttcggcaaca atcctcccgg agtggaagcg gcggcggaat ctgtcggcag atacctgtgt 3 00 

gattacaatg tctatcaggc ccgtgtaaag gccaaagcct ccttgtaa 348 

<210> 1312 
<211> 192 
<212> DNA 
<213> B.fragilis 

<400> 1312 

gaaacgagtt cacatttcaa cactttttca ggagattcga ccgtacgtcc acatccccag 60 

aacaataaca cccaaaccgg acaaaataac aacaagcctt tcagtctctt cctgtctaat 12 0 

aaattaaaag aagcgctcat aatcaacatt ctgttctttt accatcacat tccacccacc 18 0 

gagtctctgt aa 192 

<210> 1313 
<211> 243 
<212> DNA 
<213> B.fragilis 

<400> 1313 

ataaaacaat caattaaagt attttatcac caagcaacca cttatttaaa taaaatcaca 60 

gactgtaata acagctttcc agcttatttt ccttgtatat ctccattatt tcacatacct 12 0 

ttgtctaaaa ttaaggtatt aaaacatccg gatgttatat atcaaaacac ccggatgttt 18 0 

tatatcagaa catccggatg ttatctatta aaacatccgg atgttttcag acataactta 240 

tag 243 



<210> 1314 
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<211> 195 
<212> DNA 
<213> B.fragilis 



<400> 1314 

cagtttttac cctgtttaaa gcttaatttt cccccgtcac tgtctaaaaa atgccgtaaa 60 

cttgcatcat cgaaaaacaa cgaatattca caatttaaaa agcaacgtta tgaaaagttt 120 

aagcttcaga aaagatttaa ttggagttca ggaagagcta cttcgctttg catacaaact 180 

aacaaccgac cgtga • 195 



<210> 1315 
<211> 1467 
<212> DNA 
<213> B.fragilis 



<400> 1315 

gtctataata cgaaagggaa taaaatagga ttttatatgg 

gctattgaac tgggttcatc gaagatagcc ggtatagccg 

agtatacagg tattagctta tgccagggag gattcgtctt 

atctataatc tggataaaac ggcacaaagc ctgacttcaa 

gctctcaata actcaattgc caagatctat gtgggtatcg 

gtgcgcaatg tggtaagtcg tgatcttgaa gaagaaacca 

gactcaatct gtgatgagaa cctcgagata ccactgatcg 

gctccacaag aatacaaaat aggaaacaat cttcaagccg 

agccacattg aagggcgttt tctgaatatt gtagcacgtg 

gaacgctgct tcgaacaggc taaaatagaa atagcagacc 

actgccgatg cagtactgac ggaaagtgaa agacgctccg 

ggtgccgaca catctaccat ttccatttat aagaataata 

ctgccgttag gaggaaacag tattacccat gacctcgtct 

gaggccgaac gcctgaaaat cagatatggc aatgctttct 

gaacctgcta cttgccaatt ggaagacgga aatagaacga 

aatatcatcg aggcacgtac cgaagagatt atcgcgaacg 

tcgggatatg acgacaaact tctggccgga ctcatcatca 

aaagacctgg acgaggttct acgtaaacgg agtaaaatag 

ttcgtacgca ataccatcca tgcagacgaa gacgttgtga 

accttattcg gactgcttat tgcgggcaac gaaaactgtt 

ccacagccgc atatacaacc tcagccccag cccgaaccgg 

gaaagtctga aggaacagga agccgctgcc cgcgctgcca 

gagaaaaagc ggaaagaaga agaaaagcaa cgcaagctgg 

gaagagagaa gaaataaacc taactggttt aaatcgactt 

attttttctg acgaagatat gaaataa 



caacaacaga 


ttttat cgee 


60 




f- 3 rrf- ffR t" CTCfFi 
u~ a. w gl vj y 


120 


ctttcatccg 


gaaaggagtg 


180 


teat caataa 


actggagggg 


A 4U 


geggacaate 


gctccgtacg 


300 


ttatttctca 


ggaactggtc 


360 


atatggatat 


actggacgtt 


420 


accctgtcgg 


tgtagccgga 


480 


cttcgctcaa 


gaaaaatctg 


540 


tattgatctc 


acctctggtt 


600 


gctgcgcact 


gatcgacttt 


660 


tcctccgctt 


cctcactgtg 


720 


ctcttcagat 


ggaagaagaa 


780 


acgaagagga 


agaaggegaa 


840 


tagagttagg 


taaactgaat 


900 


tatggaatca 


gattcaactt 


960 


ceggagggge 


cgccaacctg 


1020 


agaaggtgag 


aaacgcacgt 


1080 


agaaagaegg 


tacacaaaac 


1140 


gtttattgga 


aacacccgct 


1200 


tgaacatgtt 


tgaagaagac 


1260 


agaagaagaa 


agaagaagaa 


1320 


aagagaagaa 


aagaagggaa 


1380 


tcgacaagct 


ctctaatgaa 


1440 
1467 



<210> 1316 
<211> 1470 
<212> DNA 
<213> B.fragilis 



<400> 1316 

aaacattgea cacctggctt taceggatte tgctgtcgtt 

attagegcaa caatcatgaa tatagaaacg attcaatctg 

ggtatcggaa tgagtgccct cgtccgctat tttctttcta 

tatgacegta ctcccagtga actgactcaa catcttatag 

tacgaagaga atatcgatct cataceggag gettgeaaag 

gtcctgaccc ctgccgtacc tcaggaacat gecgaattaa 

ttcgaaatac agaaacgtgc acaagtactg ggcaccatta 

tgtgtagccg gcacacatgg taaaaccact acctcaacga 

caatcacatg taggttgtac tgcttttctg ggaggtattt ccaaaaatta 

ctactactct cttcaaccag cccttatacg gtgattga 

ttccattggt tgtctcctta tatgtctgtc attacege 



attgeaaaag 


aagttataaa 


60 


tatattttgt 


eggggcagge 


120 


aaggaaaagt 


agtggcaggc 


180 


aagaaggagc 


acagatccat 


240 


acaaagctac 


cacattggta 


300 


cttacttccg 


tgataatgga 


360 


cccgttccag 


caaaggactt 


420 


tgacagccca 


cttgtttcat 


480 


ccaaaaatta 


eggaacgaat 


540 


cagacgaatt 


tgaccgttca 


600 


ccgatccgga 


tcatctggat 


660 
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atttatggca ccgaacaggc ttatctggaa agctttgaac actacaccac actgattcag 72 0 

cccggaggag cactgattat ccgcaaaggc atttccctac agccgaaagt gaaagaagga 780 

gtgaagatgt atacttactc acgtgacgag ggagactttc atgctgagaa cattcgcatc 840 

ggaaacggag aaatcttcat tgacttcgta gggcctgaca ttcgtatcga caacattcag 900 

ctaggagtac cggtaagtat aaatatagag aatggtgtcg ctgcgatggc acttgcccac 960 

cttaacggag tcacacctga agagatcaaa cagggaatgg ccagtttccg gggtgtggac 102 0 

cgccggttcg actttaaaat caagaataac cggattgtat tcctgagtga ctacgcacat 1080 

catccatccg agattaaaca aagcgtgatg tccatgcgtg agttgtaccg ggacaaaaag 1140 

atcactgcgg tttttcagcc acacctctat acccgtaccc gcgacttcta caaagatttt 12 00 

gccgacagtc tgtctttact cgatgaagtg atactggtag atatctatcc ggcgcgcgag 12 60 

caacctattc cgggagtaag cagccggctg atatatgaca acctacgtcc gggtattgaa 132 0 

aaaagcatgt gcaagaaaga agaaatactc gatgtactga aagcaaaaca tatcgaagta 13 80 

ttaattacat tgggagcagg agacatagac aactatgttc cgggtatttg tgacttattg 1440 

tcccgaagaa tggttccttc ggacaattaa 147 0 

<210> 1317 
<211> 765 
<212> DNA 
<213> B.fragilis 

<400> 1317 

atcgtcaaaa caataaattt ccttatgata aaaagaattc ttctgaccat cgtcatgtta 60 

cttctcatag cctaccttgt agcagctgtg acagtgttca acgacaaacc tgcccatcag 120 

gtatgccgtg acatggaatt agtgatcaag gatacactca atgccggttt cgttaccaag 180 

aacgaggtgg ctgccatcct gcagaagaaa ggcatttatc ccgttggaaa gaaaatggac 240 

cgggtacaca ccaaaacatt agaaaaagag ttggataaac atccactcat caatgaagct 3 00 

caatgctata aaacgccaaa cggcaaaatt tgtgtggaag taacccaacg cgtaccgatt 360 

ctccacatca tgagcagcaa cggtgaaaac tactatttgg ataacaaagg aaaaatgatg 42 0 

ccgcccgatg caaaatgcgt agcacaccgg gcaattgtca ccggaaatgt agaaaagtcg 480 

tttgcaatga aggatttata taagtttggt gtatttttgc aaaacaatcc gttttgggaa 540 

gcccagattg tacagattaa cgtgctgccc ggaaaagaaa tcgaattggt tccccgggta 600 

ggcaatcata ttatctattt gggtaaactg gaacattttg aggataaact gaaacgcttg 660 

aagacctttt acgaaaaagg gctcaaccag gtgggatgga ataaatattc gcgtatcagc 72 0 

ctggaatttg gaaatcagat tatctgcaca aaaaagaaac aataa 765 

<210> 1318 
<211> 2010 
<212> DNA 
<213> B.fragilis 

<400> 1318 

ctcagatatt ttcactatat ttgctctgtt attataatta ggaatcaaat tatgagcgaa 60 

gaacagaatc ccaccaataa cgggtcttat tcagcagata gtatccaagt attggaagga 120 

cttgaagcag ttagaaaacg ccctgcgatg tacattggtg acatcagcgt aaagggactt 180 

catcacttgg tatatgaaat tgtcgacaac tctatcgacg aagcattggc cggttattgc 240 

gaccatatcg aagtaactat caacgaagac aactctatca ccgtacagga taatggacgt 300 

ggtattccgg tagatttcca cgaaaaagag cagaaatctg ccctcgaagt tgccatgacc 3 60 

gtactgcatg ccggaggtaa gttcgataaa ggttcgtaca aagtatccgg aggtcttcac 420 

ggtgtaggta tgtcctgtgt gaatgcattg tctacacaca tgactaccca ggtattccgc 480 

aacggtaaaa tctatcagca ggaatatgaa atcggtaaac cgctttatcc cgttaaagaa 540 

gtaggaatag cggaccacac aggaaccaaa cagcaattct ggcccgatga cagtatcttt 600 

accgaaacca tttatgatta taagattctg gcttcacgtt tacgtgaatt ggcttatctg 660 

aatgccggtc tgcgcatctc gctgacagat cgtcgcgtag tgaatgagga cggcagtttc 72 0 

aaacacgaaa ctttctattc ggaagagggt ttaagagaat ttgtacgctt catcgaatcg 780 

tcacgcgaac acttgattaa cgatgtgatt tatctaaaca cagagaaaca aaacatcccc 840 

atcgaggtgg ctatcatgta caataccgga ttttcagaaa atatccattc gtacgtcaat 900 

aacattaata ctatagaagg tggtacgcat ctggcaggtt tccgccgcgc cctgacccgt 960 

acactgaaga aatatgcaga agacagcaaa atgctggaga aagttaaagt agaaatctcc 102 0 

ggcgatgact tccgtgaagg tctgacagct gtgatctctg taaaagtagc tgaaccccaa 108 0 
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tttgaaggac agactaaaac taagttggga aacaacgaag taatgggtgc tgtcgatcaa 1140 

gcggtaggcg aagtactaaa ctattatctg gaagaacacc cgaaagaggc taaagcaatt 1200 

gtagacaaag tgattttggc tgctactgca cgccacgccg cccgcaaagc gcgtgagatg 12 60 

gtacagcgta aatctcctat gtcaggtggc ggtcttccgg gtaaactggc cgactgctcg 132 0 

gacaaagacc cgcagaagtg tgagttattc ctcgtcgagg gagactctgc cggcggtaca 1380 

gctaagcaag gtcgtaaccg tgcatttcag gctattcttc cactacgcgg taagattctg 1440 

aacgtagaga aagccatgta tcacaaagcg cttgaaagcg aagaaatacg caatatatac 1500 

acggcactgg gtgtcactat cggaacggaa gaagacagca aagctgccaa tattgataag 1560 

ctgcgctatc ataaaatcat tatcatgacc gatgccgacg tcgatggatc acacatcgac 1620 

acactgatca tgactttttt cttccgctat atgccacaga tcatccagaa tggctatctg 1680 

tacattgcca ctcccccgct ctacctttgc aaaaaaggaa aaatagaaga gtattgctgg 1740 

acagatgcgc aacgccagaa gtttatcgac acttatggtg gcggttcgga aaatgcaatc 1800 

catacacagc gctacaaagg tttgggtgag atgaatgccc agcagttgtg ggaaacgact 1860 

atggatccgg aaaaccgtat gctgaaacag gttaatatcg acaacgcagc agaagccgac 192 0 

tatatcttct ccatgttgat gggtgaagac gtaggtccac gccgcgagtt cattgaagag 1980 

aatgcaacgt atgcaaatat cgatgcataa 2010 

<210> 1319 
<211> 1308 
<212> DNA 
<213> B.fragilis 

<400> 1319 

aaacttaaag cagtggatct attaaagaat atattcaaag gtgataaggt aatctggatt 60 

attttccttt gcctctgcct catctctatc atagaggtgt ttagtgctgc cagtacgctg 120 

acttataaaa gtggcgacca ctggggaccc atcacacaac attccatcat cctgatggta 180 

ggtgcggtcg tagtggtcct gatgcacaac atcccttata agtggtttca ggtgtttccg 240 

gttttcctct accctatttc ggtagtattg ctggctttcg taactttgat gggagtcatc 3 00 

acaggtgacc gtgtgaacgg agccgcccgc tggatgagtt ttatggggtt acagttccag 3 60 

ccttcagaac tggccaagat ggcagtaatc atcgcggttt ctttcattct atccaaaaag 42 0 

caggatgatg aaggggccaa tccgaaagct tttaagtata tcatgatact gaccggactg 480 

gtatgtatgc ttatcgctcc tgaaaacctt tcgacagcta tgctgttgtt cggagtagta 540 

gtattgatga tgttcatcgg acgtgttgca ttcaagaagt tagccatgtt attgggcggt 600 

ctggcattgg ttggctgtct gggagcagta tttttgctgg ccataccgaa ggataccgac 660 

atcccgttcc tccaccggtt tgacacttgg aaaagtcgta ttaccaactt tacggagaaa 720 

gaagaagttc cggcagccaa attcgatatt gacaaagatg cccagatagc tcatgcacgc 780 

attgccatcg ctaccagtaa cgtgataggt aaggcaccgg gaaattccat tcagcgtgac 84 0 

ttcctgagcc aggcattctc cgatttcatc tttgccatta tcattgaaga gttggggctg 900 

gtaggaggcg cctttgtagt catactctac atctggctat tggtccggac aggccgaatc 960 

gcccaaaagt gcgaacgtac attcccggca ttcctcgtca tgggtattgc cttgatgttg 102 0 

gtatcacaag ccatattgaa catgatggta gccgtcggac tgtttcctgt aacaggacaa 1080 

cctttaccgc taatcagtaa aggaggtaca agtacactga tcaactgtgc ctacatcggc 1140 

atgatactga gtgtcagccg ctataccgct tatctggaag agaaaaaaga aaatcctgct 1200 

cctctgctca cccagagtga aggaaatgag gcgattgcaa gcgaggcaca gactgcggcc 12 60 

gaacctacag cagaggtttt aaacagtgat gctaaatttg aagagtaa 130 8 

<210> 1320 
<211> 408 
<212> DNA 
<213> B.fragilis 

<400> 1320 

aaaaggttgg cacgcgtacg tggcattgat tttatccatg actccaaagc acccaatgta 60 

aactcttgct ggtatgcctt gcagagtatg actactaaaa cggtattgat tctcggagga 120 

aaagacaagg gaaacgatta tacggaaata gaagaactgg tacgggagaa atgctcggca 180 

ctggtctacc tgggattgca caacgaaaag cttcatgagt ttttcgaccg tctcggactc 240 

cctgtagccg aagtacagac cggcatgaag gatgccgtag aagcagctta caagctggcg 3 00 

aaaaagggag aaacagtatt gttgagtcca tgttgcgcct cctttgacct tttcaagagc 3 60 

tatgaagacc gtggcgaaca gtttaagaag tatgtaagag aattataa 4 08 
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<210> 1321 
<211> 201 
<212> DNA 
<213> B.fragilis 

<400> 1321 

gttggccatt tacatcaaca agcttgctta agaaataaga tatataacag actctatata 60 

atattagaag ccggactttt cagttcggct ttttatgctt tcaattaccc ggtcgggttt 12 0 

atttttaatt cttctttttt cactcttcat tcttctctaa ctcagatatt ttcactatat 180 

ttgctctgtt attataatta g 201 

<210> 1322 
<211> 546 
<212> DNA 
<213> B.fragilis 

<400> 1322 

cagtggagtt atcacatttt aattaaagta aaatatcgaa tcattaaccc ggatttacca 60 

attgtaaacc gtaaatcgtt aaatcataat aacatcatgg atttattcga aagagtcagc 12 0 

gaagacatta aaaacgcaat gaaagcgaaa gataaagtag ctctcgaaac tctcagaaat 180 

gtaaaaaagt tctttttgga agctaaaaca gctccgggag ctaatgacac ccttacagat 240 

gcagatgcac tgaaaatcgt gcaaaaactg gtaaaacaag gtaaggatgc cgcagaaata 300 

tatataggac aaggtcgtca ggacttagct gatgcagaat tggctcaggt gcaagttatg 3 60 

gaaacttatc tgcctaagca gatgagtgcc gaagaattgg aagccgcact gaaagaaatt 420 

attgctgaag taggtgctac cagcggcaaa gacatgggaa aagtaatggg agtcgcttct 480 

aaaaaactgg caggattggc cgaaggacgc gcgatctcag ctaaagtaaa agagttattg 540 

ggataa 546 

<210> 1323 
<211> 204 
<212> DNA 
<213> B.fragilis 

<400> 1323 

cgaacgtcct acacaagata cttcaaggct atgtgtcagt cggttgtgaa caaaaatact 60 

gcccggaagg ggaaaaactt gggttttgtt ctgaagcctt ctgaaagggg cggagaagac 12 0 

aaggcggtca tagtcccgct gaaactccga acggttttcc tgacgctctt cgtgaaactc 180 
ttccatgccg aaacgcttag ctga 2 04 

<210> 1324 
<211> 1032 
<212> DNA 
<213> B.fragilis 

<400> 1324 

gatagaaatc tgggcgaaag aacgtgcgga gaaactcttt atggaacccg aagcattcgg 60 

agcagccttg gaagagatta tgaaagaaga acggagaaca acgaacaacg agctaaaatg 120 

aaagaagaag aaacaacata tcacgtaccg gtactgctaa aagaaagtgt agatgccatg 180 

aacatatctc ccgacgggac ttacgtagat gtcacctttg gcggtggcgg acattcccgc 240 

gagatacttt cacggctcgg agacggagga cgcctgctag gattcgacca ggacgaagat 3 00 

gccgagcgca acattgtaaa tgatccgcat tttacttttg tacgaagcaa ctttcgttac 3 60 

ctgcacaatt ttctacgtta tcacgatatc ggagaggtag acgctatatt ggctgatctc 420 

ggcgtctctt cccaccactt tgacgacagc gaacggggat tctctttccg ctttgacggg 480 

aaactggaca tgcgcatgaa caaacgtgca ggcattacgg ctgccgatgt ggtaaataca 540 

tatgaggagg aacgccttgc cgacattttc tacttgtatg gcgaactgaa gaacagccgc 600 

aaactggcat ccgtcattgt gaaggcacgt accggacaga aaatagaaac gatcggtgag 660 

tttcttgaaa tcataaagcc tctcttcggc cgcgaaagag agaaaaaaga gttagctaaa 72 0 

gtttttcagg cactccgcat tgaagtgaac caggagatgg aagccctgaa agagatgctg 780 
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atggccgcga cagaagcatt aaagcccgga ggacgactgg tggtaatcac ttaccactca 840 

ctggaagacc gcatggtgaa aaacatcatg aaaaccggca atgtagaagg caaggccaca 900 

caggactttt ttggcaattt acagacacct ttccgcctgg taaacaataa agtgatcgta 960 

cccgacgagg atgagataac acgcaatccc cggtcgcgca gtgccaagtt gagaatagcc 102 0 

gagaagaagt aa 1032 



<210> 1325 
<211> 543 
<212> DNA 
<213> B.fragilis 



<400> 1325 

agccagaatc ttataatcat aaatggtttc ggtaaagata ctgtcatcgg gccagaattg 60 

ctgtttggtt cctgtgtggt ccgctattcc tacttcttta acgggataaa gcggtttacc 12 0 

gatttcatat tcctgctgat agattttacc gttgcggaat acctgggtag tcatgtgtgt 180 

agacaatgca ttcacacagg acatacctac accgtgaaga cctccggata ctttgtacga 240 

acctttatcg aacttacctc cggcatgcag tacggtcatg gcaacttcga gggcagattt 3 00 

ctgctctttt tcgtggaaat ctaccggaat accacgtcca ttatcctgta cggtgataga 3 60 

gttgtcttcg ttgatagtta cttcgatatg gtcgcaataa ccggccaatg cttcgtcgat 42 0 

agagttgtcg acaatttcat ataccaagtg atgaagtccc tttacgctga tgtcaccaat 480 

gtacatcgca gggcgttttc taactgcttc aagtccttcc aatacttgga tactatctgc 540 

tga 543 



<210> 1326 
<211> 1329 
<212> DNA 
<213> B.fragilis 



<400> 1326 

ttcataatac aatataatat ggacgagata gtacaattcg atttccctac agattcaccg 60 

aaaatcatca aagtgattgg tgtaggaggt ggtggaggta acgccgtcaa ccacatgtac 12 0 

cgggaaggca tacacgacgt aacattcgtt ctctgcaata ccgacaacca agcattggct 180 

gagtctcccg taccggtcaa actgcaactg ggacgttcca tcacacaagg actcggtgcc 240 

ggaaaccgtc cggagcgtgc acgtgatgct gccgaagaga gcatcgaaga catcaaaact 3 00 

ctgctgaacg atggtaccaa aatggtgttt atcactgccg gaatgggtgg aggaaccgga 3 60 

accggagccg ctcccgtcat cgcccgtatc gctaaagaga tggacatcct gactgtcgga 42 0 

atcgttacca tccctttcat ttttgaaggc gaaaagaaaa ttattcaggc tctggacggt 480 

gtagaacgca tcgcacaaca cgtagatgct ttgctggtaa tcaacaacga acgcctgcgt 540 

gaaatctact ccgacctgac ttttatgaat gcattcggca aggcagatga tacgctatca 600 

atcgcagcca agagcatagc cgaaattatc accatgcgag gtacggtcaa cctggacttt 660 

gcagatgtga aaacgattct caaggacggc ggtgtagcca tcatgagtac cggattcggc 72 0 

gaaggagaaa accgtgtgac caaagcaata gacgatgcac tgcattcacc tctgctcaat 780 

aataatgata ttttcaacgc caagaaggta atgctgaacg tctccttctg tcctgcttcc 840 

gaattgatga tggaagaaat gaacgaagta cacgagttca tgagcaaatt ccgcgaaggt 900 

gtggaagtga tctggggtgt agctatggac aactcactgg atacgaaagt aaagatcacc 960 

gtattggcta ccggtttcgg tgtagaagac gtaccgggca tggacgacct gcacgaaaaa 1020 

cgcagtcagg aagaagaaga gcgacagttg caactggaag aagagaagga gaagaacaaa 1080 

gagcgcatcc gcaaagcata cggtgaaagt gccagtggaa tcggaacacg caatctgcgt 1140 

aaacgccggc atatctatct cttcaatgca gaagacctgg ataacgatga catcatcgcc 1200 

atggtagagg actctcctac ttacttacgc gacaaaacaa ctttgggtaa aatcaaagca 12 60 

aaagccgcac tggaagaaga gatagcaaca gaagaggcta tagatgacag tggagttatc 13 2 0 

acattttaa 1329 



<210> 1327 
<211> 516 
<212> DNA 
<213> B.fragilis 



<400> 1327 
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aagtttatta 
tttctgttcg 
ctgctggaac 
gagatgaatc 
tttgacggtc 
gaaaggacct 
aaaataccgg 
tcgggcaatg 
ggagaacatc 



aaatgaaaac 
tgtcatgctc 
ctgaagaagg 
ttcacgataa 
attcgcatac 
atacggataa 
ctgacgccac 
aaacttatgt 
accatgatga 



aaaaaattat 
taaagaagac 
cgctatatta 
tgaagcgatt 
cagggcatcg 
ggccggacag 
tccgggaaac 
agtacgtaat 
acatcatcat 



ttgtcaataa 
gaaggtgata 
aggatcggaa 
gcttcctata 
gaagcgggag 
aaagatgcac 
tatcacctga 
attgtgttaa 
gattaa 



tcagtatctt 
cgataaaacc 
gttctcatgg 
agattaatat 
taaccaagcc 
acgtacataa 
tggtttactg 
gcgtggaagg 



atttttttct 
cgtcattgat 
agtacatttt 
tcacaacaac 
ttttacattt 
ccatgatatt 
tttagatcag 
aggtgaagag 



60 

120 

180 

240 

300 

360 

420 

480 

516 



<210> 1328 
<211> 987 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (928) , (942) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1328 

agattagata acagaagtga cgagcaacag gctatgagct acaagtgcct gccgctaatc 60 

acaaagtatg aacctaaaac ttgtagcata aagaaaaaga tctatcgacg gatcgacaaa 12 0 

ttccggttga taaatcgaaa cctaaaaatc atgaaaagaa ttgtagtatt aggagccggt 180 

gaaagcggtg cgggagcagc cgttctggcc aaagtaaaag gattcgatac tttcgtatcg 2 40 

gatatgtctg ctatcaagga taagtataaa actctccttg acggccatgg cattgcctgg 300 

gaagaaggcc gacacacaga agaacagatt ttgaatgctg acgaagttgt gaaaagcccc 360 

ggaattccta atgacgcccc actgattctg aaattgagag aacagggcac acctatcatc 420 

tcggaaatag aatttgccgg cagatacacc gatgccaaaa tgatctgtat caccggctcg 480 

aacggaaaga cgaccacaac ctcgcttatc tatcacattt ttaaaagcgc aggactaaat 540 

gtgggacttg ccggaaacat cggtaaaagc ctggcattgc aagtggccga agagaaacat 600 

gattattatg taatcgaatt gagttcattc cagttggata acatgtataa cttccgtgcc 660 

gatatcgctg tattgatgaa cattacgccg gaccatctgg accggtacga ccattgtatg 720 

cagaactata ttaatgcaaa gtttcgtatt acgcagaatc agacttcgga agacgcgttt 780 

atcttctgga acgatgaccc tatcatcaaa cgtgaactgg acaaacatgg cattcgtgcc 840 

cacctgtatc cattcctcgg catcaaagaa gaaagatcta tcgcctatgt ggaagaccat 900 

gaagtagtaa ttaccgaccc gatcgctntc aatatggaac angaacaagt ggccctgacc 960 

ggccaacaca atcttattac tctttag 987 



<210> 1329 
<211> 1359 
<212> DNA 
<213> B.fragilis 



<400> 1329 

ataatgtgca aataccaaat tgtaaagatc atgatggact ggaaaagatt gatttcagct 60 

aagcgtttcg gcatggaaga gtttcacgaa gagcgtcagg aaaaccgttc ggagtttcag 12 0 

cgggactatg accgccttgt cttctccgcc cctttcagaa ggcttcagaa caaaacccaa 180 

gtttttcccc ttccgggcag tatttttgtt cacaaccgac tgacacatag ccttgaagta 240 

tcttgtgtag gacgttcgtt aggcaacgat gtatcgaaag cgatcctcgc ccgacagccc 3 00 

gaactgcaag actctttcct gcccgagatc ggttccatcg tctctgccgc ctgtctggcg 3 60 

cacgacctgg gtaaccctcc tttcggtcac tccggtgaaa aggccatttc taccttcttt 420 

tcagaaggaa aaggagttca gctccaagag aagctctcac cgatggaatg gaatgatttg 480 

acacattttg aggggaacgc aaatgcattc cgattgttga cacaccaatt cgaaggacgt 540 

cggaaaggtg gatttgtcct gacttattca accttggcct ctatcgtaaa atatcctttt 600 

tcatcaagcc tcgcaggaaa taagtccaaa ttcggattct tcaccaccga agaagaggga 660 

tttcgccgta tcgcaacgga actgggtctt attcagctca gcgaccgccc tttaaaatac 720 

gcacgccacc cgttggtcta tctggtagaa gctgccgatg acatctgtta ccagatgatg 780 

gatatcgagg atgcccataa attgaaaatc ctcactacag aagaaaccaa agaactgttg 840 
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ttagcttatt tcgcagatga acgccagaca catatccgaa aaacgttcga tattgtcaaa 900 

gacacgaacg aacagatcgc ttacctccgc tcatcagtta tcgggttgtt gataaaggag 9 60 

tgcactcagg tatttttaaa taacgaaacc gaaatacttt caggaacttt tgaaggggct 102 0 

ctgatcaagc atatatcaga acgtccggga aaagcataca agcattgttc ggaggtatct 108 0 

ttcagcaaga tttaccgttc acgggatgta ttggacattg agctggcggg gttcagggtt 114 0 

atcaatacac tccttgaatt gatgatcgat gccgtcactt cgccggagaa ggcttactca 12 0 0 

cagctgctca ttaaccgcgt atcgggacaa tataatataa aagcgcctgc actctacgag 1260 

agagtacagg cggtgctcga ttatatttcg ggcatgaccg atgtctttgc cctggatctt 132 0 

tatcgcaaaa tcaacgggaa cagtcttcct gcggtgtaa 1359 



<210> 1330 
<211> 186 
<212> DNA 
<213> B.fragilis 



<400> 1330 

tttcaaaaaa agccctacct ttgcactcgc aatcgggaaa caacgataca gaacgaaagc 60 

aaagtgaaac aaaaagagca gaaatgctca aaacaggatg gcccgttcgt ctatcggtta 12 0 

ggacgcaaga ttttcattct tgaaaggggg gttcgattcc cccacgggct acaattaaaa 180 

aaataa 186 



<210> 1331 
<211> 627 
<212> DNA 
<213> B.fragilis 



<400> 1331 

atgaaaaaaa agtaccttgt aattgtttta ttgttcttgg tggctaatac ttgctatatt 60 

tatcatcagc atgtaggctt aaagaaagtg cattcgtttc tttctgaact gagacaagat 12 0 

acaggagaac ggttgggcat attagaaatg caaaaagagg atagagtgta tgaaatacag 180 

ttcaatggtc aattaattga taaagaactg actgttattg atactgatgg caaacagaaa 240 

aaaataggtg atttaataat agacaatccc aaattagtat ttaggttttc cgaacttaat 300 

tgtgataaat gtattgatgc tcaaatacgt aatttgaatg agtatgttga ttcaatagaa 3 60 

cttcaaaata ttattttatt aacagatttc caaagtcttg aatatatgcg tagctttcag 42 0 

aaatcaaata aagtgaaatt tgctatttat aacatggagg cggagatcga ttctgttttg 480 

gtgaatattg atttacctta tttttttgtt ttgactcctc aagaagaacg gattcaatgt 540 

atgtatattc ctcataaaga aatacctttt ttgacagagg tatacttatc ttctgtcaag 600 

aggaagtttt ttactgattt ggaataa 627 



<210> 1332 
<211> 423 
<212> DNA 
<213> B.fragilis 



<400> 1332 

ggttggtgtt tatattacga attatgcatc gatatttgca tacgttgcat tctcttcaat 60 

gaactcgcgg cgtggaccta cgtcttcacc catcaacatg gagaagatat agtcggcttc 12 0 

tgctgcgttg tcgatattaa cctgtttcag catacggttt tccggatcca tagtcgtttc 180 

ccacaactgc tgggcattca tctcacccaa acctttgtag cgctgtgtat ggattgcatt 240 

ttccgaaccg ccaccataag tgtcgataaa cttctggcgt tgcgcatctg tccagcaata 3 00 

ctcttctatt tttccttttt tgcaaaggta gagcggggga gtggcaatgt acagatagcc 3 60 

attctggatg atctgtggca tatagcggaa gaaaaaagtc atgatcagtg tgtcgatgtg 42 0 

tga 423 



<210> 1333 
<211> 342 
<212> DNA 
<213> B.fragilis 



536 



<400> 1333 

atgggtgtga agagaaatac tagacatagg atttggcttg catggatgtt actgatgaca 60 

tttatgccct tgtctgtcgt gaaggttttt cataatcatt ccgaagaaac ttcgataacc 12 0 

tgtacagacg cacattccgg aaagtcccat cacacatgtg agacttgtcc catctgtcag 180 

tttatgcttt ctccatttat tgagacccct tctactcttc tgacttatac gcccctttac 240 

gtaaaatggg agagtggaac ttttcaggat aaaaagcttt ctatcgcttt ctatccgcat 3 00 

tatcttcgcg gacctcctcc tgttttttat catatcgttt aa 342 

<210> 1334 
<211> 2643 • 
<212> DNA 
<213> B. fragilis 

<400> 1334 

cacaccgtcg atccttccag ccccgtggtg aagactcaca atctttcgtt ggaagaggta 60 

gctgtatatg cctaccggaa taaagccggg aaagccaatt gggaggtaac ccgtgcgtcg 12 0 

gccgatacga taccggccga tactgcatcg accgatttta atagtgagat tgatatccgg 180 

aatatagaac tcaaacatgc caatcttgtt ttcgatgatc ggaatacgga tatttactca 240 

cgcatcgatg atgccaatct gaagttgagg ctttcgctga caaagggtat ttctacttta 3 00 

gggttgaaat ttgacaacaa gaatattctt ttctggcagc agggagaact gttggtcaat 3 60 

aagatagcta cttctttacg gacagatatt atggtggaca ggcagaccgc cgtctggaaa 42 0 

ctgaaggata cggaactcga tgtgaatggt atccggttgg atgtaaacgg agctttccgg 480 

cgggataccg tggcgaagac aatcggtatg gatctggaat atggtttgca tgccccttcg 540 

atggagacgg tgttgcggat gattccgaaa tcgtatgtga aggacactaa agtctcggct 600 

aaaggtgaag ttaccgttag cggtagggtg aggggtgtgt atggtgacaa aaagttgcct 660 

gccgtttcac tcaagatcgg tatcaaagag gcttcggcac aatataaggg tttaccatac 72 0 

ggtattgatg aggtaacggc agattttgat gcgtatgtcg acttgatgcg tcatcagcct 7 80 

tcgtatctaa acctgaaaat attccatttt aaaggggcgc atactgaagt tttggccgat 840 

gcgaaggtag acgatttgct ggatgatccg ttgattactt tccataccaa gtcgactgtc 900 

gacctggatg cactggctaa aacctttcca ttgcaggaaa gcgtgacaat cacgggaaaa 9 60 

ctggatgcgg atatggggat gaagtgccgc ctttctgctt tgaagaagca ggatatcggg 102 0 

cgcatgaagt tgggaggcaa acttgaattg aaagattttg aattgaagga tactgccaag 1080 

gatttcgatt ttctaggtaa tgctactttc cgtttccgtg ataacgaaac cttgcaggcg 1140 

cagatggatg tccgtaaact ggtgttgaga agccgttttc tctcttctga catcgaacgg 1200 

ttggttgcca atgtttcttc gactaatccg caggatacca accgcattgt ctctttgcag 12 60 

tgcgatatgg aggtcagtaa gctccgtgct tcgatgggcg attctataaa gttatacagt 1320 

gcccgtgcaa aagcacaagc tgcactgggg cctcaggggg tggatgtaac gaagccggcg 13 80 

attgattttt cacttcgtgc cgattcgctt ttcttcagtg cggcaggaac tcgcatggct 1440 

atgaatgtgg cgggcatcaa gatgaaggct gataagctga atgactccct gtggatgcct 1500 

aaagggattg ttgggttcaa tcgcttacgc ttccgtacgc cggaattcgg cttgcctatt 1560 

cgcatgtcaa aaacagcggt gacggtggat ggcccgaaga ttactttaaa gaatgcttct 162 0 

gtccgtatcg gacgctccaa tatgacggct acaggcgata tgatgggtgt ttacagggca 1680 

atgacgaaag gagagaagtt gacggcacat ttgtctctta cgtctgatct gatcgattgt 1740 

aatcagttga ttaattctct ttctttcccc gaggatacta cggaagtgct taccgacagc 1800 

gtaccttcgg agatgaaatt gtttgtgatt ccccgaaata tagattttga attgcaaaca 1860 

gatctgaaga aggtcatttt tgagaaaatg ctgtttgaga atgtacatgg agcggtagat 1920 

attaagaatc aggccataca tctggaagat ctttcaatgc gtgccctcga tgccgatatg 1980 

aaggctgtga tggtctataa ggccggtagt ccccgcggcg gatatgccgg ttttgatttt 2 04 0 

aagatccgaa acatcaatat tgcgaagctg gtcgactttg ttcctgcact cgatacgata 2100 

gtgcctatgc ttcgttcttt caagggccgg gttatgtttg atgttgctgc cgatgcccgt 2160 

ttggattcgg caatgaatat ccgtatcccc actttgcgtt cggccattca catcaaagga 2220 

gacagcctgg tcctgatgga tggtgaaacc tttgctgaga tctcaaagat gttgatgttt 2280 

aagaataaaa aagagaatgt attcgatagt atctctgtca atgtgacggt acacgacggt 2340 

aatgtgaccg tctatccttt cctggtagag atagaccgtt ataaagctgc tgttggaggt 2400 

gagcaaggat tagatatgaa ctttaactat cacatctcca tcttgaagtc tccgttgccg 2460 

tttaaagcgg gagtgaatat ttcgggcaat ctggacaaaa tgaagttccg tataggtaag 2520 

gccaaatata aggatgcggt tacccctgct gcggtacatc gggtggatag tacccgcatg 2580 

aatatgggca atgagattgt taatcgtttc cgacgagtag tattgggacg acaacctcga 2 640 

taa 2643 
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<210> 1335 
<211> 654 
<212> DNA 
<213> B.fragilis 



<400> 1335 

aaacagagcc tgaattacaa ttggacgtta 
ctttttgtgt tactcctgat tatatttttg 
aattggcttt acctgtttcc gattagtctt 
ataaaagagt atgatgccaa agagcgaacg 
gaaaaagata ttacgtggct ggtaggaaag 
aatgaattat tgcatgaaaa agttttgttc 
attaagacgg gatttgctgt tgtcaatagt 
aaggtgattg gttcgatgat ttcagattta 
ggatatattt atgtagacac aaaggattta 
ccaatgctat tgcgtgtaga tactgataat 



tttatgcgct 


tccaggtgag 


atctatactt 


60 


caacaaacta 


tagcaatgaa 


cgtattgaga 


120 


ttaatgatac 


ttctgtttag 


ctgtaatcag 


180 


agcagaaaca 


caagagttca 


acatttgatt 


240 


aaattaaata 


gtgtcgattc 


tattttgcct 


300 


ttgtttaatt 


atcatgactg 


cggtacttgt 


360 


atagacaggc 


agaagggtaa 


ggaatatgta 


420 


acatctgtgc 


agcgtttcaa 


tgagtatcat 


480 


ataagaagag 


aactcaaata 


tgctcccact 


540 


cggattcttg 


aagcattgat 


tccaactacc 


600 


attgcatcct 


gtctgaaaag 


atag 


654 



<210> 1336 
<211> 1044 
<212> DNA 
<213> B.fragilis 



<400> 1336 

cagattaagg aacaactact aacaaagtat atggaagaag ttattgaacc ggtaagcaag 60 

gagctgatta tagctgagtt gactgaagac aagcggttgc gtatgaccaa taagagtaat 120 

aatcagatat acattattac ttatcaggac tctcctaata ttatgcggga gatcgggcgt 180 

ctgcgtgaga ttgcctttcg ggctgcagga ggtggtacgg ggctgtcaat ggatatcgat 240 

gagtatgata cgatggagaa tccgtataaa caattgattg tatggaatcc tgaggcagaa 3 00 

gaaattttag gaggctatcg gtatattctg gggacggatg tgcgttttga cgagcat ggt 3 60 

gctccggttc ttgctacttc gcacatgttc aatttttcgg atagatttgt gaaggaattc 42 0 

ctgcctacca ctattgaact cgggcgttcg ttcgttacat tggaatatca gtcaacccgt 480 

gccgggagca aggggctatt tgctttggat aatctgtggg acggattggg ggcattgacg 540 

gttgtgatgc caaatgtaaa atatttcttt ggtaaagtaa cgatgtatcc cagttaccac 600 

cgtcagggaa gagacatgat cctttacttc ctgaagaagc attttggaga taaagacgga 660 

cttatcactc cgatgaaacc gctggaaatg gagacggatg aggctgaact ggcaaggatt 72 0 

ttctgcaaag attcatttaa ggatgactac cggatactta acggtgagat ccgtaaactc 7 80 

ggttttaata ttccaccgtt agtgaatgct tatatgagtc tcagtccgac catgcgtatg 840 

tttggtacgg ctatcaatta cggttttgga gatgtagaag agaccggtat cctgattgcc 900 

gttgacgaaa tccttgaaga gaaacggatg cgtcatatcg aatcgttcgt gaaaaacgat 960 

ccggaagatt gccagataac ttccggggtg aataaggttt tcacaccgaa agtcgttaca 102 0 

ccgcaggaag actgttcccg ttga 1044 



<210> 1337 
<211> 1461 
<212> DNA 
<213> B.fragilis 



<400> 1337 

ggtatgaaac taaaagagat tctaacatct atccaaccgg tgaaaattac cggaaatcag 60 

gatatcgaga taaccggggt tgacatcgac tccagacagg tagagtccgg tcatctgttt 12 0 

atggccatgc gcggcacaca gaccgacgga catgcctaca ttccggcagc ggttgaaaaa 180 

ggtgccacgg ccattctttg tgaagagtta cctgcagaac ttgtagaagg agttacctac 240 

attcaggttg ccgacagcga agatgccgta ggaaaagcag ctacgacttt ctacggaaat 3 00 

ccgagctcaa aattggaact ggtaggcgtt accggaacaa acggaaagac aacgattgcc 360 

accttattat ataatacgtt ccgatacttc ggctataaag tgggattaat ctccacggta 42 0 

tgcaattata tagatgatga agccattcct accgaacata ccactcccga cccgatcaca 480 

ttgaatcgtt tattgggacg catggcggac gaaggttgca aatatgtttt catggaggtc 540 
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agttcacact ccatcgcaca aaaaagaatc agcggactga aatttgccgg cggcatcttc 600 

accaacctga cacgcgatca tctggactat cataaaacag tagagaacta cctgaaagca 660 

aagaagaagt tcttcgacga tatgcctaag aactctttca gtctgaccaa cctggacgat 720 

aaaaacggac tggtgatgac acagaacacc aagtcgaaag tatatactta ctctctccgc 7 80 

agtctgagcg acttcaaagg aagagtactg gaatctcatt tcgaggggat gcttctcgac 840 

tttaacaacc atgaactcgc agttcagttt atcggaaaat ttaatgcatc aaacttactg 900 

gctgtatttg gtgctgccgt attgctgggc aagaaagaag aggacgtgct ggttgctctc 960 

agtacgctgc atccggttgc cggacgcttc gatgccatcc gttctccaca aggatataca 1020 

gccattgtag actatgcaca cacaccggat gccctggtca atgtcttgaa cgccatacat 1080 

ggagtactcg aaggaaaagg caaggtaatt accgtagtag gtgcaggcgg taaccgcgat 1140 

aaagggaaac gccctatcat ggcaaaagaa gcagcccggg caagcgaccg ggtcatcata 1200 

acttcggata atccccgttt cgaagagccg caggatatca tcaacgacat gctggcagga 1260 

ctggataccg aggataagaa aaaaacgcta agcatcgcag accgtaaaga agctattcgc 132 0 

acggcttgta tgcttgcaga aaaaggagat gtgattctgg ttgccggaaa agggcacgag 13 80 

aattatcagg acatcaaagg agtgaagcat cactttgatg ataaagaagt tttaaaagag 1440 

attttttcat tgactgttta a 1461 

<210> 1338 
<211> 249 
<212> DNA 
<213> B. fragilis 

<400> 1338 

aaaccgtcat gcaattggca tcacttctat tataaagccg gaaaaagaaa gggtgtaaaa 60 

caacgcctct ttaagcgggc cccgatacac gaccatttcc gtacttcgat gagtctggta 12 0 

gagccgggat gcagcgtaaa gtttacgaag cccgatcagt tgttccacga atcaaaaatc 180 

acagtccgct tctggattgt aaccatcgtg ctggcagcta ttacgattat aacgctgaag 2 40 

attagataa 249 

<210> 1339 
<211> 1788 
<212> DNA 
<213> B. fragilis 

<400> 1339 

atgttcggtc gaatcaataa ttgtagtaaa atgaaaataa ataaattgct tttcggaatg 60 

ctgctgcttc tgttggcttc ctgcggttca tcccgcaagg tggagaaaca atcggagcaa 120 

gttgccgttc aggaaataaa tctcactccg gagcagcaac gtaaatatga ctatttcttt 180 

cttgaagcat cgcgcctgaa agtgaaaaag gagtatacgg ctgctttcga cctgttgcag 240 

cattgcctgg ccatcaatcc gaccggttcg gcagctctat acgagattgc ccagtattat 3 00 

cttttcctta aacaagtgcc gcaaggacaa gaggcattgg agaaagcagt cgcttatgcc 3 60 

cctgataatt attggtatag tcaggcattg gccggtctgt accaacagca ggatcagaaa 42 0 

gaaaaagcga taggaatact cgaaaagatg gcaacgcgtt ttcccgctaa acaagatccg 480 

ttgttcaacc tgctcgattt atataatcag aaggaagact atggtaaagt tatttctacc 540 

ctaaaccgta tagaggaaaa aacgggaaag aatgagcaga tcaccatgga gaagtttcgt 600 

atctatctgc aaatgaagga caacaaaaag gcttttgaag aaatagagag cctggtcaat 660 

gagtatccga tggactaccg ctatcaggtg attctgggag atgtctatat gcagaatggc 720 

aagaagcagg aagcttatga tacttataag aaagtacttg ccgcagaacc ggacaatccg 7 80 

atggcattgt tttcattggc ttcgtattac gagcagaccg gacagaaaga actgtttgaa 840 

caacagatgg atactttgct actgaaccgg aaagttcctt cggatacaaa ggtgaatgtg 900 

atgaggcagt ttatcgtcca gagcgaacag gagggaaaag acagtacaca ggttatcgga 960 

ctctttgacc ggatgatgca gatggatatg gatgatgtgc aaattccgat gctttatgca 102 0 

cagtatctgc tatccaaagg aatggaggca cagtctattc cggtattgga gcaagtagta 1080 

cagatagatc ctaccaataa agcagcacgc atgactttat taggttccgc catccggaag 1140 

aatgattatg agcaggtgat taagatctgt gagccgggga ttgaggcaac tccggatgcc 12 00 

ttggagtttt atttttatct ggttattgca tataatcagg ccgaacattg ggacgatgta 1260 

ctggaagtca gccggaaagc actagaacat gtcactccgg agagtgacaa gcaaatggtc 132 0 

tccgatttct atacaattat aggtgatgta tatcatacca agaagttgat gaaggaggca 13 80 

tatgtagcgt atgattcagc tttggtttac aacccgtcca atataggtgc actaaataat 1440 
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tatgcttatt atttatcggt agagcgcaga gatctcgata aggcagagga gatgagctat 1500 

aaaaccgtaa aagccgcacc gaacaatgct acttatctgg atacttatgc ctggattctt 1560 

tttgagaagg gaaactatgc tgaggcccgt atctatattg acgatgccat caagaatacc 162 0 

aaaccggaag aagagagcag cgtggtgttt gaacattgcg gggatatcta ttttatgacc 1680 

ggtgacgtag aaggtgcttt gaaatattgg aagaaagcac tggaactggg cactgaatcg 1740 

aaaacactta aacagaaaat agaaaaaaag aaatacattg cagaatga 17 88 



<210> 1340 
<211> 1170 
<212> DNA 
<213> B . f ragilis 



<400> 1340 

acagtgatgc taaatttgaa gagtaatatg aataaagaaa ataataaaga aggacagggt 60 

gatgccttaa gagtcatcat cagcggtggt ggtaccggag ggcatatctt tccggccgta 12 0 

tccattgcaa acgccataaa agagttacgt cccgatgcac aaatcctgtt tgtaggagcc 180 

gaaggcagaa tggaaatgca acgagtaccg gatgcaggct atcagattat cggattgcct 240 

gtagcaggat tcgatcgtaa acatctgtgg aaaaatgtcg ccgtattatt aaaattggta 3 00 

cgcagccaat ggaaagcacg aaatattatc cggcaattcc gccctcaggt agcagtagga 3 60 

gtaggcggat atgcaagcgg tcctacttta aaaatggcgg gaatgatggg agtacctact 42 0 

ttaatacaag agcagaattc atacgccgga gtcaccaata aactattggc acagaaagca 480 

cgaagaattt gtgtggcgta tgacggaatg gagaaattct ttcctgccaa taaaatcatt 540 

atgacaggta acccggtacg tcagaatctg ctggcggaaa aaccggaacg tgaacaggca 600 

attcgttctt tcgggctgaa tccggaaaag aagaccattc tgattttggg tggaagcctg 660 

ggggcacgca ccatcaataa cacattgatt gcgggactgc aactgattcg ccggactaca 72 0 

gacgtgcagt tcatctggca aacgggaaaa atttatcatc aacaagtgac agaagctgta 7 80 

aaagcagcgg gagagatact caatctgttt gtaacggact tcatcaaaga tatggctgcc 840 

gcttatgctg ctgccgacct ggttatttca cgtgccggtg cagggtctat ttccgagttc 900 

tgcctgctga ataagcccgt tatcctggtt ccgtctccta atgtggcaga agaccatcag 960 

accaaaaatg ctttggcttt ggtaaataaa caagcagcca tctacgtaaa ggatgcggaa 102 0 

gcagaaaaca aactattacc ggtagcactg gaaacgatcg ccaatgccga gaagctgagc 1080 

gaactcagtg aaaacattgc acacctggct ttaccggatt ctgctgtcgt tattgcaaaa 1140 

gaagttataa aattagcgca acaatcatga 117 0 



<210> 1341 
<211> 621 
<212> DNA 
<213> B.fragilis 



<400> 1341 

acagaaaata gaaaaaaaga aatacattgc agaatgaaag gaagtaagtt gaaacaaact 60 

gtcattaaac agtcgtacct gctgcctttg ctacttatgg tagttctgct tgcaggttgt 12 0 

aaaacatcaa aggtggtcaa gactacaccg gtagaaccgg cttatctgtc atctaaactg 180 

caactgacag tgcccaacaa aaacggcagt atgaccgtaa gcggcagcat gaagatgaaa 240 

agcggtgaac ggatccagtt atctgtcctg atgccggtat tccgctcgga agtaatgcgt 300 

atggaagtta ccccggatga ggtgttactg attgaccgta tgaataaacg ttatgtgcgg 3 60 

gcaacccgtg atgagctaaa gggaatactg cccgagaatg ctgattttga ccggttggag 420 

aaacttttgt tcaaagcttc acttccgggt gagaaaaagg agctcacagg acgtgaattg 480 

ggaattccat ctctggaaaa ggcaaaggtg agactatctg atttctcgac tgccgaattc 540 

gaattaatac ctactgaggt atcgtccaga tacactcaag tagcattgga ggatctgcta 600 

aaaatgctga tccaactatg a 621 



<210> 1342 
<211> 453 
<212> DNA 
<213> B.fragilis 



<400> 1342 

atagtaaaca gactgaatat gaacatacaa gtaatcaata aatcgaagca cccgcttccc 



60 



540 



gcatacgcga 
tctttggctc 
ggatttgaag 
ctgaattctc 
aatctttcgg 
gcgcgccacg 
gcgggtggtt 



ccgaactttc 
ccatgcaacg 
cgcaaattcg 
ccggtaccat 
ccgagacgtt 
aacaagctgt 
tcggacatac 



tgcaggaatg 
atgcctggtt 
tcctcggagc 
cgatgcagat 
tgtcatagaa 
atggaaagaa 
cggaagagga 



gatatccgcg 
cctacaggac 
ggactggctt 
taccgtggtg 
gatggggagc 
gttgaggtac 
tag 



ctaatatttc 
tgttcatagc 
tgaagaaagg 
agatttgcat 
gtattgcaca 
tggacgaaac 



cgaacctatc 
tctgccacag 
gattactgta 
catcctggta 
gatggtcatt 
ggaacgcggc 



<210> 1343 
<211> 2172 
<212> DNA 
<213> B.fragilis 



<400> 1343 
gacgatttaa 
tctaccttgc 
gtgacggaga 
gggaaagagt 
ccgggagtac 
ggattcaacc 
gccgatcatg 
cctgcttcct 
ccattgcctg 
ggtacattgg 
aggtattcgg 
acccaacgta 
gtcagttggg 
gtttttcaga 
caggatgacg 
gtttctaccc 
cagaagaatc 
ccggacaaag 
aagctgaagt 
cagaggaata 
ggggctttct 
cgctatgatt 
ctgcgggagc 
gttcgtcgcc 
ggtcatcttc 
gcatcgaacg 
tccgaacgcg 
agtttgtcac 
tggtcgatac 
gcgggcggtg 
ggagaatatg 
gcttcactcc 
cagcatattg 
ttgttgaatg 
ttatcagccc 
gtagaaatcc 
ttattaaaat 



gtatgcgtat 
atgctcaaga 
gttatcagca 
ttctgcgtga 
attccatgga 
ggatttcggt 
gactcgagct 
tattgtatgg 
cgggtaaccg 
gaggttcact 
aacaacactt 
tgcctgttta 
ctgccggatt 
aaacagggtt 
gggacagccg 
ggcagagcct 
atcgcgagga 
atccggataa 
tgtttgcttc 
cgattgccgg 
ggatgaccac 
acggaaagat 
agggttatgg 
attttggcga 
ttcaggtaaa 
gagttcatca 
gttggcagtt 
catttgtcag 
tgccccatgc 
aggctgcagt 
tctatactta 
gcaatacact 
ccgcacaaca 
ccggagtatc 
ggaatttatc 
ccgaaccggg 
ga 



cggatttttt 
gcccgattca 
tctgaagaat 
acacttcaca 
tatcggttcg 
tgtagaaaat 
ggatgcattt 
tagtgatgca 
gttattcggt 
gatgcttggg 
tggggattat 
tcaccgaagg 
tagaaaagaa 
ctttcccggt 
taacatagaa 
tttgtacgat 
gtggagccgg 
ggaactggct 
tgccgtatgg 
ttattcattt 
ttatcgtccg 
agatgcatcc 
tgatgaattt 
ttattccggt 
tgtcggccat 
cggcactttc 
tgatgcttca 
ctggttcagc 
cggacagatt 
gggcattgac 
taattgcgat 
gacgtggcga 
ccgggtggca 
tgccaacctc 
cggtgccaaa 
acggaacttt 



tcgggtatgc 
ctgaaagccg 
aagaactcca 
gggaacttga 
ggcttttcta 
gggattaagc 
aatgccggtc 
atgggagggg 
gaagcgagct 
atcaaaaaag 
cgtattccga 
cttaaaaata 
cggtatgtct 
gcacatggca 
cttccgtaca 
aaatgggcgc 
tttcatactc 
tttacgttga 
caacacacgg 
cttttgcctg 
ggacccactc 
gcttatactg 
atccggaagt 
tcactggggt 
agcttccggt 
cggcatgaac 
tatacctatg 
aattatattt 
taccggtaca 
ttcctccgac 
gaacatattc 
tataaagagt 
cgaaatgaag 
cggatagggg 
tactttaatc 
cagattttaa 



tgggagggtt 
tgtccttatc 
catggcgtat 
tacaaactct 
aacctatgat 
aggagggaca 
aggtaagtat 
ctattgaact 
tgttggggaa 
atgcctggta 
cggatactat 
cagccggttt 
cttcttattg 
ttcccgatgt 
gccaggtgaa 
tgacgtggga 
attatgatgc 
acacttatag 
caggatggga 
cttaccggag 
tgtccttcag 
atccttactt 
atgaatggcg 
tggtatggag 
tgccgggagc 
agggagatgc 
aaaatgggcc 
ttcttcggcc 
ccggtgcgga 
actttaacta 
ctctaagttt 
tcagcatcta 
atcccactcc 
gtatatgggc 
atcttagttt 
ttaaagtacc 



atgtttaatc 
tgaagtggtg 
ggaggtggta 
tggaacactt 
tcgcggaatg 
acaatgggga 
tcgcaaaggt 
tgttccatta 
atccgtcaac 
tacctgggca 
tgtctacctc 
tgagagagat 
ggtgagtaat 
gtcgcgtttg 
tcacttgaaa 
tattggcttt 
tcagcctgta 
ttctgctgtt 
tgtacaatat 
atttaccaca 
tggaggactt 
ggcaatatat 
tagttacccc 
tccttccgga 
caatgaattg 
agcgcttgct 
gttgtcggta 
taccggagaa 
agccctgttt 
tcgtgtcagt 
ctcgcctcct 
tggtgaggta 
cggagcgcag 
cgaggtgact 
ttaccggaaa 
atttaaaagt 



<210> 1344 
<211> 357 
<212> DNA 
<213> B.fragilis 

<400> 1344 

aacaagagaa gtgaaatgga agataaagaa gcaaaaaaga agaaaagcaa ctccctgaaa 



120 
180 
240 
300 
360 
420 
453 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2172 



60 



541 



agtattctgg gaggtgatat tctggctacc gacttttttc gccgccagac taaattgctg 12 0 

gtactgatta tggtgctcat cattttctac attcataatc gctacgcaag ccagcaacag 180 

caaatcgaaa tagataagtt gaaaaaagaa ctgatcgaca taaaatatga tgcactgaca 2 40 

cgaagttcgg aattgatgga aaaaagccgt cagtcgcgga tagaggatta tatatcgacc 300 

aaagaaagtg acttgcagac atcaacccat ccaccttatt taatcagtac gaaatag 357 



<210> 1345 
<211> 597 
<212> DNA 
<213> B.fragilis 



<400> 1345 

agcttaattt tcccccgtca ctgtctaaaa aatgccgtaa acttgcatca tcgaaaaaca 60 

acgaatattc acaatttaaa aagcaacgtt atgaaaagtt taagcttcag aaaagattta 120 

attggagttc aggaagagct acttcgcttt gcatacaaac taacaaccga ccgtgaagaa 180 

gcaaacgatt tgttgcagga aacctctctt aaagcgttag ataacgaaga taaatatact 240 

cccgacacta actttaaagg atggatgtac accatcatgc gcaacatctt catcaataat 3 00 

tatcgcaaag tagtacgcga tcagactttt gtagatcaga ccgataatct ttatcatctg 360 

agccttccgc aagaatcagg actcgacagt accgaaagtc gttacgactt aaaagagatg 42 0 

caccgcatcg tcaattcatt acccaaagaa tataaagtgc ctttctctat gcacgtttcc 480 

ggattcaaat accgtgaaat agctgagaaa ctggacttgc cgctcgggac agtaaagagc 540 

cgtatctttt tcacccgcca gcgtttacag gaagaactga aagactttag acaatag 597 



<210> 1346 
<211> 507 
<212> DNA 
<213> B.fragilis 



<400> 1346 

ataattttct tttcgccttc aaaaatgaaa gggatggtaa cgattccgac agtcaggatg 60 

tccatctctt tagcgatacg ggcgatgacg ggagcggctc cggttccggt tcctccaccc 12 0 

attccggcag tgataaacac cattttggta ccatcgttca gcagagtttt gatgtcttcg 180 

atgctctctt cggcagcatc acgtgcacgc tccggacggt ttccggcacc gagtccttgt 240 

gtgatggaac gtcccagttg cagtttgacc ggtacgggag actcagccaa tgcttggttg 3 00 

tcggtattgc agagaacgaa tgttacgtcg tgtatgcctt cccggtacat gtggttgacg 3 60 

gcgttacctc caccacctcc tacaccaatc actttgatga ttttcggtga atctgtaggg 42 0 

aaatcgaatt gtactatctc gtccatatta tattgtatta tgaattatca ctttatttca 480 

tatcttcgtc agaaaaaatt tcattag 507 



<210> 1347 
<211> 369 
<212> DNA 
<213> B.fragilis 



<400> 1347 

tataaatcta taagtctttt tattatgaaa cggaaaatct ttagtttgat tgtggtgctc 60 

gttgttacga tttcttttat caatgtttgg atttcagttg gttcaactaa tgcaaaagtg 120 

aagttgcgtt tggctgcgat agacgcaatg gcaatgattg aagcagaaac acctggagat 180 

actgaacttt cgctttcagg atcatgtaag attactttta cttgttatga tagctggacg 240 

ggagcggcgg atggtagtat tacttgctgg ggagctgaat attgcaaacg aggaattaaa 3 00 

aaggaaggta tagttattat tacagagact cggtgggtgg aatgtgatgg taaaagaaca 3 60 

gaatgttga 369 



<210> 1348 
<211> 1245 
<212> DNA 
<213> B.fragilis 



<400> 1348 



542 



ggcaatcata tgacaaaggc tattgacttt aggtatgagt gtaagataat tgtaataaat 60 

gaaaaacaaa ttatgaaaat agtgaatatc tttgtagcct tattctgtct cacggcatgt 12 0 

tcgttgtcta atgatttgaa ggacaggctg attttgaaac ctcaatccat attgattgat 180 

ccggacaaag tgaaagactt tattgatttg acccctttgt tgagagattc ggttgagatt 240 

attcctttgg agactaaaga tgagtgtttg ttatccgaaa ttgaacggat tgaattctat 3 00 

aaagatcgta tatttgtact tgatagaact cgcaaaggag tttacatgtt tgatcaatcc 3 60 

gggcgattta tcggtaagat tggttgtcaa ggaagtggtc cgggagaatt tacctctgtt 42 0 

ggattctttt gtgttacagg agattctgtt ttaatttcag atcagcatca atctaaatgg 480 

gtagtttata atcttcaaga taagagaacc acggaatttt cttgtggtga gtttacttat 540 

ttgaatgggt tcttgatggg gaggaattta tatttagtct ccaattataa caagtcccaa 600 

tcagatcgtt ttaatcttta taagtttgat gtatctactc gtaagataga agaggcttta 660 

attccatttg aagaaaagat ggataagtat agtactaccg catttactat ttatgctagt 72 0 

caatatcaag atacagcttt tctgatttat ccctttaacg atactattta tgaggtaagt 7 80 

tctaaaggag ctcaaccttt ttatacgatt gattttactc aacggaatct tcctgatgat 840 

atagagccga taaataatag ctttcgtctg gctgttgcaa aaggaaattt tgtgaaaggg 900 

ttgagttata tgcagatgtc tgggaattac atattaggac gttatgcaga taagggatac 960 

tttcgctatc tctctgttga ccggagtacg cttaagtcta cagtaggcaa cagttttgtt 102 0 

gtgagggatt taggatatct tcctgttact tctttctata ccatcggtga tgctttggtc 1080 

tctgtgtact ctgcaagcgc actgatgcag atgcttgatg ttatattgtc tccggattct 1140 

cctataaaag aaaaatatag aactaaattt gagtctttga aacagataac taattgcgaa 12 00 

ggtaatcctg tattattgaa atttcaattt gagagtgctg aatga 1245 

<210> 1349 
<211> 798 
<212> DNA 
<213> B.fragilis 

<400> 1349 

gatagtagta ttcgattgtc tatttcctct ttttttgtag ctttgttacc ggaaagaaaa 60 

gttagagcta tgttgcaaaa aacggttgga atcgttcttc atgtattaaa gtataacgat 120 

acatcgaata tcgtagagat gtatactgag ttgtcagggc gtgcgtcctt tttggtcacc 180 

gtgcctcgtt ctaagaaggc tactgtcaaa tcggtgttgt ttcaaccatt ggctttaatc 240 

gaatttgagg cggattacag accgaatact tcacttttta gaattaagga ggccaaatct 3 00 

ttcagtccct ttacttctat tccctatgat ccttttaaat cggctatcgc tctttttctc 360 

gctgaatttc tttatcgtgc cattcgtgag gaggctgaga accgtccgtt gtttgcctat 42 0 

ttacaacatt ctatattatg gttggatacc tgtaagatta gttttgctaa tttccatctg 480 

gtatttctga tgcgtctttc acgctttttg gggctgtacc ccaacctgga cgattatcat 540 

gcgggtgact attttgatat gttgaatgct acttttacgt ccgttcgtcc tcaactgcat 600 

tcttcttata tacagcccga tgaagccgga cggttgttgc agttgatgcg tatgaattat 660 

gaaaccatgc atctttttgg aatgaaccgt acggaaagag cccgttgtct ggctattatt 720 

aatgaatatt accggctgca tcttcccgat tttcctatac tgaagtcact ggatgtattg 780 

aaggaactgt ttgattag 798 

<210> 1350 
<211> 516 
<212> DNA 
<213> B. fragilis 

<400> 1350 

aaacctaaaa tactaatgtc catcaattac gcagttacca agaaagtaga caagagcaaa 60 

ggtatcgcca aagaacgata ttatgccact acacgcgctt tacagaaaaa acccgtaaac 120 

agtgtacaaa ttgctaatca actcgcagaa agaagctctc ttcaaaacgg agacgtactc 180 

tctgcactta ctcaattatc ggatattatt gccgctcacc tgaaggaagg gcgtactgtt 240 

tccatcgatg gattgggcaa tttctacccc agtatcacca gtgaagcagt ggacaaaccg 3 00 

gaagaatgca ccgccaacaa agtatgggta tcccgtattt gctttaaggc cgcacccgct 3 60 

ttcctgaaca atgtgcggaa aaccgatttt gtcagcctgc aacttaaata cggacgcaag 42 0 

tctacaaagt cacaaaacgg ttccgacaag gagacaaccg atgttatccc ccaccagcaa 480 

agcatctctg aagattcttc attatcagac gaataa 516 



543 



<210> 1351 
<211> 1059 
<212> DNA 
<213> B. fragilis 



<400> 1351 

aagaacagaa tgttgattat gagcgcttct tttaatttat tagacaggaa gagactgaaa 60 

ggcttgttgt tattttgtcc ggtttgggtg ttattgttct ggggatgtgg acgtacggtc 12 0 

gaatctcctg aaaaagtgtt gaaatgtgaa ctcgtttctt acataaagag ttatcctgat 180 

tcttcatttt tctctcaggt gggtacaatg caatatcagg atggtaagat ttatttgttg 240 

gatgaggctc ggagagatgt ggctgttatg gatttggagt tttctgattt tagtttgatc 3 00 

ggtaaacctg gagatggacc tggagagttg gttcgccctg taggatttta tgttgagaag 3 60 

gatacagtct atatattgga tggaggaacc gtgaacgtaa aaagatattt tgattcggaa 42 0 

tttatatctt ctttttcagt tcctgctgcg aacgattatc gtttctttat gaataaagat 480 

actattttct tatcagcagt tactgactcc actttttata cgaagagtgc tgaaagttgg 540 

caaagagggg atttgtttac acttgttttg gcagggaacg tccatgattt tggtaatgct 60 0 

aggaggaata tggtgcttaa tcagaggcat ttagtaaaag acagtacttc tctatatggt 66 0 

attacaagca gttcttcttt attaggaaaa tatgatttgt catctaataa acaagtagct 720 

acttttgatt tatcttctgt ttccttgata aaagataacc tgacttacga aggaagtcaa 780 

ccatatgatc ctaagagtta ttatactttt atttcggatg cttatgcgat gaatggctat 840 

ttgtatttgc tttgttcaga actgaaggat cgggataagg gaggctttcg ggtaaataag 9 00 

atattgtgtc tgaaaacaga gcctgaatta caattggacg ttatttatgc gcttccaggt 960 

gagatctata cttctttttg tgttactcct gattatattt ttgcaacaaa ctatagcaat 102 0 

gaacgtattg agaaattggc tttacctgtt tccgattag 1059 



<210> 1352 
<211> 483 
<212> DNA 
<213> B. fragilis 



<400> 1352 

ggtgaaatga tacgtttttt aggcaatatt gaagcaaagg cggacgcgaa aggaagagtt 60 

tttatccccg cccaattcag acggcaacta cagtccggct ctgaagacaa gctcatcatg 12 0 

cgcaaagacg tatttcaaga ctgcctggtg ctctatccgg aagaggtctg gaatgaagaa 180 

ctggacgaac ttcggcagcg actgaataaa tggaacgcca accaccaact tatcttccgc 240 

cagttcgtca gtgacgtcga aatcatcacg atggacggca acggacgtat actgataccg 3 00 

aaacgctatc tgcaaatcac cggtatacaa agcgacgtac gctttatcgg ggtagacaat 3 60 

aagatagaaa tctgggcgaa agaacgtgcg gagaaactct ttatggaacc cgaagcattc 42 0 

ggagcagcct tggaagagat tatgaaagaa gaacggagaa caacgaacaa cgagctaaaa 480 

tga 483 



<210> 1353 
<211> 2127 
<212> DNA 
<213> B. fragilis 



<400> 1353 

aagatggctg taaacaagaa aaatataatg acccgctact tcttcgtcat cctgttgatg 60 

ggactgatag gagtagccat tgttgtcaaa gcaggcatca cgatgtttgc cgaacgacaa 12 0 

tactggcagg atgtggccga ccgtttcgtc aaggagaatg taacggtgaa acccaaccgc 180 

ggaaacatca tttcgtccga cggcaaactg atggccagtt cgctgccgga ataccgtata 240 

tatatggact tcaaagccgg tggagtaaaa aaagacacca tgctgatgaa tcatctggac 3 00 

gagatatgcg aaggacttca taaaatattc cctgataaaa gcgcttcgga atttaagact 3 60 

caccttaaga aagggcgcaa acagggaagc cgtaactatc tgatttatcc gaagcgtatt 420 

tcatatattc aatataaaga agctaaacgc cttccggtgt ttaacctcaa caaatacaaa 480 

ggcggattcc atgaattggc ttataaccaa agaaagaaac cttttggttc acttgccgcc 540 

cgtacgttgg gtgacttata tgccgatacg gcccagggag ctaaaaatgg tatcgagttg 600 

gcttttgatt ctatcctcaa aggacatgac ggaattactc accggcaaaa ggtgatgaat 660 

aaatacctga acattgtgga tattcctccg gtagacggtt gtgacctgct ttctaccatc 72 0 



544 



gacgtaggca tgcaggatat ctgcgagaag gcattgaccg ataaactaaa agagctgaat 780 

gccagcgtag gtgtggccgt attgatggaa gtggcaaccg gcgaagtaaa agccattgtc 840 

aacatgacga aagccggaga tggcaattat tacgaaatga ggaataacgc tatcagcgat 900 

atgctcgagc cgggatcaac atttaaaaca gcttctatca tggtggccct tgaagatggc 960 

aagatcactc cggaagacgg tatagatacg ggaaacggta tcaagatgat gcacggtcgg 102 0 

cccatgaaag actggaactg gtataaagga ggatatggct acctgacggt tacgcaaatt 1080 

ctggaagtat cttccaatat aggaacttcg agcattatcg aaaaatatta tggaagtaat 1140 

ccgcaaaagt ttgtcgacgg actgaaacga atgagtatcg accagcccct ccaactgcaa 12 00 

atagcaggag aaggcaaacc caacataaaa ggtcctaaag agcgctattt tgcaaagacc 12 60 

actctgccat ggatgagtat cggctatgaa actcaggtac ctcccatgaa tatactgaca 1320 

ttctataacg ccattgccaa caacggagtt atggtacggc cgaagtttgt gaaagcagcc 13 80 

attaagaacg gagaaatagt gaaagagtat cctacggaaa tcatcaatcc gaaaatctgt 1440 

tcggagcgga ccttgaagca gattcaggaa attctttata aggtagtaca cgaaggtctg 1500 

gctgctccgg caggttccaa gcaatttgcc gtttcgggta aaactggtac ggcacagatc 1560 

tcacaaggtg ccgccggata taaatcggga cgggtgaact atctggtcag cttctgcgga 162 0 

tatttccctt cggaagctcc gaaatacagc tgcatcgttt ctatacagaa accgggactt 1680 

cccgcttcgg gaggtttaat ggcaggtagc gtattcagca aaatagccga aagagtgtat 1740 

gccaaagatt tacgcttgga catcaggaat gcaatcgata ccaatacggt agtgattccc 1800 

gatgtaaaag caggcgaaat gatagaagca agacaagtat tggaaggcct aaacatccag 1860 

acacaggctg aatttaaggc taaaaagaac aaagaggtgt ggggacatgc acaggcagcc 192 0 

cccaaagcag ttatcctgca gggaaaagaa caattacgca actttgtgcc cagcgtaata 1980 

ggtatgggtg ccaaagacgc tgtatacctg ctggaaagta aaggattgaa agtaaccctg 2 040 

tcgggagtcg gcaaagtaaa gagccagtcg ttgccccagg gaactaccat caagaaggga 2100 

caaaccatca gtatccatct gaattga 212 7 

<210> 1354 
<211> 1131 
<212> DNA 
<213> B. fragilis 

<400> 1354 

gcaccggatg ttattcaatc gaaaagcgta aatcgtctaa ccgtaaataa agttatgtta 60 

tactatctgt ttgaatggct acacaaactc aactttccgg gtgccggaat gtttgggtac 12 0 

acctcgttcc gtgcattgat ggctatcatc ctggcactgc ttatttccag tatctgggga 180 

gataagttca tcaatctgct gaaacggaaa cagatcaccg agacgcagcg tgacgccaaa 240 

atcgatccgt tcggcgtcaa taaagtagga gtgcccagca tggggggtgt catcattatc 3 00 

gtagcaatcc tgatcccctg tctgttattg ggaaaactgc ataatatcta tatgatactg 3 60 

atgctgatca ccaccgtctg gctgggatct ttaggatttg cagacgatta tataaagata 42 0 

ttcaaaaagg ataaagaagg gctccacggt aaattcaaaa ttatcggtca ggtgggtctc 480 

ggcttaattg tcggactgac tctatatctg agtccggacg tagtgattcg tgaaaacata 540 

gaagttcaga aatcggaaaa cgaaatcgaa gtaatacatg gcactcacga tctgaaatct 600 

acccagacca cgattcctgt cttcaaaagt aacaacctgg agtatgccga ccttgtaggc 660 

tttatgggag aacacgctca aacagccgga tggattttgt ttgtcattat caccatcttt 720 

gtcgtgacag ccgtgtcaaa cggagccaac ctgaatgatg gtatggatgg tatggcagca 780 

ggcaattccg ccatcatcgg actaacgctg ggcatattgg cttatgtatc gagccacatc 840 

gagtttgcgg gttacctgaa tatcatgtat attcccggaa gtgaggaact ggtaatcttt 900 

atatgcgcct ttatcggagc attgatcggt ttcttatggt acaatgccta tccggcccag 960 

gtattcatgg gggatacggg cagtctgacc attggaggta tcattgcggt atttgccatt 1020 

attattcaca aagaattgct aatcccgatt ctctgcggta tatttctggt tgaaaaccgt 1080 

catgcaattg gcatcacttc tattataaag ccggaaaaag aaagggtgta a 1131 

<210> 1355 
<211> 270 
<212> DNA 
<213> B. fragilis 

<400> 1355 

aacagattag tcgagatggc aaatcataaa tcatcaatca agagaatcag acaagaagaa 60 

acaagaagac ttcgtaacag atattatggt aaaaccatga gaaatgctgt tagaaaactt 12 0 



545 



cgttcaacta ctgacaaagc agaagcaact gctatgtatc cgggcatcgt taagatggta 180 
gacaagttag ctaagacaaa cgttattcat aagaataaag ctaacaatct gaaatctaag 240 
ttggccattt acatcaacaa gcttgcttaa 270 

<210> 1356 
<211> 861 
<212> DNA 
<213> B.fragilis 



<400> 1356 
aaaagaacag 
tttttaatag 
ccaaagtttg 
ctgcgcgatt 
gatgcaaagt 
gtgtccaatc 
catttcgatg 
gcaccattgt 
gtagaagccg 
cggcggcaga 
gttcagtttc 
tataatttgg 
ctggcggacg 
at tccgtggc 
gatatcgtgt 



ttaattttgc 
atattgataa 
tcgtctccta 
cgaaagataa 
tggaggtgaa 
atccgctggg 
ggaaggtgaa 
gcattcctat 
gattcaagtc 
acggagtgat 
agcgtgatgt 
ctaatttatg 
agatgctgaa 
aaacattcga 
ataaactgta 



gcaatcattt 
gatacttcag 
tctgaagaga 
agtgggagtc 
aggtctggaa 
aggacaggac 
gtatctggtg 
taataaaaca 
ggacgatcaa 
tcgcgatctt 
gattcctgtc 
caaggcactg 
aaatcgccat 
taagtcgaaa 
a 



catctaaaaa 
acgaaagctc 
attgtacatc 
gattttctgg 
aatattccga 
ggtgtttctc 
aatgatttgc 
ggcaaacagg 
ctgataatgt 
gactggaaga 
cattttgaag 
gggattaagt 
aaaacattca 
actccggcag 



gaatggctga 
cgaagcacta 
aggaagaatt 
gggcttgtct 
aagacggttt 
tcggatatat 
tgatgaacct 
caaaagattt 
ttcccgctgg 
aaacatttat 
ggcgt aactc 
ttaatattgc 
ccgtcacttt 
agtgggcaca 



cgactcttta 
caagtacata 
aaatgttttt 
tgagtttctg 
atatacgttc 
tttgggacgt 
tcatggcttg 
tcctaagatg 
actttgttcc 
cgtaaaaagt 
tgatttcttt 
tatgctttac 
cggaaaaccc 
atatgtgaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

861 



<210> 1357 
<211> 216 
<212> DNA 
<213> B.fragilis 

<400> 1357 

tgttttcact cttccggacc agtattcgtc ctccggttat tttccgtact tctggttcct 60 

ctgccggtac gggtaacatt cccaggagtc actctgttgt tatcccgtga agctgagctt 12 0 

tctctttggg acactcctct tctcacctca tctctccgtc ccgaattgga acgtatggtt 180 

accgatccga aatcggaacg gcgatgcgtg ttataa 216 

<210> 1358 
<211> 348 
<212> DNA 
<213> B . f ragilis 



<400> 1358 
ggcagcggct 
cggttcaatc 
catttcacag 
cccaaaggat 
ttttttctgg 
ggaaaggaat 



ccttcagccc 
aaatgcagtt 
cagccgtaga 
tattcagtaa 
atgcaaattt 
ttccgcatgg 



cgtggtgaag 
tcacccttac 
caaaccctgg 
cctggaaggt 
tgccctgttg 
aaaactacgg 



accgccgaac 
ctgaaagccg 
tttccggcag 
ataaagacaa 
aacagtctta 
ggtcacaaat 



tggatagtac 
aaaagaaaaa 
accaattgtt 
gtggcgaact 
aaatggaaat 
ctggttaa 



tactgtggta 
acagcaatgg 
cggttcactt 
ggcctatcat 
cgaacctgaa 



60 

120 

180 

240 

300 

348 



<210> 1359 
<211> 846 
<212> DNA 
<213> B.fragilis 



<400> 1359 

agagtgaaat caccggttcc tctattgaat ccgggtgggt atcaaacaaa tcgtgtagga 60 

ctcttgttcc ttctgcataa taaaataacc tataaaatat cagcgcttat gaaaatggct 12 0 

ctttctgatt ttgctttgcg gaagaaaggg atttacggca ttttacatgt catcatcctg 180 
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ctgctttccc tgtttctggt catcagcatc tcgatagata cgtttaaggg tatccctttt 240 

tatacccaat cggtttatat gaaagttcag ctatggattt gtgtcttatt tctgttcgat 3 00 

ttcattctcg agttgtttct ttcgaaaaat aagtggcact atcttagtac gcatttcatc 360 

tttttgttgg tggcgatacc ttaccagaat attatatcct atatgggatg gactttttca 42 0 

cccgaagtga cttatatgat tcgttttgtt cctttggttc gaggcggcta tgcgatggct 480 

attgtggtgg ggtggcttac ttataataag gcttccggac tgtttgtttc ctatctgact 540 

atgttgcttg ctactgttta cttttcaagc ctggcttttt ttgtactcga acacaaggtc 600 

aatcccctgg tgaccggtta cggagatgcg ctttggtggg cgtttatgga tgtgactacg 660 

gtaggttcca atattattgc tgtcaccgtg acgggacgtg tactttcggt gttgctggcg 72 0 

gcactgggta tgatgatgtt cccgatcttt acggtttatg tcaccagcct gattcaaaaa 7 80 

aagaacaaag agaaagagga gtattataaa caattggagg cagctgacga aagtaagcca 840 

aaataa 846 



<210> 1360 
<211> 978 
<212> DNA 
<213> B . fragilis 



<400> 1360 

cagctcctac cattggattg ggagatacgt tgcaaagacc gttttttgcc agttggatgc 60 

aacgcctcat gtatttttct tctttcatct tttaggctga tagctatatt atttgtactt 12 0 

ttgcacccca aaataaggag aatgaaccac gtcaccactt atatccgcca ggctttacac 180 

gatatttatc caccgggaga actcaggagt ctcacaaaaa tcatttgttg tgatctgctg 2 40 

ggtcaggatg ctattgatta ttatctgggc aaagatataa cattatctgc aaacgagcag 3 00 

tgtgatttag aaagcattgt cgaacgattg aagaaaaacg agccgatcca atatattcag 3 60 

ggcgaaacct gtttttatgg gtctatgttt cgggtagctc cgggtgtgtt gattcctcgt 420 

cctgaaactg aggagctggt tgatctggta gtgaaagaag ctgcaaccgg tacccgtttg 480 

ctggatatag gaaccggtag cgggtgtatt gccatcagtc tggctaaaca tattccgcag 540 

gctgtggtca ccgcatggga cgtatcggaa gaggctcttg ccattgccgg ggagaataat 600 

cgggaattga aggccggagt gcattttgag aaaatggatg ttctgtctgc agaacctgtt 660 

ggtgatgatc aatatgatat gattgtcagt aatcctcctt atgttacaga gagcgaaaaa 72 0 

aacgaaatgg aacccaatgt gttagattgg gagcccagac tggccctttt tgtgccggac 7 80 

aatgatccgt tgcgctttta tcggcgtatc gcatctttag gaagaaaaat gttacgcctg 840 

cacggcaggc tctattttga gatcaatcgg gcttatggtg aagaggttct ccaaatgctt 900 

cacgaacaag ggtacgaaga actccgtttg ataaaagata tatcgggtaa tgatcgaatt 960 

gtaaccgcca aacgatga 97 8 



<210> 1361 
<211> 576 
<212> DNA 
<213> B. fragilis 



<400> 1361 

tcactaaaca ctaaaattga tatggacgac caaattaaac aaatagcaga acgtctgcgc 60 

ggattacgtg acgtactcga actgacggcc gaagacatcg cccgtgattg cgagatttca 12 0 

gcggaagaat atcgcctcgc agaaacggga gattacgaca tttcagtgag tatgctgcaa 180 

aaaatcgcac gtaaatacgg aatcgctctc gacgctctga tgtttggcga agagcccaag 240 

atgagtagtt acttcctgac ccgtgcagga aaaggaacca gtattgagcg cacaaaggct 3 00 

tataaatacc agtcactggc agcaggtttt atgaaccgga atgccgaccc gttcattgta 3 60 

actgtcgaac ccaaacccga catcgagccg atacactata acagtcatag cggacaggaa 420 

ttcaacctgg tacttgaagg ccgcatgatg atcagtatag atggaaaaga cttgatatta 480 

aacgaagggg acagcctgta cttcaattca aaactacctc atggaatgaa agcactcgac 540 

gggaaaacag tacgtttcct ggcagtaatc atgtaa 57 6 



<210> 1362 
<211> 185 
<212> DNA 
<213> B. fragilis 



547 



<220> 

<221> unsure 

<222> (166) , (167) , (168) , (170) , (172) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1362 

tatggagtaa cggtggtgcc aacgaagtat ggcgcggacg actcaataaa tttggttttg 60 

aaagacagaa caacaactaa cgaaatgtcc tataaagaac aaatagattt aaaccggata 12 0 

cctaagcatg tagtcgtcac cgccgagtgc aatgcatgct ccattnnncn cnggcctgcc 180 
tcccc 185 



<210> 1363 
<211> 927 
<212> DNA 
<213> B.fragilis 



<400> 1363 

caagcagatg cggcaatatg tttttcggcg gctaatcatt cttacgtctg ttttttattt 60 

atctttgcag taaacagaat gaagattatg agcattgaat taggaaaatt caaccagctt 120 

gaggtagtca agcaggtcga tttcggtatg tatctggatg ggggagaaga gggagaaatc 180 

ctgttgccca cccgctatgt acccgaagat tgtaagttgg gagactggtt gaacgtcttc 240 

ctttatctgg ataatgaaga acggttaata gctactacat tgacaccttt ggtacaagta 3 00 

ggggagtttg cctgcctgga agtatcgtgg gtcaaccagt tcggagcttt tcttaactgg 3 60 

ggattgatga aggatctgtt tgtccctttc agcgagcaga agatgaagat gcaggtaggg 42 0 

aataaatacg ttatccatgc ccatattgat gatgaaagtt tccggatcgt agcttcggcc 480 

aaagtagacc gttacttatc taaagagaaa gcttcttatc agcctggtga agaagtgaac 540 

atccttatat ggcagaagac agacctcggg tttaaggcta ttattgagaa tatgtatagc 600 

ggcttgctgt atgatagtga aatatttcag actttacata ccggcgatgt actgaaagca 660 

tacgtcaagc aggtacgcga agatggcaag atagatctga ttctccagaa gccgggcttt 720 

gaaaagatag atgatttttc aaagacactt catcgctaca tcacagagca tgggggatgg 7 80 

attggactta cagataagag tcctgccgag gagatttatg acacgttcgg tgtcagtaag 840 

aagacattca agaaggccgt tggcgatttg tacaagaagc gtctgattct tcttcatgaa 900 

gacggcatcg agttggtacg tccctaa 927 



<210> 1364 
<211> 213 
<212> DNA 
<213> B.fragilis 



<400> 1364 

agttgtccgg ccgccgatta cccgcatcgc tgccgtgtaa tattcacatg gaacggaaag 60 

caatatgctg cctatacctt tgtctttcat agccacttcg atagatgcgg ctttcaggag 120 

gcttttgaca cttgctactt cccatggact gcccgcgaac acttcaatct gagaaatctt 180 

tctgcttact atcatttgct cttaattgtt taa 213 



<210> 1365 
<211> 1374 
<212> DNA 
<213> B.fragilis 



<400> 1365 

tatgttatct atgatacaat attaatagct atggcacaga aactttggga aaaatcagtt 6 0 

gaggtaaata aggatataga gcgatttacc gttggacgtg accgtgagat ggatctttat 12 0 

cttgcaaagc atgatgtact tggttcgatg gctcatatca cgatgctcga aagtatcgga 180 

ttgctcacaa aggaggaatt agctcagttg ctgaccgaac tgaaagatat atatgcttct 240 

gcggagagag gcgagtttgt aatagaagaa ggagttgaag acgtgcactc gcaggtagaa 300 

ctgatgctta cgcgtcgttt gggtgatgtc ggtaagaaga ttcatagcgg gcgttctcgt 3 60 

aatgatcagg tgttgcttga tctgaaactt ttcactcgta ctcagatcag agaagtagca 42 0 

gaggctgtag agcaattgtt tcatgttctg attcgtcaaa gtgagcgtta caagaatgtt 480 
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ctgatgccgg gttatactca tttgcaaatt gcgatgcctt cttcgttcgg gctttggttt 540 

ggagcgtatg ctgagagttt ggtagatgat atgcttttct tgcaggctgc ttttaagatg 600 

tgcaataaga atcctttggg ctccgctgcc ggatatggct cttcattccc gctgaaccgc 660 

acgatgacta cggaattgct gggattcgat tctttaaact ataatgtagt gtatgcccag 72 0 

atgggacgtg gaaagatgga acgcaatgtc gcttttgcct tggctacgct tgcaggaacg 780 

atttctaaat tggcttttga tgcttgtatg ttcaatagcc agaattttgg tttcgtgaag 840 

ttgccggatg aatgtacaac cggatcaagc attatgccac ataaaaagaa tccggatgtg 900 

ttcgaactga cacgtgccaa atgtaataag ttacaatcgc tgccgcagca gat tat gat g 960 

attgccaata atctgccttc cggatatttc cgtgatttac agattataaa ggaagtcttt 102 0 

ttaccggctt ttcaggagtt gaaagattgt ctgcagatga ctacctatat catgaatgaa 1080 

attaaggtga acgagcatat cctcgatgat gataaatacc tttttatttt tagtgtagaa 1140 

gaggtgaatc gcctggcacg tgaaggtatg ccattccggg atgcttataa gaaagtaggg 12 0 0 

ctggatattg aagccggtca cttttcgcat gacaagcaag tacatcacac ccatgaagga 1260 

agtattggca atttgtgtaa tgatgagatt tccgcattga tgcaacgtac catcgagggt 132 0 

ttcaactttc aaggtatgga acaggcggag aagaccttgt tggggcgtaa atga 137 4 

<210> 1366 
<211> 486 
<212> DNA 
<213> B.fragilis 

<400> 1366 

tcgaattgta accgccaaac gatgaataca ataacagaag aagaagcgtt aaatcgcatg 60 

gctgcctatt gttccgcagc cgaacattgt aaagccgaag tgaatgaaaa actccagaaa 12 0 

tggggcttac cttatgaagt gattaaccga atcatcgatc gtcttgttgt cgagaagttt 180 

attgatgaag aacgttattg tagagcgttt gtcaacgata agttccgttt tgccaaatgg 240 

ggtaaaatga agattacaca agctctgtat atgaaaaaaa ttcctcgtga ggtaacttac 3 00 

aggtatctga atgacattga ccgggaagaa tatcttgcga ttttaggaga tctgatagca 3 60 

gcaaaacgta aaagtataca tgccaaagat gaattcgagc tgaatgggaa attgattcgt 42 0 

tttgccatga gtagaggatt tgaaatggac gatatccgtc gctgtgtgca ggtagaagaa 480 

gagtaa 4 86 

<210> 1367 
<211> 1248 
<212> DNA 
<213> B.fragilis 

<400> 1367 

aaaaagaaca agaaagaaat aacatttaaa atatacagca ctatggaaca gaaaaagaaa 60 

gtagtggttg cattcagcgg aggcctcgat acctctttca ctgtaatgta cctggccaag 120 

gaaaaaggat atgaagtgta tgcagcgtgt gccaacacag gtggcttcag cgaagaacaa 180 

ctgaaacaga atgaagagaa tgcctacaaa ctgggtgctg tgaaatatgt cacactcgac 240 

gtcactcagg aatattacga aaaaagtttg aaatatatga tattcggtaa cgtactacgt 3 00 

aacggtacct atcctatttc tgtcagctcc gaacgtattt tccaggcatt ggccatcgca 3 60 

cgctatgcga aagagattgg tgcggaagcc attgcacacg gttcgacagg agccggtaac 420 

gaccagattc gtttcgacat gacattcctt gtcatgactc cgggcgtaga aattattacg 480 

ctgacccgcg atatggcact cagccgtcag gaagaaatcg actacttgaa caaaaatggc 540 

ttcgaagcag actttacaaa actaaaatac tcttataatg tcggactatg gggtacttca 600 

atctgcggcg gagaaattct agacagtgca caaggattgc ccgaaacggc ctacctgaag 660 

caagtaacga aagagggaag tgaacttctg cgccttgaat ttaagaatgg tgaacttcac 72 0 

gccgtgaatg gagaagtgtt cgaagataaa attgccgcca tccaaaaagt ggaagagata 7 80 

ggtgctgctt acggtattgg ccgtgatatg catgtaggtg atactatcat cggtatcaaa 840 

ggacgtgtag gattcgaagc cgccgctcca atgttgatca tcggtgcaca ccgtttcctt 900 

gagaaataca cattgagcaa atggcaacaa tattggaaag atcaggtagc taactggtat 9 60 

ggtatgttcc ttcatgaaag ccaatacctg gaaccggtga tgcgtgatat cgaagcaatg 102 0 

cttcaagaat cacaacgtaa tgtgaacggt acagccatcc ttgagctccg tccgttgtca 1080 

ttctctactg tcggtgtaga atcacaagac gacctggtaa agaccaagtt tggtgaatat 1140 

ggagaaatgc aaaagggttg gacggccgaa gatgcaaaag gcttcatcaa ggtgacttct 12 0 0 

accccactac gtgtttacta tgctaaccat aaggacgaag aggtatga 1248 
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<210> 1368 
<211> 501 
<212> DNA 
<213> B.fragilis 



<400> 1368 

ggtttctcaa aagggattta ccaatacgat gaacgaattg ggattgagtg cttttgtgat 60 

ggggcccgta tcagatttgc tcccgatctc ttattggaaa atatgaagac tttgataaaa 12 0 

acggccggaa ctaattttct gattatcgac gggcatcatt gtactgagaa aactgctgtt 180 

atagagacgg taaactcaat gatgctccaa acggtggagg gtgtcatcta tctttttcca 240 

tgttggacac aaacaccggc tgcgtttacg aggcttagag caaaaggagc ctttctcgtt 300 

tctgccgatt atgacggaac gtcagtgggc ggtctgaaaa tctttagtga gaaaggaggt 360 

atatgcagac tgagcaatcc ttggagggga agaaaacttc gggtcaccga gaatggaaaa 420 

cccgtctccg tgaaagaaca aaacaatgtc tgttcattta ttacccgaaa aggaagcact 480 

tatacgatag taggtcttta a 501 



<210> 1369 
<211> 1602 
<212> DNA 
<213> B.fragilis 



<400> 1369 

cttatggcaa atattaaaca agcagtgaaa ctaggggtat tcaccctggc gatcatgaac 60 

gtaacggcag tagtatccct acgcggactt cctgccgagg ccgtatatgg aatgagttcg 120 

gccttctatt atcttttcgc agctatcgta ttccttattc cgacatcact cgttgcggcg 180 

gaattggctg ccatgttcca ggacaaacag ggtggtgtgt tccgttgggt aggcgaagcg 240 

tacggaaaga aattgggatt ccttgccatc tgggtacaat ggattgaaag tacgatctgg 300 

tatccgactg tattgacatt cggtgctgta tctatcgctt tcatcggaat gaatgataca 360 

cacgacatga cactggccag caacaaatac tatacactgg ccgttgtgct tatcatttat 42 0 

tggctggcta ccttcatctc actgaaagga atgggatggg taggtaaagt agctaaaatt 480 

ggcggtatgg tgggaaccat catccccgct gccctgctga ttatcctggg tattgtttac 540 

ttggcatccg gagggcattc caaccttgac ttccatagca gcttcttccc cgacctcacg 600 

aatttcgata acgtggtatt agcggcaagt atcttcctct tttatgccgg tatggaaatg 660 

ggcggtatcc acgtaaagga tatgcaaaac ccttcaaaga actatccgaa agcagtattt 72 0 

atcggtgcac ttattactgt aatcatcttc gtcttgggta cattctcact aggtatcatt 780 

atcccggcca aagatatcag cctgacacag agtttacttg ttggcttcga caactatttt 840 

agatatatcc atgcatcctg gttatcaccg atcatcgcca ttgctcttgc attcggtgtg 900 

ttggcaggtg tattgacatg ggttgccggt ccgtccaaag gtatctttgc cgtaggtaag 960 

gccggttata tgcctccgtt cttccagaaa accaataaat tgggtgtaca gaaaaatatc 1020 

ctgttcgttc agggtggtgc tgttaccgta ttgagccttc tgtttgtggt tatgccttcc 1080 

gtacagagct tctatcagat cttgtcacag ctgacagtta ttctttatct ggtgatgtac 1140 

ttacttatgt tctccggtgc catctacctg cgctataaca tgaagaaagc taaccgtccg 1200 

ttccgtatcg gtaaaaaagg taacggcttg atgtggattg tcggcggcct cggcttcctc 1260 

ggttcattac tggcgtttat cctcagcttt atcccgccca gccagatttc tacaggtagc 1320 

aacacggtat ggttctctgt attgattatc ggtgctttgg ttgttgtgat tgctccgttt 13 80 

atcatttatg cagctaaaaa gccatcatgg gctgacccga atagtacttt cgaaccgttc 1440 

cactgggaaa cacaagctaa accacaagtt gctccggcaa caacaactac cgccggtccg 1500 

gcaacaagca gcgctaccac tatcggtagt acaacttctg ccccatcgac aggttccggc 1560 

tctgtttcat ccgataagga caccccacag aaacaaagtt aa 1602 



<210> 1370 
<211> 567 
<212> DNA 
<213> B.fragilis 



<400> 1370 

atgcaaagtg aagacggagc tttctacttc caccggggat tccttcctga agccatgcgc 
aaagcgttgt atcaagatct gaaagtgaaa cgttttgccc gcggagggag taccatcacc 



60 
120 
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atgcagttgg tgaaaagcgt atttctgagt cgaaacaaaa acatagcccg caaactggaa 180 

gaagctctga ttgtctggct gatagaaaca gaacgcctta cctccaaaga acgaatgtac 240 

gaagtatatc tgaatatagt agaatgggga ccgctcgttt atggagtgca ggaagcagca 3 00 

acctattatt ttaaaaagcg cccatctcaa ctgacagccg aagaatctat ttttctggct 3 60 

tccattattc caaagccgaa gcatttccgg aattcgttta acaatgatat gcaactgaag 42 0 

gagagcctgg aaggctatta ccgtttaata accgaacgat tagtgaaaaa aggaatcatc 48 0 

agtgaagtgg cagccgacag catccgcccg gaaattaatg taaccggcga ggcaaagaaa 540 

gatctgcaaa gagacagcat acaatag 5 67 



<210> 1371 
<211> 666 
<212> DNA 
<213> B.fragilis 



<400> 1371 

aaactaaacg acatgcgaaa agtaatcata actctttgtt tcttgtttgt tgcatttgtt 60 

gcacaagccg gaagaatcag tggaataaat atccaaagct caggtgaggc gattcttgtc 12 0 

tttgtggatg gcgagcaaat ctgcactccg acggagactt gtttcattgc taactattcg 180 

ggcaggcacc ggatagaagt atatgcagta cgttatatac cacgtaccgg acaaagtgtg 240 

aaaggcgact tgctgtttca ggaatgggtc tcaaatcccg gtatgaatat cagggatatt 3 00 

cgggtgggct ataatgatcg tcctgatttc tgtcccgatc gtccggtgcg tcccggctat 360 

gatgtagtga tgaaccgtac agagttcgac cgttttctga gaagtgtgaa agacaaacat 42 0 

ttcgactcag accgtaacaa gctgattgaa actacacttg tttcgacagg cttcacttcc 480 

gaccaatgtc tccaattagt aaatctgttc agtttcgata gtgaaaagat aaaactgatg 540 

caggctatgt atccacggat tgttgataaa cccaatttct atctggtcat cgaaagcctc 600 

acttttcagt cggataaaaa caagatgaac gaatttgtga gaaaatacca taatcaacgt 6 60 

aactaa 666 



<210> 1372 
<211> 1044 
<212> DNA 
<213> B.fragilis 



<400> 1372 
aagatgaaag 
aacgtatctc 
gaaggctatc 
aaggatctgt 
catggaaaaa 
atcggatgcc 
gccggatgcg 
tttatcactt 
gatggtttca 
acttccatgc 
accgcactac 
cgaatagtga 
gtatctacgc 
acactcaatt 
ctacagtcgc 
atatgggatg 
cctgaaataa 
cattacttga 



aagaaaaata 
ccaatccaat 
acatccgttg 
ctttactgaa 
ctcccccatg 
aagacccatt 
aagtcattgt 
tccataccct 
t cgacctgga 
tggtacacaa 
tggacaatcc 
tggaccgtaa 
tcgtttttac 
accagacaga 
tgatgataga 
aagtcatcat 
gcgataaaat 
agagaaatac 



catgaggcgt 
ggtaggagct 
cggagaagca 
acacagtact 
cgctgattta 
ttccaaagta 
cggagttctg 
tcaccgccct 
acgtacggaa 
aaaaagagca 
ggcactcacg 
tcattcactc 
ggaacatccc 
tattctgcca 
aggagcjciagg 
agaaaagagc 
tagttattcg 
ctaa 



tgcatccaac 
gtcatcgtat 
catgccgaag 
atatatgtaa 
atcatagaga 
gcaggcaaag 
gaaacggaat 
tacatcgttt 
ggacaacctg 
gagtcggacg 
gtacgcaact 
cctcaaacct 
cgtgccggaa 
caaatattgt 
attcttctgg 
gataaactgc 
gaagaaaaac 



tggcaaaaaa 
gtgaaggaca 
tcaatgcgat 
gtctcgagcc 
aacaaattcc 
gaatccaaaa 
gtcgcgaact 
tgaaatgggc 
tcatattatc 
ctatcatggt 
ggcacggaca 
cccatttgtc 
aagaaaacct 
ctgccctcta 
agtcatttat 
tttattccgg 
atttctgtac 



cggtctttgc 
aataatcggt 
ccgctctgta 
ttgctcccac 
taggattgta 
gttacgggat 
tatacggaaa 
agaatcagcc 
gactcctctc 
cggtacgcga 
caatccggtg 
ggataacagc 
ggaatacatc 
tcaacgcaac 
ccgttccgga 
tgttaaagca 
gaccttcagg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1044 



<210> 1373 
<211> 759 
<212> DNA 
<213> B.fragilis 



<400> 1373 
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gatatgaaac gaatattgat attctttttg gtcataggaa tcactgctat aagcagtgtg 60 

agtatggcag ccatgagcaa tagccgcatt cgcaaggaga ctcgtttcct gaccgataag 12 0 

atggcctatg aactgaacct gagcacaggg caatataatg atgtatacga aatcaattac 180 

gattttattt actccattcg ttatctgatg gacgatgtga taaggggaga agagtgggca 240 

ctcgataaat actatcgtac cctggacatt cgtaatgatg atttgcgttg ggtgctgact 3 00 

gcttcacagt atcgccgttt tataggggtc gattattttt atcgaccggt ttatgccagt 360 

ggtggcagtt ggagttttcg tatctatatt cggtatacaa accataatca tttctacttt 42 0 

ggcaaaccgt accactataa cagctattgc ggtggacact atcgtactca ttatcataac 480 

agctattatc gcggacgcta tcgacatgat ttctattcgg gttcgcacag tataagagat 540 

catcgaaatt ataacacgca tcgccgttcc gatttcggat cggtaaccat acgttccaat 600 

tcgggacgga gagatgaggt gagaagagga gtgtcccaaa gagaaagctc agcttcacgg 660 

gataacaaca gagtgactcc tgggaatgtt acccgtaccg gcagaggaac cagaagtacg 72 0 

gaaaataacc ggaggacgaa tactggtccg gaagagtga 759 

<210> 1374 
<211> 492 
<212> DNA 
<213> B.fragilis 

<400> 1374 

ttaaaatata gaaagaagag ccattcggac agaatctgtt cggattgttc ttcttttttc 60 

tttataagta agataaataa gatgactaaa tttgaaagta gtgtcaaggt gataccttat 12 0 

agccaggaac gtgtgtacga gaaacttgcc gatcttagta acctggaagc tattaaagat 180 

cgtttgcccg aagacaaagt gaaaaatatg agtttcgata ctgatacact tagtttcaat 240 

gtggatcctg taggacaact gaccctgaga attattgaac gggaacccag taaatgtatt 300 

aagtttgaga ctaccaattc gcctctacct tttaatatgt ggattcagct tgtggctgta 3 60 

tccgaagaag aatgtaaact aaaggtaact attgggctgg aaatcaatcc gtttatgaaa 420 

gcgatggtac agaaaccttt gaatgaagga ttggaaaaga tggctgatat gttatctatg 480 

atacaatatt aa 492 

<210> 1375 
<211> 981 
<212> DNA 
<213> B.fragilis 

<400> 1375 

ggacgaagag gtatgataaa agcaggaatc attggtggag caggatatac agcaggcgaa 60 

cttatccgcc tgcttatcaa tcatcccgag actgaaatcg tatttatcaa cagtaccagt 12 0 

aacgccggaa acaaaattac tgatgtacac gagggacttt acggagagtg tgacctggct 180 

tttacagacg aacttccgtt ggaagacatc gatgtactgt tcttctgtac agcccatggg 240 

gatacgaaga aatttatgga aagccataat atcccggagg aactgaaaat tatagacctt 3 00 

tcaatggatt atcgcatagc ttcaccggat catgacttca tatacggtct gccggaacta 3 60 

aatcgtcgtg caacctgcac agcaaagcat gtggctaatc cgggatgttt cgcaacttgc 42 0 

atccagctgg gactgctccc actggcaaaa cacctgatgc taaatgagga cgtaatggta 480 

aacgccatta caggaagcac gggagcggga gtaaaacccg gtgcaaccag tcatttcagc 540 

tggcgtaaca acaatatgag tgtatacaaa gctttcgaac accagcacgt tcctgaaatc 600 

aagcaatcgc tgaaacaact ccagaacagt tttgatgcgg aaattgattt tatcccttat 660 

cgcggcgatt tcccccgcgg catctttgcc actttggtag tgaaaacaaa agtagcattg 72 0 

gaagagatcg tacgcatgta tgaggaatat tatgccaaag attcgtttgt ccacatcgtt 780 

gataaaaaca tagatctcaa acaggtagta aataccaata aatgtctgat tcacctggaa 840 

aaacacggcg ataaattact gatcatttct tgcatcgaca atttattgaa aggtgcatcc 900 

ggacaggctg tccacaacat gaacctgatg tttaacctgg aggaaacggt aggcctgcgc 960 

ctcaagccct ctgcattcta a 981 

<210> 1376 
<211> 687 
<212> DNA 
<213> B.fragilis 
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<400> 1376 

ctaaaaccta taagaattat gaaaaaaata catgtttcgg cgatattaat cttgcttgtt 60 

gttatgagta gttgtgcagg cttgatctta aacttcaaaa acagtcagct aatgagtatc 120 

cagaaaggaa tgacacaaca ggaagtgaag gcgattcttg gaaagcccaa ttacagacgc 180 

tttgatggag caatggaaga gtgggaatat cgcgggtatc tttccaaagc agggcattca 240 

gtgatttgtg ttaactttat cgacaaccgt gttgttgggt tggattcgtt tagagacggt 3 00 

gcaccgactg ctcctcctgc cccttccttt tctttaggca taggtggtac agtcactgct 360 

tcggacatag ctcccgcttg tgactataga gccatgagaa acgatgagtt tgcccgcttt 42 0 

ttaaatgatg taaagagtaa aacttttgat tcggaccgga cagatttcat tgagaaagca 480 

acccgctcta ccggatttac atcagagcaa tgctgcagat tgataaaact ttatagcttt 540 

gatgatgatc ggactaaggt actgaagata ctttatccga gcgtagtgga taaagataat 60 0 

ttttccgcag caatagacgg attggatttt ctgtcgaatc aggatacggt gaagaacttt 660 
gtgaggaact ataatagaat taaatag 687 

<210> 1377 
<211> 783 
<212> DNA 
<213> B.fragilis 

<400> 1377 

ttatataata tgaaagtagc tatcattggt gcaggaaata tggggggctc cattgcctgc 60 

ggactggcaa aaggtaagct gattccggct tccgatataa tagtatccaa ccccagcatt 120 

ggaaagctgg aagcactaaa gaaagaattt ccttctattg ctatcactcg caataatgct 180 

gaagctgcta ctggtgctga tatcgtgatt ctagctgtga aaccttggct gatcagaggt 240 

gtactccgag aaatgaaact aagaagcaaa cagattctgg tctctgttgc cgccggtatc 3 00 

agtttcgaac aattggctca tgatgtagta gaacctgaaa tgccaatgtt ccgtattgtc 3 60 

cccaacacag ctatcagtga attacagagc atgacactga ttgcttcgcg aaatgccggc 42 0 

caagaattag aaacactgat ggtcaatcta ttcagcgaga tgggtatggc aatgattttg 480 

cctgaagaca aattggaagc ggctaccgcc ctgacttcct gcggtatcgc ttacgtgctg 540 

aaatatattc aggctgccat gcaagcgggc atcgaaatgg gaatccgacc atcggatgcc 60 0 

atggatatga ttgcccaatc tgtaaaaggt gccgccgaac tgatactgaa caatgacacc 660 

catccaagcg ttgagatcga caaagtgact acacccggcg gaattaccat taaaggcatc 72 0 

aacgaactgg agcataatgg attcacctct gccatcatta aagcaatgaa agcatcaaga 78 0 
tag 783 

<210> 1378 
<211> 693 
<212> DNA 
<213> B. fragilis 

<400> 1378 

tctcgttcca tttccgtatt tttgcatcct gtaattatca tagttagttt agttatgaaa 60 

aatttagaga gattattcgc cgagaagttg ttgaagatta aagctattaa gcttcaaccg 12 0 

gcaaatccgt ttacatgggc ttccggatgg aaatcaccgt tttactgcga caatcgtaaa 180 

accctttctt atccttctct tcgtagtttt gttaagttcg agattacacg tttggttctg 240 

gaacgtttcg gacaggtaga tgctattgcc ggagttgcga cgggggctat ccctcaaggg 3 00 

gctttagtgg ctgatgcatt gaatcttccg ttcgtgtatg ttcgctctac cccgaaagac 3 60 

catggtctgg aaaatcttat cgaaggcgaa cttcgtccgg gaatgaaagt cgttgttgtg 42 0 

gaagatttaa tctctaccgg tggaagcagt ttaaaagctg tagaagctat tcgtcgggat 480 

ggttgcgaag ttattggtat ggtagctgct tatacttacg gatttcctgt tgccgaacag 540 

gcctttaaag atgctaaagt gcctttggta acattgacta attatgaagc tgtgttagat 600 

gttgcacttc gtaccggtta tattgaagaa gaagacattg caacgttaaa cgaatggcgc 660 
aaggatccgg ctcattggga aaccggaaaa taa 693 

<210> 1379 
<211> 1377 
<212> DNA 
<213> B.fragilis 
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<400> 1379 

ataaacacaa tctttcaaat gagaataaag tttttatcag taattgtaag tttcttcctt 60 

gtatcgtttg ccgttacttc gtgccttgac acagaagaaa ttgaatatag cccggatgct 12 0 

accatacacg catttgcact tgacaccata catggggtaa actataaatt tacaattgac 180 

caacttggtc ccgatggagt aggacttatt tataaccagg attcactacc tgtaggctcc 240 

gatacaatta ttgaccgtat tcttatcaag acactgacca caacttccgg aataatcact 3 00 

gccaaaaacg cagagggtca ggatactctg ttcaactatt ccgattctat cgacttcaga 3 60 

ggcactatgc aaaaaccgat gagaataaaa gtgtgggctg ccgacatgca atataccaaa 42 0 

gagtatacta tttcggtacg tgtacatcaa caggacccgg attccatgaa ctggaccaaa 480 

atgacagata acttcgcaaa ctatagcgga tatcagaaat cagttaccct gaatgaagat 540 

ctgttgatct atacatcgaa tacgactgca taccaatcat ccggagatgt tatcagtaaa 600 

ggaagaagct ggacaccagt atccataaca gggcttccgg acaacatcaa gctttcctcc 660 

attatttctt tcggcggaaa actatatgcc acaaacggtg aaagtgcata cgtttcatct 72 0 

gatggagcat tatggaatgt tgcaaccgat ttgaataaaa acggtaaagt agagatgctg 780 

atcgcccctt tcccgaaaaa tgaaggtaat ctgttgggta tctccggaat tgccggtatt 840 

attaataatg gtgatcaatc tacatttgcc ataactaatc ctgaagcaac agcgtggaac 900 

attggttccg aaacagtagg tgcggacttc cccttggaga atttgtctgc aacttcttac 960 

ctgacagcaa caggaatcca gacaatagcc gtaatgggta acaatcgtaa tgcaaacgat 102 0 

acgacttcca tcgcatggac ctcacaagac ggtttgcttt ggataccttt aaaaacttcc 1080 

tcgagtaccg cctattgtcc gaaactggac aatccgtctt tcttctatta tgataatgct 1140 

tttctggcat tcggaggaaa ttttgaaacg atctatacat cggaagcagg tattgcctgg 1200 

tataaagcca acaagaaaat cttcctgccg gccgaattca aagacagagg aaacaattac 12 60 

tcaactgtag tagacaaaaa taactttatc tgggtaatat ggagtaacgg tggtgccaac 1320 

gaagtatggc gcggacgact caataaattt ggttttgaaa gacagaacaa caactaa 1377 

<210> 1380 
<211> 612 
<212> DNA 
<213> B.fragilis 

<400> 1380 

ctaacaacaa ttatactcta tactaaatca acaatggaca ctcaacaaat agatgttatg 60 

gtagccgacg cctcgcatga ggtttacgtt gacactattt tggagacaat cagaaacgca 12 0 

gcaaaagtac gcggaaccgg aatagcagaa cgtacacacg agtacgtagc caccaaaatg 180 

aaagaaggaa aagcaatcat agctctttgc ggagatgtat ttgccggatt tacctacatt 240 

gaatcatggg gaaacaagca atacgttgct acttcaggat tgatcgtaca ccctgacttc 3 00 

cggggattag gactggccaa acgtatcaaa caagcctctt tccaattggc tcgtttacga 3 60 

tggcccagag ctaaaatatt cagtctgacc agcggcgcag ccgtgatgaa aatgaatacg 420 

gaattgggat atgtaccggt cacttttaac gagctgaccg acgacgaagc cttttggaaa 480 

ggatgtgaag ggtgcataaa ccatgaaata ctgatggcga aggaccgtaa attctgcatc 540 

tgcaccgcta tgctatatga tccgacagat ccgcataaca taaaaaaaga acaagaaaga 600 

aataacattt aa 612 

<210> 1381 
<211> 1134 
<212> DNA 
<213> B.fragilis 

<400> 1381 

gaagaaacaa tgaacttatt tgatgtatat ccacttttcg atataaacat aataaaagga 60 

aagggttgtc acgtctggga cgagaatgga accgaatacc tggatcttta tggaggccat 120 

gccgttatct ctatcggaca tgctcatccc cattatgttg acatgattag caagcaggtg 180 

gcaaccctgg gcttctattc aaactccgta atcaacaagc tgcaacaaca ggtagccgaa 240 

cgtctcggaa aaatatccgg gtatgaagat tattccctat tcctgataaa cagtggtgcc 300 

gaagcaaatg aaaacgcgtt gaaactggct tcgttccata acggacgtac caaagtgatc 3 60 

tcttttggga aagcttttca cggacgcact tcactagcag tcgaggccac agataatcct 42 0 

aaaattatag ctccgatcaa tgccaacgga cacatcacct accttccgct aaatgacata 480 

gaggctgcga aaaccgaatt ggcaaaagaa gatatttgtg ctgtcatcat tgaaggtata 540 

cagggagttg gcggtatcaa aataccgact cccgaattcc tgcaagagct ccgcaaagcc 600 
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tgtacagaac acggaacaat cctgattctg gatgaaatac aaagcggtta cggacgtagt 660 

ggcaaattct tcgctcacca atatgccgga atcaaaccgg atattattac agttgccaaa 72 0 

ggaatcggaa acggattccc gatggccgga gtactgatca gtcctatgtt tacacccgta 780 

tatggcatgc ttggaacaac tttcgggggc aatcatctgg cttgctcggc tgcattggca 840 

gtaatggatg tcatcgaaca ggagaattta gtagaaaatg cagccaacat cggctcctat 900 

ctgctggaag aactgaaaaa attcaaggaa atcaaagaag tgcgcggatg cggattaatg 960 

atcggaatgg aatttgacca accggtaaag gaaatccgga gccgcctgat ccacgaacaa 1020 

aaagtattta ccggtgccag cggtacgaac gtaatccgat tgttgcctcc tctctgcctc 1080 

agcaaagagg aagccgatga attcctcgcc cggctaagaa aagtactcgg ttaa 1134 

<210> 1382 
<211> 1242 
<212> DNA 
<213> B.fragilis 

<400> 1382 

aagtatatga tgatctacat tatctttagc gtctttataa tcattattct ttttatttgt 60 

gccagatatt ggtatttgtg gagaaagata agtgtgcaaa agaacgaatg ggtggctcaa 12 0 

accaaggaat cagatacgat tttacgtagt atgaacgctt gctttatctt gataaatagt 180 

gacttggtgg tgataaggac caattattat gatttgagtg gaatctcaga agagcctgag 240 

tcctccggta gagtcggtga tctactgaat tgtaaaaacg ctgttcgtag tggcggagga 300 

tgcggggcac ataaaaattg cgaaaattgc atgattcgcc atactattga gaatgcattc 3 60 

tgccataaaa agggttttca taaattagaa gcatccatga ggctgctcag ttcggatcat 420 

caacagatta ttccttgtga tgtttctgtc tcgggtactt acttgaataa tgaaggtcac 480 

gaacagatgt tactgactgt ctatgatatt actgaattga agaatatgca gcggttgctg 540 

aatattgaaa aagagaacgc tgtttctgcc gaaaagttga aatctgcttt tatagctaat 600 

atgagtcatg aaattcgtac cccactgaat gcgattgtcg gtttttccgg tttgttggct 660 

tctgcggatg atgatacgga aaaaaagatg tatctggata ttgtagcgga aaacaatgat 72 0 

cgtttattgc agatagtgac ggatgtactc gacctttcaa aaatagagtc gggtagtctg 780 

gatttccatt attccgaatt tgatgtaaac gatttattat gtgctctgca tggcatttta 840 

aacatacgtc tgaaagacaa gcccgaaatt aaactgattt gtaaggcagg aacagatgaa 900 

tggatcattt attccgagca acatcgtatc gtacagataa tcacaaatct ggtacataat 960 

gcgatgaagt ttacccacag tggggagatt tgtttcggat gtcgtcccca aggagaggac 1020 

gagatttact tttatgtttc tgatacaggt attggcattc ctgccggaga acaggataaa 1080 

atattcgacc gatttaccaa attggaccat gaagtacccg gaacagggct gggactgact 1140 

ctttcacaaa ctatcgtaca gaatctgggg ggagaaatgg gagtcgaatc ggaagtgaca 12 00 

aaaggatcta ccttctggtt tacccttcct ttgaaatcct ga 1242 

<210> 1383 
<211> 1980 
<212> DNA 
<213> B.fragilis 

<400> 1383 

ttatcagtga tttgtattac tgagaagtat aatcgactgt taaacacaca cgcattaaat 60 

atgaagttaa agtttctctt tctgtttcta agctatagct tatcgataca ttcgcaaaat 120 

aacttattgt atgctgattc gtcaaaggat tcgctcttct ttgaacaaaa aaaggcactg 180 

cttacggcaa catggcataa acctttcagt tatagcagta ttcagagcaa tcatgctcct 240 

ttaggcccct atatggggaa tggagatgta ggggtagtgg catttacatc ggacaatagt 3 00 

cagacattaa agatatcgaa agtggatttt gttacggatg gctggacaga ctgggcaggc 3 60 

agtggtccgg ctgctttgcc cattggcggg gtgaatatca ctgtaaactc tccggtatat 42 0 

tccggctttg tcacagtcaa ccgggcggac ccttccggat ttagttatca gatggaccag 480 

ttaaatagtg agttacgaat gactactgct actgctcagc aggttaaaat ggtcacttgg 540 

atgggagtga atgaaaatat gataataact gagttgacca cctcttcaaa aactccggtt 600 

cctatttcgg tagatactta tgctgataat caatctgctt cttatacaac tacagcacag 660 

gtgaatggac aaatagctca agtcactcgg caaacgaaga ctgatgcggt gagatggatt 72 0 

tcttgtgccg gtatatccac taaaatagtg ggagttatgt ctaaaccgga atgcctgtcc 780 

gagtcgatgg tccgatcaaa ttttcagctg acagcttccg atacgcttct cgttgtggta 840 

tatgtttcgg gaggtggaaa aggaaatgat ccgcaactgc caacggccta taacaaactg 900 
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ttgactttaa ataaagcgga tgtgactcaa ttgaaaatgg cgaaaaaagc atggtggaaa 960 

gatatgtgga cccgttcgta tgtggaaacg aatgacgagt tgttgaatcg tcactacttg 1020 

tcgtccattt atttactagc ttctgcctat aacgaacatt caccggtatc cggtggcatg 1080 

tatggagtct ggaatatgga tgacaaaatg atgtaccatg gtgatattca cctgaactat 1140 

aacagtcaag cgggattcta tagtgctttt tcatccaatc gtccggaaat agccttgcct 1200 

ttctataaaa cgatagagtt gctgataccc gaaggaaaac ggcgggcaaa ggaagaaatg 12 60 

ggaattatgc atccatcatg ggaaggaaag tcttgcaggg gaatactatt tcctgtgggg 1320 

gccttgggta tcggggtgtt ttataattat tattggcaac aaaccatgaa tgctccgttc 1380 

aatgttcctc tgttcagttg gtattatgaa tacaccggtg atttgaactt tttacggtat 1440 

cgggcgtatc cttatattcg tctttgtggc gatttttatg aagactatat gcagaaggag 1500 

acatacggca aatcatatcg ttataccata acgacaggag gacacgagga ttcgtgggat 1560 

ctgaaccctc cttccgattt agcttttgtg aaacagacgt tcggtttgtt agtgaggtac 162 0 

agtaagctgt tgggagtaga tcaaaaacga cggaagaaat gggacgacat tttgtctcat 1680 

cttccggagt ataaggtgat aatgccgacc aaaacaccta atcaaggctt gcctgtctat 1740 

gcgaagaacg aggccggatg ggatttgcca agccatgcca tacagttgca tgctgcctat 1800 

ccctgtgaaa tactgaattt acattcggac tctacagcct tgcagatagc ccgaaacaca 1860 

ttgtattatt atgaggtttc tcaaaaggga tttaccaata cgatgaacga attgggattg 192 0 

agtgcttttg tgatggggcc cgtatcagat ttgctcccga tctcttattg gaaaatatga 1980 

<210> 1384 
<211> 483 
<212> DNA 
<213> B.fragilis 

<400> 1384 

ttatacaaga tgaagaagaa agcaaatcgg ttagacgcca tcaaaatgat tatctccagt 60 

aaggaagtag gttcacagga agaactcttg caagaattag gtcaggaagg atttgaactg 120 

acacaagcta ctctttcacg tgacctgaaa caactgaaag tggccaaagc cgccagcatg 180 

aacggaagat atgtatacgt attacccaac gacatcatgt acaaacgtgt cggcgaccag 240 

agtgccagtg aaatgttgat gaacaatggc ttcatttccc ttcagttttc cggaaatatc 300 

gcagtaatca aaactcgccc cggctatgcc agcagcatgg cttacgacat cgacaaccgt 360 

gaatctgaca ccattttggg aacaattgcc ggagacgata ccattatgtt ggtactacgt 420 

gaaggggcaa cgcccactgc cgtacgacat ttcctgtctc tcattattcc gaatatcaac 480 

taa 483 

<210> 1385 
<211> 1665 
<212> DNA 
<213> B.fragilis 

<400> 1385 

ataaacaaga tgatagatag atttttatct caaacttcct tcaattcaca ggaagacttt 60 

gtgaaaaact taaagataca cgttccggac aacttcaact tcggatatga catagtggat 12 0 

gcctgggctg ccgaacaacc agacaaaccg gccttattat ggactaatga caaaggtgaa 180 

caccaccagt tctcttttgc ggatatgaaa caatatactg accggacagc ctcttatttt 240 

cagagcctgg gtatcggaca tggtgatatg gtcatgctga tattgaaacg acgctatgaa 3 00 

ttctggttca gcatcattgc cctgcacaaa ctgggggcag tcgttattcc ggctacacac 3 60 

ctgctaacca agaaagacat tgtataccgt tgcaatgcag ccgacataaa aatgattgtg 42 0 

gctgccggag aagaagtggt caccaaacac ataatagatg ctatgccgga ctctcctact 480 

gtaaagcatt tagttagcgt agggcccgaa atacccgaag gatttgatga cttccatcag 540 

ggcatcgagc atgcggcgcc tttcgtaaag ccggaacatc cgaacaccaa cgatgacatt 600 

tcactgatgt acttcaccag cggaacaacc ggagagccta agatggttgc acacgacttc 6 60 

acttatccat tggggcatat cgtaaccggt agtttctggc acaatctgaa agaaaacagc 72 0 

ctccatctca ccattgccga caccggctgg ggaaaagcag tgtggggaaa gctctacgga 7 80 

caatggattg ccggagcaaa tgtattcgtg tatgatcatg aaaagttcac tcctgccgat 84 0 

atattggaaa agatacaaaa ttaccatgtc acttcactct gtgcccctcc caccattttc 900 

cgtttcctca tccacgaaga cctgacgaaa tatgaccttt cgtcattgga atattgcacc 960 

attgcaggtg aggccttgaa tccggctgtt ttcgacacat tcaaaaagtt aacaggtatc 1020 

aaactgatgg aaggcttcgg acagaccgag actacactga cagtagccac tttcccatgg 1080 
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atggaaccca aaccgggaag tatgggggtg cccaatccac aatacaatgt tgatctgatc 1140 

gattacgaag gacggtcggt agaagccgga gagcaaggac agatcgtaat ccggactgac 12 00 

aaaggaaagc cactgggact gttcaaagaa tactatcggg acgcctcgcg tacacacgaa 12 60 

gcatggcacg acggaatata ctacactggc gacgtagcct ggaaagatga agacggttat 1320 

ctatggttcg tgggacgcgc cgatgacgta atcaaaagtt ccggttaccg tatcggtccg 13 80 

ttcgaagtgg aaagtgcatt gatgacacat cccgccgtag tagaatgtgc cataaccggc 1440 

gttcccgacg aaatacgcgg acaggtagtg aaagctacca tcgttctggc caaagagtat 1500 

cgtgaacgca aaggggaaga cctggtaaaa gagctccaga atcatgtgaa gaaagtcaca 1560 

gctccttata aatatccacg cgtcatcgaa tttgtcgacg aattgcctaa gaccattagc 1620 

ggtaaaatcc gacgagtgga aattcggaag aatgacgaga aataa 1665 

<210> 1386 
<211> 1005 
<212> DNA 
<213> B.fragilis 

<400> 1386 

attatggaat tcttaaccaa cgaaaaactt acgattgtag gagctgccgg aatgattggc 60 

tctaacatgg cacagactgc cttaatgatg aaactgactc cgaacatctg cctgtacgac 12 0 

ccctatgcac ccgctttgga aggcgtggct gaagagttgt atcactgtgc gttcgaggga 180 

gtaaacctga cctacacttc agacatcaaa gaagctttgt cgggagccaa atacattgtg 240 

tcctccggtg gtgctgcccg taaagcaggc atgacccgtg aagatttact gaaaggtaat 3 00 

gcagaaatcg ccgcccagtt tggtaaagat atccgccaat attgcccgga cgtaaaacat 3 60 

gtggttgtcg tattcaatcc tgccgatatc accggattga ttgtcttact ctatgccgga 420 

ctgaaaccgt cacaagtatc aacattagca gctttggaca gtacacgtct gcaaaacgaa 480 

ctagtgaaat accttcatat tcccgcatct gaaatagtga attgccgtac gtatggcgga 540 

cacggagaac agatggccgt attcgcttct accaccaaag tacaaggtga agcgcttact 600 

aaaattatag atactccacg tatgcctatg caggattggg aagacctgaa agtacgcgtc 660 

atccaaggtg gaaagcatat catcgacctg cgcgggcgct cttcattcca aagtccggcc 72 0 

tatctgtcta tcgaaatgat tgcagcagcc atgggcggac aacctttccg ctggccggca 780 

ggaacgtacg tatccgacaa aaagttcgat catatcctga tggcaatgga gacttctatc 840 

acgaaagaag gtgtgagcta taaggaaata cagggaactc ccgaagagca aaaagaaatg 900 

gaagagagct acgctcactt atgcaaatta cgtgatgaag tgatcgctat gggtatcctt 960 

ccggaaatca ataaatggca tgaactgaac aagcatatta actga 1005 

<210> 1387 
<211> 2283 
<212> DNA 
<213> B.fragilis 

<400> 1387 

atgaataact cgaaaattat caatgtgaga ttgatgaaaa aggtgttagt gcttgttcta 60 

tcttttttgt ctgttactgc ttttgcgcag aatataacag tgaaaggaat tgtaaaagat 12 0 

ggaaccggtg aaccgattat cggagggagt gtacttgtta aaggttcatc gatcggtaca 180 

gtgacagatg ttgatggcaa ttacacttta tctaatgttc ctgcagacgg agttctggag 240 

ttttcttaca tcggcatgaa gaaacaggat gtaaaagtaa gcggtaaaac tgttattaat 300 

gttgtgcttc aagaagatac ccagatactg gacgaagtag tggtgacagc cttagggttg 3 60 

aagcgtgaac agaaagcttt ggggtatgca gtgaccgagg tcaaaggcga tgacctgaaa 420 

gctgccaata cgatttctcc ggtagccgcc ttacaaggaa aagtagcggg tgtcgagatc 480 

cgtcagtcag acggaggtat gttcggagcg acgaagattc agattcgcgg tgcttctact 540 

ttgaaaggaa acaatcagcc gatttatgtg atcgacggag ttattctcga taactcgact 600 

tcgggaaata ccacgatgga ctgggatgcc ggaaacaata atgccaatga ctatggtaat 660 

gaactgaaga acctcaatcc ggatgacttt gagacagttt cggtcctgaa aggtgctgct 72 0 

gcgactgccc tttacggttc acgtggtctg aacggagctg tggtgattac caccaaatcg 780 

ggaaaaggct tcaaaggctt cggagtttct gtatcacaaa catttggtat cgatcatgcg 840 

taccggacac cggatatcca gactgaatat ggggtggggt tgatgcctgg ctggaaagac 900 

acggacaaca atggttctgt atgggatcct tttcagttca aactcgatga taaaggagac 960 

cggacactaa taggcgcagg cagttatgga tggggaccta aatacgatgg tcagccgatc 1020 

cgcaactatg atggtacctg gaccaattat tcgccccata aaaacaacat gctggatttg 1080 



557 



tatcaattgg ggctgaactc caatacgaat gtggctattc gcggtggcaa tgataaaaca 1140 

tcgtattaca cttctctttc ttataagaaa gcaagatcta ccagcgaaaa gaatacattt 12 00 

gagcgttatt cgtttttatt gaagggttcg cataaaatca gtgatcgggt ggaagtttcg 12 60 

gctgctatga gtttcaccaa ctcaaatccg aagaattctc cgcgaacagt aggagagcgt 1320 

ttcgtcaatc cgaacggaac cattatgact ccgatgctgg atgtgaatta tttccgcgat 13 80 

aaatatctgg gtgagcatgg tggactggca tctacaagtt atggtgacaa gtatggttca 144 0 

gttccgggac gtgatttgtt ttttatgatc gataaatacg attattccca gaaagagact 1500 

gtggttcgtc cccaaatgga agtgaatgtg cagattctgg attggctgag atttaaagcc 1560 

gatgccaata tgaattacta ttacactaag tttgaagaaa aacaactggg tagtggatat 162 0 

gcaaatgaag gcggtaagta tacaatgggg caaaccacaa aagagcaggc tacctttggc 1680 

ggaacattta ctgtcaataa gcagatacaa gatttcagtg taggtggttt cgcgcgatat 1740 

gaatactata caagtcgttc ggaagcatat aaagtatata cagatggcgg tatggtagta 1800 

ccaggacaat ggtttgtcga caactcaaag aaccctaaaa agtcggaagc gagcatttca 1860 

aatacaaaga gaatgatgtc tgctgtcttt gctttgaatc tgggatggaa aaatcaggtt 192 0 

tatttagatg taacaggacg taatgactgg tcgtcttctt tggtatatca aaacgggatg 198 0 

ggtacatatt cttacttcta cccgtcagta tccggttcat ggctgctcaa tgaaacattc 2 040 

gatttgccgc attggattac atttgctaaa gtacgcggat cgtgggcaca ggtcggtaac 2100 

gataccgatc cctattatgt gaactcggta tatggctttg aaactaaaga aatgtatgat 2160 

ggcaatatct atgtgaacac tctcgataag acaatgaaga gtttgaagct gaaaccggag 222 0 

cgtaagaatg cctgggaagt tggtttggat ttacgtttgt ttgacagcag tttgacactt 22 80 

tga 2283 

<210> 1388 
<211> 345 
<212> DNA 
<213> B.fragilis 

<400> 1388 

atatgtatat cagtaccaca attaattgca tgttgcggat tagattgcga aaattgcgat 60 

gcccgtatag ccactgtccg agatgataat gaattaagag agaaaaccgc ccaaaagtgg 12 0 

agcataatga acaatgcacc ggaaattaca ccggcaacca taaattgtat ggggtgtcgc 180 

acggacggag cgaaatttgc gtattgcaat gactattgtc caattcgaaa atgtgtaaat 240 

gagaagggat ataatacctg tggtgattgt aaggaactgg atgattgtca gatagtaggt 3 00 

gctatttttc agcatgcccc tgatgcgaaa gaaaatcttc tatga 3 45 

<210> 1389 
<211> 966 
<212> DNA 
<213> B.fragilis 

<400> 1389 

attgtaacat cattaaaatt ggaggacaga cttatgaaag acccgttcaa acttaatgtt 60 

aaatcaatca tggctcttta cgaactattc cctacggaag aagcctgtat caaacatctt 12 0 

gaggccatta attggcatga caaacctgtt tcaccatttg acaagacctc cagagtttat 18 0 

aaattgaaaa gcggcaaata tcgctgtaag aatacaggta agaactttac cgttcgcaca 2 40 

ggtacgatgt ttgaaaagac aaagataagt ttgcgtaaat ggtttatcgc tatttggctt 300 

gttaccaatc acaaaatagg tctttcttcc taccaactcg caaatgatat agaagtcacg 360 

caaaagacag catggtatat gttgcataaa atacgtcatg caatgcgtct tgctaatgaa 42 0 

aacatcttgg aagaggcagt agaaatagat gaaacggttg tcggaggtaa aaatgggaat 48 0 

cgtcataagg ataagaaagt ccctcactcg caaggacgtt cacacaagga taaatttcca 540 

gtcgtgggaa tgatacagcg agaaaatcta atgaatgccc gagcaacccc tgatacgaaa 600 

tctgacactc tgtccgcatt cattaaggaa tacatacatc cggatgcaat catttatacg 660 

gatgagtaca atgcttacga ccaaataggg ttcagttata ctcgtttcta tgtcgaccac 72 0 

agcaagaagt tatatagtta tgaccacata acgacaaaca gaatagaggg tgcttggacg 7 80 

catttcaaac gtatggttaa aggtacatac agaactctgc ttaaaaagta tttgcagaaa 840 

tacgttgacg agttcgtgta taggtataat ctgagggaca tcagcaattc cgacagactt 900 

aactgtttcc tttgttgcgc tgacacacgt tatacataca agcaaatcag aaaatcagcg 960 

gcttaa 966 
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<210> 1390 
<211> 183 
<212> DNA 
<213> B.fragilis 



<400> 1390 

agaacacaat cttttaaaaa aacaaaacgc acgaatcatg ctattgcaaa atccatgcgt 60 

ttttattcta ttccaatctt tcaggccgga aaaaatcgtt tctccgcatt ttcaaaatac 120 

aaaatgacaa atcgattact tatccgttat tccactcatg attttcaatg ctgcattcaa 180 
taa 183 



<210> 1391 
<211> 417 
<212> DNA 
<213> E. fragilis 



<400> 1391 

ccaacaagat cacacacaat gaaactccgg aaaccctttc gcattctgcg caatctgatc 60 

ctttttttct ttatctcctc gatcggtgcc gtcattttct atcgattcgt gccggtatat 120 

gtcactccac ttatgattat ccgctctgtc cagcaactcg tttcgggaga aaaagtggta 180 

tgcaagcata cgtgggtacc atttgataaa atctctccca gcctgcccat ggcggtgatt 240 

gcttcggagg ataaccgctt tgcctctcac aacggattcg acatgataga aatcaaaaaa 3 00 

gcgatgaagg aaaacgaaac ccggaaaaaa gaaacggggt gccagtacca tcagccagca 3 60 

gacagcgaaa aacgtctttc tttggccaca atcttcgtgg atacgaaaag gatttga 417 



<210> 1392 
<211> 1002 
<212> DNA 
<213> B. fragilis 



<400> 1392 

gagtccccgt caacaccctg cagttgctta tcaagaagat caacgaggaa tatgtattgt 60 

aaaccgcata ggccgggaaa cactccgcag gtaccagata acaagggaaa atgcacccgg 120 

ctggtagact acctgagtaa ggagtcccag gtcgagcgtc cctattatga caattttttc 180 

tcacagcaaa aagattatgt tatacccctg actgtcaaga atcatataga caacaaccac 240 

aggaccctga aaagcaacga tgacaaattc tatatgcttt ccatcaatcc cagcggtgac 3 00 

gaacagagac atctgataga aagggtgacc ggacggaagg tcggggagtt ctcggaactg 3 60 

actcccgggg agcaggagag tgtgctggca cagatgaaga aattcacccg cgaatgtatg 420 

gatgaatatg cccgtaactt ctaccgtgag aaaataaaat caggggatga cctggtctgg 480 

tacggccgcg tggaaacgga acgccactat aagaatgatg atccggaggt taaggccggc 540 

agggcaaagg cgggagataa gaagcccggg ctccagcttc atgtgcatgt gatcgtttcc 600 

cgcatggaca ggacgcagac cgtatcactc tccccgctat caaaaagcag gggaaaccga 660 

caggtacttg aaggcaggga agtcgtggta ggttttgacc gttcccaatg gtcctcccgg 72 0 

tgcgcttcac gcttcaatca gttgtatgat tatttcccta attactattc cagggatgaa 7 80 

agtttgagga agtactccga gaactggcag gccaaaaacg aactgaagaa cgaggcggta 840 

tcaaagctca aacaggaagt tctcaaaggg gagctgaagg aagaaaggcg gctgtatgca 9 00 

aacaccttcc ggatttaccg gtttgtggta aatcccagga aggcaattat tcaggaactt 960 

aaaaggctgg ggacggatct tctttccgga agggatctgt ag 1002 



<210> 1393 
<211> 969 
<212> DNA 
<213> B.fragilis 



<400> 1393 

acggccactc cgtacaaaaa taaagactgg 
aataaaaata ctattatctc attacatgaa 
tatgctaatg attacgatta ccatatcggt 
ttattgatgt ccgacattct tccggctaaa 



cagtgggatg tcgcaatgac atatacaaag 60 

aatgtagcag actacatcgc attgagcggc 12 0 

tcggttgcca aagtaggtgg cgactacgga 180 

aatgaaaaag gagaaacatt gttggagtgg 240 
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gacgacagtt ggcggggagc ttacgaagca cgtagcggaa aagtacagga agtgggtaaa 3 00 

atgactcccg actttttagg ctctttatct actactctgt cttggaaaaa tttgagcctg 3 60 

catattgcta ctgacatgcg ttttggagga ttggtagctt cttactctaa cttgtatggt 42 0 

acacaggccg gatggatcaa gagttcttta aaatggcgtg atcccgaaca tggtggtttg 480 

tcttggacca gccagtacgg tgacagtaaa ggaatctctt atggcgatgg tgttatcccc 540 

gacggagtat ttaagaatgg tacgttcgca acacttgtag acggaacgaa aatggatgta 600 

agcggtatgt cctacaaaca gttggtggca gaagggaaac tggaacccac acatgccgga 660 

acttatcatg taaaccgtgc ggcctgggga cagaatacga tattcgacac ttgggtacac 72 0 

gagttgaact atattgcttt gcgcgagatc accttatcgt atcgcttccc gaaatcagtg 7 80 

gcaagtaagt ttggtgctca gggattggga ttgagcttct ctgcacgtaa tctgggatat 840 

ctgtataact cgttgcctaa ccatctgaat cctgagagtg ttcgtggtaa tacggcttct 900 

gagttccgta tccgtggcta tgaaccttac acagctaatt atatgatgac tattaatgta 9 60 

gatttctaa 969 



<210> 1394 
<211> 867 
<212> DNA 
<213> B.fragilis 



<400> 1394 

gaattatatc cagatatcat gaatcgaatc gaaaaaaaca agacccaaac cgatcaggcg 6 0 

tggaacaaat tacacaatcg cctggagaca gacggactgt taccaacggt gacagaacgt 12 0 

cgttttgcca cacgtccgac cgtatggatc ggcatagcag ccattgcagc tattataagc 180 

ctatgtgttt acctgcctac ggtgttacgg accgaccgcc atctttccgg cggtgaacta 240 

ttggtcaaag caaacaaaga agagagcata ctggtcacca cgcttgagga tggttccgtt 3 00 

gtctatctgt cagagcagac ttcactggaa tatcccaaac atttttctaa aaaaagaaga 3 60 

gaagtgagtt tgaaaggaaa cgcccttttt gacatagcgg gcaatcgtgc acgtcctttc 42 0 

ttcatcgaaa ccggaaaagt acagatcgag gtgataggta ccgcctttca tgtgagaaac 480 

agtggcaact ctccatttga attagcagtt cagagaggtg aggtaaaagt tactcaaaag 540 

cagaatggcc aggaaataca tgtcaaggcc ggagagaccg ctactttatt gggcgatgaa 600 

tggcagttga ctgtgaccga gaattccgaa cagtttaccc gatacatgca aaatatgcgt 660 

ttcaaagacg aacagttgga tcacatcctg catgccatca accttcgcca gacggaaata 72 0 

catctgcaaa gttccccgga actggggaaa catgtactga cggtttcgtt ctcagaggat 780 

tctcctgaga aaatggccga gctaatcggc cttgcactga acctgaagtg cacacgcaat 840 

caaaacataa tcaccctctc cgagtaa 8 67 



<210> 1395 
<211> 447 
<212> DNA 
<213> B.fragilis 



<400> 1395 

gcacttttcc cattagcacc atctttaccg ttagcgccgt cttttccagc agccccgtcc 60 

ttaccgttaa taccatcttt tccgttggca ccatcctttc cattggcacc atccttaccg 12 0 

ttaatgccat ccttaccatt ggcaccatct ttaccatatg cacgaatacc ggtatctttc 180 

ccattaatat accagttccc attggtaccg attattacct ctggcgatct tccatcttta 240 

cctatcgcag cctttccggt atcttttcca ttgattaccc aatttccatt ttctccaatt 300 

gtaacagtag gtgttgttcc tgcaactcct tcttcacctc tggaaggttt tcctgtatct 3 60 

actccatcaa tcaaccaatt accattatcc ccgatcgtaa ctaccggtgg aacagcatct 42 0 

tctccattct gtccatccct accctga 447 



<210> 1396 
<211> 291 
<212> DNA 
<213> B.fragilis 



<400> 1396 

tatggcccgt ttgagggtct tctccgtccc ggtgaacacc gtgcggaatg gtttccggtc 
ttcggtcagc tcctgttcga cgccacggtt gtaaacagca cggagtatgc gcatgtaaaa 



60 
120 



560 



ggatatggag ttcggcacaa tccctgttgc agtcaggtac gcctgataat cttccatgag 180 
tatatggtct atcgagccaa tgtaaatgtc cctgttatcc cggaattgtt tgaaactgcc 240 
aagcgcagca ttatagttct tggcggtgcc ggagtggttc aattgccgta g 2 91 

<210> 1397 
<211> 1401 
<212> DNA 
<213> B.fragilis 



<400> 1397 
ggattgcagc 
caaggaatca 
cccattatgg 
gggcgtttgg 
tcgggttcca 
ggagcacaaa 
attatttcac 
tacgaactgg 
ggattacctt 
agtaagatac 
cttttcatct 
caagcagcag 
cgattctatt 
ttaccggtag 
gcatccgaac 
gctatcacct 
aactatgcag 
acttcaattt 
gctctctttg 
ggttattcac 
ggacgcacca 
gccattctgt 
actactgttg 
ttgtcgcgcc 



cggaagaaag 
agaatctgac 
ccacctcgtt 
ggagtgaagc 
tttccttact 
atcatgaaga 
tttgctgggg 
aagcacatat 
tcatttttct 
cattctacat 
tcggtttcgg 
tattcggcat 
tctttacccg 
ctacccttaa 
aaggagggca 
ggaatacatc 
ccggacgaac 
tcggtacgct 
tacccgaaca 
aactctttat 
ttccacctgc 
ttgtccgcat 
ccaaagggct 
cgatactctg 



tcgtatcttt 
tcaaggtcct 
tatccaaatg 
cgtcgccgcc 
gaacaaagta 
tgcccgtaat 
tggactgctt 
caccgaaaat 
ttcggcagcc 
cagtggtacc 
gctgggtacc 
ttttatctat 
gttaaagaaa 
caccttgttt 
catcggattg 
acaaggattc 
agaccgtgtg 
ctgcaccctg 
ggccgcttat 
gatgctcgag 
cattatcagc 
gggcatggga 
tattctggcc 
a 



gccactcatt 
atcaataaac 
gcatatagcc 
gtcggttcgg 
ggttccgagg 
ttcgcctcac 
ttcctgcttg 
gcaatcgctt 
tttaccggta 
ggattggtat 
aacggagcag 
caattacgtt 
aaatataccc 
gctttcgtca 
atgaccttta 
tctactgcct 
atcaaatcgt 
ctttttgtat 
gaagcaggag 
atcacgacac 
atcacctgca 
gtagaaggta 
ggttggtttg 



tcttattaaa 


taatatcatg 


60 


aattgtttaa 


cctggcgatg 


120 


tgacagatat 


ggcttgggta 


180 


taggcatcct 


gacttggatg 


240 


tcagtgtagg 


tcagtccatc 


300 


acaacattac 


tat cgccctt 


360 


cacgtcccat 


cat cggtatc 


420 


atctgcgcat 


aatctctacc 


480 
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540 


tgaatatcct 


cctcgatccc 


600 


cctatgctac 


ttggatttcg 


660 


gcagagacgc 


tttactggga 


720 


atcgtatcct 


taaattaggt 


780 


atatgttcct 


ttgccgtaca 


840 


ccaccggagg 


gcaaattgag 


900 


taagcgcatt 


cattgcccag 


960 


ggcatatgac 


tttgttgatg 


1020 


ttttcggaaa 


cgaaatcttt 


1080 


gcgtgtttct 


ccgcatcgac 


1140 


aaggtgtatt 


ctacggtatc 


1200 


actacatgcg 


tattccgctc 


1260 


tctggtgggc 


tgtttgcgtc 


1320 


cattgattaa 


acggaaagtg 


1380 
1401 



<210> 1398 
<211> 237 
<212> DNA 
<213> B.fragilis 

<400> 1398 

ttaccttcta ctacgcatga tatagatacg cacatgatca ccgccacaat ggaaggtaat 60 

gctgcagcac aggactttct ctcacatctc cctctgaaag cgacattacc ctttataccc 12 0 

cgtggagtag cgttgcaatt ttctgctttc cttactccta cgcatctatc ttccgaaaag 180 

ttcgttgcag tcgtactcat gaccaattcc tttgttacgg tctgcgaatg gcagtaa 237 

<210> 1399 
<211> 1206 
<212> DNA 
<213> B.fragilis 



<400> 1399 
agtatggcta 
atcgtctatg 
ttcccccatg 
acgaccgtca 
atcatagagc 
ttccggcatt 
ctacggcaat 



cagtaaaagc 
ttgtaattca 
aatgggacga 
tgcagtcgat 
gttttgacaa 
ccgaagaggg 
tgaaccactc 



aaaattccgc 
ccgacgtacc 
agaacgctcc 
aactcggaag 
gcagtgccgc 
ggacagcttt 
cggcaccgcc 



ccctcgacgg 
gtcagacaga 
aagccggttc 
ctgcaatcgg 
agttattcat 
ttcaatttca 
aagaactata 



tcaaagaccg 


tccgggcact 


60 


tcacgaccgg 


ttacaaggtg 


120 


tggccgacaa 


tgtcaggcgg 


180 


atatgaaacg 


gctgtacaaa 


240 


cggatgatgt 


ggtggcggag 


300 


tggagggtgt 


catcgaacgg 


360 


atgctgcgct 


tggcagtttc 


420 
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aaacaattcc gggataacag ggacatttac attggctcga tagaccatat actcatggaa 480 

gattatcagg cgtacctgac tgcaacaggg attgtgccga actccatatc cttttacatg 540 

cgcatactcc gtgctgttta caaccgtggc gtcgaacagg agctgaccga agaccggaaa 600 

ccattccgca cggtgttcac cgggacggag aagaccctca aacgggccat atcaatcaat 66 0 

gacatcaggc ggatcagaaa ccttgacctc tcgctaaaac caggccttga atttgcccgt 720 

gacctcttcc tttttctttt cctctgcagg gggatgtcat tcatagacgc ggcgttcctg 780 

aagaaggccg acatccagaa cggcgtgctg acctaccgcc gccacaagac cggccagctg 840 

ctgcacataa aggtcatcaa acagatagag gaaatagttg accgccactc ggacaaggaa 9 00 

tcgccgtacc tgtttccggt cataacccgc cccggagaga acgagcgcaa gcagtatgag 9 60 

acggcgttgc accacgtaaa caaatccctt aaaatcatag ccggaatgat aaagctgccc 102 0 

gttgcgctca caacgtacac tacccgccat gcttgggcga ccatcgccaa gtcgaagaac 10 80 

gtaccggtca atgtcatctc ggacgcgctc gggcatgatt ccattaccac cacgcagata 1140 

tatctcgctt cgattgacgt ttccgttatc gacaaggcga atgaactgat tattaaagat 12 00 
ctgtag 1206 

<210> 1400 
<211> 582 
<212> DNA 
<213> B.fragilis 

<400> 1400 

cgaactctaa ctcccgatat atcccccaat atgctgaacg acgtatttat acttactcaa 60 

ataaaagaag gcaatataaa ggcgtttgaa acattattcc gccaatatta tactccgctt 12 0 

cgcctatacg ccgccagcat aacgggtgaa ccggacgtag ccgaagaaat cgtcgaagaa 180 

ttgttctatg tattctggaa ggatcgggaa aaacttgaga tttttcattc tgtcaagaac 240 

tatctctatc ggtctgtacg taaccgctct atccagtatt gcgaacatca ggatgtcaga 3 00 

aggcgctatc aggatgcgat tttatccgtt ccggtaaata tagcttcacc cgatccgcaa 3 60 

gagcagattg agtacaaaga actgcaacaa attataaacc gaactcttga gaaactaccg 42 0 

gagcgccgtt tgcatatctt ccgactacac cacacagaag gaaaaaagta ttcggaaata 480 

gcctctctcc tctcactatc ggtaaaaaca gtagagaaag aaatgacccg ggcactccgg 540 

actttacgaa aagaaattga gaattatatc cagatatcat ga 5 82 

<210> 1401 
<211> 282 
<212> DNA 
<213> B.fragilis 

<400> 1401 

agaaaggcaa cgccatatac tattgacacg gacggcggga gccgctgcct tttcatttcc 60 

gaaaaggtca tggaagtgtt tgacgacggg ttcgattacc gggaggtggc gaaccataaa 12 0 

gagctgacag atcttaccgt ggccttgttt gcagggatga tatctgcaca gcctccccgg 18 0 

acggcggcga tggaagttcc agtggcctgc cttagggaag acgcgaggac ggggacgaac 2 40 

gggaatgggc acgccgttgc acccgtgaag cggaaagggt ga 2 82 

<210> 1402 
<211> 891 
<212> DNA 
<213> B.fragilis 

<400> 1402 

atagcagaaa caatggaaat attttcttta aaggagcatg tctcctgcta caattatgca 60 

aaatgcatcc gggaagggtt ttcatattac gaaagttcta aaacagaaac ggacgaagga 12 0 

cctcatgaaa ccgattgtat tttgtttgta atggaaggag aactggaact gtcctgtaac 180 

ggggaaagaa taaaactccc ggccggtaat atgatttgtt gcagtaggga aagcatgtac 240 

agggtattct cacaaggaaa aatgagtatt gtgattgcac aatttgataa tgccgtgcaa 3 00 

agttgtgaaa aggtttcatt ttcacaactt aacagcctga actcctcggg tgaaaaagga 3 60 

atatatcctc ttgaaatcag agacaggtta caattgtttc tcaaactttt aataggttat 42 0 

ttgggagacg gagcaagttg cgttcatttt catgaaacaa agcttaaaga actgttctgg 4 80 

aacatacggt tttattatac aagaacggaa caggcctcct ttttccgtcc tatattgggg 540 



562 



aatgaccacg 
gagcttgccc 
ttcagagagc 
ttggctgatg 
cctcaatttt 
ctgctgaaaa 



aattcaaaaa 
agatgtgcga 
ctgcaagcga 
aggatatacc 
gcaggtactg 
atagagataa 



gaaagtgctt 


gataattata 


gaaatgccag 


gacagtaaag 


600 


aagttctctt 


tctactttta 


aaaggaagtt 


ttttaaagag 


660 


gtggttacag 


aaacaaatga 


atagtataat 


taaatataaa 


720 


gataaggaat 


atagccgatg 


aactgcattt 


ttcttcacaa 


780 


caaacgaaac 


tttggataca 


ctcccggaga 


atggagaaaa 


840 


aactccaaga 


agaccctccg 


gcagagtata 


a 


891 



<210> 1403 
<211> 372 
<212> DNA 
<213> B. fragilis 



<400> 1403 

ccctacagaa caagacagtc cttgcaacag atcgagtgta gattgatgat aaaatgcgat 60 

cccttgtgtg ggaattccac tattagtaag gtgtcgggtg gtatctgtaa aacgactaag 12 0 

gtcaactttc ctgtttatgg tctgtcggaa aacaaacgca cccgttatcc aatgatagcg 180 

gctcttattg accgatttaa tcgtaaattc ctggcaatac atatgttgcc gtatcttatt 240 

ttgaccatag aatatattgc gaggcgagaa atcttgatcg attcccatct tgtcctgtat 3 00 

atattgatag gatgtttggc tgttgaagct gatatggggg ccgttatacc gtatgttgat 3 60 

tccggaggtt ag 372 



<210> 1404 
<211> 489 
<212> DNA 
<213> B . fragilis 



<400> 1404 

ataattatga ccagcaagga ttttatagaa gctgaagcag gtaataacgg aagtatcatt 60 

ctttatcgtg aggggctttt ttggaaagct tatgaaaagt ctgcttacgc tgtctgtacg 12 0 

cagatcaagc ctttgaaggc tataaaaaga aggttaaagt cactcggtgg cggtgagata 180 

gtttcagtgg gatttccatg taagcatgaa caaaagtata taggttcttt ggagcatatg 240 

gagactatgc ctgaccgtct tgtgttgcga acgctaaaac ctatagacgg acagaggttt 300 

gaagaatgga aacaggaact ttcatcggag cattcagttg tagggaggag agatgcgtgt 3 60 

gtgcagaatc tgtcacggag taatattccg catggagagt taatcatgcg aatccggatg 42 0 

ttcaatctgg cggaaagtac tccgatggat tgtatgttat ttgtaaatga gttgaaaaag 480 

atgctctaa 489 



<210> 1405 
<211> 192 
<212> DNA 
<213> B. fragilis 



<400> 1405 

gctgtcattg ctttttgtaa aatagaaaca attataaaaa tctcgtatct tcaggttaca 60 

ggggaagaag tctatagaag cctgtctaca tatcctcacg gacttgttgg cttagtagat 12 0 

gagccaaatt ggatacaaaa gttcagagac ttctgttgtt tatataatta caaaaaccat 180 

gccatatttt aa 192 



<210> 1406 
<211> 1287 
<212> DNA 
<213> B. fragilis 



<400> 1406 

tgttttgttg ctttcatgcg atttttgcaa cctttttcat acaaacagtt aatttatata 60 

atcatttgta attcatttat agttatggca acagtaaaaa cggtattagt aaaggggcgg 12 0 

tgtaatagtc gcggggccta tccattggct gtgcaggtcc ttcataaacg gaagaaaaaa 180 

gtcttttata cgggatacag tattgagccc tgtcagtttg attccattag cggacgggta 240 

atctttaatg gtatgtatac aatggaaact atccggcgca tgaaccgcgt gtgtaggaaa 300 



563 



attagtaaag tcttggacaa ggctattgac atattggaaa agggggatag tgaatatacg 3 60 

acatgtgaca tatccagaat ctatgaaacc ctgacaggaa aggtgggttt ttatagctat 420 

ttccaagaga gaatccgtgt gcttttcgat accggacatg aggggacggc aaaagcatat 480 

gagtcgaccc tgcattccat gcaaagacat ttgtgtgaga gtgattttcc atttccccat 540 

ctttcttccc gtcttgtcat taagtatcgt gacgctctgc aggaagccgg tgtaggcaag 600 

aatacgatag gcttttatct acataatatg aaggcagtgt ataggagagg atgtcttgaa 660 

ttgaatctgg tattcccttc accttttagt gatattagaa tccggtctga aaagactgtc 72 0 

aaacaaagcc ttcctatgaa tcaggtaagg tctcttgccc gtctgtctct ttcggccgga 780 

acaccggaat gcttggccag agatatcttc atattcagta tctatacccg tgggatgtcg 840 

ttcgtggata ttgccctgct gaaaaagagc gatgtttttc ccggagtaat tcgctataga 900 

aggcacaaaa cgggtcagtt gcttgaaata ggtattaacc ggcagataca gagtttgctg 960 

gacaggtatg gagatactgc cggtaactac cttttccccc tggttgacga gtcggaaact 102 0 

ccttattcgg gttataaaaa agcttataat aagatgagat atgcactgaa gaaggtttca 1080 

aagagtatcg gtatgaaccc tcctttacgt ctgcatgccg cccgtcactg ttgggcaacg 1140 

atggcccgcg aaaacggtac tcctctccat accatcagtg aatgtttggg gcattcttcg 1200 

gagaaaatga ccagaatcta tctgaaggaa cttgaccggt cggtactgga cgaagtgaac 12 60 

aatcacatag cagataatat atgttaa 12 87 

<210> 1407 
<211> 1572 
<212> DNA 
<213> B.fragilis 

<400> 1407 

acagatactg caatggaaat aaggaaacaa tattttaagg agaaacaata tatgtggcat 60 

aaagtaaggg agcttcagtt gaaaggactg aacaaaacac aaattggaat atatctgggt 12 0 

gtgaaccgta agactgtacg acggtatctg aatatgacta tggaggagtt tgttaaaaaa 18 0 

caaagttctc accgcaagta caggctgaaa ctggaaaact acgagcaata tgtacgtgca 240 

aacctggaag aatacccgta tatatcggcc gccaggatac atgactggct aaaggaatgc 3 00 

tatccggact tcccccgtgt atgtaacaga accgtatccg gtttcgtgga aagggtacgc 3 60 

aaaaaatacg gcatcgggaa aaaggttgaa acgcacaagc gaaactacga gaagcagcct 42 0 

gatactccct acggggaata cgcacaggcg gattttgggg agaaatggat gcgcactgaa 48 0 

aacgggaagt ccatcaaagt gtacttcttt gctattgtcc tgtcccgttc acgatataaa 540 

tttatctatt ttagccggag gccctttgac accgggcttg cggtttatgc ccatgaactt 600 

gccttcgaat acttcggagg caggccgcaa aagatcattt acgaccagga taaagtactt 660 

atagcacggg cgaacatggg ggatttgata ctgaccggca aatttcaggc atttgtaaaa 72 0 

gagcagcatt tccatcccgt gttctgtcac aaggccgatc cggaatcaaa ggggaaggta 78 0 

gagaatgttg tgaaatatgt gaagacgaat ttcctcacgg cgcgtatttt tcagaatgta 840 

gacagactta atgaagaagc acgtctctgg cttgaaagaa cgggaaacgg gaaggaacat 900 

ggtaccacac accggattcc ccttgaggaa tttgcacagg aaagagaata ccttgtaccc 960 

tatcacggta ctccgcactc acccggtgga gaaatgaagg aatatcatgt acgtaaagac 102 0 

aataccgtac agtacagggg aaactactat agcctgccat gcggaaccta tcggggagga 1080 

gaaacgacag tatggctcca tgaaacagac ggatgcctgg agctttataa taaggagacg 1140 

ggaaagcttg tctgccggca tgatctgtgc gaactcaagg gaaagactat ctatggtgaa 1200 

ggacacagaa ggcaaagaaa tatcggagca caaaagctgg ctgaacgcat tcttatctat 12 60 

gtatcgtaca atagagaggt cgccttatgg cttgagaacc tgcagagaag gaaggaacgt 1320 

tattacaggg agaatctgga ggtaattcta cgcataattc ccggatatga caaagccatc 13 80 

ctgacagaag cggttagtgt atgtctggac aagggaatct ataatggtga gtccgttaaa 1440 

agcctgtgcg gacatatatg gaagaagaaa atgggagaat cggatgtagg aaaaaatcct 15 00 

gcctcccgga cacagtcaac cggattggta aaaacatata atgaaatctt tagaaacaat 1560 

ggcaaggtat aa 1572 

<210> 1408 
<211> 1437 
<212> DNA 
<213> B.fragilis 



<400> 1408 

caaactgatg ctccactctc cacactcata ttcaaaacct ttataacaat gaaccagaca 



60 



564 



gtaacagaaa atcatcgttt gcgagatacc atcatcatcg gtgcaggtct aacaggtctg 12 0 

actaccgcct attgccttac ccgtaaggga tgtgatatcg aggtaattga acagagcccc 180 

tgcgtaggcg gccagatacg tacctaccac gaaaatggtt ttacgtttga aagtgggccc 2 40 

aataccggtg taatctccca ccctgaagta gccgaactct tggcggaact atcccccacc 3 00 

tgccgtttag aaacggcccg tgaagcgtcc aggcaacgcc ttatatggaa aggagatcgc 360 

tttcattctc tcccctcagg actgttcagt gcaatcacca ccccattatt cagtacaaaa 420 

gataaattca atatactggg tgaacccttc cgtgcaaaag gaaacaatcc cgatgaaacg 4 80 

attggtgaac tggtacagcg gcgcctgggt atctcttatc tacattatgc agtagaccca 540 

ttcatttccg gggtttatgc cggtgaccct atgcgattgg ttacccgtca tgccctgcct 600 

aaactttatc aactagaaca aacttatggc agtttcattc tcggtggtat tgcaaaatca 66 0 

ttttcacatc gaagtgaaca cgaccggttg gctacacgaa aagttttctc tacttatggg 720 

ggacttagta acctgacaaa agcattggaa caggccattg gaatcaaacg attctccctg 780 

ggagctactt ccacttccct aatgccatgt gaacaaggat gggtagtttc tttcacagat 8 40 

tcctgtggaa tagttaaccg aatccactgc cgaaaggtga taaccaccac acctgccttt 900 

gtcctccctt cacttcttcc tttcgtccct gacaaacaga tgaatcggat aagtaatctt 9 60 

acctatgccc ctgtgatgca agtctccgtg ggattgcgta atacttatgg aaaagagttt 102 0 

catgcctttg gcggattggt cccttcctgt gaacaaaaac cagttttagg tattctgttt 1080 

ccatcagcct gtttcgataa ccgttccccc gaaggaggtg cattatattc atacttctta 1140 

ggcgggacga gacaccctga acttctagag aaaagtgacg acgaaattat caggttgata 12 00 

accaccggcc tgaatgaaat gttagattat cctgccggaa tagtacctga tcttatacgt 1260 

atcttccggc ataagaaagc cattccccaa tacgaaagca gcagtaccga cagatttgct 132 0 

gcgatcaatg aactgcaaaa acagtatccc ggattggtgg tagcaggaaa tctcaaagga 13 80 

ggaatcggca tggccgaccg catcaaacag gctgttgaaa tcgcccgaga aagatga 1437 



<210> 1409 
<211> 474 
<212> DNA 
<213> B.fragilis 



<400> 1409 
gggacatcag 
catacaagca 
ttcagacttg 
aggacggatg 
attccccagc 
ataaatcacc 
gcagaggcat 
agtccggctg 



caattccgac 
aatcagaaaa 
gaacaatcat 
gggtacggct 
aatatgtgtt 
tctataaatc 
tgccattgct 
atttaaatag 



agacttaact 
tcagcggctt 
tgttccctat 
tatcattgat 
cacttggaac 
gtctaaagcg 
tatagaccct 
gtttaatact 



gtttcctttg 
aagatgaaaa 
attgaaatgt 
acctatttta 
tggctcggta 
aaaactaaac 
aaaaacggac 
gtgctgaaga 



ttgcgctgac 
gaagaaaaga 
tcgatgaagc 
acgacctgtc 
aaggttacaa 
gaggctttaa 
taataaaaaa 
cttgttcaaa 



acacgttata 
acctacggag 
agtagccata 
tattgatgac 
gtttaccgtc 
gaagttactc 
gccggaactt 
gtga 



60 

120 

180 

240 

300 

360 

420 

474 



<210> 1410 
<211> 267 
<212> DNA 
<213> B.fragilis 



<400> 1410 

gaactttttt atcgagatat ctatctattg cttacttcat gtcagttgat aaaccaaaat 60 

ccgtttactt atgaattgct tgtgccgatt cttcatcaag atacttttgt acaaatggta 12 0 

ttgcgaggca tcacttacct ccccctgctg atactggtca tggtggtcat ggatgggctg 180 

ttatgtatta tcggtttact gccattcgca gaccgtaaca aaggaattgg tcatgagtac 240 

gactgcaacg aacttttcgg aagatag 267 



<210> 1411 
<211> 189 
<212> DNA 
<213> B.fragilis 



<400> 1411 

aatcttctct atggcttcac tcaactcttt ttcaggagga attacattga aatcttccca 
aaagcttttg tcgtatacaa atggaatttc cgcaaagatg gtatgtacgg acaaccgctc 



60 
120 



565 



attatggtcg aaacggctga cattatccgt ttctatcttg caggtgacca tttcaaacca 180 
tgtatgtag 189 

<210> 1412 
<211> 204 
<212> DNA 
<213> B.fragilis 

<400> 1412 

aattacaatc tacagggtca gcgggcacag gatgtagaag cagtgtttgg caatctcaaa 60 

acaacaaaca atccaagaag gttttatctt cgtagagtgg agaaggttga tattgagttc 120 

ggattgctgg ccatagcata taatcttgca aaagtagcct cttcgacaac tttttgtgcc 180 

cccaaagaca tactgaaagg gtaa 2 04 

<210> 1413 
<211> 1584 
<212> DNA 
<213> B . f ragilis 

<400> 1413 

tcccggaatc aaaacccaaa aacaatgaaa gcctcgaaaa gtctctgcct acaatgcctg 60 

ttcacctgtc tgctattatt catagcagcc cgggtaaagg cggatgactc cggtatactc 120 

gatcgtatca tccggttacc gaaaagtgaa atgaccgttt ataaactact ttctaaaata 180 

acggaagaga caggatacct gttcatctac gacagcaaac tggttgacaa tgaacgtacg 240 

gtaaagttga aaggtggaaa acagaccgta cgccaagcca tttacagtat aattggtaat 3 00 

gacaacctta aactacgttc ggtagacaaa cacatcatca tttatcgacc ggaaacggca 3 60 

ttaagtataa gtaaggaaga aggtttatgc agagacagca cactctcatt cactcttgaa 42 0 

ggaacattaa tagaccaact atccagagag ccgatcccct atgctaccgt aggtgccgag 480 

ggttcctcca tcggtagtgt caccaatcaa aacgggagtt tcaggcttca tctgcccgat 540 

tccctccgaa acggccggat tcgcttttca catctgggct acgtaccgca aaccacggat 600 

gcatccctac ttgccggacg aaacggaact tttgctctgg agcctaaggt aattcccctg 660 

caagaggtta ttgtacgcat tgtcaatccg gtgcgtctat tgagagagat gctgcaattc 72 0 

agaaaaaaga attattccaa agtccctgtc tatctgacct ctttctatag ggaaggtatc 780 

gagcaaaaga accggtttgt cagcctgact gagggaatat tcaagatata taaggcttcg 840 

tcaagtaccc cggaaaagac cgaccaggta aagttgttga aaatgcgccg tatcactaac 900 

caagctgtaa aggatacctt aattgctaaa atgaaatcgg gtattcatgc cagcatcgag 960 

ttagacctaa tcaaaagttt gccggatttt ctgcttccgg actccaaaga atgtgtctat 102 0 

gtctacactt ccagcgacct tgctattatc gataaccggc ttgcccatgt agtctccttt 1080 

gaacagcgtc caagtatcaa gtatccctat tattgcggtg aactctacat cgactcggaa 1140 

aacagcgccc tgcttcgggc acgattcgag ttgactccgc ggtatataca taaagcagcc 1200 

aacatgctgg ttgagaaaag aagccgcaac atccggatta ttccccaaaa ggtggtttat 12 60 

accgtatcct ataaaccttg gaaacagaca tattacatcc accatgtacg cggagactta 1320 

cactttaaaa taaaacaaaa gaacaaatgg ctcaacaata ccatcctaca tacatggttt 1380 

gaaatggtca cctgcaagat agaaacggat aatgtcagcc gtttcgacca taatgagcgg 1440 

ttgtccgtac ataccatctt tgcggaaatt ccatttgtat acgacaaaag cttttgggaa 1500 

gatttcaatg taattcctcc tgaaaaagag ttgagtgaag ccatagagaa gatttcatcg 1560 

aaaatcgaag agacagaaaa ttaa 1584 

<210> 1414 
<211> 564 
<212> DNA 
<213> B.fragilis 

<400> 1414 

aaaataaaac accttattac tatgatcgta caggatttaa attttgatca aacagtcgcc 60 

atggcaaaag ccaaagcgga aagcatgata caaaacggac tttattcacg tttcgggtta 120 

tccgaaaaag agagactgga caaagccctt ctaggctgta tcggagagtt ggcattccaa 180 

aaacacctga aaaatctggg aatcccattc gaattggacc agacagattt ccaatcccat 240 

cactccgatg aatttgacgt aaaagtaaac ggtgccaaaa ttgatatcaa agtagcaaag 3 00 



566 



aaaacaactg ccaaccctcc aaccgacaat tggacctatg gttatccaca agagcagcac 360 

cccgaaacca aagattacgt cgttgtaggt tgggtagact tcaacagaaa agaagttggc 42 0 

ttttatggtt ggatcagagg aaagcaaata gtggaattta aagttgtcac ccaaaattct 480 

tatgccaagt atccctatct gacacctaac catgaattca aatggggatg tctcaccaag 540 

gatttgaatg agattttgaa gtaa 564 

<210> 1415 
<211> 1305 
<212> DNA 
<213> B.fragilis 



<400> 1415 
ataaccatgc 
cctgttttct 
tctctggatc 
cactctcggc 
gaaggaggtt 
gtcattcagc 
gactgggacg 
ctccgtatcc 
cgtaaacggc 
ttctttcccg 
gaaaacgatg 
ct tcgtgata 
at tgctaata 
ct tctcgtcc 
aactggatgt 
ggcgagatcg 
catcacgtcg 
tatatgggtg 
gaatacggtt 
ggttatttgg 
gcaaccaaaa 
ctgggacgtg 



atatgtttaa 
gcggtggtga 
tttcctatcg 
aggcacacgc 
ctgctttaaa 
atcgtcttgc 
gtctttcctc 
ttcgtgatac 
agttgatgaa 
aacttcgtaa 
atgtccctgc 
cggttgaaag 
tccccaaacc 
cgaatatcgg 
atgcctggtg 
gcctgcgtaa 
gcctttacgg 
gcaggcccgg 
attcccttgc 

ggggcactta 

accgtcgttg 
gcaattataa 



gaagctttta 
caccctgcat 
ggataacggt 
cgtactgcgt 
caaacgcttg 
cattcctgat 
tcttgttgaa 
tcccgtatgg 
tctgcgtggc 
cagttccgtg 
cccttctcaa 
agccattatg 
tttctatatg 
cgtggagttc 
gaacaacgac 
atatttcgcc 
acaaatgttg 
tggttctctc 
cattggcaga 
ttacgagtat 
gttcggtccc 
tgataagaaa 



atattgctgc 
tatcgtttct 
ttacgcctgg 
cgtgtaagtc 
tctgatagac 
tctatactta 
gggtctgaca 
gtcatccgga 
gggcgtgtat 
atcgaatgtg 
aaaggggtgt 
atgcgtgata 
ggactcaaga 
taccttggca 
agacgtcatc 
ccccgtgctg 
acctatgatt 
tgggaaaaat 
aggcttaatc 
gctcccatgg 
acgaaagcgg 
ggaggcagac 



tgtccttatt 
acttccgcgt 
actctcttct 
tatactccag 
gtcttgcatc 
tatccttctc 
ttccctaccg 
acggtatcgt 
ggcgttatat 
agttcgagcc 
gtccaccgga 
cggtagaggt 
cgaatctgct 
agggctggtc 
gttactggcg 
catatacacc 
tcgaatttgg 
ccaattacgg 
tcgacttttc 
atgggcatta 
aggtttcgct 
gatga 



ggattgtata 
aggtgcttcc 
ttctgttatt 
tgcctctcct 
cttacgcact 
tttaggagaa 
tggggaagtt 
cgtggactcc 
gcatgaacat 
ttttgttgcc 
tactgtaatc 
acccgttcct 
ctacgatgcc 
ccttggaggt 
tgtgtacggc 
ttttatcggt 
cgggaaaggc 
tttcggtctg 
tttgggcgtg 
cgtgtgggaa 
cgtgtggctt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1305 



<210> 1416 
<211> 975 
<212> DNA 
<213> B.fragilis 



<400> 1416 
agacacatgc 
atcttactca 
cttatcactt 
caaatgttta 
gcttacttta 
aaagagaata 
atgcccaaaa 
gaacagacct 
ctcgaaggag 
attatctcga 
gtctattact 
ctggttttgg 
gccatctcac 
ccgttggcgc 
gaacgcgagg 
ctgagcggat 
ctggaacaat 



aatacgtcgg 
ttctctttcc 
ttaccgtgga 
tcaacctgat 
ccaattccag 
aacgtatcta 
tcaacattat 
atacagttac 
tcatcgcaca 
ttgtatttgt 
cttcgtggac 
caatgattgt 
gaaaacgtga 
tggccagtgc 
atgtggcaca 
taagcggatt 
tctaa 



aatacagaca 
atgcctggtg 
agacgattat 
tccttatatt 
tattatcaaa 
caatttggta 
cgatgatgat 
cctatccaaa 
tgaactgaca 
cggtatcttc 
acgcagtaga 
agctgccatc 
atatatggct 
tctaagaaaa 
gttattcatt 
attcgccacc 



caacagagcc 
gcggtattga 
gggcagtaca 
ataggaggtg 
gctgccacag 
gagaatctct 
tcactaaacg 
ggtatcattg 
catatccgca 
tctatgctgg 
aatgataaga 
ggttatttct 
gatgcaggtg 
atatccgcag 
cagcatccgg 
catcctccta 



ggaacaatct 
cttacctgtt 
atacactggc 
ttctggtttg 
gcgcgcgtcc 
gtatgtcgca 
catacgccag 
aaaagcttaa 
atcatgatgt 
cacaaatagc 
ataatggagc 
tcgccacttt 
cagcagaaat 
atcccgatat 
gcaaacaagc 
tcgaaaagcg 



ccgttccggt 
ctgctatctg 
aatgaccaat 
gtttatcata 
gcttgaacgt 
aggtatgaaa 
tggaatcaat 
cgatgaagaa 
ccggttattg 
attgcgttcc 
catacttatt 
gatgcgtttc 
gacaaagaac 
agaagctgta 
caaaagtgcc 
aatagctatc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

975 



567 



<210> 1417 
<211> 402 
<212> DNA 
<213> B. fragilis 



<400> 1417 
caagaacagc 
ctggaatcca 
ggaaatgtca 
gccttgaaac 
aaccgggatt 
gccggactcc 
catggttacg 



aaaacctatt 
agagaggcat 
acaacgacca 
ggggctggat 
gcatggaacc 
agagcatggg 
tgctgaagaa 



ccccgaaaac 
tccccatctt 
tgacatacac 
gaccgccgcc 
cctgcaatct 
ctataaattt 
aggcaacgcc 



caaggcagca 
cacggcacgt 
ctgcgagcac 
gaggtgcggg 
atgaaaagca 
tgggagccgc 
atatactatt 



agggcacggt 


gtggctgtat 


60 


tctgccgtat 


cgacgaacgg 


12 0 


agcgggcagc 


cgggcgtgtg 


180 


agacaaacat 


cgggcatgtg 


240 


ggtcatggga 


cgagtacgcg 


300 


gtgacaacaa 


aaaaatattg 


360 


ga 




402 



<210> 1418 
<211> 969 
<212> DNA 
<213> B. fragilis 



<400> 1418 

gaaaggaggc agacgatgaa tagaatattg actaccaatc atatattaca tgttgtcagt 60 

gtttttacaa ttctgttatt gacatcttgc gaacataagg atttatgcta tgaccattcc 120 

gatgctactg aaatacaggt tgttttcgac tggacgaatg ctcctgatgc cactcctgag 180 

accatgcgcc tttacctgtt ccctgtgggg ggaggcactc cccatacata cgaattcccc 240 

gattaccgtg ggggtcgtat taacgtacct gcaggccgtt acaaggccct ttgtatcaac 3 00 

tccgatacgg agtccatact ttaccgcaat attgactcgt ttgacagttt cgaggcttac 3 60 

gctgcggacg gtgtcctgaa tgtgaggtca tcttcgtccc ctcgtgcgga aggtactgcc 42 0 

gatgagcgta tcgccaggtc ccctgaccgt ttttacagcg cccgtcttga cgatgtcacg 480 

attgaactct ccaaagagaa ccggacggta accctctatc ccgaaccatc ggtatgtcgt 54 0 

tgccgggtca cgataacgaa tgtgtccaat cttaaataca tctcctctga tggtatctcc 600 

ggtgctcttt cgggtatgtg cggagggctg ttggtaggtc gcaacgaagc tacgtccgat 660 

cccgtgaccg tccctttcgg ggtggtttcc gacggtgcct ctactctgac ggctgatttc 72 0 

ctggtttttg gacaaatcgg tcccagcggc tatcccgcac ataaattagt tatttatgca 780 

atcatgtctg acggcagtaa gaattactac acgttcgatg tgacgcgcca ggttgatgaa 840 

gctgctgatt cccacgatat ccatattacg ttggacggac ttccttttcc taaacccatt 900 

gttaacggcg gtggttttca tccggcggtt gacgagtggc agaatgtcga tgtggatgtt 9 60 

tccatgtga 969 



<210> 1419 
<211> 729 
<212> DNA 
<213> B. fragilis 



<400> 1419 

gaaaggaggc atgaaatgaa aaaggtatat actgtaatgg 
ggaatgctca tgctgctttc gtcatgcggg cacaaggact 
catgccccga aatcggaggt gcgcatagag gcacggtacg 
catgagggca gtaccgattg gaagagccat tccacgtggc 
tacgacgcgt tgcgccccgg aataccggat ggattgcgcg 
ggctcggacg agatcatcaa cacagctccc gagggggatg 
gaacactccc tgctgttcta caacaactac acggaataca 
tcgttcgctt cggctaaggc caccacacgt actcgtaccc 
tcctacatgg aaacggcgtc tgaaaacact gtgaaccagc 
tacatggagt catactgtgg cggaacgctc gaccgaaacc 
gcacccgatg gtgttcacct atctggtacg ttacgagttc 
gtctctggca cgcggtgcct tggctggtat ggcgcaggcg 
cacgtctga 



aatacgttcc 


cttgctcgcg 


60 


tgtgtttcga 


ccatgacgtg 


120 


aaaaggagtg 


gcaatatacc 


180 


ctgaatcttt 


cggtatggaa 


240 


tacaagtgta 


caacgcggat 


300 


tcgtttatat 


gcggccaggt 


360 


tagtgttcca 


cgagatgcag 


420 


gttcttctta 


tcttggcaat 


480 


ccgatatgct 


ttaccgcagt 


540 


gacgtgatac 


ccgtcaccat 


600 


agccacgggg 


tggaatacgt 


660 


gtatggttga 


acagcggaca 


720 
729 



<210> 1420 



568 



<211> 204 
<212> DNA 
<213> B.fragilis 



<400> 1420 

ttgtgggtta attttagtgt ctttatcttt gccgccgtaa ataaacaaac aactaaacca 60 

gatttaagga atacagtttc tataacccta aaaacattag agttttgctt tccttttttt 12 0 

cctatccatg caggcactac aacttttcaa aaagagtcga tatgggtgtt aagtaagcag 180 

gtaataccgc tatccgtagc ttga 2 04 



<210> 1421 
<211> 651 
<212> DNA 
<213> B.fragilis 



<400> 1421 

tgttttccta accggcataa agatatcatc ctgttgtatg 
ggtgtatatt tccttatgat ctgtctgacg ggacatggaa 
aaccatgatg ggaaaagcgg aatctatttc ctctcccacg 
atcacgggag aatcgagaaa caagaaacaa gaaacaagaa 
atcttccata tcatcaagga caatttgctg ccgtccggtc 
gatttcatgc ggctgacaca ggagaagtcc aaaaaggtca 
gaggtcagcc gtgcgccgga acacaccaaa gactttacta 
tgggacgact tcatgggtga gttcgacaac atcgaactgc 
tattccccga aaaccaaggc agcaagggca cggtgtggct 
gcattcccca tcttcacggc acgttctgcc gtatcgacga 
accatgacat acacctgcga gcacagcggg cagccgggcg 



atgacacatc 


acttaaaatc 


60 


caaagaagat 


cgggagaagg 


120 


ggataaatga 


cattcaatat 


180 


acaagaaaca 


tcccgagctt 


240 


tggatgccac 


gggtatatgg 


300 


aaaattccgt 


cattcacatc 


360 


tcgacgactg 


gcggcggttg 


420 


ttgacaagaa 


cagcaaaacc 


480 


gtatctggaa 


tccaagagag 


540 


acggggaaat 


gtcaacaacg 


600 


tgtggccttg 


a 


651 



<210> 1422 
<211> 1296 
<212> DNA 
<213> B . fragilis 



<400> 1422 

agtaaagata acatgcatat gagaggttta atagtatcag cgttcttgtt gctcggatgc 60 

atccaggcgt tcgggcaaga gaaccggaag gaggtctgta tcggattccc ggtcggcaac 120 

tcgacactgg acacggctta cggcaacaac gccgcgcgcc tgtccgaagt ggtgtcgttt 180 

ctggaaagtg tgaaaaaaga cagcacgctc gaattgaccg gggtgtcttt ctgtggttca 2 40 

gcttcgcccg agggcggttt tacagtcaac aggatgctgg cggaaaaacg ccgtaattcc 3 00 

ttggagcgtt atgtacgtga acgcgtatcg cttccggacg gtatcatttc acgtcccgaa 3 60 

ggatttatcg cgtgggaacg cctcgcggag ctggtcgaag tatccgacat gccccacaag 42 0 

gaagaggcgg tggacgtgtt gcgcaacgtg cctgaattta cctatggtaa taaaggtgta 480 

ttggttgaca gccgcaagaa acatctgatg gagctgcaat atggccgtac ctggcattac 540 

atgcacaagc atttctttga ccggatccgg aatgccagtg tcattctcgt gaccgtgcgt 600 

caaaaaccgc taatcgagga gaaaacggtt gtcaaggaag aaccggttgt gccgactccc 660 

gcagacgaca cgacaaccgt tgtggagaaa gcggatacgg tcgtggcagt ttcctctgaa 72 0 

acttcaaaac ctttctacat ggctctcaag accgacatgc tctatgacgt actggccgtt 7 80 

cccaatatcg gggtggaatt ttacttgggc aagaactggt caatcagtgg caactggatg 840 

tatggctggt ggaaaaagaa cagcaaccac cgttattggc gcgtctatgg cggtgacctc 9 00 

gccgtgcgtt actggctcgg gaagaaagcc catgaaaagc ctcttacggg acatcatata 9 60 

ggcatatacg ggcaggcgtt cacttacgat ttcgagtggg gaggcaaagg ttacatgggc 102 0 

ggtgaacccg gcggaatgct ctgggacaag acgaattacg cggctggcgt ggaatacggt 1080 

tactcgctgc ccattgcaaa ccgcctgaat atcgacttta cgcttggcgt gggctactgg 1140 

ggaggaaaat actacgagta cgtccccttg gacagccact atgtatggca ggccactaaa 1200 

aaccggcact ggttcggccc gacgaaagcg gaaatctctt tggtatggct tctcggaaga 12 60 

ggcaacagca ataataagaa aggaggcatg aaatga 12 96 



<210> 1423 
<211> 594 



569 



<212> DNA 

<213> B.fragilis 



<400> 1423 
tatcaatgta 
aaaacaagga 
gaagcgagac 
gaactgcttt 
ttttacacgg 
tgccactttc 
gtagttttcg 
gccccaatcg 
gtaagcatcg 
tctgataccg 



aaaatagaaa 
aactccttga 
gtatcacatt 
cagaactgtt 
atttcggtaa 
aagatcatgg 
ccacactcaa 
tactcggcaa 
ggaacaatgc 
tcgtcggtgg 



agaaataaat 
tacagaagaa 
tcaactcaac 
cggttatcgt 
gaacattact 
tgggattaca 
ccacggactg 
gaacgtgtgg 
cattgtcgca 
agtgccggca 



atgacgatca 
atccatcaat 
acgacatatc 
gttccctctt 
attggcgaag 
atcggtgacg 
ctacccgaag 
gtaggctcca 
gcgggagcag 
aagtttatca 



gagaatttaa 


agagcacgta 


60 


tcatggacat 


catgagtaac 


120 


atacgcccaa 


cgaggtacga 


180 


catttcgtgt 


atttcctccg 


240 


atgtgtttat 


caatgcctgc 


300 


gttgtcagat 


cgggcataat 


360 


aacgcaagtc 


cacccaaccc 


42 0 


atgccaccat 


tcttcaagga 


480 


tagtaaccaa 


agatgtcccg 


540 


aaacaatccg 


ataa 


594 



<210> 1424 
<211> 267 
<212> DNA 
<213> B.fragilis 

<400> 1424 

attaagaaca aaaaaattaa tggaatgaat atgaataaga agatgtatat attgccgggg 

gatgagcgca tagcagcctc cgatgccaaa gagtttgtac atgaacttcg gacgggcagt 

tggatggatt ccaactgcac agatgaacag tacatgtgca attttgccga acgttacgtg 

attcaggcag gtgtgaggat tgccactgat acaccggaga atttccttgc cgatttgatt 
cggacaggat acgccaaaga gatgtaa 

<210> 1425 
<211> 2073 
<212> DNA 
<213> B.fragilis 



60 

120 

180 

240 

267 



<400> 1425 
aatatgataa 
cccgatacaa 
caaaacacac 
gagaatgaaa 
gattatgggt 
gctccatcag 
attgacttgt 
ggacgtaatt 
ggtacgtatt 
tatacaaagg 
ggctttttta 
cgaatcgggc 
gaatattcca 
gaagccgtaa 
aacatacggt 
caggacaaga 
aagatacggc 
tatcattgga 
agtcgtttta 
gcattttatc 
cgttacgatt 
aatggggaaa 
cctaggttca 
aaaggatata 
tcccccgaat 
ctatcagctg 



aaagattatt 
tacaggttaa 
ctaatcgtga 
tatcgggagc 
ccaagcagaa 
taggctttta 
cggatataag 
ctattggtgg 
tccggttggg 
tgaacgagca 
ccaatttgca 
tcacatggaa 
atcaaggagg 
actacaacaa 
ataacggccc 
tgggaatcga 
aacatatgta 
taacgggtgc 
cagataccac 
atcaatctac 
atgaacacgc 
caaaatcgct 
gtatgcagta 
aagccggagg 
ataactggaa 
atttgagcct 



tttcctccta 
gcgtatcgat 
gccactttct 
taaagactta 
ttctccggtt 
cgtagatggc 
tagtatagaa 
aaccatcaat 
atatggcagt 
gttaggttta 
tacccataaa 
acccgcagcc 
atatccatac 
tgaaggatta 
ccatatcagc 
tcaagatttc 
ttgccaggaa 
gtttgttttc 
ccgacacctt 
actcgatctg 
ccgatgcgat 
tgaacaattc 
cctttcttct 
attcaatgtc 
ttacgaaata 
gttctatata 



cctttctcta 
ctggatgaag 
atctctacct 
agttccttac 
tatattcggg 
attccctatt 
gtacttcgcg 
gtatataccc 
tacaatgata 
tcctttagcg 
aaggcagata 
cattggacta 
ggattgtata 
taccgacgaa 
ttcaacagcc 
tcgcctcgca 
tttacgatta 
cgacagacca 
actaatagtg 
ttgcaaggac 
ttttccaaag 
aaccgatcgc 
cacaatcaat 
tctttcctca 
ggtaccaaac 
gattggcgca 



cgatagtctc 


ggccaatgag 


60 


ttacaatagt 


agcttttaaa 


120 


tagataatcg 


cttcctgaaa 


180 


ttcctaattt 


ctatatgccc 


240 


ggataggagc 


caaaaaggat 


300 


ttgaaacgtc 


cgctttcgat 


360 


gaccgcaagg 


cacactctac 


42 0 


attcgcccct 


cgattatcaa 


480 


tgcgattaat 


agcttcgaac 


540 


gtaattatca 


tcacaatgat 


600 


aacttgataa 


cggagccgga 


660 


cccgcttcat 


aacctcctac 


720 


atgccgacaa 


gggaacaaca 


. 780 


atctgctaac 


ctccggaatc 


840 


aaacatccta 


tcaatatata 


900 


atatattcta 


tggtcaaaat 


960 


aatcggtcaa 


taagagccgc 


1020 


taaacaggaa 


agttgacctt 


1080 


gaattcccac 


acaagggatc 


1140 


tgtcttgttc 


tgtagggtta 


1200 


tacagcagcc 


attaaatgga 


1260 


ttcacttcgg 


gcagtttact 


1320 


tgttctatgc 


ctccgtatcc 


1380 


acaatgacga 


ctacctttat 


1440 


tatcattcct 


gaacaatcgg 


1500 


atcagcagat 


aaccaatact 


1560 
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attccaactg taggtaacgt aatccgaaat gccggtcgct ctcgtaataa aggcatcgaa 1620 

gccagttttc aagctcgtcc gacgaaatca tggatgatgt atatgaatta tggatataca 1680 

gatgcacgat ttgttcacta tcaaaaagaa gaacgcggta tcctaaaaga ttatgaaggt 1740 

aactatctgc ctatggttcc ccgccacact ttctctttga caaccggata ttcattttat 1800 

gacatctgtt cctggatcga ccgccttacc ctcaacgccg gcgtatcggt aacaggtccc 1860 

atttattggt atgaagataa ccacccctca caaagcccgt atgcattggt caatctaaga 1920 

attagcataa acaaaggatg ctttacatgg gaagcctgga gcaagaatct cacaaatacc 1980 

gactacctga gctactactt cgtaaccagt aaagcctatg ctcaaaaggg aaaacccatt 2 040 

accttaggta catcagtcag catcagtctc taa 2 073 

<210> 1426 
<211> 252 
<212> DNA 
<213> B.fragilis 

<400> 1426 

taccccacaa acacatatat ttacataaag aagaatacta tttatatgta tcagacaatt 60 

tcagaatacg agtccctgca gccggttaca tatcaaatag acacaaaccg cttcgaaaaa 12 0 

aatcaaataa aagactccgc aaaagaacag tataaatcag ttaccaacct ctacttacca 180 

cgaaacgaca aaaactgtat atgcacgtca gtcccgaacg ttaataccca agaagccaag 2 40 

cctggcagat aa 2 52 

<210> 1427 
<211> 696 
<212> DNA 
<213> B.fragilis 

<400> 1427 

ctatataaaa agagcttaat tatgaagaaa attaaattta tggctttgtt tctaagcatg 60 

gcgcttgttt tcggaagttg tggaagcatg aataatacag ctaagggtgg tgtcatcggc 12 0 

ggtggttcgg gagcggccct gggagctatt atcggtggta ttgccggtaa aggaaaaggt 180 

gctgctatcg gtgctgcagt aggtactgcc gtaggtgccg gagcaggtgt tctcattggt 2 40 

cgtaagatgg acaagaaagc tgctgaggct gcaaagatca aagacgcaca agtagaacaa 3 00 

gttactgata acaatggtct ggctgccgta aaggtaactt tcccctcagg tatacttttt 3 60 

gcattcaact cttctgcact aagtgcagca tctaaacaat cattggctga atttgccaat 42 0 

atcctgaaag aagatccgac agtcgatgta gccattatcg gtcataccga taaagtaggc 480 

agctacgaag ctaaccagaa agtatcggcc aaccgtgcat acgccgttga aaattatctt 540 

caggcatgtg gcgttaaacc ttaccaattc aaaaaggtgg aaggtgtagg ctactcacaa 600 

tacaacgagt cggaaacacc ggaacaaaac cgtcgtgtag aaatatttat gtacgccagt 660 

gaacagatga ttaaaaacgc tgaagccggt aaataa 696 

<210> 1428 
<211> 1275 
<212> DNA 
<213> B.fragilis 

<400> 1428 

actctgaaaa agatgaaaac gcaatggata agatcaatcg gatgtatact ggcagttttg 60 

attctgtcgg gcatcatgcc actggctgca caggacaatg cggaaagata caccacgatc 12 0 

agtggagtgg tcaaagacaa actcaacaaa aagaaactgg agtatgtcaa tgtatcgata 180 

ccgggaagca gtgtcggtac cgtcaccaac gcagacggtg agtttactct aaagattccc 240 

gagtcggttc aggccaaaga cattgaagcc tcacatgtag gttacctcaa ttcccgtatc 3 00 

cctttaaaag aagaaaatcc cacagaacgg attgtctggc tcactcctta tgccaacctg 3 60 

cttagtgaaa tcctggtaag agccagagat ccacgcagca ttgtggaaga agcacttcgc 420 

aagattccgg ccaattatag tccccagagc aacatgctca caggattcta cagggaattg 480 

gctcaaaaag ggcgtcgtta tatcaatatt tcagaggctg taatcgatat ttataaaacg 540 

ccctacaatg aaactgccga acacgatcgg gttcagattt acagaggacg cagactgttg 600 

agccaaaaac agagtgacac actggctgta aaattactcg gaggccccaa tatggccatt 660 

tatatggata tagtaaagaa cccggactgc ttgttggctc aagaagacct attgttctac 720 



571 



gaatttcgaa tggaagaccc gaccagcatt gacgaccgat cccagtatgt catcagcttc 780 

cgtccaagag taaaattatc ctatccctta tgctatggta cactctacat cgataaagag 840 

cgactgtcat tcacacgcgc cgagtttaac ctcagcatgg atgataagaa taaagccact 900 

caagctatct taagaaaaaa acctttcgga ctgcgtttca aaccggtaga agtatcatac 960 

ctgatatcat acaaaaacct ggaagggatc acttacctga gttatatccg gaacaatatc 1020 

cgctttaagt gtgactggaa gcgtaaactg ttttctacca actataccat cttatcggaa 1080 

atggtggtta cggacaggaa agaaaacaat attacagcta ttccatataa agcagcattc 1140 

aaacaaaatc atgtattctc agacaaagtg gataacttta ccagtgacaa cttttgggga 12 00 

ggctataata tcatagagcc tacagagtca ttggagcatg cagtaaacaa attaaaaaaa 12 60 

cagcagaagc agtaa 12 75 

<210> 1429 
<211> 951 
<212> DNA 
<213> B.fragilis 

<400> 1429 

caagttatga atgaaacaat tagacgcatt ttagccgaga gtggaacaaa aacctcaaaa 60 

atccgtaagc ttcttctgac cggactttca caccgtgaaa ttgccgacct cgttacccgt 12 0 

ggaaaccgtg gcttcgtgtg gaacgtctat aagagaatga gggacgaggg cctgcttccc 180 

gcttcacaga cagcgactgt cttaagacca gaacccgact atactttcaa ccgttgcttc 240 

ggggttgaga tcgaagccta caactgcccg agacagacct tgacggatgc gcttcgggag 3 00 

actggcatcc ctgtggaaat tggaagccgt aatgccgaga ccaacagcaa ctggaaactg 360 

accacggacg gaagtttgga gggaagccat acttttgagc tggtcagccc gatcctctgc 42 0 

ggtgagcagg gtttggaggt actggagagg gtatgctggg tgctggacgc atacaatgta 480 

aagataaata gcagttgtgg agtccatgtg cattttaatg cgggtgactt taatcttaca 540 

acttggcaga acttaatcct ttcctacaaa catgccgaaa ctgaaataga caagttcatg 600 

cctgcctcac gcaggggaaa cagaaatacc tactgccgtt ctctcagagc gttctccgat 660 

gaagatatca gatcggcgga aagtatcgag tcactacaaa gactcttcgg cagcaggtac 72 0 

atgaaagtaa accttgaagc ttattcacgt cacaggacag tggagttcag acaacactcg 7 80 

ggaacgatca atttcacaaa aatagagaat tgggttagat tcttgggaag attgattatc 840 

tttgcatcta catcttcgct tcctgcggga atcagactgg aggattttcc tttcttggag 900 

gaaaaacaaa aattatatta taaattaaga acaaaaaaat taatggaatg a 951 

<210> 1430 
<211> 1206 
<212> DNA 
<213> B.fragilis 

<400> 1430 

aataatatga atttcaatga aataaaaaat agacttctta 
gtttttccct tttatgttca tgctcaaaac gataagatgg 
accaaagcct gggaaatagg ggtaggtgga gctcttatca 
tcgaattttc gtcaggtcga tgggaactat ctgtatcgaa 
ggcggtatcc aactctatgc agctcgtgaa ttgaatcctt 
gggacattgg gactggcaag aaaacaagtt gaaacaggcg 
tatatggccg gtccgggact tcaattccgg ttaaccccat 
gaaccttatt tacgcgtagg tgttaactac ctccatcatg 
ggaaagtttg aaaatgatcc tataggagaa gcagaatgga 
aaagagaaaa taggatctaa acaatcctat ttccccttat 
gcttggctta acgatcattg gggagtagga ttacagggag 
aaaaaacaaa cgcgttttgt tcaggcttcg atgcgtatta 
acaaaacgtc ctatgccggt tgtgcaatat atagaccgtc 
cgaattgttg aaaagagaat tgaagtgccg gctgtggtgg 
ttcgataaca ttcattttgc gtttgataag gatgtgatta 
ttggataaga ttgcagatct gttgaaaagt tatccggata 
tatacagatg caagaggaag cgacaattat aacatagatt 
gctgtgtata gtgcattgct gaaacgacaa gtacctcaac 
gtcggatatc atgctagttc agtgccggct tcaggtccgg 



taggagcatt 


ggccatcact 


60 


aaggacaaac 


agcacataga 


120 


actgggacag 


agtgactttc 


180 


tgaatatcga 


tcatcttttt 


240 


ggttttatct 


tgatttgcag 


300 


ggcgtaagtt 


tgatttcatg 


360 


tgtttaaatc 


aaaatatgta 


420 


atttttatgc 


aattaatgca 


480 


catcatccaa 


tccttggaac 


540 


ccttcggagc 


cggagtacaa 


600 


aatacatcat 


gcctgtcgat 


660 


tgttccgttt 


gggtggaagt 


720 


cggttgatag 


gattgtagaa 


780 


aaagtcatgt 


ttgtgattta 


840 


cttccgaatc 


tgaaatcact 


900 


acaatttctt 


gataaccggg 


960 


tgtcgaaacg 


ccgcgctaaa 


1020 


atatgttgaa 


atggcgcgga 


1080 


ataaagtcag 


gatgggtgat 


1140 
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cgaaaggtgt ctattgagag agtgacgaat tcagattatt ggggttggtt aacgaatgaa 12 0 0 
gaataa 1206 

<210> 1431 
<211> 906 
<212> DNA 
<213> B.fragilis 

<400> 1431 

aagaatatga aaacaactca gagaacggcc ggatggtatc atgtgatggc agcggtgaca 60 

gtaatgatat ggggaacaac tttcgttgct actaaagttt taataaaata tggcctgtca 12 0 

cctgtcgata ttttattcta ccgtttttta ttggcatata tttgcatctg gtttttctct 180 

cctcgtgtgt tgctggctaa gagttggcag gacgaactgc ggtttgtagg actcggacta 240 

tgtggaggtt cgctctattt tgtagccgaa aatacggcat tgggtatgac gcttgcttcc 3 00 

aatgtatcgt tgattatctg tacgactcct attctgactg cactgttggc accctttttc 3 60 

tataagggtg ataaattaaa agcacgtctg ataggcggtt ctctgatggc gcttatcgga 42 0 

gtgggactgg ttgtgtttaa tggtagtttc attttgcagc ttagtccggc cggtgatatt 480 

ctgaccctga tagctgcatt aatgtgggct ttttattgct tgcttctcag gaggatgaat 540 

actcattatc cgacattgtt cattacacgg aaagttttct tttatggttt ggtgactctt 600 

ttacccctat tcttagtgta tcctttacag acggatatac atatcctgtt ccggcccgtt 660 

gtcgctctaa atctgctttt tctgggggtg attgcttcga tgctgtgcta tattatgtgg 720 

aatacggcag tgaaacaatt gggagtggtt tgtgccacca gttatattta tgtagttccc 780 

cttattactt tgctgacctc tgccattgtg atcgacgaaa ccatcacaat agttgcttta 840 

ttgggatcgg cactgattct gagcggagta tatattgccg aaaggggagt gaacttgaag 9 00 

aaataa 906 

<210> 1432 
<211> 234 
<212> DNA 
<213> B.fragilis 

<400> 1432 

ctgacaatga ataaagtggt atatgtacat ctcgtttttg agaagaaaga ctattttttc 6 0 

ggcagcattg ccgccatcta tgactatttg agtgcggatc agatcggagc cggttacaat 12 0 

acgctccgga acgttcggtg gaaagaaacg tcagtgtatg ttactccaaa agccatcata 180 

aagatcggaa aacttcttcg ggcaggcagt tgcaaaaagc aaccgataaa ataa 234 

<210> 1433 
<211> 561 
<212> DNA 
<213> B.fragilis 

<400> 1433 

tctgtaacga aaatgaatac gaataaagaa tttttgacta aaatgattcg ggtttctcca 60 

agcgtaaaaa agcgtatgga aatttttcag ggaggtgatt ccgccaattc ctgtattgac 12 0 

agaatgatta cattttttga aatcacagga ttcaatcccc gctacgcatc ccggaatccg 180 

acggcactgg tggaaaagag aattgaggac gttgtcagaa tcatcaagtc ccaggaacgg 240 

gatatactca agcccgtact tgagaaactc tccgccataa acaacacccc gcaggagtca 3 00 

cctgactatg cccggttgat gaacgagctc cgggatctga aagatgaaaa ccggaaattg 3 60 

aaggaaaggc ttcaggcgga tgatctccat acccaagacg ccgccgtata ccaggacaag 42 0 

ctcaaacgtc tgggcggcct gctgaaatac cagcttgatc cggagaagtt tccaaggata 480 

aaatacagcg atgatgtaag agtccccgtc aacaccctgc agttgcttat caagaagatc 54 0 

aacgaggaat atgtattgta a 561 

<210> 1434 
<211> 459 
<212> DNA 
<213> B.fragilis 



573 



<400> 1434 

aggaaaacga aacccggaaa aaagaaacgg ggtgccagta ccatcagcca gcagacagcg 60 

aaaaacgtct ttctttggcc acaatcttcg tggatacgaa aaggatttga ggtctacttt 12 0 

acatttctga ttgaaacttg ctggtcgaaa gaacggatta tggaagtata tctaaactcc 180 

atcgagatgg gtaaaggtat ttacggtgct caggcaaccg ctaaatataa atttaaaacg 240 

acagctgcca aactgacccg gggacagtgt gccctgatcg cagcaacttt accaaatcca 300 

atacgattcg actcggcaca cccctcacct tatatcaaac gacgccaagg acaaattctg 3 60 

cgactgatga atctggttcc gaagttccct cctgttgata aggaaaaagc gaaaggacaa 42 0 

gatacaaaaa aacaaaagaa taagaaaaag aagaaataa 459 



<210> 1435 
<211> 615 
<212> DNA 
<213> B. fragilis 



<400> 1435 

ctaataaaga atacaataga ctcccccaat aatgacagat caattatgta caaaattata 60 

ttcgtatttc tggcaataat gggcatagcc actgcatcat gtgcccaaca aaaacaaggc 12 0 

gcaaacagaa agcagcccaa taacaaagtg cttatagcct acttctcggc gacaggaact 18 0 

acagcaggtg ctgctgaaaa attgtctaag gttacaggtg gagaacttta tgaaattact 240 

ccagcccaac cctatacaaa tgctgacctc aattggaata acaaacaatc gcgcagttcg 300 

ctggaaatga atgatccgaa gtcacgtccg gccatccgga aatcttccat agatatcgcc 360 

gattatgacg tgattttcgt cggctatcct atctggtgga atcttgctcc acgtattatc 420 

aatacattca tcgagagcta tcatttgaaa aacaagacaa tcatcttgtt cgccacatcg 480 

ggaagcagta gcatcactaa cagtatggca actctgaaga aaagttatcc cgaactgatc 540 

tggaaagagg gaaaactgct gaatggaatg aacgaaaacg atatccgcga atggatcagt 600 

aaattggact attga 615 



<210> 1436 
<211> 279 
<212> DNA 
<213> B. fragilis 



<400> 1436 

ctatcctgga acaattctaa tagtgcatgg cctttgctgt acaagaaaga acaccaaagg 60 

cctatcaccg caaaatacac agggctacac tgcatcaaaa gtaaacaacc gactatctat 12 0 

aaaaatgagt tgtataaatc tgtgcaatct atagtaaaaa agagttttga aactctctct 180 

tttattcttt gtgaaatctg ctccgacaaa cacgaaataa tattaaatca tttgtacgct 240 

ttgtacattt tatctcttcc gatcgttatc tttatgtaa 279 



<210> 1437 
<211> 318 
<212> DNA 
<213> B. fragilis 



tgtataacgg caacttgaaa 60 
gcaagaacat tcatgttgtt 12 0 
aacgtagttc tggtaatttc 180 
ctattaagaa cggtcaagaa 240 
atagctatgg caatgaccca 300 

318 

<210> 1438 
<211> 621 
<212> DNA 
<213> B. fragilis 

<400> 1438 



<400> 1437 
at cccaaacc 
tatgttgcca 
cacaatggcg 
agaacacaac 
gttgctatac 
tacccaccag 



tattgggagt 
ctaaagacag 
accaatggaa 
aagaagcatt 
acggattaga 
aaggttaa 



gggaatggga 
aaaggaggta 
agtaaagcaa 
tgagcgtgct 
tggacgtatt 



tttattcaag 
aatatggcag 
gaaaatgctc 
cgtgaaatcg 
cgtgaaaagc 



574 



atagagactg 
atgaacttac 
tataactcat 
caactgaagc 
gctcacgaaa 
cggacaatcg 
aagattacat 
gaagagattt 
acaaaggaac 
ggctttcata 
gctccgaaaa 



aatcacatac 
ttatcatcct 
tggtaaaatt 
agcgtcacga 
aagaaacact 
atgaaaaaat 
tggaagctta 
cggacgtaga 
tgaacaatgc 
aagaaatgat 
taaaatttta 



ctttggatca 

ggggattata 

gagaaataat 
ccttattcca 
tgagcgcgtc 
tacagctgaa 
tccggacctg 
aaataagctg 
tgtacagaca 
gttcgacttg 



agtttaataa 
atcatcctcg 
cgcgaaaacg 
caattggtag 
atccaagccc 
aatcaactta 
aaggccaacc 
gctgccgtac 
ttcccttcta 
ggcacagaac 



cgttaagaat 
tcattatcat 
catttgccga 
atacagtaaa 
gcaacggagc 
gttccgccct 
aaaacttcct 
gccgctactt 
acctgattgc 
aacgtgccaa 



caaaagaatt 
tgcttccatg 
cattgatgta 
aggatatgca 
tgtcagtgcc 
cgcaggattg 
tcagctacag 
taattcggcc 
caacatgttt 
tttagaagag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

621 



<210> 1439 
<211> 1311 
<212> DNA 
<213> B.fragilis 



<400> 1439 

gtacgggctg aatctggaag tacgcttgaa gaacggaaaa atcaagtcgt tcgatttcga 60 

cgtgaccgac caggtggtgg cgcaaccgca gggaggtgtc atcgtggtga agggcatcga 12 0 

gatttccgac gaagagggta cggaaggcgg ctccggcttc gacgtggatg tggacgactg 180 

gggagattac gaggacatcg aacttcctct ttaatttgca tagagagtta tcaaaacatt 240 

tttattcaca atttaatttt taatcaaatg aaaaagatct ttttgattgg attagcagca 3 00 

acagccatgt tggcaagttg cagcaacgac gagaccgtgg aaatggcaca gtctaaggcc 3 60 

atcggtttca gcaacgcctt cgtgaacaac ggaacacgca gtatcgtgga tccgagtttc 420 

acatcaacaa gtttggaaga ctttgccgtg tatggtttca cacgggcagg ccagatcttc 480 

aagggtgaca aagtgtataa agaaagtttg gaatctactc cccaatggtc atacgatgta 540 

ttgcagtact gggttccgga caacacttac actttcggtg ccattgcgcc ttacagtgta 600 

gcgacaaatg ttttcgatgt agcattgcct gaaaatgcca caaaggtaga aatgaaggtt 6 60 

gctttcacca acactgatgc agaccaggtt gacttgcttc acgcagctcc cacacaaatt 72 0 

gctggaacag aagtaacgga gacatacgca actcctgtta gcatgacatt cgaccaccag 780 

ctttcaaaag tgaaattctc gttcgagaat gcagtaggtg agggctacaa tgtaaaagta 840 

agcaatgtga gaatcacgga tgcttacaca aaaggtactt tgacagtaac cgctgccggt 9 00 

aacatttgga gtgctcaggc ggataacaat cttatattga acttcggcaa tgtggtagcc 960 

aacgatgcta ccgctgatga agctgctgtc attgctaacg ctacaacttc tgaaagctac 102 0 

aacgaaaagc tgatgatccc gatggctgct accgctacat atactgtaac tttcacggca 1080 

gaattgtatc atggtgacgt attgttaggt tcttacaacc acgaagtaac aattaagaat 1140 

gttgagttca agctcggtta ctgctatgat ttcaaggcta ctttgacttc tagtaatata 12 0 0 

acagacaaac cgcttaatcc tatcaagttt gaggttgaca atattactga ctggaacaaa 12 60 

actgacattg acaaagactt ggccgttccc acaacccaga gcggcaacta a 1311 



<210> 1440 
<211> 222 
<212> DNA 
<213> B.fragilis 



<400> 1440 

ggtcagcaag ataatttttt cctgtatacg tttgtctacg caaagtataa gatgtataca 60 

tataataggt taattatcaa cgctatgcgt cgattcatgt tcccgtgtgc gggaactcct 120 

cttgctcttc cttatcagga atttactaag ggacctccgg cccttagccc gggacgcttc 180 
cgagaggccc ggaacatcag gcaagggcat ggctgctggt ga 222 



<210> 1441 
<211> 2664 
<212> DNA 
<213> B.fragilis 



<400> 1441 

aataatatac ttatgagacc ttcccatttt acattacatt cctgcaggga tgccttttcg 



60 



575 



cattttctgc tttttacact cgtttccggc agtatctgtc tttcctcctg tgaggatgac 120 

atggaccggc cttcgccttc ctcacatgtc tctttcacca ccgagatcag ttcctcccgg 180 

actccatcca cccgttccac caccgatact gatactccgc agggtactgt caccgccttg 240 

caaggcggca gtacccctct ttacctgcac acgctttaca ccgacagcat cgccctccct 300 

ccttcggaca gccggcctga caggggcgtt cttacccgtg ccactcccat aaaggatgcc 3 60 

aacatgtacg agagtttcgg tgtctcggct tactcgtata ccggttcctg gagcgaagac 42 0 

aaaactccca actacttcta caacgccaca gccagcaagt ccgacggcgg ctatacgctt 480 

tcctccacct attactggcc cggttcttca tataaaatga agttcttcgc ctatgctccg 540 

acagccaaca ctcagtacgt actttccggc aggacacatg caggttctcc caccatcagc 600 

gtcactattc cgggcgatgt caatgaccaa aaagaccttc tcgtggcaaa gacggatgaa 660 

ctggccggca acaccaacac tgccgtggcg cttagtttca atcatgccct taccgccatc 72 0 

aggttcgtgt gcggagatga catgcaggta ggtaccgtaa agagcgtcag tttgaaaaac 7 80 

gtttattcca aagggactta ccatatgggg acacagtcat ggagcaatgt aggaactccc 840 

gctactttcc cgcagacatt gaataaatcc attacgggaa ccccggacga accccttact 900 

gtcgatgcgc agaccttcat gatggttccg cagacccttc ccgacggcgc gcaacttgaa 9 60 

gtcgtattca ctgataactc cagcatggac cacacgctga ctgccgatct caaaggcacg 1020 

gtctggcctg ttggcaagac cgtcacttat aagatttcca gcagttccat aaactggacc 1080 

tacacacttg ccgttacctc cccggccgac tttacctatg aaggcggtac gcagcaatac 1140 

aatgtgacca gttaccggca gaacaccaaa ggggttaaag aggctgtcgc atggaccgca 1200 

caatattcag aagatggcgg ggcatcatgg agcaatacca gacccggctg gctggatgcg 12 60 

tttaccgtat ccggaaatgg tggagatact ccacaatcat acaatgcaac tgtcatcgca 132 0 

cagactggcg tagaggctaa tcctcaacat acggctgctc tacagaatgc ttcggtcaaa 1380 

ggtactgaaa ctgtcccata taatcttgcc aaccagacta atggtggcac agtggatgaa 1440 

aacaccgcca actgctatgt tgtcagtggt tcgggatact actgtttccc tttagtatat 1500 

ggaaatgcca tcaaaggtgg tacgaccaat acgtccgcat atacctctac ggctccgtcc 1560 

ggaaccacta ttctgagtcc tttcatcaac catgctggta acgcgatcac tgccccctat 162 0 

atacgcaaca acgctgactg cacgcctgct aaagcagaac tggtatggca ggacgcaccg 1680 

aatttggtta ccgatataaa atacaacaat acgggcaacg gcaatatatc tttcacggtg 174 0 

gacaagaata ccatccgaca gggtaatgcc atcattgcca tcaaggatgc cggcgacaat 180 0 

gtcctgtggt cctggcatat ctgggttacc gatgaagata tcaataatgt cattgaaatt 1860 

accaatttcc agggtaagaa gtataaattg atgtctgtca atctcggctg gtgcgatgga 192 0 

agtaccacga attat gccga. acgtagttgc aaggtgaaat tcactgccgg aggagagagt 1980 

cagacaataa ctatcagaca ggcctctaaa tcaatcgttg tcggtggtaa taatccttac 2 040 

tatcaatggg gacgcaagga tcctttcctg ccttcgaatg gattacgcga tatcaataaa 2100 

acctggtacg acaaagacgg caatgctcac acgggaagtc ctaaaacgga ggacttttct 2160 

atcggcgccc cttgtatcac gaattatatt ctcaaaccgg atgtgatgca gagtcaagat 222 0 

tatggcgata atacatatgc aaatctatgg agtgccgata acaatgttta tactgccaat 2280 

gacgaaaatg tcataaaaac gatttatgat ccctctcctg tgggcttcaa agttcccccc 2340 

agtaatgctt ttacgggatt cacaacaacc ggaaacaata caagtacatc ttctgaaatc 2 400 

aacggaactt gggacagctc cttgaaggga tggaattttt acactgactc ctcaaaaaat 2 460 

aaaaccatct tcttccctgc gtcggggttt cgcgactatt cctatggcgg ggcgctcatc 2 52 0 

gttggcagct acggctactg ttggtcggcg gttccgagca tccagtacta cgctcgcaac 2580 

ctgaacttca actcgtcgtt cgtgaacccg ttgaacaact ccagtcgggc gtgcgggttt 2 640 

ggggtgcgtt cttcccaaga atag 2 664 

<210> 1442 
<211> 264 
<212> DNA 
<213> B.fragilis 

<400> 1442 

agaggggtac tgccgccttg caaggcggtg acagtaccct gcggagtatc agtatcggtg 6 0 

gtggaacggg tggatggagt ccgggaggaa ctgatctcgg tggtgaaaga gacatgtgag 120 

gaaggcgaag gccggtccat gtcatcctca caggaggaaa gacagatact gccggaaacg 180 

agtgtaaaaa gcagaaaatg cgaaaaggca tccctgcagg aatgtaatgt aaaatgggaa 240 

ggtctcataa gtatattatt ttag 2 64 



<210> 1443 
<211> 204 



576 



180 
186 



<212> DNA 

<213> B.fragilis 

<400> 1443 

gcgacgctct gggaaacaat tatgcaaccg tccgcattct tacgtgacaa ggtggaaaac 6 0 

tcgaaaaaca gactctcgga agccctgacc aaactggacg ggaccgtctc ttccctgcac 12 0 

gaggcaagca cgatttccgt cagcgcaaag gccaagcgga cgcttgagca gggaagggag 180 

gcgacttgcc agaaaaaggg ctga 204 

<210> 1444 
<211> 186 
<212> DNA 
<213> B.fragilis 

<400> 1444 

gaaaatggcc gagctaatcg gccttgcact gaacctgaag tgcacacgca atcaaaacat 6 0 
aatcaccctc tccgagtaat cccggaatca aaacccaaaa acaatgaaag cctcgaaaag 12 0 
tctctgccta caatgcctgt tcacctgtct gctattattc atagcagccc gggtaaaggc 
ggatga 

<210> 1445 
<211> 516 
<212> DNA 
<213> B.fragilis 

<400> 1445 

aaaccgcaac gtatgaaaag tttaagtttt agaaaagatt taataggagt gcaagaagag 60 

ttgctccgct tcgcttataa attgactgct aatcgcgaag aagcaaatga cctgttacaa 120 

gagacctcat taaaagcatt agataatgaa gataaattta tgccagacac taattttaag 

ggctggatgt atactatcat gcgcaacatc tttattaata attaccgtaa aattgtgcgt 

gatcaaactt atgtagacca aaccgataat cttttccatc tgaatctccc gcaggactcc 

ggttttgaaa gtaccgaagg agcctatgac ctgaaagaaa tgcaccgtgt agtaaatgcg 3 60 
ttgcccaaag aatataaagt tccattttca atgcatgttt ccggatttaa ataccgtgaa 

atagccgaaa aattagaatt accactcggc actgtcaaga gccgtatctt ttttacccgt 
cagagattgc aacaggaact gaaggacttt gtttga 

<210> 1446 
<211> 2235 
<212> DNA 
<213> B. fragilis 

<400> 1446 

gaccggttta gcatcgtgta ttctggaatt atcatccatg gaatagacga tctacctaca 60 
gggtgtaagg ggttgaatgt ccccttacac tcactaacag ccgttcaggc aaacaaatat 12 0 
aaaacagaag aaatgaaggt acataagaaa aatccgagtt ggatggcggg cggcatggcg 180 
acgatgttgc tgtgcacgct acttttctcg tgcaataacg aggactttct cgaaagcggg 240 
aatccggaga aagccggtga caacatttgt tttggcatat cgtccgataa gaacatgcag 3 00 
acaaggggat atgccggtag tgatgacgaa ggatataccg cggaccgttt cgtgttgcgg 360 
tcggacgact cggcagacac gctttgtgtc cgtgccattg tgtcggacgg tatcaacgtg 
tccggctttg agggcgaaca agccttgaca cgcggaacgc ttgttggcaa agacaatttc 
tataataagt tccatgtgct ggcatactgg agtaagaatg gggcgtccat tgaccagttc 
tacatgaaca cgaatgcttc caacgcggct gcttccgttg gaacaggtgc tatatggagc 
acggaacaaa tatactattg gccgggagca gaccattcgt tccaattcta tgcctgggca 
ccgacggatg ccggtggctt gatcactccg tccgatccgt caagcaaaga acttaaatac 720 
accgttccgg cagatgctgc cgaccagaaa gacattgtgg tggctactac caatgaaata 
ccgggcaaca acaatgcggc tgtacctctc aacttcaagc atatctgcac cgccgtccgt 
tttgccgtgg gcagccagat gcagcccggc tctatcaaga gcgtggcttt gaaaggtgtc 
aaaaatgccg gaacttacga tatggttgcc ggtacatgga ctcttggtga tgcgactgtg 
gatttctcgc aggaattgaa caaagaaact accggaagtg aagccaacgg agcggaaatc 



180 
240 
300 



420 
480 
516 



420 
480 
540 
600 
660 



780 
840 
900 
960 
1020 



577 



1080 



acctctgcag aaggcacatt catgatgttg ccgcaaacat taccggctga tgcgatggtg 

gaagtggtat ttaccaatgc taatgcttcc ggtgttgacc gcacactcac tgcgtccatc 1140 

ggaaatacag agtggaagac aggtacaact gtgacatata tactctcaat cacgccggaa 12 00 

tatgaattgg agttcgtttc ccaacctgaa acacaggatg cgcattatgt catttatccc 12 60 

attgccatta aggcggacaa gttcccagaa ggaggttgga ccttgacatc caatgacaag 1320 

gaaaatgtta cttttgttga aaaatttgct gacgacggaa taaaaaatct ggttgaccaa 13 80 

gggtattggc tcaaggatta ttgtggtgca agcactctga ccggttcatc cttcggtgag 1440 

gtcccaatat atgttttcct taaagagaat atatcggaaa aggaccgtga aattgtactc 1500 

tcgctcgctc cggctaatga tccaaatgca aaaccgaaga catttacatt caaacaacac 1560 

tgtccggcat ggaacaatgg tataggagtg gaacgtattc agggaaaaga ctatccttgg 1620 

ggattcaact ggagttcgga tatgaagatt acttattcca tgccatccgg cctttggtca 1680 

ggaattatac atgtactatt tgaaatcttt ggagaccatt cttatgtgga aagtagtgga 1740 

ttggctgtgc taggaacttg gaaagttatt gttaactttg ctaaagttcc gtccctgacc 1800 

atcgccataa gcccgacaga cggcattaca aacacatggg agctatacaa ctttgatggc 1860 

atcaacgaag cgtcaatcat catgagccaa ttggaatcgt ggggaggtgt cccagataaa 192 0 

gagttgccag tcaacccaag cgagtttgta gcatgggctt gtgcaaagaa gaacagattt 1980 

ggggtagaaa gaaagagtaa ttccggagaa acaatctatg taccgacttt agaacaaacg 2 040 

gacatggtat ggtatcttcc tgcacaagaa gaagcacttt atatgaaaga tgacctgtcg 2100 

gagaattatt ggacttccac cgctataacg gacccgggaa ccaccgctta tacatataca 2160 

gcagaaagtg gttctacatc cgggatggat cgtaatgagg taattcatgt tcgtgctgta 222 0 
cgaaagaaac cctag 

<210> 1447 
<211> 1494 
<212> DNA 
<213> B. fragilis 



<400> 1447 

cggggtggta gccggattga ccggttacca cggtcttttt attttcgaag ttatattggc 
ttccatatca ttgatctata tattttattt cttcagaaaa gaatctcaac ccattcattc 
ttaacatata tgtatactga aataatcaat aaatacaatg taccggtacc tcgatacacc 
agttatccac cggccaatta ttttgagcca tttaccaacg cccgctacct ggaggctgta 
cagcagtcga atcaggcttc agagcgtgca ttatcgtttt acctgcatat cccgttttgt 
cggcacttat gccactattg cggatgcaat tcgtatccca tggcacgtcc cgagattatt 
gagtcatatg tagtagcttt gcatcaagag atagatctga ttcttcccct gttagataag 
gatcggccga tcgcgcagat acattatggt ggcggaagcc ccacagccat tcccgttgct 
ttaatcaaag aattgaatgc tcacttatta tcatcattcc cagccatcga ccgccctgaa 
atagccattg aatgtcaccc aggctatctc tcagaaaaag actggctgca acttaccgaa 
tgcggcttta accgtctcag tattggtgtg caggacttta atatcgaggt actgaaaaca 
gtcaatcgcc gcccttcttt attaccgatg gaagatatat ttatcctgct acgcgaaaag 72 0 
ggaataagta tcaaccttga ttttctttat ggtttaccca aacaaactgt ggagaacttc 
acccgcaaca taaagcaggc tattctttta tcacccgaca gactggttat gttcagttat 
gcccacgtgc cttggattaa taagcgacag ttgcttctgg agaaatcagg cctacccgac 
aaccatgaaa aacagacaat gtttgacact gctgccggac tattgcataa atccggttat 
caatctatcg gaatggatca ttttgtactc cccaatgacg agctgagcat cgccatgcaa 
actaaaaaat tacatcgtaa ttttcaaggc tactgcaccc ggcgtactac cgcacaggta 
tatggtttgg gcgtaaccgc tatcagtcag cttgaatcgg cttatgctca aaatacgaaa 1140 
gatattcccc attacatcaa gactatcagt aaaggcgaac taagtattac caaaggttat 12 0 0 
gccctttccc caaccgaaca gctcaccaga gaggttatcg aaaccctaat gtgcaatggc 12 60 
tgtatcgatt ggagagatct ttcaaagcgc ctgcatgtat cggtatccac tttaaaggct 132 0 
gccactgcct acgatgaaaa aaaactatct ggctttgccg atgacggact gatttattat 13 80 
acagacgact atcttgagat gacaaccgca ggttcggcat ttgtacgcaa cgtagcggct 1440 
tcacttgaca aactgatgct ccactctcca cactcatatt caaaaccttt ataa 1494 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 



780 

840 

900 

960 

1020 

1080 



<210> 1448 
<211> 216 
<212> DNA 
<213> B. fragilis 



578 



<400> 1448 

agacaaatca gaccatcaaa aagttcccat 
aaagtaaagg agcaaggagg gggtccctca 
tacactaaat cactacctaa aatcaatctt 
agtccttcag ttcctgttgc aatctctgac 



aaaagatatg ttatatatca tattgtcaca 
cttactcctt cttttgcaca actaaaaaat 
tttattaatg aaaaatacct aatcaaacaa 
gggtaa 



<210> 1449 
<211> 1281 
<212> DNA 
<213> B. fragilis 



<400> 1449 

attatgaaaa gaggaaaacg attgatactt cccttactgg gaactcttat actgacaagt 
cttttttcct gtggtgtaga ccgctggccg gaatattatc cggagacggg acgcgatatt 
tggatagaca gtgtgatgcg tcaggagtat ctgtggtata gagatatgtc atcacctgct 180 
gctccggact atttccagaa accggaagcg tttctgaaaa aagccgtcgc ttccatggat 2 40 
aatggcttta gtaagatcga ctccttgctg gatgaaccca ttccgagcta tggttttgat 
tatactttat ataaagtgct tgataatgat acagcgtata acgctttaat ctcttatgtg 
gtgcccggat cgcctgccga agaagccgga ttgcagcgtg gtcattggat tatgatgatg 420 
aatggagatt atatcactaa aaaggttgaa tcggaattac tgcagggaag tacccgtcaa 480 
cttcagatag gtgtttataa agaagttgtc ggtgaagatg gtgaggtaac cggtggggtg 540 
gtgccgatag gagagacgac aatgcccgct tcacgttctt tggtggataa gcctgttcac 
cgttttgaga ttattccatg gaatgggaaa aaggtaggct atttgatgta taatgaattt 
aaggcagggc cgacgacaga cagtcaggct tataatgatg atttgcgtag ggccttccgg 
gatttccaga caggaggggt aaatgagttt gtattagatt tacgctataa caccggaggc 
agtttagatt gtgcccagtt gctctgtacg atgcttgctc cggctgataa gatgaatcaa 
ttgttggctc tattgagata cagtgataaa cgcgtggaag caaatcagga tttgacattc 
aatccggagc ttatccaatc cggtgccaat cttaacctat ctactgttta tgtactgact 
accaatgcta ccagaggggc ggcggagatg gttatcaatt gtcttaatcc ttatatgaag 102 0 
gttgtattga taggtactaa aacggcgggg gaatatgttg ctacaaagcc ttttgttcat 1080 
ccaacggatc ggtttatatt gaatctggtt gtttgcaatg tatacaatgc agaagaaaag 1140 
tcagattatg ccaccggttt caaacctaca tacgaataca atgaagattc ttatctgagt 12 00 
acttatttgc cttttggcaa tacgaatgaa actttattga atgcagcatt gaaaatcatg 12 60 
agtggaataa cggataagta a 1281 

<210> 1450 
<211> 612 
<212> DNA 
<213> B. fragilis 



60 
120 



300 
360 



600 
660 
720 
780 
840 
900 
960 



<400> 1450 

ataaccattt taatattgaa gagtatgaaa aagaatttta ttacagtaac tcccgatagt 
ggaacatcag gtagcagcaa taccattagt gtagctgctg aacctaatat actcctgaaa 120 
gaacgttctg aaatactgaa ttttaatgca agcggtggag tttctaagtc tgttcaagtt 
atccaaaatt cgatgcctta ctttccaatc agatttccct ttgtactgaa tatgaacaaa 
ttcattggaa atcttaaagt ggattcatcg ggtgtattac aaggagctta ttcgatatca 
gaaataaaaa gtatagatcc ttccgcttat gactcttctt taggagaagg gtggggaatg 
ggaagtactc aaaaaatgct catctacgac cctttcagta atgtaaccga aatcgtagct 
catattattg acagtagtgg aacctatgat gcaaacttta tctcttcaaa tgcaaatatg 
gggagaatat ggggtgaccc tggattatta taccaaaaat tactcaagat tatttcagaa 
gattatacaa aaggaggcta cgagattcgc ataaattcag tgcttgcaat gaaatttgtt 
tttgtggaat aa 



60 



180 
240 
300 
360 
420 
480 
540 
600 
612 



<210> 1451 
<211> 1167 
<212> DNA 
<213> B . fragilis 



<400> 1451 



579 



60 



tctaacatta ttatttttat gaagaaaaat cttttattta cggctatagc tgtagcagtc 
ctggcctctt gttccaacga tgacgtcgtt gatgtaaata atggtagcgg catttctttc 12 0 
cgtgcctctt tggataaggc cataacccgt tccaacgtga caaacttgca aaacctggct 180 
gcattcaacg tgacggccat cggtaacggc gccaattttt tcacagacct gtctgtcact 
tctactgaca acggtactaa ctggacaact gcttctactt actattggcc aaactatgcg 
ctttctttct tcgcctacgc ccctcaaact cccggcggta ctgttagtat agacaatacg 
gcaaagaaga taaccgggtt ctctcccgcg cagtccgttg cagaccaaaa ggaccttgta 
atctcttaca atacgggtac taagggctcc aatgaaaatg ccggtgttgc catgaacttc 
aaacatgccc tttcccaaat tgtggtaaat gccaaatgtt ccaatgacaa gattaaaatt 540 
gaggttctcg gcgttaaact ggtgaatgcc gcagcaaaag ccgactttgc ttttccggaa 600 
gcggtaacca ataccggata caccttgccg caaggccaat ggagtaactt gtctgaaaaa 660 
gacgatcctt caaaagctta tatgatcagg ggagatgctc ctcttactct gacggctgat 720 
gcccaatcta tcatgttcgg tgataacaac ctcatgctta ttccccagca gctaacggca ™n 
tggaacggta ctgtcgcaac tgccggtgct tatctgtctg tactctgccg tatttacagt 
ttggatggtg gcaatgagac ccttctttat ccagaaccga catctactga tgataagagc 
ggtaaatatg cattctctgc cataggtatt aataccaact gggaaccggg taagaaatac 
acctacacgt tgaacttctg tggtgatggc ggcggcggtg gtaaaattga tccgactccc 
actgatccga ccaagcccac tgatccgaca gtcgatccaa ctcctataga tggtggtagt 1080 
ggtggtgatc ctattttggg aaaaccgatc aaattcactg taactgttga cgattggacc 
gaccaacctg tggatgttcc tatgtaa 

<210> 1452 
<211> 876 
<212> DNA 
<213> B.fragilis 



240 
300 
360 
420 
480 



780 
840 
900 
960 
1020 



1140 
1167 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

876 



<400> 1452 

aatgatgtta tgaaattact ttattttaaa gaacatctgt catgcataaa ctatcaaata 
aatgttaata caggtttcgt ttattataat ttagagaaag atagtgtaag taaaatagat 
aatagtgctt ctccttgtat tcttttcctt cttgatggag aggtgtctat tgatagtggt 
gagtatcaga atgtacatat tgagaaagat aaaatggttc ttattccgca acatgtagat 
aataaaattg aagttatata tgatgcaaaa tgtcttttac tattttggaa taaagacata 
agggtgtgtg acaaagtata tatgaactct ctttcttctt ataaggaaag aaaaaaagag 
atgtgtgtgc ttcctataag agatcctttg caagctgttt taaactccgt tgtcgcatat 
ttatatgcta agatgcagtg taaacatatg catcttatca aacaacaaga ggttttgttg 
gttttgagag gatattatac gaagaaagaa ttatttactt ttttttcttc tatattgggg 
aatacagggc attttgaaga ttttgtaatg aataattata ggaaagtaaa gagtgtaaag 
gaatttgccg gtttgtattg tacttctgag cgctctttta atcgtaagtt ccaaaattgt 
tttaaagaaa gtccttatca gtggatgcaa aaaaagaagg cggagttgat cagagaaaaa 
ataagtgagt cggatactcc ttttcaagag atcgcaatgg attttgattt caattcgcaa 
gctcatttca cttcctactg taagagatta tttggaatga ctcccagcaa attgagaaca 
gaaagtaaga aggttgctcc tgatttggag tactga 

<210> 1453 
<211> 1248 
<212> DNA 
<213> B.fragilis 

<400> 1453 

tcaaacgata tgaatcacat gcgtataaac aagctttcac cgctttcccg gaaagcactg 
aacctcagta catttttttg cttatacatt gcacaagcca tcccgatgag tttcttctct 
acagccatac aagtactaat gaggcaagcc gattactctc tttcttccat cgccttatta 
caactcatca aactcccctg gatattgaag tttctttggg caccgcttgt cgaccggcat 
tgtatcaccc taaaagacta taaacgctgt atcattacat ccgaaatcgt atatgcatta 
ctgatcctga tggtaggcct gctcgatatc caaacagatc tctaccttat cattggatta 
gtatttctat cattgatagc ctcagctaca caagatatag ctacagacac acttgcagtg 
ctctcctttg gtaagtcgga taaaagtttg gtcaacagca tgcaatcaat gggtagcttc 
ggaggcacat tgataggaac aggtatatta ctcctcgttc ttcagcacta tggctggcat 540 
gtggtgatac catgcttatg catttttgta ctattggcaa ttattccgtt attgaaaaac 



60 

120 

180 

240 

300 

360 

420 

480 



600 



580 



aaacatatga 
tggttctttg 
agtattatcg 
aaagagatag 
ctggccggat 
ttcatcctgc 
atgctttgtt 
tataccactt 
acagtactca 
ttgaccggtt 
tatatatttt 



aaataatacc 
cccgtcgtaa 
gaattttatc 
gcattatgat 
tgctggttcg 
ttactacact 
tagggatcgt 
ctatggattg 
cacacttaag 
accacggtct 
atttcttcag 



caaagaacct 
catctggaaa 
ggtgttacgc 
aggtatcgga 
taaaatcggg 
ctattttatg 
cctgctatgg 
tgtacgcaaa 
cggcttacta 
ttttattttc 
aaaagaatct 



tcgaaacggg 
caaataggat 
tcttatctgg 
ggtaccggag 
cgatatcatt 
tgtatttcat 
agtgcctatg 
ggatgtgaag 
atagcctttc 
gaagttatat 
caacccattc 



cacaattcac 
tcctattact 
tcgatttggg 
ctgctttcgc 
ccagaatact 
ggacagtccc 
gaatggcaac 
gaaccgactt 
ttagcggggt 
tggcttccat 
attcttaa 



tgattttatc 
atattatgcc 
ttattcaatg 
atcatctttc 
atttgcaata 
ttcattttca 
tattgtagtg 
taccatccaa 
ggtagccgga 
atcattgatc 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1248 



<210> 1454 
<211> 852 
<212> DNA 
<213> B. f ragilis 



<400> 1454 
aaacatataa 
gcccgcaacc 
caggaaaagc 
gacaaggaaa 
gatctggatc 
ttgcgtgaac 
accggaaaga 
gcatacctgc 
cccgcgatga 
acgctgtttc 
caaggtaaga 
gcaggagacg 
atcaggctat 
cagataggga 
gggtactgtt 



tgaaatcttt 
tgaagctgcc 
aacagactta 
ggagaagtta 
tatatgattt 
tggtatggat 
cattcattgc 
tgaccttgga 
aaacatacaa 
ccctgaaagg 
catcacttat 
aagtttgtgc 
taggaaaaag 
ctgcacctca 
aa 



agaaacaatg 
attcctggca 
ctccgagttt 
tctgaccagg 
ctcacgtacc 
aagaaggaca 
ttcaggactt 
agaactgctt 
acgaataatg 
agaagatgta 
cattgccgca 
ggcagctcta 
ctaccgcatg 
aaaagggtta 



gcaaggtata 
gaacacctgg 
ctgtcaactt 
ttgaaatttg 
gaagggattg 
tataatcttc 
atccatgaag 
gtctgtttga 
aaagcgcagc 
ctgctgctgt 
agccgggatc 
ctggacagac 
gaaaacagga 
atgaaagtaa 



agaaagaact 
atgaaatact 
gtctcatgcg 
caggattgcc 
accaaaggca 
tgctggtagg 
cagtgaaagc 
aggctaagga 
tgctggcaat 
ttaaactggt 
ttaccggatg 
tactctattg 
aaacaatttt 
agaagagaac 



gaccgaatgt 
acatgaagca 
ggaacttcgg 
tgcaaggtat 
gatgcgcgaa 
agattccgga 
gggttataag 
gatatcacga 
cgatgatgtt 
gaattgcgtt 
gctggagatg 
ttgtgagata 
tagcaatcaa 
taaggaaagt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

852 



<210> 1455 
<211> 1785 
<212> DNA 
<213> B.fragilis 



<400> 1455 

cttacggcaa 

tccggtaagt 

atgggatgtt 

tatggtgaaa 

atacaaaccc 

gatgtccgtc 

ttttgtgggc 

ggaatgattc 

tatacggcta 

atggcaggtg 

catcccgaag 

attcttgaag 

tggggaaaag 

cgtgtacctg 

atcgatgatt 

ctgaaatgtg 

catgccggta 

ctgcttgtga 

ctggtaactc 



tgaaaaagga 
atcaaccccg 
ttgcccgtgt 
accccaataa 
acatgcttga 
ggttgatgta 
tgaatgaagt 
cgcaggcaac 
ttgctgataa 
tcggacgtcc 
tgttgattca 
tttgtgagaa 
ttcatccgga 
aaattaatat 
tccttggtta 
gcttgcccgg 
ttaatatgat 
tgttgttcga 
cattcagcca 



aattaaattc 
tgtcgaccag 
ggagaccaat 
agcagtgcgg 
ccggggactg 
caaagtcaag 
aaggaatatt 
tttgtgcatc 
gctgattgag 
tgccatgtta 
atatcatggc 
tggtgctgat 
cgtgatctct 
gaaagcctat 
ctttatggac 
aggaatgatg 
attaaagagt 
tgaagtggaa 
gtatgtgaaa 



agtctcgtat 
ttagtgcgaa 
ggcggagcgt 
gcttttacca 
aatggtttac 
catgcccagg 
attccttcca 
accttttcac 
gccggtgctc 
ggacaattga 
catagcgggc 
attattgatg 
gtacaggcca 
atgaaggcgc 
ccgaccaaca 
gggtctatga 
aataatcagc 
tacgtatggc 
aatgtggcat 



atcgggatat 
ttgccccttt 
tcgagcaagt 
aacctttcaa 
ggatgtatcc 
gagtagatat 
tacactatgc 
cggtacatac 
ccgagatttg 
caaaagccat 
ccggattgtc 
tagccatgga 
tgttgaaaga 
gtgccatgac 
aacatatgtc 
tggccgattt 
ctgaactcag 
ctaagttagg 
taatgaatgt 



gtggcagtcg 
gattattgaa 
caatttattg 
tgatgccgga 
cgttcctgcc 
cacccgtatc 
acttgaggga 
agtagaatat 
tctgaaggat 
taaggaacgt 
aatggcttcc 
acctatgtcc 
tgccggtttt 
acaggagttc 
ttccttattg 
gaaaggtgtg 
cattgacgat 
ttatcctcca 
aatggcacgt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 



581 



1200 
1260 



gtgaaaggtg aggaacgctg gagcatgata gacaataata cctgggggat gattctaggt 

aaaagcggtc gtttacccgg tccgttggat ccggaaattg tagcattggc caaagagaag 

gggtacgaat ttacagatga agatccgcag aagaactatc ccgaccagct tgatgaatat 132 0 

cgtaaagaga tgcaggagaa tggttgggag tccgggcccg atgatgaaga actgtttgaa 1380 

ctggccatgc atgacaggca gtatcgtgat tataaatcgg gagtagccaa gaagaggttt 1440 

gaagaagacc tgcaacgtgc caaggatgcg gcattggcca aacaaggttt ctcagaagaa 1500 

gatgtgaaaa ggatgaagcg tgccaaggca gagccaatca ctgcgatgga aaagggtagg 1560 

attatctggg aaatagatgt tgaatcgccc tccatgcctc cggaagtagg gcataaatat 162 0 

gaaccggatg atgtattttg ttatattgcc actccatgga acacttatga tagagtattg 1680 

gctaatttta gtggacgcat cattgaggta tgtgccaagc aaggtgcttt ggtcaataaa 
ggtgatgctt tggcttatgt agaaagatgt gaagaaccgg cataa 

<210> 1456 
<211> 459 
<212> DNA 
<213> B.fragilis 



1740 
1785 



<400> 1456 

atgagccaaa ttggatacaa aagttcagag acttctgttg tttatataat tacaaaaacc 

atgccatatt ttaacggaca actccatttc ttcactttca cgttacatat tcagtatcaa 

tcattaaaat tactgaatat gaaattagac gaaaacattt tgaagacctg tcaaggactt 

gtaatgaact gtaattgtaa ggttttaatc cttaacgtat tgggtgaaca ccgtgtattc 240 

cttgtgaatg atgtacacct aaagacccgt gagtgccgat acaatgaagt ccgtgatgcg ™ n 

caagacatca ccactcttgt cttgaatatc gggcataact ttgccaatgg tatgaccgaa 

cagaccttat tggaacgtac ccaatctatt cacaaggaag atttcaagtt tggaactgat 
aattacctgt ggataacaaa agttgatttg aatagataa 

<210> 1457 

<211> 2319 

<212> DNA 

<213> B.fragilis 



60 

120 

180 



300 
360 
420 
459 



60 



<400> 1457 

tataacaaaa tgaacaatgc taaattatta catttaatgg tttacatctt aaccattatt 

ttagggcagt cttgtacaga agtggatatt acgatgccca aaggaccgaa aggtgataga 12 0 

ggaatgtcag cttatgaatt ttggaaagag aatgtagaga atggagtgat ttcttggcct 18 0 

aagaaagaga ctgaaataac tgattttttt aagtatttaa aaggtaagga cggtctggat 240 

ggaaaaagtg cttttgaact gtggaaggaa gaagtagcta ctggtgctct ggataatcct 3 00 

caccgcccgg gaagtatgtg gcctgtatcc cagaataatc ttagagattt ttggtattat 360 

ctgacaggag cgagtggcga gaatgggcaa acacctcata taggtaataa tatgaattgg 42 0 

tggattggca ataaggatac cggaatacgt gctcagggta gggatggaca gaatggagaa 48 0 

gatgctgttc caccggtagt tacgatcggg gataatggta attggttgat tgatggagta 540 

gatacaggaa aaccttccag aggtgaagaa ggagttgcag gaacaacacc tactgttaca 600 
attggagaaa atggaaattg ggtaatcaat ggaaaagata ccggaaaggc tgcgataggt 
aaagatggaa gatcgccaga ggtaataatc ggtaccaatg ggaactggta tattaatggg 

aaagataccg gtattcgtgc atatggtaaa gatggtgcca atggtaagga tggcattaac 780 

ggtaaggatg gtgccaatgg aaaggatggt gccaacggaa aagatggtat taacggtaag QAn 
gacggggctg ctggaaaaga cggcgctaac ggtaaagatg gtgctaatgg gaaaagtgcc 
tatgaattgt gggtagagag tgttgaggcg ggttgtaaca atactggccc taaagtgaaa 
aatcctcata atccgtcttt ggattgggat tgtggtaaaa caactttaag tgatttttgg 
gagtttttga gaggtgcgga tggtaaagat ggtgcggacg gtaaggatgg aaaacccggt 

gttccgggaa aaccgggtgc tgaagttact attatcaaag gagtacctaa cgtgattgca 114 0 

ctttattcac aacaagaatt tggagagtat gttcgtacaa ccgatggagg agtagcttat 12 0 0 

cgtgtgtatg acgaatctgg caataaggct ccgaaggctg tggttaaggg aattcccggt 12 60 

ttggatccgg ctaaaactta tacagctaat gaagagggag aatttattat tccgaaagaa 132 0 

gatcttcctc aaattgacga tatagatgcc cgatggggga aagttaagga ggtgactatt 13 80 

aataaggtga caaaggaatc tgcagaaaat acttatgtcc ccaacagaat gcagattaga i A A n 
atgatttata ttgccacgtc cccatatctt gattatgaac ataacctaca gtttagagtg 
gaaagaaaga cggatcctag tgcggaatgg aaaacattgc ccagctattt gcctaatgtc 



660 
720 



840 

900 

960 

1020 

1080 



1440 
1500 
1560 



582 



aatgccgtat ttacggcata tcaggttaca aatccggaag acccgacatc tcttgataaa 162 0 

acgaaaaaga tagagagtac tacacctaat atgagtagta catcaatgtc tattaatcct 1680 

aatcgatatg ttaaagagaa tcctgccggc ataaaaaatg gaataactga tttttgggat 1740 

ggaaaagaca actatttctc aatagtaaaa gatacccctt attatggaga aacgatttat 180 0 

tggaatggag tatgtaagat ggcaccttat cagatacctc ctacacttaa aactctagcc 1860 

ttaacaaagg catctgctga aagtggagat gatgtattct tgaataaagc ccagggggaa 192 0 

tttgactttt cgactattga tttcaatatc atatgtaaac atgaattggt aaaaacagta 1980 

aaaccgaacg gaatagatta cattgaacct gaatattatg ctccggaaga ggcgaaggaa 2 040 

ctcctactct gttatgttaa gtttacttat acctctccat tgggggtaca aacagccaca 2100 

agcgaactta atatgtcgag ttacaagaaa cctgagtatg ctgcccttag cccgtacttg 2160 

ggagctacaa tctattcggt aggagcgagt agcactttta tctactctag caatgtgtct 2220 

ttaggagttc tcaagaagaa agcagataac ggtacgtatt atgttgagaa tacatataaa 2280 
gatatgcctg aaattagtgt aacctataaa gaaaaatag 



2319 



<210> 1458 
<211> 549 
<212> DNA 
<213> B. f ragilis 

<400> 1458 

accagcccga tatgctttac cgcagttaca tggagtcata ctgtggcgga acgctcgacc 

gaaaccgacg tgatacccgt caccatgcac ccgatggtgt tcacctatct ggtacgttac 12 0 

gagttcagcc acggggtgga atacgtgtct ctggcacgcg gtgccttggc tggtatggcg 180 

caggcggtat ggttgaacag cggacacacg tctgatgaag ccgctaccgt gctgtatgat 0/m 

tgtacggtag aagatttcgg cacacaggct ttggtgcgtt ctttcggaat acctgatttc 

cctaacgaac attacggcac aagggcagaa cgtaagtacg ggctgaatct ggaagtacgc 

ttgaagaacg gaaaaatcaa gtcgttcgat ttcgacgtga ccgaccaggt ggtggcgcaa 

ccgcagggag gtgtcatcgt ggtgaagggc atcgagattt ccgacgaaga gggtacggaa 

ggcggctccg gcttcgacgt ggatgtggac gactggggag attacgagga catcgaactt 540 
cctctttaa 

<210> 1459 
<211> 261 
<212> DNA 
<213> B.fragilis 



60 



240 
300 
360 
420 
480 



60 

120 

180 

240 

261 



<400> 1459 

agatacgaga tttttataat tgtttctatt ttacaaaaag caatgacagc ttacgattta 

aagaacattg catctgcagg tggaaatatt gtcgttaatg cagaagattt ttcagcgtat 

gatttaaaga atattgctga aaacggagta gcaacaaagg caaagctaac catcaaaaac 

gcaggtggat tatctggata tgattgcaaa aatattgcat cagccaatcc agggaatgta 
acatttgatt ttagcgaata a 

<210> 1460 
<211> 705 
<212> DNA 
<213> B. fragilis 

<400> 1460 

acattgtgga taataaaagg aagtgccttg atactcgaag aatcgccaat tcaacaaaaa 
gtacaaacga ggcacagtgg gcaacctata tcaagtgggc tgttcatctg tgtgtttgta 12 0 
caagcgtttg gcgatgcttt cgagttcaca caggtggcag tccactttct ttgtaataac 
gtaatttctc actggctgtc tgagaaaaac tcgaaaaatt tcgccattat gaaaaagtat 
ttcgttttat cagtatttat tatgctgatt ggagcattta caaatgttca aggtcaaaat 
tctgcaactc ccgataaagg agtgttagtt catggaaacg tccgtttatg caactatgaa 360 
agaggctgtt taataacaga caacgacatg aagcattgga cgcaaaaact tgaaatctcc 42 0 
tacgacggct cggacaagac atacggcatc tatatcaatg tccgagggga tattatcaat 480 
ttgggtgtca aatataaaag tagcggaaca gaatcttata catacgaagg aacggacagg 540 
gtaacaggac gaaaggttgt tgtcgtaaca aaacaaaagt tgagttggta tttgaataac 600 



60 



180 
240 
300 



583 

aatggagttg attctcatac agaggttgaa agtccaaagg ggataatcgt taccgttcct 660 
gccacttata cagtgttttc agtagttcct attaagaata agtag 7 05 



<210> 1461 
<211> 849 
<212> DNA 
<213> B.fragilis 



<400> 1461 

aacaatcact tcattataac atatatcatt atgaacaaag tttttttatt tttattattc 60 

agctttttaa caataacgag tatggcacaa gaaaaaatca aacagacagc cgggcgcgat 12 0 

caacttggtg attttgcccc taaatttgcg gaactcaacg acgatgtcct tttcggcgaa 180 

atctggagcc gcactgacaa actcagtctg cgtgaccgta gtttggttac gatcacttca 240 

ctcattagcc aaggtataac ggataactca ctgacgttcc atctccagtc ggccaagaat 3 00 

aacggtatca gtcgcacgga gatatccgaa atcatcacac atataggttt ttatgcagga 3 60 

tggccgaaag catgggccgc ttttcggctt gccaaggagg tatgggcaaa agatacaacc 42 0 

ggggtagatg caaaggccgc tttccagcgt gaaatgatat tcccgatagg agaacctaac 480 

acagcctatg cacagtattt caccggtaat agctaccttg cacccatatc gcatgaacag 540 

gttaatatct ccaatgtcac gttcgaaccc ggttgccgaa ataattggca cgttcatcat 600 

gcgaagaaag gtggcggaca gatgttgatc ggtatagcag gccgcggctg gtatcaggaa 660 

gagggtaaac cggcggtaga gattcttccc ggtacagtca tacatatccc tgccaacgtg 72 0 

aaacactggc atggtgcaac agccgaaagt tggttcgcac accttgcatt cgaaattccc 780 

ggggaagact cctctaacga atggctggaa cctgtgacta ataaagaata caatagactc 840 

ccccaataa 849 



<210> 1462 
<211> 186 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (159) 

<22 3> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1462 

attaatactg tcccatcatc attgcttaca cttacggcac gtaaaaacgg tcagcacgat 60 

acagagtgca agaaaacctg ccgtacagcc gaacgttttc tagagcatgg cactgtgtat 12 0 

gagccgggca ttggccatac aggtgcggat gaaagccgnc aaaagaaaaa tgagcttttc 180 
cattaa 186 



<210> 1463 
<211> 225 
<212> DNA 
<213> B.fragilis 



<400> 1463 

tatttgtatg ttgcaccgga tcacataaaa ttgaacttac agttcaaaaa cggcttttat 60 

cacttaaaaa aacaggatga cggatggctc actacagaga taaatcttgt accatttctc 120 

tcagtaaagt tcttaactgc actcttgttt atacagaaac ttttctactt tcaggaatta 180 

atattgtcga atggatgcaa agcactaaaa actcaaagca attaa 225 



<210> 1464 
<211> 1911 
<212> DNA 
<213> B.fragilis 



<400> 1464 

acctttaatg tgatatttag tatgaaacaa atgatgaaaa aatatctata tatggcagct 6 0 



584 



gtggctgttg taggtacagg cttcctgatg tcgtcttgta aagacgaatt tgccggacag 12 0 

aataccaatc cctccacagt ctcaaaaccg aacgtacgct atttatttac tcaatgtgcc 180 

atgagttttc agccggccga ttatcttcag tggtttgctg gtttcgatgc aatgtctacc 240 

tgggtgcagg caactgcctc aggaggtgga aactccagca aattgaatat ggtaactcag 3 00 

accggctgtg gctatcaggt caacgaggtg cttcgttata cgaatgaaat aaagcatcag 3 60 

atcagtctga tgtcggatga tgaaaaagca aaatacgaat atattgctta tttatgtaat 420 

ccgatgctgg tgtacttggg acttgaagac tcggatatgt atggatcccg tcaatattca 480 

gaggcagaaa tggcccgtta tggtgggact ctgactccga aatacgatac gcaggaagaa 540 

ttgttcgaac tctggctgaa acagcttgac gagacaatta actatctgag agagaacaat 6 00 

ccgcaagacg tgcttggtgc gcaggatttt atttatagag gaaaacttga taaatgggct 660 

aaactggcaa actcattgaa actcagaatt gctgcacgcc tgattaataa agacaaggct 720 

cgtgcaattg ccattgtgaa tgaggctgcc cagaatccgg ccggtcttat tttaactctt 780 

gacgatgatt ttgttttcaa taaaggtaaa agagacaata actggaacaa tgatatttcc 840 

gttggtgcgg gaactaagca gttaatcgat tttatggtga gcaatcgtga ccctcgtttg 900 

ttttactttt tccagaagaa cgattacaac tctaatgtag ttcaaggttt ctttgatcaa 960 

aaaagagctt taccgtctta tgtagaagcc aatgtgaact atacggtcga tgcggacgga 102 0 

aagaaacact ttgagagctg gaaagctccc ggagagcctt gggtacgcta ttatggagtt 1080 

ccttgtcaag tggatatcaa taaaaaggaa gagtacaaag actatttcga ccccaataac 1140 

gagttgttct atttgctgag caaagacggt gcgaaaaaga cctatactcc gattgcctac 1200 

cggaataccg aaaatattaa aggtctgttg atttacacat tccccgatgt tcctgatgta 12 60 

gctcccgtac aggataaaga agaatacggc tggtacggac tgtacttctc tgcaggtgaa 1320 

accaacctcc tgctggcgga attcaaatta ttgggtgcca atctgccgat gaccgcacaa 1380 

cagtatttga gtgcaggtgt cgagatgtct gttcgtggtt atgattttgt ttccgctaag 1440 

aatcatattc cttattatga taaaacctac acaggcgatg tacacgataa gacaatcagc 1500 

ctgaaagaag gcatgattga tgaaatgctg tcacatgatg cataccatct gacaggtgat 1560 

ttgagtaaag accttgagaa agtttatatt cagcaatata ttcactatct gatgcttccg 162 0 

atggacatgt ttgttaccgc ccgtcgttcg ggagtgccaa tgaagaacag taccttgttg 1680 

ccatatcagg attttgatcc gttattgggt gaccagtacg tcattcctcg acgtttcccg 1740 

gtaagcaaac ctcttgattc tgatttgctc cgtgacatta caattgcagc ctatcaagca 1800 

cagggttata cgtatgaagg tgagatgagt aattcacctg tgacgttaag caaagaacgt 1860 

gtctggtatg ataaagaggc accggctttt ggtacaggtc ctcaacagta a 1911 



<210> 1465 
<211> 375 
<212> DNA 
<213> B.fragilis 



<400> 1465 

gaagtggcag tttatgataa tttgcctgtg tataaggctg catatgactt gttaaggagt 60 

gtgtatgaga agacgggaaa gattccccgt gatgtgaaat atacactggt ggaggtgttg 120 

aaaaaggatc tgaccgagat tatggtaatg atatacaggg ctaatgctac gactggaaaa 180 

cttccgtata ttgaacgggc aagagatctg gttgtaggag tcaaggtccg tttaagactg 240 

ttgcaagata tgcggcatat cagtgtgaag cagtatgcgg cgtttgccca acaggtggag 3 00 

ttgctgtcga agcaattgtc ggcttggcat gattatgcac ggagacagga cgcaaagagt 3 60 

caagaaaaaa tataa 375 



<210> 1466 
<211> 1750 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (2) 

<223> Identity of nucleotide sequences at the above locations are unknown. 
<400> 1466 

anaaccttat ttatcatcct cttttctttg ggatattcag gaatatactc acaggaacag 60 
caggtgaaga aagactctgt ctaccaattg caagagatag tggtatcgtc ccaacagata 120 



585 



cttgggagta agtttaaagc aagaaaccgc acaggatcgg catattatat ttcgcctgag 180 

gaaattcgca ggttgggata tacggatatt aatcgtatgt tgaaggccgt tcccggagtt 240 

aatatgtatg aagaagacgg tttcggtctt cgcccgaaca ttagtttgag aggaacgaaa 30 0 

gccgagcgaa gtgaacgcat ctcgattatg gaggacggtg tactggcggc accggctcct 3 60 

tattccgctc cggcagctta ttatttcccc aatgtagccc ggatggaggc catcgaagtg 420 

ctgaaaggaa gtagccaggt acaatacggt ccgttcacta cgggaggagc tattaatttg 480 

gtatcgactc ctattccgaa cagtttttcc ggtaaagcga acatttctta cggaagcaaa 540 
aatacgttta agtcgcatac atctgtcgga agcagttgga agcatttcgg gtatatggta 
gaatatttgc gttatcagtc agatggtttt aagaaatacg aagatcatgc tgccaaagga 
tttaaaagaa atgatattat agctaaaata agggttaaaa cggatcatgt aaaaggagtg 

aatcatgctt tggaactgaa attcggatac gcagacgaaa attcggatga aacgtatgtg 780 
ggactctctg cagatgattt taagaagact ccttttctca ggtatgcagg ttcgcaaatg 
gataaactta aaaccgatca tcggcagtgg gtagcaactt atctgctgac tttttccaac 
aagttgaaaa taactaccaa cgcctattac aactatttcc accgaaattg gtacaaactg 

aatgatgtgc gcgcaggaat cacttcaaaa gagaagagat ccatcgccga tgtacttgtg 102 0 

gatccggaaa cgaatatccg ttacttcgac attttgacgg ggaaaacaga tcgggaaggg 1080 

gaagcactgt tggtaagagc caataacaga acttaccgtt ccagaggtat acaaaccagg 1140 

gccgaatacc gtttcaacct gaacgagttt ttcttcgatc tggagttcgg acttcgttat 12 00 

catgccgatg aggaagatcg ttttcagtgg gatgattctt actctatgaa aaataagaaa 1260 

atggtactgt ttatggaggg tattcatggt acgaatgcta accgtgttac ttctgccaac 132 0 

gcgttagccg gttacctgct tgctaaatta agatatgacg cgtggactgt cactgccggg 13 80 

ctgcgatatg aagatgtaga cttactgaaa aaagactata cgaaagaaga tttggcacgg 1440 

tcgggtaagg tacgtattga aactccgaat catgcgcgtg tactgattcc gggggtagga 1500 

ttacattatc aattgatgcc ggctgcttct gttttcttcg ggattcataa aggctttgcc 1560 

cctccaagcg cggaattata tcaaaagcct gaaagcagtg tgaatatgga actgggtaca 162 0 

cgtgttgcta tcgggaattt tagggcggaa ctaatcgggt tctacaataa ttacagtaat 1680 

atgctgggaa gtgatctggc tgcttcgggt gggggtcttc accacggggc tgcaaggagc 1740 

, , 1750 
cgtgcagtat 

<210> 1467 
<211> 186 
<212> DNA 
<213> B.fragilis 



600 
660 
720 



840 
900 
960 



60 
120 
180 
186 



<400> 1467 

tgccggcatg cctgcccctc cggcaggggc ggaccgggtg acggcgtgtg tccggctctc 
ggtacgggta gcgggggaga ctacccgggc ggcggtacag gcggatgcga cggtggagga 
gaagatctct tctgttgccg tttttctggt gtcggtcgac gggagtggaa aggaggattg 
gaatga 

<210> 1468 
<211> 1152 
<212> DNA 
<213> B.fragilis 

<400> 1468 

gttctcgctt atcaggatat cttaagagaa aaagatctct gtggaactct gtgttactct 
gtggtgaaac acccgttcaa tcataaaatt ttcccaacca tgtccgataa gtttcagact 
ttctgttttt cccattccgg cagttggttt ctgccgttct tgtggctttc gttgttggcg 
ggcttatctg cctgctcgtg gaccggggac gaccgtagcg actgtcccag tggtttccgt 
attcgtcttc agcctgcatt gcatgcacag atacagcccg acagcgggac aggcgtcatc 
accgacgaga tcgacacgct gtccctttac gtgttcgacg cacagggaca gttcgtctgc 3 60 
ctgcacacag agaacaggca atcgctgact gaaaacgatt atatcattac cctgccgctg 42 0 
gaatataaag acggagacgt ttacgaactg gtgttctggg cgggagggga caaccggcat 480 
taccggatgc cacaactcac accgggcagt tcgacccgtg acgagctgac cctccggttg 540 
gaacgtgacg gagacggacg tcaggatgac gaattggggc acttgtggta cggtcatctc 600 
cggttgagcc ggatacagcc ttcggaactg acatcggtca gcgtaccgat gttgaaggac 660 
agcaaccggt tcgtcattac cttgcacgat acgtcggggc aggggctgga cgccgatgat 72 0 
tacgacttta cgctgttggc ggataacggc cggatgaatg ctgacaacga agtgatgacg 780 



60 

120 

180 

240 

300 



586 



ggcgaccggg tgacttatgc cgcctatcat accgagtccg cttccgaaac ggaaccggcc 840 

gccacccgta cgggagaagt cagcctagcg cgtgcccgct tgaacacact gcgcttactg 900 

gcggatcagg aggcccgtct ggtggtgacg gaccgtgtct cggggcagaa agtagtggat 9 60 

gtcgacctga cgcgttatct gctgatgacg cgccccctgt ttgaagagag caacggtgtg 102 0 

gagctcagcg accaggatta ccttgattac gaagatcggt tcaacgtgat tttctacctt 1080 

accccgatgg gaaagctgga ggcgctgaac attaacggat ggattatcag actgaacgat 114 0 

gcacaactgt aa 1152 



<210> 1469 
<211> 879 
<212> DNA 
<213> B.fragilis 



<400> 1469 

cgtaaaaaag aaatgaaaaa actaaagtac atgagtatga tggggttggc tgctttattg 60 

ctgacaacct gggccgcctg ttccgacgat acggatgctt cgggcggaga gaatccggaa 120 

gaagcgagag cttataccac agtgaccatt gccgtaccga atggtgtggc ggagacaagg 180 

gcctccgatc cgacggcgga tactgacgat acgaatatgg atatcggttt aacggatgaa 240 

tacaaagtga cgaaggccaa tctgtatctg tttccgggag gaacgggtag tagctttggt 300 

agcgctaagt tgacagagat tatttccatc agccagttta cgcaaaccac cactactact 3 60 

accgaccaga agaccattgt atggaccagt aagaaaacag ccctgacccc gggagactat 42 0 

cgtatttata tagtggtgaa cggtacggtc aatggggtgg gtgacagtga caagggaact 480 

ctgaccgaag ctgcttttct cgcaaagaca acggctgctg ctacgagtgt gatagctgct 540 

gtaccgagtg acggactggt aatggcgagc cgttctccca acagtaataa ctcgaatact 600 

cttccttata ttgcccagga gataaccaaa gacccggagc agaccattgc ggcaacagtg 660 

gagcgtgtga tgggaaagat tacggtgact gcgggaggaa ccagtgcgtc ttctgctgct 72 0 

actgttaata aatatacttc gttttctacc acagtagctc agatcaacaa tattaaggat 7 80 

atcaccctaa aaactcatta tgtagtccac gccggaaaag agggatatta tttccgtcat 840 

gtggataaag aaagctctgc aacgaatcct ttgacttag 879 



<210> 1470 
<211> 753 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (170) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1470 

cagatccgag aacgtatgat aaaacttata cttccaccac ctgttttagc caatagctat 60 

ggcaattggt atctgcaagg atcaagcgca tttggcttgt cgtctttcgg cactttttcc 12 0 

ggaacttata cggatatgcc gggctactct tccggggcgg tggaaactan agtcgccgct 180 

tactgctatg aaaacacgat gctgaaggat aaacagaaga acggatatac aaccggcatc 2 40 

gtatttaaag cggaaatagc tccgagtaaa atgatgaaaa aaaggtcttc gggcggtggt 3 00 

gtggaagaaa ctactacaat tggttcgatt ggtgaaatct tctaccattc cggtatcttc 3 60 

tacaaagata ttgaagcgct gaaagaagcc ggtgtattac tggcagacgg aactacttcc 42 0 

agttcggcca gcggtgtccc tgccgacctg aaaaagaacg acgtccagtg tttcaagaaa 480 

ggaaataccg atggcaagtt cttttgttat tatccgtatt ggatcaaaca tctcccctcg 540 

gatacagcag aagatgtgat ggagttcggc attgtccgca acaatgtcta tcaagtaacc 600 

gtcgccagta ttcaaggtgt cggcaaagac ggtgtaaccg aaaatatcat taccgatacc 660 

gaaaccgatg atccgactac cgtattgctg aatgtgaagt taagtatcaa accttgggta 720 

gtgcgtgcga atagtgccgt attgggccgt taa 753 



<210> 1471 
<211> 1488 
<212> DNA 
<213> B.fragilis 



587 



<400> 1471 

atatcgggga gcgaacaaga aaaattttct ggcgccgacc cgtgcgggca tcagtctgat 60 

ttatatgtta aaataaaaag tcttatgaaa gagaagatac gatcgacccg tttaagaatg 12 0 

tgctttcgga agatgccgcc ggcagtgtgc ttcctgacgc tctttgcact gtgcttcggt 180 

gcgctgtctg tccgtgcggc agatccgtcc ggaagagtag ccctttccgc cgtccgaatg 240 

cagcgtgcgg gtggacaggt atatgtctcg tttgccgtaa agatagcccc ccgtgcagtg 3 00 

cgtgcccgtc accgctgggt gattacccct tgtctgggca acgcctcgga tagtgtgttg 3 60 

cttggcccgt ttgtggtgac gggacgcatc atggcgcgcg aggaaaatca gcggcgccta 42 0 

ttggccggcc ttccggaccg tgacgtcaat catcggtgga ccgcccgcaa tggagacacc 480 

ttcttgtata ccgatacgtt gcgctatgcc ccgtggatgg agaatggctt gaacctgcgg 540 

ctcgacatcg accgggaagg ttgctgccgg gtacagacag tgggaagcat cgtctcctcc 600 

ggcgcttttc cggtggcttt gccctatcgt ccgtcggtta gtgagctcac tccgagggtg 660 

agccggacgg tggcggaaca tgcggatgac tatccgttcc tgtgcgaggc aggcagccgc 72 0 
cccctgcatg aaagtggcat cggtattcgc ttccgtgcgg catcggcagt ggtggatacg 
ctgtattccg ccaatgccgg aaacctgcgc cggataacgg aagccatcgg gttgctgcgt 
gcggacagtt gcgcatttct gcaaggtatc tcgatcagcg gatatgcttc gcccgagggc 

acgacgggac tgaaccggaa attatcggcg aaacgtgccg aagctctgcg gcatgctctc 9 60 



780 
840 
900 



1020 



tcggtgcgca tgaacctgcc tgtatcgttg tttgaactga atgccggagg agtagactgg 

gacaggctgg ccgaactggt gaatgggagt gacatgacat ataaggagga agtgctcgct 1080 

attctccgca gtcatccgga ggaagagcgg aatgacaggc tgaaagcctt ggcgggcggg 1140 

cgtccgtatc gttcggtgct ggatgtgctc tatccgcagt tgcgcgatgc ctgctacatc 1200 

cgtgtgcagt atgccaaccg ccctgacagc gtggcggata cggtgaaccg tgcgatagaa 12 60 

gccattcggg ggcggaagta tgaagaggca ttccggttgc tgaagacggt ggaggcggac 132 0 

gaacgctcgt ggaatgtacg gggagtctgc catctgttgt gcggagacga caaggaagcc 13 80 

gggctatggc tgcatagagc ggtgaaagcc ggaaaccggg aagcggaaga aaaccttaaa 1440 

aagatgaatg cggaacgacg ggccgctacc atcggtataa cgcaataa 1488 

<210> 1472 

<211> 339 

<212> DNA 

<213> B.fragilis 

<400> 1472 

gccccgctat tgcctttgtc aagtcattct acggacacct tcttgggtgt acatgcgggt 60 

gggggagaat ccaacctgag ccgggtgcat ctcctttttg tttcccgttg ggtttcttcc 120 

cgctacgagg ggtgggcgct gatgccgggg ttttcatcgg gttactcgtg ggtgctcggc 

aaacgctgga atctggaggc taccataggt gcagggtggg tgcatgccca atacaaacgt 

tttaattgtc cggtctgtgg tgaatatcgg ggagcgaaca agaaaaattt tctggcgccg 

acccgtgcgg gcatcagtct gatttatatg ttaaaataa 

<210> 1473 
<211> 1035 
<212> DNA 
<213> B . f ragilis 

<400> 1473 

cgtactgcgt cgccgttctc tccatccggc acggcgtctg tgaacggaag atctatgaaa 60 
tcgtcggaag attcaaaaag gagtgtacgc tccatgcagt ataaatgccg tccgtttttt 120 
gttttcatgg gaaaaactga agaattttgc tgcccggaaa tacaaacaca tatattacac 
gacaaaatga ttatgaaaaa ggaaaagact tactcccgtg ctccgctccc tttcgtgggg 
cagaagcgca tgttcgtatc ggaattcaaa aagatcctga aacattttga tgacaaaacg 
atatttgtcg acctgttcgg cggctccggc ctgctatcac acattaccaa acgtgaaagg 
ccggatgcgg tggtcatata caatgaccat gacaactacc gcgagcgttt ggaaaacatt 
gaccggacca ataccctgct gagagatctc cgtaaaatag tcgggatata tccccgccat 
cagaagatta ccggaaaaat gcgcgaggct ttccttgaac gcatcaggct ggaggagaca 
accggtttcg tggactatct taccctctct acttccctac tgttttccgg aaaatacgca 
caaaacatgg aggaacttga aggattgtat ttttataaca agatacgcca gtctgactac 
cggtgtgacg gctatctgga cgggcttgag gtagtctgct acgactataa ggaactggca 72 0 



180 
240 
300 
339 



180 
240 
300 
360 
420 
480 
540 
600 
660 
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gacacatacg 
gatatcagta 
ctgaaaggac 
cgctggatgg 
cttaccgcac 
ccaagggcgg 



gggtgtttcc 
catacaagat 
acccgtttgt 
aagagcatcc 
ggatgaatta 
cctga 



gggagtggta 
ggactggaag 
ctatttcact 
cgggatcggt 
caactcctcc 



ttcctggttg 
ctggcggatt 
tccgggaaat 
aatcctttca 
tataccgata 



atccccccta 
acctggatgt 
cccccatact 
agggagccgg 
tcatgctcta 



tatgggaacg 
cctgctggta 
ggatttttgc 
ccggtccaca 
caaagacctg 



780 

840 

900 

960 

1020 

1035 



<210> 1474 
<211> 264 
<212> DNA 
<213> B.fragilis 

<400> 1474 

gcgcgcggca caaccacata catacagttc attacacttt ccgcccattc aatggtacgt 60 

gcgacaacgg ttgcaccatc cttgctcttc aaggtaatac ccgtacaggc tcccgccggt 120 

atttgcggaa cgctacataa aactgacatc aataatgctg ataattttgt attcatactt 180 

tctattttaa tggtcgttta tggtagtgta acacattcca ttcccgaaag gttgattaca 240 

atagagagta attattcatt ttaa 2 64 

<210> 1475 
<211> 435 
<212> DNA 
<213> B.fragilis 



<400> 1475 
aatttgaaaa 
agtaaaacaa 
tcatgcggcc 
aaggacgagc 
atcaccgcat 
ggccatgaaa 
aacgcctcct 
gcggacgccg 



cgcataaaga 
tgaaaatatt 
cggatgaact 
cgggtgaaga 
cattgcagaa 
tgggagtctt 
atctttttga 
gtctt 



agtaacttca 
tagatatata 
gataccggaa 
accgga.gga.g 
catgcagcag 
tgtcggaaca 
tgggaaagta 



aataaaagta 
ttgctcgcct 
tccgtgccac 
ccggaagagc 
accaggggaa 
agtcagacag 
tggaatgccg 



aaacactgga 
cgcttacctg 
cggtggtgaa 
ctgaaaagat 
tcatagaggc 
atgaagcagc 
gacaggatgt 



ttttgtaata 
tacgcttttc 
tcccggggat 
acagttagcc 
ttttgctccc 
aggtataaaa 
accggtggaa 



<210> 1476 
<211> 351 
<212> DNA 
<213> B.fragilis 



<400> 1476 
catacagatg 
tatctgtata 
gagattttaa 
gatgactata 
aaggtgacgt 
tatgaaatcg 



cgcatcaccg 
tgaatgattg 
attttaatag 
aatatatcga 
actgcgtcgc 
tcggaagatt 



cacgaccggg 
gctttgcaaa 
agaactactg 
cctgtacaag 
cgttctctcc 
caaaaaggag 



ggtaaatacc 
gataaaaaaa 
gaacgtctga 
gagtatgaac 
atccggcacg 
tgtacgctcc 



cttgtcgcgg 
tattcgctat 
cccgcatggg 
agatgcgccg 
gcgtctgtga 
atgcagtata 



gatgcgcttt 
gacactattt 
tttcaaaccg 
gcagggtgat 
acggaagatc 



60 

120 

180 

240 

300 

360 

420 

435 



60 

120 

180 

240 

300 

351 



<210> 1477 
<211> 1101 
<212> DNA 
<213> B.fragilis 



<400> 1477 
acgaccatta 
tgtagcgttc 
ggtgcaaccg 
gtggttgtgc 
aagttcaggg 
gagggcatga 



aaatagaaag 
cgcaaatacc 
ttgtcgcacg 
cgcgcgctca 
caaagcatgg 
acgaaaaggg 



tatgaataca 
ggcgggagcc 
taccattgaa 
agagttgcag 
ctttgtgggc 
actttccgcc 



aaattatcag 
tgtacgggta 
tgggcggaaa 
tcactgactc 
ctggcggtag 
ggattatact 



cattattgat 
ttaccttgaa 
gtgtaatgaa 
cctccggtat 
agcagaagga 
attttccgaa 



gtcagtttta 
gagcaaggat 
ctgtatgtat 
ggatggactt 
atttgtggtg 
ctatggtagg 



60 

120 

180 

240 

300 

360 
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tatcctgttt atgatgcggc acagagggac aagagtcttg cggattttca gttggtatca 420 

tatgtgctgg cagaatgcag cacggtagat gaagtgaagg aggccctttc gcaggtgcgt 480 

gtcatcaata ttgatccccg ttcgtccacg gtgcattggc gctttaccga agcatccgga 540 

agacaggtgg tgttggagat tgtaaatgaa atgatgaact tctacgacaa tccattgggc 600 

gtgttaacca attcaccggg tcttgaatgg cattggacca atctgaacaa ttacatcaac 660 

ctacaaccgg gcacgttacc tgaacataac ttcgggccgt tggagccgaa gtctttcggg 72 0 

catggcagtg gtctgctggg acttcccggt gattttacac ctccatcccg ttttgtgcgt 7 80 

gccacctttt tccaacttac ggcaccacaa caacccgatg caaaaggaag tgtgttccaa 840 

gcgttccata ttctgaacaa ctttgatatt ccgacgggta gtgaacagcc ctggggaaag 900 

gcgtcagcca atgtaccgag tgccacccag tttaccgttg cgtgcgatat acgggaccag 960 

aaggtttatt atcgtaccat gtacaacagc aacatccgtt gcattgattt gaaaacgata 102 0 

aatttcgaca atgtaaaata tcaagcggat cctttggatg aaacgaagga gcaaccggtg 1080 

gaaatgaaag tgataaaata g 1101 

<210> 1478 
<211> 189 
<212> DNA 
<213> B. fragilis 

<400> 1478 

aaggtgacaa aattgaatca aattgctctt ctgtatattc caaccctgtg tggactgtat 60 

acgtcatttg ttgctttcgt tgtgctgcaa catagccaat ctgctaacca cagaactgta 120 

tacatggcaa atggtaaaga gaatgtcaag cgtaaccgta tgaaagggtc ggacaagaca 180 

aagtactaa 189 

<210> 1479 
<211> 426 
<212> DNA 
<213> B. fragilis 

<400> 1479 

ctcaataaga agaagctaag ttttctattt ctaaaggtga taattacaga agctatcaaa 60 

caactttttt cttctcacac attccaaacg ctctcccttg aaagtaggac attattagac 12 0 

gaatataatt tcacaaaatc catgatagca aatcttttgg ataaacaaga aaaactctac 180 

cttgtaccct ctactaaaaa ggaaaatgaa cttttagcag ggattatcct taatgatgaa 2 40 

attatttatc tactaaaatt ttcaaaggca tctgataaca tttatactct ttacaacgaa 300 

acaaacgaac ctatatgcga tgtcaaatat gattttgaaa aacaaaatat agttattatt 3 60 

agcaactatg gaaatgatgc tatcccccct acaacacaag ttggtacagt tttgtttgta 420 

atatag 426 

<210> 1480 
<211> 816 
<212> DNA 
<213> B. fragilis 

<400> 1480 

tcaaggagat gcaaacgtat gaaaacaatt acaacggcat gtgtgaacca taagggaggt 60 

gtcgcaaaga caacctcgct gctgaacctg gcagccggga tcgcacggat gtataagaaa 12 0 

agggtctgca ttatcgatgc ggatccgcag gcgaatacga caatggcagc gttcggggag 180 

gaaatggcaa gccttccccg ggaggttctg ctcgagagtg cgctacagga ctgtatgcag 240 

gacactccgc cggagttaaa gccgcaaaag tggctggaga aggtggacat actgccggcc 3 00 

tccctggatc tggcggctac ggaagtaatc atgtacacca cacccggaag ggaattcctt 3 60 

ttcagggaaa tagtaaaggg gctggaagag aagtatgacc acatacttat cgactgtccg 42 0 

ccatcattgg ggatcatcac gcagaacgcg ctgatggcaa gcgattacgt gatcatacct 480 

acggacggga attacttcgc catgaaagga attgaaaaga tacactatat catcggcctg 540 

ctcaaaagga agctgggagc cgaagtccgg atactcggat actttatgac caagtacaat 600 

gccaggagaa agctggatat ggatatcagg gagagtctgg taagaagttt gggagatggt 660 

gtcttcgaaa cggtaatacg cagcaatgtc gccctgggag aggcacaata caaggcacag 72 0 

agcatatttg actatgcgcc ttcctcaaac ggggctgatg actacaggga gctggtcaag 7 80 
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gagttcctgg gcagaattaa aaaaataaat aaatag 816 

<210> 1481 
<211> 294 
<212> DNA 
<213> B.fragilis 

<400> 1481 

aagaaaggca ggattatgaa agactttaca tcgaaaggaa tatccctgga aaacatggtg 60 

ggagaaaccc cgggaaaaga aaaaggtatg acaggaaaaa catcacccaa aacgaaccag 12 0 

accgttgcac tgacggaaga tctgaaatgg gagttacgga cgttcgcttc ggaccatcgc 180 

tgcaggggag tcaagacact gcttgaaacg atgatagaat gtttcgtcag ggaagacggt 240 

acgcttgacc gtgacaagtt agaaggcttc tggcgggaat atgtcgaaaa ataa 29 4 

<210> 1482 
<211> 1569 
<212> DNA 
<213> B.fragilis 

<400> 1482 

ttgtatacca tgaaaaaaat atcaatttta attgcggctt taaccctcag cataagcctg 60 

aaaccacttg ccgctcagaa taaaaaggtt tttatcatcg ataaacagac cgtctatcaa 12 0 

gaaatagaca acttcagcgc ctcagacgct tggcgctgcg ccttcattgg taaaaactgg 180 

cctcaagaga aaaaagaaaa aattgccgac ttactattca aacgtgaatt tgacgaaaaa 240 

ggtaacccca tcggtatggc cttgactaac tggcgcgtaa acatcggagc cggaagctac 3 00 

gaaaaccgtg aagcaaagga ggtggataac tcctggaacc gtaccgaatg tttcctctca 3 60 

cccgatggta aatatgactt taccaaacaa gctggacaac aatggttcat gaaagcggcc 42 0 

cgtgaacgag gcatgaacaa ctttctgttt ttcacgaact cagctcccta ctttatgact 480 

cgtagcgctt ctacagtttc tactgaccaa gattgcatca atctgcaaaa tgataaattc 54 0 

gatgactttg cccgtttctt ggtgaagagt gcccaacatt tccgtgaaca aggctttcac 600 

gtaaattaca tcagcccgaa caatgagcca aacgggcaat ggcatgccaa ttccttccaa 660 

gaaggcagct ttgccaccaa ggccgacctt taccgcatgg tagaagaatt ggataaagca 72 0 

atcagcgaag ctcaaatcga cacaaaaatt ctaattccag aagtaggtga catgaaatat 78 0 

ctatttgaaa ttgattcgat agccaaaatt ccagatgata tcatccactc tatgttctac 840 

aaagacggac aatacagcgt gctgaagttc aaaaacctgt ttaattgtgt agcagcacac 90 0 

gactattggt cggcctaccc cgctaccttg ctggtggata tacgtaaccg aattcacaaa 960 

gagctctcag ccaacggtca caacaccaaa ttttgggcat cagaatactg cattctggaa 1020 

aagaatgaag aaattactat gccagcctct ccggaacgca gcattaacct aggcttgtat 1080 

gtagcccgta tcatccacaa tgatctaact ttggcaaatg cttcggcttg gcaatggtgg 1140 

actgccgtat cactaggcga ggatgtgccc attcagctat tgccacttga aggtt caaac 12 00 

ggattgtcac tacaatatga cggtgaaatc tctaccacca aaatgctgtg gactactgcc 12 60 

aactacagtt tctttgtgcg tccgggtatg aaacgtatcg ccgtaaaacc tacctataag 13 2 0 

gtaagtgact tggaagccgc tacttcactg atgatttcat cgtatactga tgggaaagaa 13 80 

gtggtgaccg tagccatcaa ctattcaaag gaaaatcagg tgattagcct aaactgtgac 1440 

catgcccaaa aaggaaaagt ttatctgacc accatcgaca agaatctgcg atacatgggt 1500 

gaacaaccgc tgaaaaagtt acagctgcca gcacgttcgg tagctaccat tgtagtcgaa 15 60 
gacaactaa 1569 

<210> 1483 
<211> 222 
<212> DNA 
<213> B.fragilis 

<400> 1483 

agcgtgtttg ttgccagcat gaaacggcag ggtgtagaag tgcatttcag gcacatggaa 60 

cagagcaata agctacagga catcgtattc actatggaca gctaccattt caatggttcc 12 0 

aaagtgggca ggcgtttcag ttattctaag tttggtacaa ctcttttttc gtgggctgta 180 
ccgcctctgt ttgccagtgg aagcctctgc tgccggatat ga 222 
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<210> 1484 
<211> 1269 
<212> DNA 
<213> B.fragilis 



<400> 1484 

aaaacgaaag ttcgctttct acgggcttgg cgttactaca tgatggtaac tgcatatggt 60 

gacatcccct tggttacaga agtggttcct tctttggaag atgccaaact gccggctaat 12 0 

ccagaaactg atgtagtgga attcatcctt gatgaactaa atgacatcac caaagacgga 180 

gcactggacg taagtccaaa acagaaagga agaattacac gcggtgctgc tttggccttg 2 40 

aaggtaagat tgtgtctgtt ctacaaaaag tatgatgaag tgattgacgc tgccaatgaa 3 00 

atcaatagct tgggtgtgta taatctgtac caagaaggtg aagttcctta ctctgaattg 3 60 

ttcaaagaag ccaatgaaga caactgcgaa atcattctgg ctgtaaagaa agtaatgaac 420 

gactacaaaa accaaaccat cattgaattc tgtaacgtaa ttgatggcgg ttggtcggca 480 

ttcgtaccca tccaatctct gattgatgca tatgaaatga aggatggtct gacaatcgaa 540 

gaagctcagg ctaaaggtga gtataatcca gaacatcctt acaaagatag agatcctcgt 600 

ttctatgcta caatccttta ctcgggtgct gattggatgg ataacaaggg tagaaagaga 660 

atctataata cgctggacag aaacattaac ggtgaaccta acaaagatca tcgtcttgat 720 

tctagaaatg cttctcaaac tagctattct atctgtaaat acatgaaacc actgactcaa 780 

tattcagata taaacaacac tggtctggat atgattgtat tccgttatgc tgaaatccta 840 

ctctcaaagg ctgaagctat gattgaaaag aatacagacc tttcgggtgc tactgacttg 900 

attgacctga tcagagaaag agctggcatg ccaaaagtag acagagcaaa atataataca 9 60 

caagctaaac tgagagaatt gctcagaaga gagcgccgtg tagaatttgc tttcgagggc 102 0 

ttgagaagag atgatatcat ccgttgggac attgccaagg acgtactgaa tggtccaatc 1080 

tatgcttcta accaaggtac cgtagatatg gatacaagca ttccccaaga ggagcgtgct 1140 

acaattttcc aaggtgaaaa gaaccaggtg gtactcgaga tccgtaaatt caggaaccgt 1200 

tacatgccga ttccacaagc tgaattggat aagaacccga acttgaaaca aactaacttc 12 60 

aaaatataa 1269 



<210> 1485 
<211> 246 
<212> DNA 
<213> B.fragilis 



<400> 1485 

cctgttttat cacttcacct atgtatatat 
gtctttccca aagataccaa tcacataaaa 
ggtataaata aatggactta tttatttata 
ccggatggag atagaggtca aaacttaaat 
ctgtag 



tactcttttt catattccaa ataccattca 60 
atccccgaca gccgtcacgg acaccgggga 120 
aaatcgtcca cccggccgat ggcctatcag 180 
ccgatcttaa atgtcggagc ggccagcaca 2 40 

246 



<210> 1486 
<211> 459 
<212> DNA 
<213> B.fragilis 



<400> 1486 

ataaataagt ccatttattt atacctcccc ggtgtccgtg acggctgtcg gggattttta 60 

tgtgattggt atctttggga aagactgaat ggtatttgga atatgaaaaa gagtaatata 120 

tacataggtg aagtgataaa acaggtcata gccgaaaagc aggtgacaaa ggccgagctt 180 

gcccgtaggt tgggggtaaa accacagagt gtggactatc tgctgacacg gaaaagtatc 240 

gatacggata ccctgtatag cttgtcgttg gcgctggatt atgatttcgc tgttttatat 3 00 

tccataaaga aagaacatgc tcttgctacg gacgaagagt ctccgtttaa agtgggaaat 3 60 
gcaaagatca gtttagagat cgagttgcgt cccgatgaaa tgttgaaatt gaacctgaaa - 420 

cagaagattg cagacctgtt ggaaggaaag ggaaagtga 459 



<210> 1487 
<211> 2250 
<212> DNA 
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<213> B.fragilis 



<400> 1487 

cggagatatc tgccaccggc tattaatttt gtttcaaatt gtataacgat tgataccaaa 60 

ctcctgtcag aagcggatac tggcagccag ctccacggta aacggacgaa tataactgcc 12 0 

cgccatccaa tgaccgttat aagatgccgc atcttttttg tccaccagtt cggtcccggc 180 

aatgcttcct ttagccccgg tctgattcag gaaattgata accgtggcac tcaacgcaag 240 

atgcttgttc accgtccagt tgatacctcc gaaggtttcc cagcgtccgt tgaaatagta 3 00 

agcattttta atgttggcat acgtcttgct gaagtagcgg aagctggccc agactctgag 3 60 

atcattcgtg atgttatagc tgggatccag ttcgacaagc actttcggaa tctccgtcac 42 0 

aatattccct gtggcatcga tctcccccac cgtaccgtct ttaaaggtca cactcgtttc 480 

atacttcttg taagtcggac tttgataagt gaagaggaag tggaagttga agcccttaaa 540 

cggcttgatc actgcatccg tcgtccagcc gatggtttgg atgtcataac tcaacggggc 60 0 

ggcgagaatc tcagtctggt cattcggatt aatcaggttc agtgtcgagt tgttattcgt 66 0 

cttcgagata taagagaaca gagaggtcag actgatccaa tcggtgttat aatagatccc 72 0 

ggcacgtccc aggggaacag agattttatc ggtattgggc atagtggccg gtgcaaagtt 780 

cgacaagccg ggacgctgtg tgttgtaagt gaaatcggcg gtaaaaccaa attcacgggt 840 

cagcttataa gtagcggcgg cagtaaaggc catgttcatc cagtcgtagt cgaaaagacg 90 0 

gggagtgatc ctgactcctt tatcggacac agctcccagg taataatccg gaaaacggcc 960 

tacggcattc ccttcggcat cgtatacagc cgcattttca ccttccaggt gctgccattc 1020 

cagacgggcg ccataataca cgttccactt gggcgaaata tcccaatcgt gagtggcata 108 0 

aacagccagt ttgttttcat gtcctttata atattccgaa gcattcttat tgaaatcgta 1140 

atatacacca tctgtattcc cttcacgtac cagacgttcg ggatactcct ctaccgtatg 1200 

atcgtacatg gtagtattcg aagcataatc caccttatag tgccattcgt tcactccgat 12 60 

tctccaggtc gtattacgat aggaacggga gagttcggag gtagcgaaaa actcatcgat 13 2 0 

ttttccccgg ttaaggcaag acatgcggct ctgcacatat ccttcatacg gtttcaatgt 13 80 

accgtctacc gctttcaggt agtaaccggc agaggcatct ttctgttcca tggccattgg 1440 

agtctggtag acatacgagc ccaaggcatg atcatacttc agattgatct tccagttcag 1500 

gccattatcc caggtgtacg tgttgagcaa ggtcaactgg tttcctttat tcaaagtggc 1560 

atcataaaga ttggtctcct tcagttcacc cgtacgcatg tcacggtagg ccatttcgcc 1620 

cgttgtaggc agataggaac ttgttccgag gctgaatccc ggtatttctt tcacgctacc 1680 

gtcgcctaca tagatgaaag gggcggaagt cacttcgttg cttggccaac ggctgttgct 17 40 

gtagtgatag atagccgaca gctgtccgcg tccttcatta taattcttgg tcagggctgc 1800 

tttatagatt tgagtacggt cctgattcga tgtaaaacgc agtttgaaac tgcccggatc 18 60 

aaaattttgg taaacgcttc cactgtagaa ccatcctttc cccataccac cgctgatatt 192 0 

agcatcgaac tgctgcttgc caaagtgatt ggtactataa ttgacgattc cacgaaactt 1980 

ttctgttccc aactgggtaa aagagtttac agcataaccg atattgccgg tagtaatggc 2040 

ggtttcggag attttgagca acccgacgtg cccgagactg ctgtctccac gccagtgcgt 2100 

atttacatta tgcggattag aggtatagac tacggggagt ccgttttcga gtacatttac 2160 

gtctccacct ggcaagccga tagagatttc gcggggacca ttggcgctcg aagcgttcag 22 2 0 

cataacatta cgattgcctt cttcctttga 2250 



<210> 1488 
<211> 411 
<212> DNA 
<213> B.fragilis 



<400> 1488 

gtgaagacgg agtataatga cgcattggct gccgtatcgg gtgataatgc tacggcatat 60 

gctaatttat taactgctat ggataatgct gttaaagcac gagtggagac tctaatagca 12 0 

tactataaag ctgatcataa ctattctgtt cagaacacat tggcttatac attacaaact 180 

atagctgatg gcttggctga ttatgatcag ttgattttag tccagaaaca agctattgct 240 

gctgctgatg aaaatatagc taatgccgct tcagttgtat caaaggaaca ggctattgct 3 00 

aatcaggaga aaaccattgc tgaccttgaa aatagtttgg ctgtaaatga acctatttac 3 60 

aatgattatt tagctcagat caaagcttta gtaggtgact ctgcagaata a 411 



<210> 1489 
<211> 786 
<212> DNA 
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<213> B.fragilis 



<220> 

<221> unsure 
<222> (510) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1489 

aaaaagtatc gtatgaaaat aaaaagactt ttagtgttgg ccgttctacc catgctgtgt 6 0 

cttgcagtga atgcacagaa ctccagtaaa gacaatactc ctaaaaaagg agactttact 12 0 

gtagcagcta ctgttggata caatagttac acaagtgtca cagccccttc ggggctgctg 180 

actgactatg aagtcagagc gctctcaacc aactgggcag acaaaaagct gatggttggt 2 40 

tttgaaggag gctggttctt caaagatcag tggaaactaa atttgggtgg cggtgtcagc 3 00 

ttcacgaata accccggtta tccggctgtt cccggcacaa tagacgattc gaataagaat 3 60 

aactcggctg acgagaatat gggagagatt cctaattatc gtgccgtagc cgatgctcag 420 

tcgttcgcct ataatgtgtc agcaggtgtt gatcgttatt tcaacatcaa gcgtgttcct 480 

aacctgatgt ggtatacagg tattcgcgtn aggtttgctt acggtgaaaa tgaaatgaag 540 

tatgatgaag agacctctat gggcaaatct attgccgaga gttggaatct tcgcggcgcc 600 

ttgactatcg gtgtcgacta ctttgttttt cctgcactct atatcggtgc gcagatcgat 660 

ccgtttgctt atacgtacaa caagactacg tataatccgc aagcaggtct tggcgatctg 72 0 

tcggcagaca gccacaacta cagtgtgctg gccgctccga catttaagat cggatttaag 7 80 

ttttga 786 



<210> 1490 
<211> 795 
<212> DNA 
<213> B.fragilis 



<400> 1490 

aaataccatg ttatcatgaa aacaactttt atgaatgtaa gtagaggagt gatcggtgct 60 

ttggctttct ctctcggaat ttcgtcttgc caaagttcac aaagtaaaat gacatttgaa 12 0 

caagaaggcg acagccttac ggtgattcat attacaaatc ctacacagta tttacttttg 180 

cccgtcgagg agaagactcc cgaagcacag gtctgcattg cttcggactc ggttccggta 240 

gacatggacg tacgcctgtc aagggagaaa gtggactatt ttgttccctt tgctttgcct 3 00 

aagggagaga aagaggtagc cgtgcgtatc cgtcacttgc cgaaggaggc tttgtgttgg 3 60 

aaagaactta agctttcgga tacttttgat acgaccaata cagaccaata ccgtcccttg 42 0 

tatcaccata ctccgctcta cggatggatg aacgatgcca acggactggt atataaagat 480 

ggtgagtatc acttgttcta tcagtataat ccttacggct cgatgtgggg caacatgcac 540 

tggggacatt cggtgagcaa ggatctggtg cactgggaac atctggagcc ggcacttgcc 600 

cgcgatacgc tgggacatat tttctccggc agttcagtag tggatgatgc caatacagcc 660 

ggatatgggg caggggccat cgttgccttc tacacttcgg ccagtgataa gaacgggcag 720 

atacaatgta tggcctatag cactgacaac ggacgtacgt ttaccaaata tgaaaagaat 780 

ccgtcttcac cacgg 795 



<210> 1491 
<211> 2373 
<212> DNA 
<213> B.fragilis 



<400> 1491 

aacaaatctt ttagttggga aatgggattc 
aatcaactaa aaaatttaag taccatgatg 
gcttgcttgc tgatagcggg cttcacagct 
gcttccggat caaaggaaga aggcaatcgt 
ggtccccgcg aaatctctat cggcttgcca 
ctccccgtag tctatacctc taatccgcat 
agtctcgggc acgtcgggtt gctcaaaatc 
ggttatgctg taaactcttt tacccagttg 
tatagtacca atcactttgg caagcagcag 



tgcggcatca 


aaacaatctc 


60 


aaagggctgt 


ttgcttgctg 


120 


agaaagaatc 


tgctcaaaca 


180 


tgaacgcttc 


gagcgccaat 


240 


taaatgtact 


cgaaaacgga 


300 


cgcactggcg 


tggagacagc 


360 


ccattactac 


cggcaatatc 


420 


agtttcgtgg 


aatcgtcaat 


480 


atatcagcgg 


tggtatgggg 


540 
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aaaggatggt tctacagtgg aagcgtttac caaaattttg atccgggcag tttcaaactg 600 

cgttttacat cgaatcagga ccgtactcaa atctataaag cagccctgac caagaattat 660 

aatgaaggac gcggacagct gtcggctatc tatcactaca gcaacagccg ttggccaagc 72 0 

aacgaagtga cttccgcccc tttcatctat gtaggcgacg gtagcgtgaa agaaataccg 780 

ggattcagcc tcggaacaag ttcctatctg cctacaacgg gcgaaatggc ctaccgtgac 840 

atgcgtacgg gtgaactgaa ggagaccaat ctttatgatg ccactttgaa taaaggaaac 90 0 

cagttgacct tgctcaacac gtacacctgg gataatggcc tgaactggaa gatcaatctg 9 60 

aagtatgatc atgccttggg ctcgtatgtc taccagactc caatggccat ggaacagaaa 102 0 

gatgcctctg ccggttacta cctgaaagcg gtagacggta cattgaaacc gtatgaagga 10 80 

tatgtgcaga gccgcatgtc ttgccttaac cggggaaaaa tcgatgagtt tttcgctacc 1140 

tccgaactct cccgttccta tcgtaatacg acctggagaa tcggagtgaa cgaatggcac 12 00 

tataaggtgg attatgcttc gaatactacc atgtacgatc atacggtaga ggagtatccc 12 60 

gaacgtctgg tacgtgaagg gaatacagat ggtgtatatt acgatttcaa taagaatgct 132 0 

tcggaatatt ataaaggaca tgaaaacaaa ctggctgttt atgccactca cgattgggat 13 80 

atttcgccca agtggaacgt gtattatggc gcccgtctgg aatggcagca cctggaaggt 1440 

gaaaatgcgg ctgtatacga tgccgaaggg aatgccgtag gccgttttcc ggattattac 1500 

ctgggagctg tgtccgataa aggagtcagg atcactcccc gtcttttcga ctacgactgg 1560 

atgaacatgg cctttactgc cgccgctact tataagctga cccgtgaatt tggttttacc 162 0 

gccgatttca cttacaacac acagcgtccc ggcttgtcga actttgcacc ggccactatg 1680 

cccaataccg ataaaatctc tgttcccctg ggacgtgccg ggatctatta taacaccgat 1740 

tggatcagtc tgacctctct gttctcttat atctcgaaga cgaataacaa ctcgacactg 1800 

aacctgatta atccgaatga ccagactgag attctcgccg ccccgttgag ttatgacatc 1860 

caaaccatcg gctggacgac ggatgcagtg atcaagccgt ttaagggctt caacttccac 192 0 

ttcctcttca cttatcaaag tccgacttac aagaagtatg aaacgagtgt gacctttaaa 1980 

gacggtacgg tgggggagat cgatgccaca gggaatattg tgacggagat tccgaaagtg 2040 

cttgtcgaac tggatcccag ctataacatc acgaatgatc tcagagtctg ggccagcttc 2100 

cgctacttca gcaagacgta tgccaacatt aaaaatgctt actatttcaa cggacgctgg 2160 

gaaaccttcg gaggtatcaa ctggacggtg aacaagcatc ttgcgttgag tgccacggtt 2220 

atcaatttcc tgaatcagac cggggctaaa ggaagcattg ccgggaccga actggtggac 22 80 

aaaaaagatg cggcatctta taacggtcat tggatggcgg gcagttatat tcgtccgttt 2 3 40 

accgtggagc tggctgccag tatccgcttc tga 23 73 

<210> 1492 
<211> 384 
<212> DNA 
<213> B.fragilis 

<400> 1492 

atccgcagca cccccaaggc gatttcttca caataccttt accagatatt tcgatacgac 60 

ctacttttag taaaaaggag ggagtcatgg ccccatttcc catgctccta ttattttccc 12 0 

gtggtcgtca atcttccgga ctttgatttt aaacaccaac ttccttcatt gaactactat 180 

cctctaaatc ctttaggtgg aaaaatttta ctcaagtgtt caaccagccg tttattccgt 240 

tcttcgtttg cacgaaatat taatccttta ctggaaaaat cccgttgtca acagactccc 300 

actgtcaaga ccctatttta ccgtaccgcc ttaacaatac gcctggcacc tggttcatgg 3 60 

aacaatccct acaccctctg ttaa 384 

<210> 1493 
<211> 1203 
<212> DNA 
<213> B.fragilis 

<400> 1493 

actaatactg cgattgttat gaatactaca gaatatttac agacttggtc tgactcttat 60 

aaaaatgaca tgataagcaa tatcatgccc ttttggatga aatatggttg ggatcgcaag 12 0 

aacggaggtg tttatacctg cgtcgaccgt gatggtcagt tgatggatac caccaaatct 180 

gtttggttcc aagggagatt tgcttttaca tgttcatatg catataatca cattgagcgt 240 

aatactgaat ggttggcagc tgcgaaaagc actctcgatt tcatagaagc acattgtttt 3 00 

gatacggatg gacgtatgtt ttttgaagta accgagaccg gattacctat tcgtaaacgt 3 60 

cgttatgtct tttctgaaac atttgctgct attgcaatgt ccgaatatgc cattgcatca 42 0 
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ggagatcata gttatgctgt aaaagctttg aaattgttca atgatatccg tcacttcctt 480 

tcgactccgg gaatcctgga gcccaaatat tgtgaacgtg tacagatgaa gggacattct 540 

attattatga ttcttatcaa tgtagcttcc cgcattcgcg ccgctattaa cgatccggtt 600 

ttggatcggc aaatagagga gtctatagcg attctgcgca aagactttat gcatccggag 660 

tttaaagctc tgcttgagac tgtaggtccc aatggagagt ttatagatac gaatgccact 720 

cgtaccatta atcccggcca ttgtatcgag acctcatggt ttattctgga agaagccaag 7 80 

aaccgcaatt gggataagga aatggttgat acagcactta cgattctgga ttggtcgtgg 840 

gagtggggct gggacaaaga atacgggggt attataaatt tccgtgattg tcgaaacctg 900 

ccttcacagg attatgccca tgacatgaag ttctggtggc cacagaccga agcgattatc 960 

gcaactctat atgcgtatca agctactaaa aatgaaaaat atctggctat gcataaacag 102 0 

atcagtgact ggacttatgc ccattttcct gacgcagagt ttggtgaatg gtatgggtat 108 0 

ctccatcgtg acggaacgat ttctcagcct gcgaaaggaa atctgtttaa gggaccattc 1140 

cacattccta gaatgatgac gaaaggctac gcactttgtc aggaattact gtcagaaaaa 1200 

taa 1203 



<210> 1494 
<211> 222 
<212> DMA 
<213> B.fragilis 



<400> 1494 

cgaccacggg aaaataatag gagcatggga aatggggcca tgactccctc ctttttacta 60 

aaagtaggtc gtatcgaaat atctggtaaa ggtattgtga agaaatcgcc ttgggggtgc 12 0 

tgcggatcta tcctttctaa tatgtatcaa attcgggatt tcaatggagc attattgaca 180 

atggttgttt tcttggaaac agtagacggt cgttattcct ag 222 



<210> 1495 
<211> 1254 
<212> DNA 
<213> B.fragilis 



<400> 1495 
cctcttattt 
tggggggtag 
atgcaggtag 
ttcctttgga 
cgtaagtggc 
attgcagaaa 
ctttatattc 
ttagcggttg 
gctactgttg 
attgcctatg 
gaacgtttga 
ttgctgttca 
ccgggttggg 
atgtcacagg 
attctaggtg 
accggtgcaa 
agttttgttg 
gctaataata 
ggtatcatga 
tggaccgatg 
gctttggcag 



ttatcatgaa 
ccctactcaa 
atattgtgga 
tttatggcct 
tgattgtcgg 
catttaatca 
cggccggtct 
gtatccacat 
ccgctgcttt 
ctttggtttt 
aaccttcatc 
gtaacatagc 
ctactaagaa 
ccggacctat 
gtaccctttc 
taggcttggg 
ctgttgtggg 
tgcctattct 
atatgaccgg 
gaggaaattt 
tgcaactcta 



aaactcaaaa 
ttatatggac 
acttcagtcg 
tatgagcccg 
cagtcttttt 
ggttttttgg 
ttctcttatt 
gactggtctt 
ctcatggcat 
ggtattattt 
aaagaatggt 
tttctgggtg 
ttggttgcct 
gtcgactatc 
tgacaaatgg 
attgactatt 
agccggatta 
ttgtcagttt 
ggtatttgca 
aggtttaggt 
cttcctgcgt 



atttatcctt 
cgacaaatgc 
gcaaccaatt 
atttccggta 
gtctggtctt 
ctgcgtgcat 
gccgattatc 
tataccggac 
accacattcc 
ctgaaagata 
gaaaaagctg 
atattacttt 
actctgttcg 
acgattgcat 
gtacaaaaga 
ccttctttat 
ttattcggta 
gtctcttcaa 
ggagcgttta 
tttgccatgt 
ccgaagacag 



ggatagtggt 
ttagcacaat 
ttggccgttt 
tgattgccga 
ttgtaaccta 
taatgggagt 
atactgaaaa 
aagctattgg 
attggtttgg 
agaaagaaca 
gcttgtttaa 
attttgcagc 
ctgaaaatct 
tatcttcatt 
acatccgtgg 
tgttattggg 
tcggttatgg 
agtaccgtgc 
tcacggattt 
tagctatcat 
ataatatgga 



tgccctcctt 
gaaagatgct 
aatggctgtt 
tagattgaat 
tttgatgggt 
gagcgaagct 
gtcacgttct 
tggatttgga 
tattattggt 
cgttaaaaca 
aggtttgtcg 
acctagtttg 
tgatattcca 
tattggcgtg 
tcgtgtatat 
attcgggcat 
tatttttgat 
gacagcatat 
gttgggtaag 
cgtatttatt 
ataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1254 



<210> 1496 
<211> 450 
<212> DNA 
<213> B. fragilis 
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<400> 1496 

acggtcgtta ttcctagctt tgccgaaata aaatatactt ttgaaagtat gtatgaatac 60 

aatcagtgtc gtatgtataa agggggtaag tttgatatgc ttcacggaca agataaaacg 12 0 

atccttccat gcctagctat gggaggtccc cagggaggta ttggaggaac tgccaactac 180 

aatggtgtaa atctggttgg tattatagaa gcatggaaag caggtgatct tgagaaagca 240 

cgtgaattac aaaatttctc tcaggaagtt attaatgtca tttgtcattt ccgcgaaaat 3 00 

atcgtaggtg gaaaacgaat catgaagttg ataggattgg atttgggtaa aaatcgtact 3 60 

cctttccaga atatgacgga cgatgaagaa gtacgtatga agcccgaacc gcaagccatt 42 0 

catttcttcg atcgttgcaa taagttttaa 450 



<210> 1497 
<211> 453 
<212> DNA 
<213> B.fragilis 



<400> 1497 

aaggaaatga ttgtatctaa tttgcaaaac agtcaacggg tggaaggact ccacccactg 60 

tttaaaactc tgtttgatta cgtaaaaaca catgatttat ttcatgccga attaggacga 120 

attgagatag atggtgataa tttatttatc aataacgtga atcctgagtg tgttgcacgt 180 

gacaagcaag ttttggaact acatcgcgat tatattgatg tacatatttt gttggaaggt 2 40 

actgagacta ttggttggaa ggctatcgaa gatctgaaag atgaagtgaa accttatgag 3 00 

gcgaacggtg attgtgctct ttactctgat gcacctacca cctttgttga tttgcttcct 360 

gggcaattca tgatagtata tccggaggat cctcatgctc ctcttatagg acaaggtaag 420 

attcgtaaat tgatagcaaa agttaaattg tag 453 



<210> 1498 
<211> 2094 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (2002) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1498 

acatctttat tattaactca aaaagcaata cttatgaaga aaaccatctt cttgattttg 60 

tgcattttat gttctcttgg agccatggca caaaagaaat caatcacagg tgtggttacg 12 0 

gatgctagcg gtgaatcagt catcggagcg agtgttgtcg aggtcggtac caccaatggt 180 

gtgattactg acattgacgg taagtttacg ttgtcggtcg atcctaacgg aaagatcaga 2 40 

gtatcttata tcgggtatca gcctcaggta cttgatgtaa agggcaaaaa ttcttttaat 3 00 

attaaattga aagaagactc tgaaatgctg gaggaagttg ttgtaacggg gtatggtggc 3 60 

aaacagctgc gtacgaaagt gacgaactct attgcaaaag taaaagatga agc&t tgaaa 420 

gtcggcttat tctctaaccc cgctcaggca ctctccggag cagttgcagg tttaaaggtt 480 

acccaagcct ctggtagccc gggtgcggct cctaaagtaa cgcttcgtgg cggtactaac 540 

ttcgatggtt caggtgaccc tctggttatt gtagacggac aattgcgtga cggtatgcag 600 

gatatcaatc cggaggatat tgaatccatg gaagtcttga aggatgccgg agcaaccgct 660 

atttatggtg cgcgagcaag taatggcgta attttaatta ctacaaaaac aggtaaagaa 72 0 

ggacgtcgcg aaatcaactt caaagccaaa atgggtttga gctatgtaaa taacccttat 7 80 

gattttttgg gagccaaaga ttatatcaac gtactgcgta caggctatag taaatccgga 840 

tttacaacct cagacggaga gtatgtctct attgccccac ttggtaactt gacaagtgct 9 00 

tctccattcg gtactggtaa tacactgaat gataaaacga tctggaatat tatgaataaa 9 60 

acggcagaca atgcctatct gttacagaaa ggatggcaag aaatgccgga tcctctggat 1020 

cccagcaaaa ccattttata taaagatact aatccggcag attataacct gaataatccg 1080 

gcaatatctc aggactataa tatcaatatg tccgggggta atgataaggg tacttactat 1140 

gcaggattag gttacaaccg tcaagaggga cttcctatca agacattcta tgagcgctat 1200 

agttttgttt tgaatgccag ttataaaatt acagattggc ttaccagttc atccaatttc 1260 

aattataacc gtgcaaattg gaaaaacatg ccgggatcac aaaccagtga aggcaattac 1320 

ttcggacgta tcatgtctac acctcccact gtccgcttcc aggatgagga tggaaatcca 13 80 
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actttaggtc cggtagctgg tgatggaaac cagaattatc agcccgacaa atggtggaat 1440 

tttaatcaga gtgacaaatt taccatggta caggccttcc agattgatat tttgaaaaat 1500 

ctttctgtaa aaggtactgc caactggtat tactccgaat cattggctga aagtttcacc 1560 

agagactatg aaaacacgcc gggtcaattt gtgagaacac gtagttcttc agcaagtttc 162 0 

tccagagatt tctctcagac ctataatgtg gtattaaact ataatcaaac tttcgctaaa 1680 

gatcataatg tggctgttat gttgggtatg gaatattttg atagatatag ccgcagcttt 1740 

agtgcatccg gttcaggagc tccaacggat gattttgccg atctatcatt gacagataat 1800 

ggagaaggga aacgttccat tgattcagga catagcgatt atcgtattct ttcttatttc 1860 

ggacgtctga attacgacta taaaggccgt tatttacttt ctgctgtctt ccgtcaggat 1920 

ggatattcat ctttattagg tgacaaccgt tggggatttt tcccgggagt ttctgccgga 1980 

tggatttttg gacaagaaaa tntcgtaaaa aatgctctgc ctttcctgtc atttggtaaa 2 040 

ttacgtgcga gttatggtgt aaatggtaac tcaaccggaa ttggtgcgtc ttca 2094 

<210> 1499 
<211> 222 
<212> DNA 
<213> B.fragilis 

<400> 1499 

agcctaaggt ctatcggttt aaaaccaaga cccctttcct ttgaaaccaa cggaactttg 6 0 

gttttaaagg tattttcacg ttggtacaat actatctatc agcacattgt gagtgagaca 12 0 

ttcgccaaag caggctcctc ggttcgcatc ttaattactg acaaaaagaa ctacaaaccc 180 

acttccgaga gatgtgtcga aactcttttc accacagagt ag 222 

<210> 1500 
<211> 990 
<212> DNA 
<213> B. fragilis 

<400> 1500 

agaaaaaaca agaagatggg attattcata aagaaaccct ttgaagccct attggcagag 60 

gccaatgcgt cgggcagtaa atcattaaaa cgagtattag gcccctggag tctggtagca 12 0 

ctgggcgtcg gtgttatcat cggagcagga ctcttctcaa tcaccggcac cgtagcagcg 180 

ggctacaccg gaccggccat caccctttca ttcgccatag ctgcactcgg atgctgcttc 240 

gcaggactct gctacgctga gttcgcttct atgattccgg tggcaggcag tgcttatacg 300 

tattcatacg ccaccatggg cgaactgata gcctggatca tcggctggga tctcgttctc 3 60 

gaatataccg tagcagccac taccgtcagt atcagttgga gccgatatct cgtcgtcttt 42 0 

cttgaaggac tcgatataca tctgccgcaa gccctgaccg cctgcccatg ggatggagga 480 

atcgtcaata tcccggcgtt cctgatcgta gtgttgatga gcatcttcct gattcgcgga 540 

acagaaggca gctccatctt caacggcatc attgtatttc tcaaagtatc ggtgatcgcc 600 

atattcgttg tcctgcgctg gaaatatatc aatgccgaca actatactcc atacatcccc 660 

gccaatacag gtacactggg cgaatacggt ctctcgggtg tcctgcgtgg agccgccatc 72 0 

gttttctttg ctttcctggg attcgatgcc gtcagtacgg ctgcacagga aacaaaaaat 7 80 

tccgaaacgg aatatgccga tcggtattct ggtatcactc ttggtatgta ccgtacttta 840 

tatgcctgtt gcccacgtaa tgacaggagt agcccattaa taccgaattt taacggccag 900 

aagggcatcg caccggtagc cattgccatt cgaacacatg ggacatgccg atgcaacagg 9 60 

gcatcattca cccggattat cccgtggtga 990 

<210> 1501 
<211> 351 
<212> DNA 
<213> B. fragilis 

<400> 1501 

aagtggattg aagaggcgcg tgcgttgggt gatacacccg aagaaaagga attgtacgaa 60 

tggaatgccc gtgtacagat tacgacctgg ggtaaccgga acgcggccga ttacggtggt 120 

ctccgagact atgctcacaa agagtggaac ggcttgctga aagatttcta ttacatgcgt 180 

tggaaactat atttcgactt tctttctcag cggatagagg gaaagacccc tgcggaaatt 240 

gatttctatg ccatagagga accttggacg aaagctgcca atccctattc tgccgaggcg 3 00 
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gaaggagact gcattgaagt agcgaagcag gtgatgcaag cggttgaata a 3 51 

<210> 1502 
<211> 609 
<212> DNA 
<213> B.fragilis 

<400> 1502 

gccatgacgg actacttccc cccattcttt tcacacatca acgagaagtt ccgtacaccg 60 

gcgcgcagta acctcctgtt tatgctgata gtgggcctgc tcgccgcatt tgttccggca 12 0 

cgcctggcgg gagagatgac cagcatcggt acactgatgg ctttcacact ggtatgcgca 180 

gccgtcctcg tagtgcgaaa gaccatgccg aacgtacccc gttcatttaa aactccgttt 240 

gtgcctctcg tccccattct gggaatactc acttgtctgt gcatgatgct tttccttccg 3 00 

gccgatacct ggatacgatt agtactgtgg atgctgatcg gactggacat ctatgtcggc 3 60 

tacggcatga aacacagtaa actggaacat ggtgtgaaaa atcgccgggg acaatcggca 42 0 

ttgaacatga tcggcattgc actgtctctg ctttgtgtca ttaccggctt atggcatcag 480 

cagactgtag gttggaatga aagtaaaata ttgctgatca tctcgtttgt tttcgcattt 540 

acgcattgtg catattatat gatgcggata tggaaaggga caacaaaaca aacgaatgac 600 

aacggttaa 609 

<210> 1503 
<211> 1298 
<212> DNA 
<213> B.fragilis 

<400> 1503 

cattgtgctg tgctcctttt gccatgcagg cacagcagcc acaaatatat accctgaaat 60 

cctgcctgga gtatggactt cagaataact actccctgca aatagtccgc aatgaagagc 120 

aagtgagcag gaacaacgcc accccgggca acgcaggcta tctgccgaca ctcgatttta 180 

cggcaggata caaggggaca gtggacaaca ctaataccaa ggtccgggcc accggagaat 2 40 

cagtaaaaga aaacggtgtc ttcgaccaaa ccttgaatgt gggtctgaac ttgaactgga 3 00 

ccatcttcga tggctttaac attacagcca attaccagaa actgaaagag ctgcaactac 3 60 

agggagaaac caatacccgt atcgccatcg aggacctgat agccaatctg gcagccgaat 42 0 

attacaacta tgttcagcaa aaaatccgct tgcagaattt ccgttatgcg gtatctttgt 480 

cgaaagagcg cctgcgaatc gtagaagaac gttaccacat cggtaacttc tcccgtctgg 540 

actatcaaca agcaaaagtg gacttcaacg ccgacagcgc caaatacatg aagcaacagg 600 

aattgctgca tacctcgcgc atccagttaa acgaactgat ggccaacgaa gatgtggatc 660 

aaccgctcgt gatagaagac agcattataa aagtgaatgc cgggcttcga ttcgaagagt 72 0 

tgtggaatgc taccttactg acgaacgcgt cactgttgaa agctgaacaa aacaacacgc 7 80 

tcgccatgct ggactataag aaagtaaact cgcgcaacta cccttacctg aagatgaata 840 

ccggatacgg atataccttc aataagtacg atattgccgc taatagccaa agaggtaatc 9 00 

tgggagccaa ttttggagta acggtaggct tcaacatctt cgatggaaat cgccgacgcg 9 60 

aaaagaacaa tgcacgcatt gccataaaaa acgcccgcct tcaacgtgaa caactggaac 102 0 

aagggctgaa agcagacctt agcaatctgt ggcaggccta ccagaataat ctgcaaatgc 1080 

tgaaactgga acgacagaat ctggtagccg ccaaagagaa tcacgagata gctatggaac 1140 

gctacatgct aggcaacctt tcgggtatcg aaatgcggga agcgcagaaa agtttgctgg 12 00 

atgccgaaga acgcatactt tcggctgaat acgataccaa gttatgtgag atttcacttt 1260 

tacaaatcag tggaaagatc acgaaatacc tggaatag 12 98 

<210> 1504 
<211> 1341 
<212> DNA 
<213> B.fragilis 

<400> 1504 

agacctgtgt ggcgcaatgt gcttgataag ttgggatata ccaagacaga gatcaatgaa 60 

ttcatttcgg gtccgggatt ttttgcctgg tgggttatga ataacttggg agggtggggt 120 

ggtcccaatc ccgacagctg gtatacccgg cagattgctt tgcaaaaaaa gatcctgaag 180 

cgtatgcgcg aatacggtat agagccggtg cttccgggct attgcggcat ggtacctcat 2 40 



599 



aatgcgaaag agaaactcgg cctgaacgta tccgatccgg gaacatggtg tggctaccgt 3 00 

cgtccggcgt tcctgcaacc gagtgatccg cgtttcgagg agatttcttc tctttactac 3 60 

aaagaacttg agaaactgta cggcaaagct aacttttact ccatggaccc ctttcacgaa 42 0 

gggggaaaca ctgcaggtgt cgacctcgat gcagccggta aggcagtgat gaaagctatg 480 

aagaaggcca atccgaaggc tgtctgggtg gctcaggcct ggcaggcaaa tccgcgtccc 540 

aagatgattg agaacctgaa agctggagat ttgctgatac ttgacctgac cagtgagtgc 600 

cgtccgcagt ggggagattc tacttccgag tggtatcgca agaacggata cgggcagcac 660 

gattggatct attgtatgct tttgaattat ggtggcaatg tgggattgca cggaaagatg 72 0 

gacaatgtga tcgataactt ctatcttgcc aaagccgatc cgcatgcaag cgctacgctg 780 

aaaggggtgg gaatgactcc tgaagggatt gaaaacaatc cggtgatgta cgagctggtg 840 

atggagctgc cttggcgtcc cgaccggttc acgaaggaag agtggctgaa ggagtatgta 9 00 

aaagcccgtt atggcgttga tgatccggtg gtacaggctg cctggaccaa tctggcgaac 9 60 

tctatttata actcgccgaa gaacctgacc cagcagggga cacacgaatc agtattttgt 102 0 

gcccgtccgg cggaagatgt gtaccaggtg tccagctggt cggaaatgaa agattactac 1080 

cgtccgcagg aggtgataga agctgcccgc ctgatggttt ccgtagccga tcgctttaaa 1140 

ggtaacaata attttgaata cgatttggta gatattgtcc gccaggcact ggcagagaag 12 00 

ggacgtctga tgcagaaagc tgtgactgcc gcttatcgtg caggtgataa acaactcttt 12 60 

gcactggcat cgggaaagtt cctcgacctg attttgttgc aggataaact gttgggaacc 132 0 

cgtccggagt ttcgagtatg a 1341 



<210> 1505 
<211> 903 
<212> DNA 
<213> B.fragilis 



<400> 1505 

aaatgtatat caatgaaacc ttaccatatt aacaagaagc aaatccttat catgggctgc 60 

ctgggaatgt ttcccttgtt ttcttccgcc caggacattt tatcaacgtc aggtacctcg 120 

cgttgggatt attcaaacag ccgtgtggag cgtgaaccgg gtagagatgc gttggatatt 180 

actttcagtg tcttccctct ggcagggctt ggaagccagg aagtcgctta cctgttcccc 240 

gtctatgtct cggctgacgg gcgtgacagc gtccgcctcg aaccggtctg cgtggccggc 3 00 

aagagaagat ataaggtgat caagcgtcga aaggcgttgg gcaacctgaa acccggtaac 3 60 

cccggatccg gcgaggtccg ttccgccaag gtgctggaaa gttcgggcct gactgtgaaa 42 0 

aggagcgtcc cttttgaacg gtggatggct gacgggcgtc tcgttgtcag ggaggtgtcg 480 

tacggctgtg ccgagtgcgg cacgaacgag agcgaggata ttgcctttca ggccggtatc 540 

cccctcttcg gggaaaagga ttatgcttac agtttcatcg aaccggaaaa ggtcatgatc 600 

aaatgctaca aggattcctt cgactgtaag gtggtctttc ctgttgcacg ccacgacctg 660 

caggaagatt ttgcaggcaa tgctcaggag cttgacagtc tgaagaagtt tctctcggag 72 0 

aacatgaata ttcaaggaac gtcactcaag gaggtacata taaaaggcta tgcctctccc 7 80 

gaggggagct tcgattacaa caggtccctc gcgcaacgac gcacccaaac cctttcagat 840 

tatatttcgc gtcaataccc cgccttgtct tcaccacggg gctggaagaa tccgcgctgg 900 

cgc 9 03 



<210> 1506 
<211> 219 
<212> DNA 
<213> B.fragilis 



<400> 1506 

ttgaccgggc ccgaccgcga aagatatcag ctcaaattct tcagttcccg tctggacgat 60 

ggacggcggt taagcactcc caaagaggct tttaccaccg ataattttac cattcccgga 120 

tgtttctgta aatcccgcct tggtgcacgt ggcgggatta tacaggaaaa aaaaggaaca 180 
ccttatttag ctaatttatc ttataccgtc ttcaattga 219 



<210> 1507 
<211> 3000 
<212> DNA 
<213> B . fragilis 



600 



<400> 1507 

tttatgaata cacttcaaaa aacagcggga gcaattctgt cgctctcgct attgtgtggg 60 

atgcaggttt atgcgttccc tgactaccct tccctgaggg gacaaattat cgaacagagt 120 

gatatctgtc aaggagttgt caaagatgcc aatggcgaaa gcattatcgg cgcatccgta 180 

cttgtcaaag ggacgacaaa cggcagcatt acgggacttg acggtgattt ttcccttagg 240 

aacgtaaaaa aaggggatat aattgttgtt tcttatgtcg gctatcagag ccaggaaata 3 00 

gcctggacag gtgaaccgtt aaacatcgtt ttgaaggaag atgccgaagt ccttgacgag 3 60 

gtggttgtta ttggctatgg tgctgtaaga aaagcggata tggccggttc cgtagccgtg 42 0 

ctggacaaca aaaatttcaa ggaccagcct ataacccagg ttgcagacgc tcttcagggg 480 

cgtgtcagtg gcgttcatgt tgagaacagc ggcgtccccg ggggaagcgt gaaaatccgc 540 

atccgcggtg ctaactccat cagcaaaagc aacgaccctt tgtatgtggt tgacggaatc 600 

gtccgcgaaa gcggtttgga cggtatcaat cccgaggaca tccgttcgat gcaggttctc 660 

aaagatgcat cgtccactgc catatacggt tccaggggtt cgaacggggt cgttctgatc 72 0 

actaccaaga ttggtaaggc cggtgtgcgt gaaattatgt tcgacgcatc agtgggagtt 7 80 

tccaatgtat ataagagata cgatatcctt ggagcatacg attacgctct ggcgctcaag 840 

gaggtcaaag gtattgattt ctcaaacgaa gagatgcaat cctatcagaa cggaacgggg 900 

ggcatcgact ggcaggatga gattttccgt acggggatca cccagaatta caagttggct 9 60 

ctttccaatg gtagcgagaa gacccaatat tacatttccg ccaactacat gagccaggaa 1020 

ggtgtggtca ttgaatcgaa gaacgagcgt tatcaggcga aggcgaatct ttcctcacag 1080 

cttaccgact ggctgcatat cacggctgac atcaatgctt cccacggtgt gcgtcgcggg 1140 

ggatcttttg cctccggtaa ggataatccg atctggattg ctttgaacta ttcgcctacc 12 00 

atgacgatga tggccgaaaa cgggaactat aacaccgata cttataattc catagcttcc 1260 

aacccggtcg gtatcctaaa gttgcagtcg ggagagacaa tgaccaatgt tttcaacgga 1320 

cgtgttgatt tgcgccttga tataatgaag gggctgacat tcaccacaac caatggcgtg 13 80 

gactattatg acgggaaaag ttatagtttc agctccaaac gagtggggac caagagtggt 1440 

atgggcaata acgatactta tcgtctgatg ctgcaaagct ccaacaatct gacatataca 1500 

ggttcctgga atgaccatca tctgacagcc acggctgtat atgaggtgac ctcctccgaa 1560 

accaggacaa tgggtataac cggtaacaac ctgctgaccg agggtgtggg ctggtggaat 162 0 

gtgggtatgg cttcctctcg tgatgcaaac aatggttacg agcaatgggc gctgatgtcg 1680 

ggagtggccc gtgtcatgta caattttaag gatcgttata tgcttaccgg aacatttcgt 1740 

gccgacggtt cttctcgctt tgccaaaaag aaatgggggt actttccttc tattgccgca 1800 

gcttggacgt tgagcaatga ggatttcatg aaggatgttt cctcggttca ggacattaaa 1860 

cttcgcgcca gttatggtat tgtgggtagc caggccatca gtccttacgc gaccatgggc 192 0 

cttatgagtg ctactgcata caattttgga accaacagta attttaccgg ttattgggca 1980 

aatgacatag cgactcccga gcttacgtgg gagaagacca agcagttcga cctgggtctg 2 040 

gagttctccc tgttcgacag gcggctgaat ttcagtgtgg actatttcta taagcgtact 2100 

acagatgcgt tgcttaaaag aagtatcccc ggatatgtcg gcggtaactc tttttgggtg 2160 

aatgacggcg aaatcagtaa tcgtggtatt gatttgagcg tgaccgcccg gattatgcag 222 0 

aatgaccggt tccaatggac ctccaccttg aacggtactt atttgaagaa ccgcgtggaa 22 80 

cgcctttccg gtggtgagaa tgatttcatc aacggctcca gtccggctgc cggtatggtt 2340 

gattatgcca ccattatcaa gccgggtgag gccatcggta ctttttgggg atatgaatgg 2 400 

accggcttgg atgaaaacgg acatgacact tacacggatg tagatggaaa tcagatgata 2460 

gacggtggcg accgcaaggt catcggcaag gccaatcccg atttcaccct gggatggaat 2520 

aactccctgt cttataaaaa ctgggatctg aacctgttct tcaacggatc tttcggagcc 2580 

aaacgtttga atcttgtaag atatacgatg gcttcggccg aagggaattc ccgctttgta 2 640 

actctggcag acgcctatct gaagggattc gacaagattg gctcttcggc aacatacccg 2700 

agcctgaccg aaggtggaaa taatctgcag ccggtttcaa ccaaatggct ggagaacgcg 2 7 60 

gacttccttc gtctggagaa tatcagtctt tcatacactt tccccaaaaa gacaacggga 2 82 0 

ttcgctgatt tgcggcttac tttcagctgt cagaaccttt ttacaataac cgggtacaag 2 880 

gggatggatc cggccggtac taccttctcg aacagcagcg ttgacgttga cgcgggtatt 2 9 40 

gacatgggag cctatccttc cccaagaaca ttcacattcg gtctccggat gaatttttaa 3000 

<210> 1508 
<211> 207 
<212> DNA 
<213> B.fragilis 



<400> 1508 

cgccccggca cgttcaccgg tcacgaccac ttccccgagt ttttgtatgt ccggggagag 



60 



601 



acatacggta tgtatgctgc tttctttgaa tacctcgttc aaaggcacac gccgggaacc 12 0 

gtaacccagg cactgcacga ggaccgtatc catgaaacac gttccatgac cgaaaacaat 180 

cattccacgg ggggagctga caagtaa 2 07 

<210> 1509 
<211> 864 
<212> DNA 
<213> B. fragilis 

<400> 1509 

gcaacaaaaa cagaaacaat ggaacttttt tatttgaacg agcacacgtc ttgttataat 60 

tactcaaaat ctatgcagga gggattccgg tattataaat ttgacgaagg attgaaccac 12 0 

gaggaggaat tggtaaaaga ttgtatcctg ttcgtactca aggggagtct gcaattctcc 180 

tgcaacggtt ttcagttcac tgtttcatca ggagagatgg tctttttttg ccgtgacagc 240 

ctgtttaaca cccaatcact ggaaaaatgt gaagtcgtgg ctgccctgtt cgagggggga 3 00 

gtatggccct gtcaaagggc ctccttttca gagttatacc acctgaggga aattgtcgaa 3 60 

tatcgtatgg aacctcttga aatcagggat cggttatgca aatttcttga actactggtg 42 0 

tgttatctgg aagatggagc gaactgtatc cattttcacg aaataaaact caaggaactt 480 

ttctggaaca ttcgctttta ctattccaga caggaacttg cgaatttttt ctatatgatt 540 

ataggccgtt cgcaagactt caaaaataag gtcttgaaca attacaaaag ttgcaggacc 600 

gtaaaagaac tcgcatcggc ctgtgatatc tccctttccg ccttcaaaag acagttttcc 660 

gcggagtttg gagaggctcc agccgaatgg atgcagaaac agttgttggg agaaatcaaa 72 0 

tacaaacttt cagttacaga cctgccgttg ggaaccattg ccaatgaact ggagttttcc 7 80 

tctcttgcac acttctccag attttgcaaa agatgtctgg gatgttctcc cagagagtta 840 

agacagcaga taaaaggcgg gtaa 8 64 

<210> 1510 
<211> 624 
<212> DNA 
<213> B . fragilis 

<400> 1510 

agtaacgaaa caatgaataa cagtatgatg aaaccgtttt ttcttctggt gtttttgctt 60 

tgttcggctg gtgccttctc tcaaaaggtt gccttgaaga acaatctggc ctacgatgcc 12 0 

ttgaaaactc ccaacctctc cttggagttc tccatggggc gtaaatggac ccttgacaca 180 

caggtgggta tgaatttctt tttctacacg cgggacgcca cgtcttcccg gtacaaggca 2 40 

aagaagttca gtcactggct tgtccagccg gagctccgtt attggacctg tgacgttttc 3 00 

aacggctggt tcttcgggtt gcatgcccat ggcggccaga tgaatatcgg aggtgtcgat 3 60 

gtccccttcg tgcttcagaa aggggacgga aacatgaagg accaccgtta cgagggttat 42 0 

ttctggggag gcgggttgag cgccggttac cagtgggtgc tttccaaccg tttcaatatc 480 

gaggcctcct tgggtatcgg ttacgtgcac gcccgttatg acaagtacaa gtgtaccacc 540 

tgcgggcaga agctgggcaa gggggatgcg gactacatag gtcccaccag ggccgccatt 600 

tcaattattt acatgttgaa ataa 624 

<210> 1511 
<211> 291 
<212> DNA 
<213> B. fragilis 

<400> 1511 

aaatcagatg cgttttccgt tcgttcattg tcacatccta caaatataaa tccggccaag 60 

agcaatacaa cactccataa tgtgaattta cttttcatat ttatatttac aacaataatt 12 0 

gataaaaata atatatctat aaatcttgcc aagaagaata ttaatcatct aaggcaaaca 180 

aattatctaa ttaacaacaa tatatttatt gaatataata cgcagcccat tatttattgt 240 

gcaaagaaaa aacaagataa aatattattg attatcattt tagccctata a 2 91 

<210> 1512 
<211> 1404 
<212> DNA 



602 



<213> B.fragilis 



<400> 1512 
aaaaaatcct 
ccgtctttcg 
atgcatataa 
aaaaaactta 
aaattcccgt 
ttcagtatcc 
aacaccccgg 
ttaaagtgta 
aggacttacg 
cgagagcttc 
tccatgcaga 
atttccgact 
tatctgcata 
ccttgcccat 
ccggttgtga 
gcccgggaca 
ttgctaaaaa 
caattgatgc 
gtaccgggag 
agaagtgcct 
gaattcccgc 
ggtgcctcgg 
atctatctga 
ttcattttag 



atacatacaa 
tgcaattttc 
gtttatgctt 
tggcaacagt 
tagtcgtgca 
cggataccct 
agtcgattcg 
tttcaatgat 
gcgtgttaac 
gtgaaagcgg 
ggtacctggg 
accaggggaa 
atatcaaggc 
ttcgcgatat 
ttaaaacact 
ttttcatgtt 
agaggcatgt 
gggtaggtat 
aatacatctt 
accataggat 
tgagacttca 
ttcatgttat 
aagaactgga 
cggtggatga 



gacgtaccgt 
ccgcttggtt 
cctttccagg 
taaaacagta 
ggttcttcac 
gtttgatccc 
gaggatcaac 
agagaagaaa 
ccgcgaagcc 
tcatgaggga 
aaaaaaggat 
actactcgct 
tcttttcagg 
aagcattaaa 
atccgcaatt 
cagcttttac 
tttcccgggt 
aaacaaggag 
cccgttgttt 
tcgttactcc 
cgcggcccgc 
tggtgagtgt 
tcattccgca 
ttga 



tcccttaccg 
actgaacttt 
aaacaaggaa 
ttggtaaaag 
aagcgaagga 
gtaaaaggga 
aacaggtgcg 
tcccgggaat 
ggtttttatt 
acagcccggg 
tttcctttca 
tcgggtatat 
aagggatgtc 
acggaaaaaa 
gaactggaaa 
acgcgtggaa 
gaaatctgtt 
atcagtcgta 
ccggcagaac 
ctgggaaaga 
cattcatggg 
ctggggcata 
ctggatgcgg 



acgggtctgg 


aaccgacttc 


60 


tttggtcttt 


ttccgatttg 


120 


gcgtcaaatg 


tctaactaat 


180 


gaaggagcaa 


ccggtccggc 


240 


aaaaggttgt 


ttatacagga 


300 


gggttatcga 


cggcggggaa 


360 


aaagtatctc 


aagggtactg 


420 


acgaaataga 


ggacgtattc 


480 


tctatttttc 


acggaaaatc 


540 


catacgcctc 


cagcctgcgt 


600 


taaaactctc 


ttcccggatc 


660 


gcgataacac 


cattggtttt 


720 


gggagatggg 


cctggaactt 


780 


ctctcaaacg 


ctcccttgat 


840 


aggacagccc 


gttatctctt 


900 


tgt catt cgt 


cgatat tgca 


960 


acaggcgtca 


caagaccgat 


1020 


tcctggaaag 


atacaagaat 


1080 


gggatccgta 


cgctggttat 


1140 


tttcccgggt 


cattggtttg 


1200 


ccacgatagc 


caaggtgaac 


1260 


cgtcggaaaa 


aacgacccga 


1320 


ttaataatca 


agtggctgac 


1380 
1404 



<210> 1513 
<211> 1461 
<212> DNA 
<213> B.fragilis 



<400> 1513 
aaccggctgc 
cttgtcgaat 
cgaagccatc 
caggttcaga 
attggccttg 
cgtgtaagtg 
gatggcctca 
gccgttgatg 
accgttcaag 
caaatcaata 
atatccgggg 
actgaaattc 
cttctcccac 
gttggttcca 
ggcctggcta 
atccttcatg 
ccatttcttt 
acgatcctta 
accattgttt 
cagcaggttg 
agccgtggct 
ttgcagcatc 
ggagctgaaa 
cagccccttc 
ctctcccgac 



agattatttc 
cccttcagat 
gtatatctta 
tcccagtttt 
ccgatgacct 
tcatgtccgt 
cccggcttga 
aaatcattct 
gtggaggtcc 
ccacgattac 
atacttcttt 
agccgcctgt 
gtaagctcgg 
aaattgtatg 
cccacaatac 
aaatcctcat 
ttggcaaagc 
aaattgtaca 
gcatcacgag 
ttaccggtta 
gtcagatgat 
agacgataag 
ctataacttt 
attatatcaa 
tgcaacttta 



caccttcggt 
aggcgtctgc 
caagattcaa 
tataagacag 
tgcggtcgcc 
tttcatccaa 
taatggtggc 
caccaccgga 
attggaaccg 
tgatttcgcc 
taagcaacgc 
cgaacaggga 
gagtcgctat 
cagtagcact 
cataactggc 
tgctcaacgt 
gagaagaacc 
tgacacgggc 
aggaagccat 
tacccattgt 
ggtcattcca 
tatcgttatt 
tcccgtcata 
ggcgcaaatc 

g 



caggctcggg 
cagagttaca 
acgtttggct 
ggagttattc 
accgtctatc 
gccggtccat 
ataatcaacc 
aaggcgttcc 
gtcattctgc 
gtcattcacc 
atctgtagta 
gaactccaga 
gtcatttgcc 
cataaggccc 
gcgaagttta 
ccaagctgcg 
gtcggcacga 
cactcccgac 
acccacattc 
cctggtttcg 
ggaacctgta 
gcccatacca 
atagtccacg 
aacacgtccg 



tatgttgccg 


aagagccaat 


60 


aagcgggaat 


tcccttcggc 


120 


ccgaaagatc 


cgttgaagaa 


180 


catcccaggg 


tgaaatcggg 


240 


atctgatttc 


catctacatc 


300 


tcatatcccc 


aaaaagtacc 


360 


ataccggcag 


ccggactgga 


420 


acgcggttct 


tcaaataagt 


480 


ataatccggg 


cggtcacgct 


540 


caaaaagagt 


taccgccgac 


600 


cgcttataga 


aatagtccac 


660 


cccaggtcga 


actgcttggt 


720 


caataaccgg 


taaaattact 


780 


atggtcgcgt 


aaggactgat 


840 


atgtcctgaa 


ccgaggaaac 


900 


gcaatagaag 


gaaagtaccc 


960 


aatgttccgg 


taagcatata 


1020 


atcagcgccc 


attgctcgta 


1080 


caccagccca 


caccctcggt 


1140 


gaggaggtca 


cctcatatac 


1200 


tatgtcagat 


tgttggagct 


1260 


ctcttggtcc 


ccactcgttt 


1320 


ccattggttg 


tggtgaatgt 


1380 


ttgaaaacat 


tggtcattgt 


1440 






1461 



603 



<210> 1514 
<211> 1029 
<212> DNA 
<213> B.fragilis 



<400> 1514 

ataaaaagaa aaatgagaac aattgcatgt aaaaccgtgt gggcacttct gataggagtg 60 

tcccttgtcc tgtcgctgaa ctcttgcagc aaggatcctg taataccaga agacgagacg 12 0 

aagaacaaac tgcatgagga cccggcaaaa atgaccgtcc gcctcgttga atgccacctg 180 

cacgctgact ggaacgagat acagaaggcc ggaggtcccc accaaaatcc ggaatccccg 240 

gccaggtata tgaaacgtgt ccaggagatc acttatgaac tgaagaccgg cagtggatgg 3 00 

acccttgctg aaggaagcca gggcaagttt tacgttcaga aaaacggcga atataaaaat 3 60 

ggaaacaact ttaccccggc cccggtttac ctgatgttta tctattacta caattccaaa 420 

ggagagttga tgaacggcca gttcgtggag aacgggcagg agaatatcca ccagcatttc 48 0 

ttcaccccgg agaacgtgag acctaccttt gacgggaaac cggaagctga cgacaatgat 540 

ccggaggcac tggtggatta tctctatgtg gataccacgc cctgggacaa gaccaaacat 600 

gacaacgagg cggaaattac gggaagcact aacccggtag gattaaaagg agttatccgg 66 0 

ttcctgaagg accgcaagga gtttgacctg aaactccgcc tgtatcacgg ctacaattct 72 0 

aaaaagaacc cgcagacaaa cggctttgac ccgttctaca agccctccgg ggtattgatc 7 80 

cagcgtggaa catgggatat taacctgagc atcccggtag tggtgttttg gagccgcgag 840 

gagtttgttg atgtggaccc ggaggcagat gtgaacctga tcggggagga tagcctggat 900 

gaagacagca accgcacgct acactccatc atgaaaacct tcagtcttac atggaaggag 9 60 

gcgcttgagg agttcatttc ctatacctac caggcggggg atgtggaagc tggatccata 102 0 

tggctttga 1029 

<210> 1515 
<211> 198 
<212> DNA 
<213> B.fragilis 



<400> 1515 

tgccacggga acgatctcca caccccattc ccctatcagc tcaaaagtca cgtgcaggtc 60 

acataccctt ctttccgaaa acatatgacc ggaaaggcag gagcagagaa cacggaaaaa 12 0 

acaattcaaa atttagtaca tgatattaaa aaattataca ttgaaaatca gaggagcgac 180 

ccgccaatca actggtga 19 8 



<210> 1516 
<211> 783 
<212> DNA 
<213> B. fragilis 



<220> 

<221> unsure 
<222> 

(210) , (221) , (222) , (237) , (242) , (251) , (2 55) , (2 56) , (2 64) f (265) , (266) , (267) , (271) , (2 
90) , (291) , (292) , (2 93) , (3 02) , (319) , (322) , (323) , (324) , ( 32 5 ) , ( 32 6 ) , ( 327 ) , ( 33 8 ) , ( 3 59 
) , (376) , (377) , (379) , (3 95) , (396) , (398) , (401) , (406) , (407) , (410) , (411) , (412) , (428) , 
(429) , (430) , (431) , (432) , (438) , (439) , (442) , (443) , (444) , (445) , (446) , (449) , (450) , (4 
51) , (452) , (453) , (454) , (455) , (456) , (457) , (458) , (459) , (460) , (461) , (462 ) , (463 ) , (464 
) , (465) , (467) , (468) , (469) , (470) , (471) , (472) , (473) , (474) , (475) , (476) , (477) , (478) , 
(479) , (480) , (481) , (483) , (485) , (486) , (487) , (489) , (490) , (491) , (492) , (493) , (494) , (4 
95) , (496) , (497) , (498) , (499) , ( 500 ) , ( 501 ) , ( 502 ) , ( 503 ) , ( 504 ) , ( 505 ) , ( 506 ) , ( 507 ) , (508 
) , (509) , (510) , (511) , (512) , (513) , (514) , (515) , (516) , (517) , (518) , (520) , (521) , (522) , 
(523), (524), (525), (526), (527), (528), (529), (530), (531), (532), (533), (534), (535), (5 
36) , (537) , (538) , (539) , (540) , (541) , (542) , (543) , (544) f (545) , (546) , (547) , (548) , (549 
) , (550) , (551) , (552) , (553) , (554) , (555) , (556) , (557) , (558) , (559) , (560) , ( 561 ) , ( 562 ) , 
(563) , (564) , (565) , (566) , (567) , (568) , (569) , (570) , (571) , ( 572 ) , ( 573 ) , (574) , (575) , (5 
76) , (577) , (578) , (579) , (580) , (581) , (582) , (583) , (584) , (585) , (586) , (587) , (588) , (589 



604 



) (590) (591) , (592) , (593) , (594) , (595) , (596) , (597) , (598) , (599) , (600) , (601) , (602) , 
(603) , (604) , (605) , (606) , (607) , (608) , (609) , (610) , (611) , (612) , (613) , (614) , (615) , (6 
16) (617) , (618) , (619) , (620) , (621) , (622) , (623) , (624) , (625) , (626) , (627) , (628) , (629 
) (630) , (631) , (632) , (633) , (634) , (635) , (636) , (637) , (638) , (639) , (640) , (641) , (642) , 
(643) (644) , (645) , (646) , ( 647 ) , ( 648 ) , ( 649 ) , ( 65 0 ) , ( 651 ) , ( 652 ) , ( 653 ) , (654) , (655) , (6 
56) (657) , (658) , (659) , (660) , (661) , (662) , (663) , (664), (665), (666) , (667) , (668) , (669 
) (670) , (671) , (672) , (673) , (674) , (675) , (676) , (677) , (678) , (679) , (680) , (681) , (682) , 
(683), (684), (685), (686), (687) , (688), (689) , (690), (691), (692) , (693) , (694) , (695) f (6 
96) (697) , (698) , (699) , (700) , (701) , (702) , (703) , (704) , (705) , (706) , (707) , (708) , (709 
) (710) (711) , (712) , (713) , (714) , (715) , (716) , (717) , (718) , (719) , (720) , (721) , (722) , 
(723), (724), (725), (726), (727), (728), (729), (730), (731), (732), (733), (734), (735), (7 

36) (737) , (738) , (739) , (740), (741) , (742) , (743) , (744) , (745), (746) , (747) , (748) , (749 
) , (750) , (751) , (752) , (753) , (754) , (755) , (756) , (757) , (758) , (759) , (760) , (761) , (762) , 
(763) , (764) , (765) , (766) , (767) , (768) , (769) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



60 

120 

180 

240 

300 



<400> 1516 

cgcggcgagg acgccgcgcg accgcgcgca acaagcgcgc gccgacgagc cccggcgccc 
acgcacgacg cggacaggag cacacgagca ccgcgcacga caggcagggc cccgcccgcg 
cgaacaggca cgccgcaacc cacaacccca gaccgcgcgg cgcgccgcgc acggggagca 
agggagagag ggaggaagag acggaggaan acgcgaaaaa nnaaggaaga aaaaaanaaa 
anaaccaaga naaannaaaa aaannnnaaa naaaaaaaaa agaaaaaaan nnnaaaaaaa 
anaaaaaaaa aaaaaaaana annnnnnaaa aaaaaaanaa aaaaaaaaaa aaacaaaana 3 60 
aaaaaaaaaa aaaggnnana aaaaaaaaaa aaaannanaa naaaannaan nnaaaaaaaa 42 0 
aaaaaaannn nnaaaaanna gnnnnnaann nnnnnnnnnn nnnnnannnn nnnnnnnnnn 480 
nanannnann nnnnnnnnnn nnnnnnnnnn nnnnnnnnan nnnnnnnnnn nnnnnnnnnn 540 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 60 0 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 660 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 72 0 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnc accaatgaat 7 80 

7 83 

tga 



<210> 1517 
<211> 330 
<212> DNA 
<213> B.fragilis 



<400> 1517 

caaacacaca gggggtgcac ggaggaaatg cgggaaaaga atgacgatga catctgtttc 
cccggagata agatacctta caggcctgcc agtcggaata taatcgtata taaaaatccc 
tgcctcgaca aacggggaga gcgtaaagag acagatggta caattttcag gagaatcgtg 
acgggagtgt ccctgttcgt tatgggatac gcacgaatgt tccccgtgaa catgaaaagc 
tttcacgagg aaaaagggca tcaatgtcag caacaacata catgctatga cagcccgctg 
tctctgtctt tttttcgtac gatccattag 
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120 

180 

240 

300 
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<210> 1518 
<211> 780 
<212> DNA 
<213> B. fragilis 



<220> 

<221> unsure 

"(212 )\ (223) , (224) , (239) , (244) , (253) , (257) , (258) , (266) , (267) , (268) , (269) , (273) , (2 
92) , (293) , (294) , (295) , (304) , (321) , (324) , (325) , (326) , (327) , (328) , (329) , (340) , (361 
) (378) (379) , (381) , (397) , (398) , (400) , (403) , (408) , (409) , (412) , (413) , (414) , (430) , 
(431) (432) , (433) , (434) , (440) , (441) , (444) , (445) , (446) , (447) , (448) , (451) , (452) , (4 
53) (454) , (455) , (456) , (457) , (458) , (459) , (460) , (461) , (462) , (463) , (464) , (465) , (466 
) (467) , (469) , (470) , (471) , (472) , (473) , (474) , (475) , (476) , (477) , (478) , (479) , (480) , 



605 



(481), (482), (483), (485), (487), (488), (489), (491), (492), (493), (494), (495), (496), (4 

97) (498) , (499) , (500) , (501) , (502) , (503) , (504) , (505) , (506) , (507) , (508) , (509) , (510 
) , (511) , (512) , (513) , (514) , (515) , (516) , (517) , (518) , (519) , (520) , (522) , (523) , (524) , 
(525) , (526) , (527) , (528) , (529) , (530) , (531) , (532) , (533) , (534) , (535) , (536) , (537) , (5 
38) (539) , (540) , (541) , (542) , (543) , (544) , (545) , (546) , (547) , (548) , (549) , (550) , (551 
) , (552) , (553) , (554) , (555) , (556) , (557) , (558) , (559) , (560) , (561) , (562) , (563) , (564) , 
(565) , (566) , (567) , (568) , (569) , (570) , (571) , (572) , (573) , (574) , (575) , (576) , (577) , (5 
78) , (579) , (580) , (581) , (582) , (583) , (584) , (585) , (586) , (587) , (588) , (589) , (590) , (591 
) (592) (593) , (594) , (595) , (596) , (597) , (598) , (599 ) , (600) , (601) , ( 602 ) , ( 603 ) , (604) , 
(605) , (606) , (607) , (608) , (609) , (610) , (611) , (612) , (613) , (614) , (615) , (616) , (617) , (6 
18) , (619) , (620) , (621) , (622) , (623) , (624) , (625), (626) , (627) , (628) , (629) , (630) , (631 
) , (632) , (633) , (634) , (635) , (636) , (637) , (638) , (639) , (640) , (641) , (642) , (643) , (644) , 
(645) , (646) , (647) , ( 648 ) , ( 649 ) , ( 650 ) , (651) , ( 652 ) , ( 653 ) , ( 654 ) , ( 655 ) , ( 656 ) , ( 657 ) , ( 6 
58) (659) , (660) , (661) , (662) , (663) , (664) , (665) , (666) , (667) , (668) , (669) , (670) , (671 
) (672) , (673) , (674) , (675) , (676) , (677) , (678) , ( 679) , (680) , (681) , ( 682 ) , ( 683 ) , (684) , 
(685) , (686) , (687) , (688) , (689) , (690) , (691) , (692 ) , ( 693 ) , (694) , (695) , (696) , (697) , (6 

98) (699) , (700) , (701) , (702) , (703) , (704) , (705) , (706) , (707) , (708) , (709) , (710) , (711 
) , (712) , (713) , (714) , (715) , (716) , (717) , (718) , (719) , (720) , (721) , (722) , (723) , (724) , 
(725) , (726) , (727) , (728) , (729) , (730) , (731) , (732) , (733) , (734) , (735) , (736) , (737) , (7 
38) (739) , (740) , (741) , (742) , (743) , (744) , (745) , (746) , (747) , (748) , (749) , (750) , (751 
) , (752) , (753) , (754) , (755) , (756) , (757) , (758) , (759) , (760) , (761) , (762) , (763) , (764) , 
(765) , (766) , (767) , (768) f (769) , (770) , (771) 

<223> Identity of nucleotide sequences at the above locations are unknown. 
<400> 1518 

ggcgcggcga ggacgccgcg cgaccgcgcg caacaagcgc gcgccgacga gccccggcgc 
ccacgcacga cgcggacagg agcacacgag caccgcgcac gacaggcagg gccccgcccg 
cgcgaacagg cacgccgcaa cccacaaccc cagaccgcgc ggcgcgccgc gcacggggag 
caagggagag agggaggaag agacggagga anacgcgaaa aannaaggaa gaaaaaaana 
aaanaaccaa ganaaannaa aaaaannnna aanaaaaaaa aaagaaaaaa annnnaaaaa 
aaanaaaaaa aaaaaaaaaa naannnnnna aaaaaaaaan aaaaaaaaaa aaaaacaaaa 
naaaaaaaaa aaaaaggnna naaaaaaaaa aaaaaannan aanaaaanna annnaaaaaa 
aaaaaaaaan nnnnaaaaan nagnnnnnaa nnnnnnnnnn nnnnnnnann nnnnnnnnnn 
nnnanannna nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn annnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn ncaccaatga 

<210> 1519 

<211> 1539 

<212> DNA 

<213> B. fragilis 

<220> 

<221> unsure 
<222> (839) 

<223> Identity of nucleotide sequences at the above locations are unknown. 
<400> 1519 

tttcttaacg gaagatccca gccaaaactc actccggaaa acttcttctc tacccaggat 
gaattgaaca tgagcatata tgccctttac cagaaagtca atctttcaca ggtatatacg 
aacatgcagc tgtcccagtg gcagggggat gatataacga ccaatccggg gagcaacaaa 
cagtctgccg cagaaatgga caagtttgcc gcagcaaaca acaacaaggg tgtcaaagat 
gcgtggaaca tgcattatgc cattgtaaag gctgccaatt tgatcataca gggggcttct 
aaaacaccta ccactcaaga tgagataaat atcggcctcg ggcaggctaa attctggagg 
gcatacgctt attttaccct ggtgcgactt tggggaccgc tgccgatgaa tctggacaat 
gtcaacgatg attataccaa acctctatcc cccgtggaag aagtgtatgg tcatattgtg 
caggacctga ccgaagctga ggccgtattg cctacgggtt acagtggcag cccccgcttt 
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606 



ctgaacggag tgaatgtgta tgtaacccgg caggcagcca aatctacttt ggcagcagtg 

tatatggcta tggccggctg gccgatgaac aaaacggaat attacgcaaa ggctgctgaa 

aaggcgaagg aggtcattga gggtgtgaac agaggtgaat acgagtataa gctcgataag 

gattacaaag atgtgtatgc tatgagcaat aactataata atgagacggt gctcggcata 

aattattcac cgttcgtgga ttgggcacag gattcggagc ttacttcatg taaccagtnt 

gaatcgctgg gcggctgggg agacgcctgg ggtgaaattc gtttctggaa ggagtttcct 

gacggtccga gaaaagatgc gacttatgat cccaagattc gtctgaaaga cggaacgttg 

gttgactggt gggagttgaa ggaggacggt acccctgtcg ttccggaaca tcaccccatg 

ttcagtatat tctccgtcaa ctgggatcct gcgtcaaaag tgaataccag tgccccgtat 

gattatacaa aaccggccag tcagaacatg tgtaatgacc atcggcatag aatcattcgt 

tattcggagg ttttgttgtg gtatgcggaa gccaaggcca ggacggggca gacggacgag 

ttggcattca agtgtctgaa tgatgtccgc gagcgtgccg gactggaacc gctcaccgga 

ctttctgcag atgaccttgc cgaagcggct tataaggagc atggttggga ggtggcaggc 

tactgggtgg cccttgtcac acgccgtgcg gaccagttcc gtatgaacag actgaaagac 

acctttaagg aaagagcgga gaacacggct gttgaagtcg ccgacgggat cctggtaaag 

gagtctgtag aatatacgaa caggacatgg agtgacaatc tgatgtatct tccatatcct 

gatatggatt cccagaaaaa cccgaatctg gtaaggtga 

<210> 1520 
<211> 837 
<212> DNA 
<213> B. fragilis 

<220> 

<221> unsure 
<222> 

(211) , (222) , (223) , (238) , (243 ) , (252) , (256) , (257) , (265) , (266) , (267) , (268) , (272) , (2 
91) , (292) , (293) , (294) , (3 03) , (320) , (323) , (324) , (325) , (326) , (327) , (328) , (339) , (360 
) , (3 77) , (3 78) , (3 80) , (396) , (397) , (399) , (402) , (407) , (408) , (411) , (412) , (413 ) , (42 9) , 
(430) , (431) , (432) , (433) , (439) , (440) , (443) , (444) , (445 ) , (446) f (447) f (450) f (451) , (4 
52) , (453) , (454) , ( 45 5 ) , ( 456 ) , ( 457 ) , ( 458 ) , ( 45 9 ) , ( 460 ) , ( 461 ) , (462) , (463) , (464) , (465 
) , (466) , (468) , (469) , (470) , ( 471 ) , ( 472 ) , ( 473 ) , ( 47 4 ) , (475) , (476) , ( 477 ) , ( 478 ) , ( 479 ) , 
(480) , (481) , (482) , (484) , (486) , (487) , (488) , (490) , (491) , (492) , (493) , (494) , (495) , (4 

96) , (497) , (498) , (499) , (500) , (501) , (502) , (503) , (504) , (505) , (506) , (507) , (508) , (509 
) , (510) , (511) , (512) , (513) , (514) , (515) , (516) , (517) , (518) , (519) , (521) , (522) , (523) , 
(524) , (525) , (52 6) , (527) , (528) , (529) , (530) , (531) , (532) , (533) , (534) , (535) , (536) , (5 
37) , (538) , (539) , (540) , (541) , (542) , (543) , (544) , (545) , (546) , (547) , (548) , (549) , (550 
) , (551) , (552) , (553) , (554) , (555) , (556) , (557) , (558) , (559) , (560) , (561) , (562) , (563) , 
(564) , (565) , (566) , (567) , (568) , (569) , (570) , (571) , (572) , (573) , (574) , (575) , (576) , (5 
77) , (578) , (579) , (580) , (581) , (582) , (583) , (584) , (585) , (586) , (587) , (588) , (589) , (590 
) , (591) , (592) , (593) , (594) , (595) , (596) , (597) , (598) , (599) , (600) , (601) , (602) , (603) , 
(604) , (605) , (606) , (607) , (608) , (609) , (610) , (611) , (612) , (613) , (614) , ( 615 ) , ( 616) , (6 
17) , (618) , (619) , (620) , (621) , (622) , (623) , (624) , (625) , (626) , (627) , (628) , (629) , (63 0 
) , (631) , (632) , (633) , (634) , (635) , (636) , (637) , (638) , (639 ) # (640) , (641) , (642) , (643) , 
(644) , (645) , (646) , (647) , (648) , (649) , (650) , (651) , ( 652 ) , ( 653 ) , (654) , (655) , (656) , (6 
57) , (658) , (659) , (660) , (661) , (662) , (663) , (664) , (665) , (666) , (667) , (668) , (669) , (670 
) , (671) , (672) , (673) , (674) , (675) , (676) , (677) , (678) , (679) , (680) , (681) , ( 682 ) , ( 683 ) , 
(684) , (685) , (686) , (687) , (688) , (689) , (690) , (691) , (692 ) , (693 ) , (694) , (695) , (696) , (6 

97) , (698) , (699) , (700) , (701) , (702) , (703) , (704) , (705) , (706) , (707) , (708) , (709) , (710 
) , (711) , (712) , (713) , (714) , (715) , (716) , (717) , (718) , (719) , (720) , (721) , (722) , (723) , 
(724) , (725) , (726) , (727) , (72 8) , (729) , (730) , (731) , (732) , (733) f (734) f (735) , (73 6) , (7 
37) , (738) , (739) , (740) , (741) , (742) , (743) , (744) , (745) , (746) , (747) , (748) , (749) , (750 
) , (751) , (752) , (753) , (754) , (755) , (756) , (757) , (758) , (759) , (760) , (761) , (762) , (763) , 
(764) , (765) , (766) , (767) , (768) , (769) , (770) 

<223> Identity of nucleotide sequences at the above locations are unknown. 
<400> 1520 

gcgcggcgag gacgccgcgc gaccgcgcgc aacaagcgcg cgccgacgag ccccggcgcc 60 
cacgcacgac gcggacagga gcacacgagc accgcgcacg acaggcaggg ccccgcccgc 120 
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720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1539 



607 



gcgaacaggc 


acgccgcaac 


ccacaacccc 


agaccgcgcg 


gcgcgccgcg 


cacggggagc 


180 


aagggagaga 


gggaggaaga 


gacggaggaa 


nacgcgaaaa 


annaaggaag 


aaaaaaanaa 


240 


aanaaccaag 


anaaannaaa 


aaaannnnaa 


anaaaaaaaa 


aagaaaaaaa 


nnnnaaaaaa 


300 


aanaaaaaaa 


aaaaaaaaan 


aannnnnnaa 


aaaaaaaana 


aaaaaaaaaa 


aaaacaaaan 


360 


aaaaaaaaaa 


aaaaggnnan 


aaaaaaaaaa 


aaaaannana 


anaaaannaa 


nnnaaaaaaa 


420 


aaaaaaaann 


nnnaaaaann 


agnnnnnaan 


nnnnnnnnnn 


nnnnnnannn 


nnnnnnnnnn 


480 


nnanannnan 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnna 


nnnnnnnnnn 


nnnnnnnnnn 


540 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


600 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


660 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


720 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


nnnnnnnnnn 


caccaatgaa 


780 


ttgacttgtc 


ctaaaataac 


cttacagctc 


ctctctattt 


ctcttattat 


actttga 


837 



<210> 1521 
<211> 1293 
<212> DNA 
<213> B. fragilis 

<400> 1521 

aggaggtggg gcatgatgcg ccggatggat gtccacgagg ccgccaaccg gagattgaag 60 

atcatctttg actattttga ttacgtgtat gtctcttttt ccggtggcaa ggatagcggt 12 0 

atccttctcc atttgtgcat ggattatatc cgcatgcatg cacccggtag aaaactgggt 180 

gtgttccaca tggattacga ggtgcagtac cgccagagca cggagtacgt ggaaaggatg 240 

ttttccaaca accgggatat ccttgaagtg ttccactgtt gtgtcccctt caaggtaccc 3 00 

acatgtactt ccatgtacca gcagtactgg cgtccctggc aagaggggta ccagaatatc 3 60 

tgggttcgcc agatgccggg caccgctctt accgtgaaag attttgattt ctggaacgac 42 0 

agcctgtggg attacgactt ccagtctctt ttcccttcct ggatacgtcg taagaaggga 48 0 

tgcaagcgtg tttgctgcct cgtggggatc cgcacgcagg agagcttcaa ccggtggcgg 540 

gcgattcatt ccgacaagaa ttaccgtaag ctggccaact acaagtggac tcaccgtgta 600 

ggatattaca cttataacgc ctacccgatg tacgactgga agacaacgga cgtgtggacg 660 

gggtatgccc ggtatggttg ggattataac cgtttgtatg acttgtatta ccaggccggg 72 0 

attcccttgt cccgccagag ggtggccagc ccgtttatct cacaggccgt ttcgacccta 780 

catctttata aagttattga tccggatacc tggggacgca tggtcagccg ggtcaacggc 840 

gtcagtttcg ccggaatgta cgggaacacg gttgcgatgg gctggcgctc catcagttgt 900 

ccggacgggt tcacgtggaa agagtacatg tacttcctcc ttgacacgct tccccgggcg 960 

acaagggaga actacctgga gaaactgagg gtgagccaga aattctggag agaaaaagga 102 0 

ggctgcctgg gagaggaaac gatcgggaag ctgcgtgcgg ccggtgtacc gttcacggtg 1080 

gaagaatgca cgacataccg gactgacaag aggcccgtcc gcatggagta tatagacgag 1140 

atagatatcc ctgaatttcg tgaaataccc acttacaaac gaatgtgcgt ctgcatcctg 12 00 

aaaaacgatc atacctgcaa gtatatgggt ttcacgcaga ccaaacggga gagggagatg 12 60 

aaagagagag ttttgaaaag atataaattg tga 12 93 

<210> 1522 
<211> 531 
<212> DNA 
<213> B. fragilis 

<400> 1522 

ttaatattct tcttggcaag atttatagat atattatttt tatcaattat tgttgtaaat 60 

ataaatatga aaagtaaatt cacattatgg agtgttgtat tgctcttggc cggatttata 12 0 

tttgtaggat gtgacaatga acgaacggaa aacgcatctg attttcagac aggaattcat 180 

aaaattgtaa tccaacagtc cggtgacaca gattcatttg aagtgagtgt aagtatcggt 24 0 

ggtgctgaca agggtggtcc cgccaagttg tacaatgaca agggagaata cattggtgat 3 00 

tcctattctg cccaaataag gacggcaact atgtcctgtt gtacaaatgg gaatgcgttt 3 60 

ttcatgacct gtgccggttc ggtttccagt atctcggagg cgggtaaacg gcttcatata 42 0 

acagtaatag gctacattga cgacaaggag gttaaccgtt tggaaaaaga atatataaca 480 

gatggtaata cccttattga aactttcagc gtttcaacca aggagatatg a 531 



<210> 1523 



608 



<211> 2358 

<212> DNA 

<213> B.fragilis 

<400> 1523 

cccgtttata ttttttcaag tatgacattt catgcatatc tccagtatgt gtgcctgctt 60 

gttgtctgcc tatgtaccct cccttctccc atggaagcac gggtacaggc tgaccggcaa 12 0 

tcgacgggta cacgcgatac tctgctggtc attgaccagg aaagcgggct tcccattgaa 180 

ggggcctata tcctgaccag ggaacggtta cttgtcagct ccccccgtgg aatgattgtt 240 

ttcggtcatg gaacgtgttt catggatacg gtcctcgtgc agtgcctggg ttacggttcc 3 00 

cggcgtgtgc ctttgaacga ggtattcaaa gaaagcagca tacataccgt atgtctctcc 3 60 

ccggacatac aaaaactcgg ggaagtggtc gtgaccggtg aacgtgccgg ggcgtcaccc 420 

aacgtggtga gccggcgcct ttcatctccc gagatcagga acgcgctggg aacctcgctt 480 

gccaccctgc tcgaacgtgt cagcggggta agttccatca gcacgggaac cactgtatcc 540 

aaacccgtta tccaaggaat gtacgggaac cggatactga tcatccataa cggtgcccgc 600 

cagaccgggc agcaatgggg ggccgaccac gctcccgaag tggacatgaa cggcagctcc 660 

tctgtttcgg taatcaaggg ctccgatgcg gtaagatacg gttcggatgc ccttggaggg 720 

attatcgtca tggagcagtc tccgcttcct ttcagaaaac gctcccttca agggggaatc 7 80 

tccgcacttt acggaagtaa cgggcgtcgc tacgtggcta ccggacagct cgaaggtgct 840 

tttcccggtg atttcgcctg gcgtctgcag ggaacctggt caaattccgg ggaccgttcc 900 

actgcgcact atcttctgaa caacacggga accagagagt atcacgcttc cgcctctctg 960 

ggctatgacc gcggacgtct gagagtggaa ggtttctaca gccgcttcta cagccggaca 102 0 

ggggtgatgc tcagcgccca gatggggagc gaggacctgc tggcggaacg tatccggctt 1080 

ggtcgccccc tgcacacgga tcccttctcc cgtggtatca gggctccctg ccaggaggtc 1140 

acccatcaga tcacattcgg caggatgcgg ctcggcatga agaagggggg aagtattcac 1200 

tggcaaagta cctggcagaa ggacgatagg caggaaaacc gtgtccggcg gctggattct 12 60 

aacattccgg cggtttccct gcacctgaat tcattccagc atcttctgcg ttggaagcgg 132 0 

gattaccgct cctggcaagt cgaggcggga ggtcaggtca tgttcatcga gaaccacagc 13 80 

cgcgcgggta ccggattcgt gcccgttatc ccgaactaca cggagacaca ggcagggata 1440 

tacggaatcg ggaaatatca cctggccagg ggaggcgttg aagcaggcct ccgcctggat 1500 

atgcaggaaa cccgtgccag tggttatgac tggacgggaa gcccctatgg cgggacaaga 1560 

aagtttaaca acgtgtccta cagcctggga ggacactatc aactttccag acgctggagg 162 0 

ctcacctcca acttcggtct ggcttggcgt gcccctcacg tgtatgaact gtacagcaac 1680 

gggaacgagc tcgggtctgg gatgtttgtc aggggagact ctgcgatgca ctcggaaaga 1740 

agctacaaat ggatatcttc cctccgttac ggcgacggga tgttcagcgt ctgtctggac 1800 

ggttacctgc aatgggtgga cggctatatc tatgacgggc cggagaaaga gacagtcacc 1860 

gtgatttcgg gagcataccc ggtcttccag tacaggcaga ccccggcttt cttccgcggt 192 0 

atggactttg acctgcgttt cactccgggc ggttcatggg actaccatgc cgtcgtctcc 1980 

tttatacggg caaacgaacg gacaaagggt aattatcttc cttatattcc ctccttccgt 2040 

ttcagccatg aacttgcgtg gatacacgag acgaaatcgc atctcaggct gcgtctgaac 2100 

atcaggcacc gtttcaccgc aaaacagagg cggtttgatc cggacacgga tcttatcccg 2160 

tatactcccc cggcgtacca tctcctcggg ttcgacgctg ctcttgaact tcccgtgaaa 2220 

cggggacacc aggtccggtt catgctgtcg gcagacaacc tcctgaatcg tgagtacaag 2280 

gaatacacca accgctcgcg ttactatgcg catgatatgg gacgtgatgt gcgttgcggt 2340 

gtaaactgga ttttttaa 2358 

<210> 1524 
<211> 417 
<212> DNA 
<213> B.fragilis 

<400> 1524 

cggattggaa tcatgaacat tctgaagtta caaggattgg acggtcggct ttttgacctt 60 

gtcgcgcctt tggtaatgaa cccggccgta ttgcggcaaa ataataatta ccctttcaaa 120 

acgacgcgta accatgtctg gtacattgcc atggatgagc aaagggtgtt gggattcatg 180 

cccgtgaaaa tgaccttgac aaacaattgc atagataact actatattag cggggataac 240 

tcctctgtaa tagaggtgtt attggaccgt attatccatg atttttcttc tgatggttcc 3 00 

cttgtggccg ttgtacacga acgccacgtg gaagactttt caatgaagaa ttttatcccc 3 60 

tgtgtcgagt ggaagaagta tgtcaagatg cgttatcatg aaggaggtgg ggcatga 417 



609 



<210> 1525 
<211> 534 
<212> DNA 
<213> B.fragilis 



<400> 1525 

attgtgattg tcatgtcaag ttttaaaagt ccggcgtata acgtcaaggc cgtgccagtc 

gagaaaatcg tggccaacag ctataacccg aatgtcgttg ctcccccgga gatgaagctg 

ttggaactgt ccatctggga agacggttac acgatgcccc tcgtgtgtta ttaccgggaa 

gaagaggata tctacgagct ggttgacggt taccaccgct atctggttat gaaaacatcc 

gtcaggattt acaagcgcga gaacggattg ctgcctgtaa cggtcataaa caaagacatc 

tcgaacagga tggcctcgac tatccgtcat aacagggcca ggggaatgca ctcgctggaa 

cttatgacag gtattgtggc ggaactgtca aaatcgggta tgtccgacag ctggatcatg 

cgcaacatag gcatggacaa gaacgagtta ctccgtttca aacaaatctc aggtctggct 

gaattgtttc gtgacaggag tttcgggctc tcggacgact ggttggagga ataa 

<210> 1526 
<211> 279 
<212> DNA 
<213> B.fragilis 



60 

120 

180 

240 

300 

360 

420 

480 

534 



<400> 1526 

tcatcttgcg taggatggaa ttctctcttt 
ttttttgtct tgtgcaacga aaaaatccca 
ggggatgtat tgggacatgc tttaccatct 
ttgaaaaaaa tcctatacat acaagacgta 
cttcccgtct ttcgtgcaat tttcccgctt 



atcaatctac aggcggcagg tggggccgcc 
tctgccatgc ttaatccccg tggtttactt 
caccctgttc catttttcat gtttggcaag 
ccgttccctt accgacgggt ctggaaccga 
ggttactga 



<210> 1527 
<211> 1506 
<212> DNA 
<213> B.fragilis 



60 



180 
240 
300 



<400> 1527 

aacctgcctt tagcgacaaa acctcccctc ccgtctatcg cttttgcaca tctttctatt 
gcagcgtact tttgtttcta taaaataaaa agaagcacga gtatgacgaa cgtaaataga 120 
atcaccctaa taacagtctg cctggcagct atcttgcccg gcaacggatt gtgggcacaa 
cagacggaag ctaccggaac atcgcaaacc gccgactcgg tatccatgcc cgcgcaatgg 
gatctgcaga gctgcatcga ctatgccttg cagcaaaaca tcagcatccg tcgtaaccgg 
atcaatgcgc agagcacaca ggtggacgta aagacagcca aagcggccct cttccccagc 3 60 

480 
540 
600 
660 



ctctcgttct ccagcagcca aaatctggtg aaccgtccct accaagagtc cagcagcatt 
atcagtggct cggaagtact gaagagcagc aacaagacca cctacaacgg aaactacgga 
ctgaacgcgc aatggaccgt atataacggc agtaaacgcc tgaaaacaat cgaacaggag 
aagctgaaca accgcgtggc agacctcgat gtagccactt cggaaaatga tatcgagcaa 
tcgatcgccc aggtatatat tcagattctc tatgccgccg aatcagtcaa ggtgaacgaa 
aacaccctgc aagtatccga agcccaacgg gaccgtggca aacaactgct ggatgcggga 72 0 
agcattgccc ggagcgacta tgcccagttg gaagcccaag tcagcaccga ccgttatcaa 7 80 
ctggtgaccg cacaggccac acttcaggac tataagttgc aactgaagca actcctcgaa 
ctggacggcg aacaggaaat gcaggtctat ctgcctgcat tgggtgacga aaatgtactg 
tcgcccctcc ccaccaaaac ggatgtcttt cgttccgctg tggccctccg cccggaaata 
gaggcaagca agctcagtgt agaggcatcg gaactgggga tcggaatcgc caaatcggga 102 0 
tatctgccca gcgtcagcct gacagccggt atcggtacca accataccag cggaagcgac 1080 
ttcaccttcg gcgagcaagt gaaaaacgga tggaacaact ccatcggact cagcatcagt 1140 
gtcccaatct ttaacaaccg acagaccaaa agtgccgtag aaaaagccaa acttcagtat 12 00 
cagaccagtc aactgactct gctcgacgaa cagaaaacat tgtataaaac catcgaagga 12 60 
ctatggctcg acgccaacag cgcccagcag cgatatgcgg cagccataga gaaattgcac 1320 
agcacacaga ccagctatga actggtcagc gagcaattta atgccggaat gaaaaatacc 1380 
gtggagcttc tgaccgaaaa aaacaatctg ctacaggcac agcaagagct gttacagtct 1440 



840 
900 
960 



610 



180 
186 



aagtacatgg ctattctgaa tacacaattg ctgaagttct accagggaga taagataaca 1500 

j_ i. 1506 
ctgtag 

<210> 1528 
<211> 186 
<212> DNA 
<213> B. fragilis 

<400> 1528 „ 
aaaaaacgtc cgaccttcac aggccggacg tttttcgttc tatttgaata cgttatttac 60 
atacaacaat taaacagtat gataattaaa ataacaactc aacaaaatgg aatccgggag 120 
aatcacttcg gccggattgg cttatctcac ttaaatgtga cttacgtaat aattatgact 
aattaa 

<210> 1529 
<211> 1557 
<212> DNA 
<213> B. fragilis 

<400> 1529 

gggaaaaacc ccgctttcaa tttaaaaaga ggctcacttt acccaatttc cggggttttt 
tcacaagcat gcgatgaaat aggaatgtta ttctgggcag aaaacgcatt ttggggaatc 120 
ggaggacaca aaggagacga ttattggaat gccagtgcat accctgtaaa cgaatccgac 
agggcagaat tcgaaaacag cgtaaaagcc caactgaaag agctgattcg catccaccgc 
aaccatcctt ccatcatcgt atggagcatg agtaacgagc cttttttcac agcacccgag 
acaatcgatc cgatgcgtaa actactggaa gaaaccgtca aactttccaa acaactcgat 
cctacccgcc cggcagctgt cggtggtgcg cagcgtccgc tgggagagaa acgaatcgat 
aagttgggcg acatagccgg atataacggt gacggcagct acattccgga gtttcagcaa 
ccgggcatgc caacagtagt ttcggaatat ggcagtacta cagccgatcg tccgggagaa 
tatgatccgg gatggggaga cctggctaag aataatgcac agaacggttt tccatggcgt 
agcggacaag ccatctggtg cgcattcgac cacggaagca ttgccggcag tgcccttggc 
aaaatgggca tcatcgatta cttccggatt cccaagcgtg cctggtactg gtaccggaat 720 
gcctacaaag ggattacccc tccggaatgg ccgcaagaag gaacacctgc ccgcatcagc 
ctggttgccg accggactga taacataaaa gcggacggaa cggacgatgt tatgttatca 
ataaccatcc tggatgcaaa cggaaagccg gtcagcaatt ctccggcagt taagctcgac 
attctatccg gcccgggaga gtttcccacc ggaacctcaa tcctgtttga aaaagagagt 
gatatccgca tactcgacgg aaaagcggct attgaattcc ggtcatatta tgccggagaa 1020 
actgttatcc gtgctacttc accgggactg gaaccggccg aagtaaaaat ccgattcacc 10 8 0 
ggaagcactc cctatacaga ggccttcaaa gtaaaagaac gtccctatac acgcttcgaa 1140 
actccgacaa aaacggataa tctgcaaacc ttcggaccta acaacccgac tttctgcagt 12 0 0 
tcgtctgcca acggacattc atccgctttt gcagccgacg gagacgaatc tacatactgg 1260 
caagcttcag agaacgaccc ggaacgttcc tggacactgg ataccgaaaa aggattgtcg 132 0 
atacgtcaca tccgaatcgc atttccggat ttagcccctt atcaatacaa agtagaggta 13 8 0 
tccatggaca gagaacactg gtcgctcata ccggatcaaa ccaataataa gcagaacgag 1440 
aacattcgga tgattcaggt tgttcccggt atacaaggac gtttcgtacg tatcagtttc 1500 
acaggagaga aagctgctat tacagacgta caggtcatcg gaacggtaat tgactaa 1557 

<210> 1530 
<211> 1491 
<212> DNA 
<213> B. fragilis 

<400> 1530 

atacatatta caattatggc aactcagaat aaaacagata taggcctgat tggtttagct 



60 



180 
240 
300 
360 
420 
480 
540 
600 
660 



780 
840 
900 
960 



60 
120 



gtgatgggag aaaatttagc tctgaacatg gaaagcagag gttggagcgt atcggtgtat 

aatcgtacgg taccgggtgt ggaagaaggc gtggtggaac gtttcatcaa cggaagggct 180 

aaaggcaagc atatagaggg cttcacggat atagaggctt ttgtagaatc gatcgctttg 2 40 

cctcgtaaga taatgatgat ggtacgtgcc ggtagtccgg tggatgagct gatggagcaa 

cttttcccgt atctttctcc gggtgatata ttgattgacg gtggtaactc gaattatgaa 



300 
360 



611 



gatacgaacc gaagagtgaa actggcagag tcgaaaggtt ttctgttcgt cggtgccggt 420 

gtttcgggcg gagaagaggg agctttgaac ggagcttcta tcatgccggg cggttcggag 480 

aaggcatggg aagaggtgaa accgattttg cagagcattg cggcccaagc gccggatggt 540 

actccgtgct gtcagtgggt gggacctgcc ggatcgggac attttgtgaa aatgatacac 600 

aacgggatag agtatggtga tatgcagttg attgctgaag cttattgggt gatgaaggag 660 

ctgctggata tgactaacga ggagatggca tctgtgttca cccgttggaa tgaaggtaaa 72 0 

ctacggagtt atctgataga aattacaggt aatatactcc gccataaaga taagacaggg 7 80 

gtctatctga ttgataagat cctggatgcc gccggacaga aagggacagg caaatggtcg 840 

gtgattaatg ccatggaatt aggtatgcca ctgggattga ttgctacggc cgtttttgaa 900 

cgcagtttgt cggcccggaa ggaactgcgt gaagctgctg cccggcaata tcaatgcagg 9 60 

cactcgatgg ctgtatataa taaacaagat acggaaaaag agattttctc ggcattgtat 102 0 

gcttcgaaac tggtttcgta tgctcaaggg tttgcggtgt tgcaacgtgc ttccgataca 1080 

tttggatgga acctcgatct ggcttcgatt gcacggatgt ggagaggcgg ttgtattatt 1140 

cgcagtgttt tcctgaatga tattgctgcg gcatttgagg caaaagaaaa gcctaaacat 1200 

ctgctgctgg ccccttattt tgaagaagag atcaaaggtc tgttgtccgg ctggaagaac 12 6 0 

ttggtggcac aagccatgcg tgaggaactg cctgttccgg ctttctcttc ggctttgaat 132 0 

tatttctatt cgttagtgtc ggctgatctt ccggccaatc tggtacaggc acagcgcgat 13 8 0 

tatttcggtg ctcatacatt tgagcggaaa gatgagttac gaggagtctt tttccatgaa 1440 

aactggacag gacacggagg agatacaaag tcgggtacgt ataatgtata a 1491 

<210> 1531 
<211> 411 
<212> DNA 
<213> B. fragilis 

<400> 1531 

tccggatcta cacaagtctg gcatgtgcat tgcataacaa tattcagatg tttagttata 6 0 

gttagtagat ctagaacaat taagttttca agaattatga aaaaggtatt agtagcattg 120 

gtaatagtaa tgggattggg attttcagta gcaaaagcag atgaacctct aaagaaaaag 180 

tccccgaaag tggaacaaag agattcgcgg gaggacttta cacctattga ggttaataat 240 

cttcctgaag cggtgattga tgagttatcc tgcgaaggag cattgattaa agaagctttt 3 00 

attgcttata gtcgttcgga gggtaaactc tacaaagtga ttatattgtc gagtgatttt 3 60 

catgaacagg ctgtattctt gaatgaaaga gggaatatac tgaatagata g 411 

<210> 1532 
<211> 225 
<212> DNA 
<213> B. fragilis 

<400> 1532 

aaagcttgta cacaagggca tggagtggat caagaggaag aggtgagtcc gaatcaggtg 60 

gcggctcttc gttgtctgtt tactccggaa tatactcgtc ccgcagctat cggcactacc 12 0 

cgtgctttgc ttcagggagt ggaattccag aaagtcagtg cgataggggg agcaattaat 180 

crgcggatcgg gtagtggaga tattgtggat gatccgacag cctga 225 

<210> 1533 
<211> 1152 
<212> DNA 
<213> B. fragilis 

<400> 1533 

actatgggta catacatttt tctacaacaa tattggtggc tggtagtctc actactcggg 60 

gccatactcg tatttttact gtttgtgcag ggtggcaact ctctgctgtt ttgtttgggc 12 0 

aaaaccgaag agcatcgtaa gatgatggta aactctaccg gacgcaaatg ggaatttaca 180 

tttactacgc tggtcacttt cggtggcgct ttctttgctt cgtttcctct gttttatagt 240 

accagtttcg ggggggccta ctggctgtgg atgattattc ttttcagttt tgtgttgcaa 3 00 

gctgtcagtt atgaatttca gagcaaagcg ggcaacttgt tgggaaagaa gacctaccag 3 60 

acttttctgg tgattaacgg tgtggtggga cccttgctct tgggaggcgc tgtggccact 42 0 

ttctttaccg gttcggattt ctatatcaat aaggggaata tggtgaacga agtgatgcct 480 



612 



600 
660 



780 

840 

900 

960 

1020 

1080 



60 
120 
180 
189 



gtgatcagtc attggggcaa cggttggcac ggactggatg cgctgaccaa tatctggaat 540 

gtgattctgg gattggccgt gttcttcctg gcgcgtgctt tgggagctct ttactttatc 

aataatatcg ctgataaaga gttggtcgcc aaatgtcgtc gttcgttgat agccaatacg 

gtcctgttcc tggtgttctt cctggcattc gtggttcgca ctttgctggc cgatggatat 720 

gctgtcaatc cggaaacaaa agagatctac atggagcctt ataaatactt caataatttt 

attgaaatgc cggtggtgct tatcgtgttc cttgtgggag tcgtgctggt gttgtttggc 

attggcaaaa ccctgctgaa aaaaacgttt gataaaggaa tctggtttgt gggtatcggt 

acggtgctga ctgttctggc actgctgctg acagccggat ataacaatac ggcttattat 

ccgtcgaata cggacataca aagttcgctg acccttgcca atagttgttc cagccagttc 

acgctcaaga ccatggccta tgtttctatc ctcgttccgt ttgtcatcgc ctacattttc 
tatgcttggc gcagcatcga caaccggaag atcgatgcca aggaaatgga cgaaggcgga 1140 

j_ ^_ 1152 
catgcttatt ga 

<210> 1534 
<211> 189 
<212> DNA 
<213> B. fragilis 

<400> 1534 

attacaaggt ttccatccaa ttatatatta gatttcattt ttatcagtag taagataatg 
ctgatttaca accacttcat gattgtatct tattttaaaa ctactgataa aattatttat 
ttttttacct ttttatttaa cctcttttgg gttagtatgc cccttgcttc aaatgttttt 
atagtataa 

<210> 1535 
<211> 711 
<212> DNA 
<213> B. fragilis 

<400> 1535 

tcatccatag ccgtgaaatt acgttgcgcc atagtcgacg acgaacctct ggcactcagt 60 
ctgctggaga gttatgtcaa caaaacacct tttctcgaac tggcgggaaa gtattccagt 12 0 
gccgtacagg ccatgaaaga acttcccggg aaccagatcg acctgttgtt ccttgacatt 
cagatgccgg aactcaacgg tctggaattc tctaaaatgg tagcccctcg tacccgtatt 
gtgttcacta ctgctttcgg tcaatatgcc atcgacggat accgggtcaa cgcactcgac 
tacctactca aacccatctc gtatgttgac ttcctgcagg cagccaacaa ggcgttgcaa 
tggttcgaac tggtccagaa gcccgaagaa gtagacagta tttttgtcaa aagtgactat 
aaactggtgc aggtagaact caaaaagata ttatatatag agggtttgaa ggattacatc 
aaaatctaca ccgaagatgc ccccaaaccc attttgtcac ttatgagcat gaagtccatg 
gaagagttgc ttccccccgc ccgctttatg cgtgtacatc gttctttcat cgtccagaaa 
aacaaaatac gcatcatcga ccgcgggcgt atcgtatttg ataagaccta tatccccgtc 
agcgatagct acaagcagac tttccaaaca ttcctcgatg agcgaagttg a 

<210> 1536 
<211> 1353 
<212> DNA 
<213> B. fragilis 



180 
240 
300 
360 
420 
480 
540 
600 
660 
711 



<400> 1536 

cccaatacag cagccgtaac atcgtgttgc gcgacggaca agtcaaggaa gacagcacta 
acccggacat tctttccgca gccgaagcat tggccgctct gccggtacaa gaagaataac 
agacagatta ttatgaacgg aaccaattta tttaaaatag ctttgcgcgc cttgaataac 
aacaagttgc gcgcgttcct tacgatgctg gggatcatca tcggtgtggc atccgtcatc 
accatgctcg ccatcggcca gggatcaaag aaaagtatcc aggcacaaat ctccgagatg 
ggctccaata tgattatgat tcatccggga gcagacatgc gcggaggtgt tcgccaggac 
cccagtgcca tgcagacact gaaactgacc gactacgaaa cattgcggga tgaaaccagc 42 0 
tttctggctg cggtcagtcc taatgtttcc agttccgggc agttgattgc aggcaacaac 480 
aactatccgt cgtccgtgaa tggcgtggga acggagtatc tggaaattcg acagctctcg 540 
atagacaatg gagagatgtt cagcgaagcc gatatccagt cgagcgccaa ggtatgcgtg 600 



60 

120 

180 

240 

300 

360 



613 



660 



ataggaaaaa ccattgtaga caatcttttc cccgatggag aagatcctgt aggacgcatt 

gtccggttca gcaaaatacc gttccgtgta gtaggcgtac tgaaatccaa gggatacaac 72 0 

tctatgggta tggaccagga cgacatcgta ctggcaccct acaccaccgt gatgaagcgt 7 80 

ctgctggcac agacctatct gcaaggtatc tacgcttctg ccctttcgga agacatgacg 840 

gacaatgcta cggaagagat taccgaactt ctccgccgca atcacaagct gaaagaggcg 900 

gatgacgatg atttcaccat ccggagccag caggaattga gcagcatgct caactctacc 960 

accgacctga tgaccacact gctcgcctgc attgccggca tatcgctcgt agtaggcggc 102 0 

atcggcatca tgaatatcat gtacgtcagc gtcacagagc gtacccgtga gatcggtctg 1080 

cgcatgtcgg tcggtgcacg tggcgtcgac atcttgagcc aattcctgat agaagccatc 1140 

ctgatcagca tcaccggagg ccttatcgga gtaatcatcg gctgcggtgc cagctgggtt 1200 

gtgaaaagtg tcgcccattg gcccatcttc atccaaccct ggagcgtatt cctgtcgttt 12 60 

gcggtttgta ccgtcaccgg agtattcttc ggatggtatc cggccaagaa agccgccgac 1320 

ctcgatccga tcgaggcaat ccgatacgaa taa 1353 



<210> 1537 
<211> 255 
<212> DNA 
<213> B.fragilis 



<400> 1537 

gttgtaaatt tgcggaaaaa caagagtaag atggaatata agtttgacga acagagtgtg 

aaagaattaa tggaatgggc acagactgca cagttgcctc aggaactgga actgagtaaa 12 0 

gcggagcgta tttttgatgt aaaactttgt atagaatctg atttatcgtg tatcagggcc 

cattatcccg atgctttcta caatccggct ataactcgtc tttatcgtat cagggagaag 
ctggaggaga aatga 



60 



180 

240 
255 



<210> 1538 
<211> 1290 
<212> DNA 
<213> B.fragilis 



<400> 1538 

cagatttatt acaaatatat gcaaccatta gcggaaagac ttcgcccaaa gacattggat 
gattatatcg gtcagaaaca ccttgtggga ccgggtgcta tcctgcgcaa aatgattgac 
gcagggcgta tctcttcatt tattctttgg ggacctccgg gagtgggtaa gaccacactg 
gcccaaatca ttgccaataa gctggaaact cccttctaca ccttgagtgc tgtcacctcc 
ggggtgaaag atgtgcgcga agtgatagat cgcgccaaga gcaataagtt tttcactcaa 
tccagcccta tcctgtttat cgatgaaata cacaggttca gcaaatccca acaagattcg 
ctactgggag ccgtcgaaca tggcacagtc acgctgatag gtgccaccac cgaaaatccg 
tctttcgaag ttatccgtcc gctcctttcc cgctgtcagc tctacacact gaaatctctg 
gaaaaagaag atttactgga attgctgcaa cgtgccatta ctaccgacgt agtgctgaaa 
gaacggaaaa tcgaattgaa agagaccggt gccatgcttc gcttttcggg aggcgatgcc 
cgcaaattac tgaacatact ggaactcgtg gtagaatcgg aaacagaaga aaccgtaatc 
atcactgatg acctggtaac agaacgcttg cagcaaaacc cgctcgcata cgataaagac 
ggtgaaatgc attatgacat tatttcggcg tttatcaaat cgatacgggg aagtgaccct 
gatggcgcca tctattggct ggcccgtatg gtagaaggag gcgaagatcc ggcattcatc 
gccaggcgcc tggtcatctc tgccgcagaa gatatcggcc tggccaatcc gaacgcattg 
ttattggcta atgcctgttt tgacacattg atgaaaatcg gctggccgga aggaagaatt 
ccactggcag aaacaacgat ttatctggca acaagcccta aaagcaattc ggcctacaat 
gccatcaacg atgcactggc actggtacgc gaaaccggta atctgcctgt tcccctacac 
ttgcgcaacg ctcccaccaa gctgatgaag cagttgggat atggccagga gtacaaatat 
gcacataatt acgaaggcaa ctttgtaaaa caacagttct taccggatga aatcaaggcc 
aaacaactat ggcaacccca acacaatccg gcagaacaaa aacatgccga gcgaatgaag 1260 
caactatggg gaaatgaaaa gaactattaa 1290 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 



<210> 1539 
<211> 1578 
<212> DNA 
<213> B.fragilis 



614 



<400> 1539 

cgtttaactc taactactaa tgatatgatt gaaagtattg acacttcact gattgactgg 
tcgagagccc aatttgctct gacagccatg tatcactgga tctttgtgcc tctcacgctt 

ggtttggcgg ttgttatggc cattatggag accctttatt ataaaacagg caatgaattc 180 

tggaaacgaa cggctaaatt ctggatgaaa cttttcggta tcaattttgc cgtgggagtg 240 

gccaccgggc tgattctgga gtttgagttc ggaaccaatt ggagtaacta ttcctggttt 300 

gtaggcgata ttttcggtgc acctcttgca attgaaggta ttttggcttt ctttatggag 3 60 

gccacgttta ttgctgtcat gttctttgga tgggacaagg tgagcaagag attccacctc 42 0 

atttcaacct ggctcacggg cttgggagct acgatttctg cttggtggat tctggttgcc 480 

aatgcgtgga tgcagcatcc ggtaggcatg cagttcaatc ctgacaccgt acgcaacgag 540 

atggtcgact ttatggctgt tgcgttttcg ccggtggctg ttaataagtt ctttcatacg 600 

gtactttcaa gttgggtgct cggagctgtt tttgtgattg gcatcagttg ctggttcctg 660 

ctgaagaagc gtgataagga atttgcggtg gcgagcatca agataggtgc agtcttcggg 720 

ctggtggctt ctttactgac ggtgtggacc ggtgatggtt ccggttatgc catcgcacag 7 80 
acacagccga tgaaactggc tgctgtggag ggctattacg aaggacaaaa tggtgccgga 
ctggtggctg tcggactttt gaacccggag aagaaaacct atgatgacgg tcaggacccg 
ttcctcttcc gtatcgagat tccgaaaatg ctttcgctgc tggccgaacg taaggtggat 
gcctttgtgc cgggtataaa aaatattatt gagggaggat atgaactgaa agacggtaca 

aaagctcttt cggctgccga gaagatagaa aaaggcaaga aagcgattgc tgccctggcc 1080 

acttatcgta cggccaaaaa agaaggtgat gaagctgccg ccaaggaggc ttataccact 1140 

ttgcaggaaa atgtgcctta ttttggctat ggctatatta aagatgtaaa tcagttggta 12 00 

cctaatgttc cgcttaactt ttatgctttt cgcgtgatgg tgatcctggg cggatatttc 1260 

attctgtttt tcatcctggt gcttttcttt gcctataaga aagatttgtc gaagataaga 132 0 

tggatgcagt acgttgctct gtggaccatt ccgttggctt atattgccgg acaagccggt 13 8 0 

tgggtggtgg ccgaatgtgg tcgtcagccg tgggcgatac gggatatgct tcctacgtct 1440 

gtgtctatct ctaagctcga tgtgggctct gtgcagacca ctttctttat tttcctcgtt 1500 

ttgtttaccg tgatgttgat tgctgaaatt ggcattatgg tccgtgagat caagaaagga 15 60 

ccgacggtta atcattag 157 8 

<210> 1540 
<211> 429 
<212> DNA 
<213> B.fragilis 



60 
120 



840 
900 
960 
1020 



60 



<400> 1540 

gttaatatgt atatgaaaac tttattttta aaacgattga tgtttatttt tcttcttttt 

gtagcatgtt attcaagcgc tcagtctctg aggtccttgc cctttcaaaa aagagacagt 12 0 

acattgattc ggattgcaaa ggaaacgttg aagaagaaag cgcctgagta tttaattgaa 18 0 

aatggtgccc cgattatttc gaagcaccgg gttcgctatt tgactccagc agaagaaaaa 240 

gaagtgcctg aatttagtac gttttatggg gccaagtcag gccaagtcta ttatattgtc 3 00 

gaatttcctc aagatgaatc aatagaatct tttgatgctg gatttgtagc ccaagtttac 360 

atttgggaag atacctcaag acctttttct attgctttag gaaatagtct gattatggat 42 0 

429 

ttgaagtag 

<210> 1541 
<211> 1341 
<212> DNA 
<213> B. fragilis 

<400> 1541 

atttatcgtc tggggaatag tgacctatac catcaaaaag aacaggagaa aaagataatg 6 0 

tttggactaa tggactgcaa taacttctac gcaagttgcg agagagtgtt caatccggct 12 0 

ttgaacggta agccgatcgt tgttttaagt aataacgacg gatgtgttat tgccagaagt 180 



aacgaagcga aagcattggg gattaaaatg ggagttccgg cttatcagat caaagatgat 2 40 

attcagaaat atggtatatc tgtcttttca tcgaactata cgctatacgg cgacatgtcc 3 00 

ggacgcgtga tgtccatact ggcagaacaa gtgccggaaa tggaagtata cagtatagac 3 60 

gaagcatttc ttaacctgga agggattcgg gatattcagt cactcggaac agacatcata 42 0 

aacaaagtaa tccgcggaac cggcatacct gttagtttgg gtatcgcccc aaccaaaaca 480 



615 



ctcgcaaagg 
atcgatacag 
gggatcggtc 
tttaccgagt 
tggaaagaac 
cagatatgta 
gcaatcgcca 
atgtcactaa 
aggaatacag 
gcactggctg 
attatcaccg 
aaacgtgaac 
aaattagcaa 
cattatacca 
caacgaaaac 



tcgccaataa 
aagaaaagcg 
accggcaagt 
tgcccgaatc 
ttcaaggcat 
caagtcggtc 
ctcacgcttc 
tggtattcat 
ttgtacatct 
ggctaaaaac 
aaatcaccga 
gtcttcaaca 
ttcaaggtac 
ctgacatcaa 
aatatagctg 



atttgcaaag 
caccaaggct 
tgccaagtta 
ttgggtgcgt 
ttcctgtatt 
attcggcaag 
tacatgcgca 
ccatacgaat 
tccgatacca 
gattttcatg 
tagcacccaa 
aacaatagat 

agggaggaat 

tcagattata 
a 



aaatatcctg 
ctgcagctta 
gaaaagcagg 
aagaacatga 
gatatggaaa 
atggtggagg 
aagaaactcc 
aacttccgta 
accaatgaca 
caaggctatc 
ttgggactgt 
aagataaacg 
tggaagctta 
agcattaatt 



cttacaatcg 


tttatgtatc 


540 


ctgaaattgg 


agacatctgg 


600 


gagtcaaaac 


agcctatgac 


660 


ctgtagtcgg 


agaaaggacc 


720 


ccacaccacc 


ggccaaaaag 


780 


acatcgatac 


aatgtcggaa 


840 


gacaacagaa 


aggctatgca 


900 


aagattcgcc 


acaatattgg 


960 


cattagaaat 


tgtacattat 


1020 


aatataagaa 


agccggagtt 


1080 


ttgattcagt 


agatcgcgag 


1140 


gtaaacacag 


ccgactcgtc 


1200 


aacaaaaaca 


gctatcaggg 


1260 


gtacttaccc 


aactgcatgt 


1320 
1341 



<210> 1542 
<211> 864 
<212> DNA 
<213> B.fragilis 



<400> 1542 
aaacaattac 
tttcagaata 
cagctcgcag 
ggatatgata 
attgcaattg 
tctgctttct 
acagcaccct 
tctttactgt 
ttctgcttaa 
cttttaggga 
cttcctttat 
ttaggatata 
gtcaaagcga 
ggcatagtga 
agtgttttca 



atatgtataa 
atgctctatc 
ttttatctgt 
aggaggatat 
gagtctttat 
ccacatgggt 
ttatttttgt 
tatcttcatc 
atatcggtca 
tcctgattaa 
ttatgatttt 
acggagtgct 
tattttctat 
ctttaacggc 
gaagtaaatc 



aaacatttta 
cggtgggttg 
actcggtaca 
ttgtaacgga 
ggaaatcaat 
tgcgcgttgt 
tgtatggctt 
gttggaaaag 
ggtgatgttt 
ttctcgtatg 
atatccacat 
ttgtgccata 
catactgtct 
accatttgtc 
ctga 



atattaggtc 
atgttattag 
gtcgttagca 
ttgtatggtt 
gttacatcta 
tttcgttatc 
ctgcttgttg 
cctgaactaa 
cagggaaata 
aatgctctgt 
actgatcttg 
gccttgggag 
atcgttctgc 
ttttccgtat 



gtggaattgg 


gcaagtgatg 


60 


gaattgcttt 


taattcatgg 


120 


ctttgacagc 


ttcattatcc 


180 


ttaatgggac 


attggttggt 


240 


tattattgct 


tatctcaggc 


300 


agaacagagt 


atcgggactg 


360 


ggtgtcatta 


tctataccct 


420 


caatggatat 


tttccgttca 


480 


tactttcggg 


attatttttt 


540 


atacactgac 


cggtgcaata 


600 


ctgcatggaa 


tttgggatta 


660 


ataagacagg 


cataggagta 


720 


agttaacggg 


tatgcatatg 


780 


ggattaccgg 


gggcctgttc 


840 






864 



<210> 1543 
<211> 1080 
<212> DNA 
<213> B.fragilis 



<400> 1543 
ttaaacagac 
tggagcatcg 
ggacaaacac 
ctgatagcgg 
atgaaagtgg 
ttgatggaca 
tgctggggat 
gagaagttcg 
atttcctgga 
aaccttaaaa 
aagtcacttg 
tccgatctgc 
gatggcgata 
tggctgggca 
gatatcctgc 



atactatgaa 
atgtccgggg 
aggtgaagaa 
aaaaaacgaa 
aagccaaaaa 
agatattgaa 
atgaagggga 
cccatttcga 
ttgagcaagt 
tcaaaggaac 
aaattatcag 
cgaatcttga 
tgaatgtatt 
ttgtcgatgc 
cgcaactgga 



acgagtattt 
cactgacgta 
tttttcgtcg 
aaaaggatat 
gtatgcgttg 
agacaaaaaa 
agactgctcg 
aggtctgttt 
cgatttgagt 
gaataatctg 
tggcggcctg 
aaagctggtt 
cagacctctg 
cgaagagcag 
aacaatggat 



gtatttcagg 
atagtgaatt 
gcaggggaag 
gtggagacct 
agctatgacg 
ctgccgtcac 
gacattgccg 
tggggggata 
ccggtgctgg 
agcattggta 
cccgattcgg 
ctctatgtag 
ttctctaaag 
aatgcggttg 
atttccgccg 



actttaagtc 


ccagaaattt 


60 


acggtaagtt 


ggggacggat 


120 


ccgaaaaagc 


tgccggtaag 


180 


tggaagaggt 


tgccaaagaa 


240 


aagcggaaga 


gggcgtaaac 


300 


tcaagcagat 


cacgataggc 


360 


atggcattgt 


ggagaataaa 


420 


tagattttga 


ggaacaggag 


480 


atgcgatgcc 


tctgctgaat 


540 


agaaaccgcg 


tccgaacttg 


600 


tggtggaaga 


tatcctgggt 


660 


gagtggagga 


ttatgggttt 


720 


accgtttccc 


taacctgaaa 


780 


tagagatgtt 


tcttgaatcg 


840 


gtgtgttgac 


ggatgaaggg 


900 



616 



gcacggctat tgctggatca tgtggataaa atcaagcatc tgaagtttat caatatgaaa 960 
tacaattatc tgagcgacga gatgaagaaa gagttgcaga aatcgctgcc catgaagata 102 0 
gatgtttccg actcacagga atacgatgac gattacagtt acccgatgat tacggagtga 1080 

<210> 1544 
<211> 777 
<212> DNA 
<213> B.fragilis 



<400> 1544 

ccatccgggc agcaagaaga aaggaaaata agtatgaaca aaacagtcat cgaacttcag 
aatatcaaac gtaacttcca ggtgggagac gaaaccgttc acgcattgcg cggggtttcg 
ttcaccatca ccgaaggaga gtttgtcacc attatgggta cgtccggttc gggcaaatca 
acgctactga atacgctggg ttgcctcgac acacctacca gcggagaata tctgctggat 
ggaatctcgg tacgtaccat gagcaaacct cagcgtgcca tattgcgcaa ccgaaagata 
ggctttgtct tccaaagtta caatctgctg ccaaagacga ctgctgtgga aaatgtagaa 
ctcccgctga tgtataattc gggagtcagc gcttccgaac ggcggcggcg cgccattgag 
gcgctgcaag ccgtaggact gggcgaacgg ttggaacaca aatccaatca gatgtccggc 
ggacagatgc aacgtgtagc catcgcccgt gcgttggtca ataatccggc agtcatcctt 
gcggacgagg caaccggtaa cctggataca cgcacttcgt tcgagatcct ggtactgttt 
cagaaactgc atgccgaagg ccgcacaatc atatttgtaa cgcacaatcc ggaaatagcc 
caatacagca gccgtaacat cgtgttgcgc gacggacaag tcaaggaaga cagcactaac 72 0 
ccggacattc tttccgcagc cgaagcattg gccgctctgc cggtacaaga agaataa 777 

<210> 1545 
<211> 318 
<212> DNA 
<213> B.fragilis 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 



<400> 1545 

tctaattaca aagaaattat ggctaatcta ttcataacta tagtgtcgac aaaaaaaatg 

ttgaacggca aacacaaagt acggattgcc gtatcccaca acttagcaac cagatatata 120 

ccaactaaca tcataataga tgctgagaat gaattcaaaa acgggaaagt agtaaaaaga 

cctgataagg acatattaaa tgcacgatta aagaaaatat acgatatgta ttatgaacgt 

tgcatgaaaa tagaatatgc taatacgttg acttgcacac aactgatcaa atactgtata 
tttgcagaat caagataa 

<210> 1546 

<211> 1518 

<212> DNA 

<213> B.fragilis 



60 



180 
240 
300 
318 



60 



<400> 1546 

tgtataagca aaaaaagaaa tatgagtaag tttgtaatga ctatctttgg tgcttcgggt 

gacttgacga agcgtaagtt gatgccggca ttgtattcgt tgtatgtagc caagcgtctt 12 0 

ccggaagaat ttgaaattct cggggtggga cgtacggttt atgaagatgc ggactatcgg 180 

acttacattt ataatgagat ggagaaattt gtgaagtcgg aagagcagaa taaggagaag 2 40 

atggacgctt tcgttggaca tcttcactat ctggcaatag acccggcatt ggaaagcgga 3 00 

tacggacagc ttcgcctgcg cattgaagaa ctgagcggag atagccggcc ggatgacctg 3 60 

ctgttttacc tcgctacgcc tccatcattg tatggtgtga ttccgttgca cctgaagtcg 42 0 
gtgcatctga ataaaggccg tgcacgaatt atcgttgaaa agccattcgg gtatgatctg 
gaatcggctg agaaactgaa taaaatttat gcttctgtat tcgacgaaca tcagatttac 
cgaatcgatc atttcttagg taaagaaacg gctcagaacc tgttagcttt tcgttttgcc 
aatggtattt ttgagccttt atggaaccgt aattatatag attatgtgga agtgaccgcc 

gtggagaatc tcggaatcga gcaacgcggc ggtttttatg atactacggg tgcactgagg 72 0 

gatatggtac agaatcatct gatccagctc gtggcattga ctgctatgga acctcccgct 780 

gtgtttaatg cggataattt ccgtaatgaa gtggtgaagg tatacgagtc tctgactcct 840 

ttgaccgaaa cggatttgag tgaacatatt gtccgggggc aatatacggc agggggaaat 900 

aaaagggggt atcgggaaga gaagaatatt tcacccgact cacgtaccga gacttatatt 9 60 



480 
540 
600 
660 



617 



gcaatgaaac tgggtatcag taattggcgg tggagtggag taccttttta tatcaggacc 1020 

ggaaaacaga tgccgacaaa agtgactgag atagtggtgc atttccgtga aactcctcat 1080 

cagatgttcc attgcgcggg tggtaattgt ccgagagcca ataagctgat attgcgtttg 1140 

cagcccaatg aaggtattgt gttgaaattt ggcatgaaag tgccgggacc cggatttgag 12 0 0 

gtcaaacagg taacgatgga tttcagttat gatcagttgg ggggcgtccc cggcggagat 12 60 

gcctatgccc gtctgataga agactgtatc ctgggcgacc agaccctctt tacacggagt 1320 

gatgcagtag aagcatcctg gcatttcttc gatccgattc tgcgttattg gaatgaacat 13 80 

cccgaggcgc cgttgtatgg ttatcctgct ggaacttggg gacctttgga gagtgaggct 1440 

atgatgcatg agcatggtgc cgaatggacc aatccatgca agaacttaac aaacacagat 1500 

caatattgcg agctatga 1518 

<210> 1547 
<211> 240 
<212> DNA 
<213> B.fragilis 



<400> 1547 

ttagagttaa acgtcagtcg tctattgccc gcgttctata agttcgcccg caacgtattc 
tcccttgtcc ttatcggtgg gctggctgct cagaaagttg gggaagaaaa acaggcggag 
aatgaagaac atgataaaga gtttcacaag tataatgagc cacagggtac gtccccaggg 
tcatgcttcg aaagccttct acatagaaat tccagatgtg gagcagtgta ttcttcataa 

<210> 1548 
<211> 2073 
<212> DNA 
<213> B.fragilis 

<400> 1548 

tcaataatga agcaaataca acctgttttt ttagctatga agttttcttt tcttattttc 



ttctttttta gtttgcctgt cggtgcacaa agtatttttc agaaatacgg gcgtttcctg 
actgaacctc gttcatatgt gtgttatcgt acggatggta aactgaaaat agatggcaaa 
ctggatgaag tttcgtggca gaaggcaaaa ccgacagctc cgtttgtaga tatcagcggg 
gaggggtttc ctacacctaa atatgaaact acggccaaga tgctgtggga cgatgaatat 



60 
120 
180 
240 



60 
120 
180 
240 
300 

ctttatatag gggcgatgtt gcaggaggat gatatcaagg cacgtcttac acagcgggat 3 60 
actattattt attatgataa tgattttgag gttttcatag atcctgactg ggacgggcac 42 0 
aactattttg aaatagaaac caacgcgcgt ggggtcatat tcgacctaat gctggacagg 480 
ccttatcggt ctagtggaaa ctttatggta caatgggatt gtccgggatt aaagttggcc 540 
attcatcgtg aaggtacgct gaataagtcg aaagacaaag ataaatattg gagtgtggag 600 
atggctattc cacataaagc attgactatg aatttcaata atccattgaa agctggtaat 660 
tgctggcgaa ttaatttttc acgtgtacaa tggctgaagg caggaggacc tgaagaaaat 72 0 
tgggtttgga cacctaccgg aaaagttgat atgcatatgc cggatcgttg gggatatttg ™ n 
ttttttgctg ccgagaaagt aggaacaccc gaacatactt ttgcattacc gtataatgct 
tctgtgtata aactgctttg ggctatgttc tatgtacaac aggaaaggta tgcgaaagag 
aaaaattatc tgcgtacgga acaggatttc ttcctgacag atgccgaatt gaaaggtctc 
ccgcaaggtg cgcaaatctc ggttgaagcc acttggaata catatcaaat agccattact 
gttccgggtg aaggcagacg ttacatcatt aataatgagg gacggttttg gacagaaaag 
gttgctccgc gtcaagtgaa gaactgggtg tggacacgta taaataagag taaaagtgaa 
acggattacc gccaatggtt tgccctgctt aaagagtgcg gcatcagcgg ggtgatgttt 12 00 
gaaggatatg atgaaaacct ataccgcatg tgtaaggaag ccggtcttga agctcatttc 12 60 
tggaagtgga caatgaatcg tgccgaattg ctcaacgtac atcccgattg gtttgcggtg 132 0 
aatcgtaagg gggagtctac gcatgataag cctgcttatg tggattatta ccgttttctt 13 80 
tgtcccaacc atgaaggagt ggcccaatat ttggcagatg attatgtgaa gatagctcat 1440 
ttaccctatg tagatggagt acatttggac tatgttcgtt tcccggatgt agttctgcct 1500 
gtcagcttgt ggaaaaatta tggaatagaa cagacaagtg aacatcctga atatgattat 1560 
tgctattgtg atgtttgccg tactaaattt aaagaacaaa cagggcgtga tccgttggag 162 0 
ttaaagtacc cgatggaaga tcagtcatgg atcaatttcc gcttggatgc gatcagtcgt 168 0 
gttgtcgacc aaattacaaa agcggtgaaa gccgatggga aagcaatttc tgctgctgta 
ttccccggtc cttctatggc caagaaaatg gtgcgtcagg attggggtaa ctggtcgctg 
gatgcttatt tcccgatgat ctataatggt ttttattacg aaggaccgga atggatcggg 



780 

840 

900 

960 

1020 

1080 

1140 



1740 
1800 
1860 



618 



cgttcggttc aagagagtgt taagaccgtt gacggacgtg cgaaagtgta tgccggactg 1920 

atgtttcccg atataaagaa cgattttgag aaagcattgg atgaagcatt tgataacggt 1980 

gcatccggtg tttcattctt tgacggacca tcagacgaat atctgcatcg gtttaaagcc 2 040 

tatctggaca agaaaggatt aaagacggaa taa 2 073 



<210> 1549 
<211> 894 
<212> DWA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (778) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1549 

gatcatgctg accgaccggg gttcgactat tcagaatggg agaaaatagg acttgcccac 60 

tctttcagta ctccatattt catgtcgaag gacttttatg taggctacgg atggtaccgt 12 0 

aaagcttttc cggtaaaaaa agagattctt ggcaagaaaa gttttcttga attcgatggc 180 

gtatttcaag aagcagagat tttcgtcaac ggacacttgg caggcactca caaaggagga 2 40 

tataccggat tttccatcga catatcagct tacctgaaag aagggaaaaa cctggtagcc 3 00 

gtccgagtaa acaactgttg gcgccctgat cttgccccgc gtgcaggcga acatgtattt 3 60 

agcggaggta tctaccggaa cgtacgtctg gtaataaagc cccccactta catcgattgg 42 0 

tatggcacct gggtcacaac cccggacctg gcagagaaca aaggtaaatc gggaagcgtc 480 

cacatacgga cagacgtatg taatgcttca ggaaaaacag acacttaccg actcctgacc 540 

accgttgtcg atgcacaagg caaagaagtg tcttcggttt ccacatccca agtattgccg 600 

gacaatgcaa cctacacatt taaacaacaa accaaagaaa ttcaggcacc tcaactgtgg 6 60 

catcccaatc atccggcact atataaagtg ataagctcac tctatcacgg acaagaattg 720 

atagaccgtt acgaaacaac attcggattt cgctggttcg aatggactgc agaccggnga 7 80 

tttttcctga atggggagca cctttatttt aaaggagcca atgttcacct agatcatgcc 840 

ggatggggag acgctgtaac ccaaaaccgg aatgccaaaa aaaatctccg gtag 8 94 



<210> 1550 
<211> 1026 
<212> DNA 
<213> B.fragilis 



<400> 1550 

ttacggagtg agcggatgag aatacttgtt gccagtaact cgacttccaa gcgtacggac 60 

tattttatca aagcgggtag aagcctcggg gcggacacct gctttgtcac ttatgacgag 120 

ttgtcggccg ttcttcccga ttgtcgcgat acggttgtaa agctggagcc tccggtgttt 180 

cgggaggcgg actttcggaa atacaatttg ctctgcgagg agtatagaag tctgttgtcc 2 40 

cgactggccg atatggataa gtcggaaagt gtacactttc tgaatgaacc ggctgcaatt 3 00 

ctttgtgcac tcgataaagt gtatactcag cggaaactga ccggggccgg cctgaaaaca 3 60 

actccgttgc tttcggatgc gcttagcaca tttgatgatt tggccgccat actttgccgg 420 

cagaagaggg gaggatttct gaaaccccgt tatggttccg gggccggtgg gattatggct 480 

gtcaggtata atcatcgccg ggatgaatgg gtggcttata cgacgatgtc ctgggaagga 540 

gggcgcgttt gtaatgcgaa acgtatctgc aggctgacga accggaaaga gattgccaca 600 

ttggcggaag aagtcatacg gtgtggggct gtccttgaag aatggatggc aaaggaaaag 660 

ctggaaggtg agaattatga cttgcgtgtt gtctgcaggg gggatgaagt cgattatgta 72 0 

gtggtgcgtt gcagtgacga tgccataact aatcttcacc tgaacaataa agcgaggctg 780 

ttcgaagaac tttcgttggc tccttccgtt cgtgaagagc ttttctgtcg gagcatcact 840 

gccatgaagg ccttggggct gcgatatgcg ggcatagacg tgctgatagc ccggaatacg 9 00 

gacacacctt atattataga ggtcaatggg cagggagacc atatctatca ggatatgtat 960 

acggaaaata agatatatgc caatcagata aaaacgatag aatcactttt caatggaaat 102 0 

agatga 1026 



<210> 1551 
<211> 1236 



619 



<212> DNA 

<213> B.fragilis 

<400> 1551 

cgttataacg atatgaagaa gaaaaagatc attcttattg ccgtaagcct cgccatactg 60 

gcaggcggag gggtttggct ctttggcggt tctacggcca agcacaaagt gacctatgcc 12 0 

acggcaaccg taagcaaagg cgagatatcg gagtcggtaa ccgccacagg aactatcgaa 18 0 

ccggtaacag aagtagaagt cggtacacag gtatccggaa ttatcgacaa aatctatgtg 240 

gactataacg cggcagtgac caagggacaa cttatcgctg agatggaccg tgtgacactg 3 00 

caaagtgaac tcgcctctca acgtgccacc tacagtggtg caaaggcgga atacgaatac 360 

caaaagaaga actatgagcg caacaaaggg ttgcacgaaa aggggctgat cagcgatacc 42 0 

gattacgagc aatcgctcta caactacgag aaggccaaaa gctcgttcga aagcagccag 48 0 

gcttcactgg ccaaggcaga acgcaacctg tcctatgcca ccattacttc tccgatcgat 540 

ggcgttgtca tcagccggga tgtggaagaa ggacaaacgg tggcttccgg attcgagaca 600 

ccgactttgt tcaccatcgc agccgacctg acccagatgc aggtagtggc cgacgtagat 660 

gaagccgata taggcggcgt ggaagaagga caacgggcca catttaccgt agatgcctat 720 

ccgaacgatg ttttcgaagg aatagtgacc caaatccgtc tgggagacgc aagcagtacc 7 80 

agcaccagca gctcgtctac taccgtagtc acatacgaag tagtgatctc cgcccataac 840 

ccggacctga aactgaaacc ccgcctgacg gctaatgtca cgatctacac actggacaga 900 

aaggacgtgc tctctgtacc ggcacgtgca ctccgcttca caccggagaa acccctgatc 960 

ggcgataatg acatagtgaa ggactgtgag ggcgaacata aaatatggac acgtgaagga 102 0 

aatactttca cggcacaccc cgtgcagata gggatcacta acggcatcaa tacagaaatc 1080 

acccaaggtg cttccgaagg catggtagtt gtcaccgaag ccaccattgg aaatatgccg 1140 

ggcggcaatg tatcgcctga aggcggacag gaaggcggag gagaacaaag tccgtttatg 12 0 0 

cctagccatc cgggcagcaa gaagaaagga aaataa 123 6 

<210> 1552 
<211> 621 
<212> DNA 
<213> B.fragilis 

<400> 1552 

aaccccgccc gtcgtgaagc ggagaaaagt cgcaccgaag cagagttgaa gaacttgcga 60 

aaccaactta acccgcattt tctgctcaac acgctgaata atatttatgc actcatcgcc 120 

tttgacagcg acaaggcgca gcaggccgtg caggagctca gcaagttgct acgctatgtg 180 

ctctatgaca atcagcagaa ctatgtaccc ctttgtaaag aggtagactt cattcgcaac 2 40 

tacatcgaac tgatgcgtat ccgtctttcg ggaaatgtag aggtcattac acaattcgac 3 00 

atacagccgg acagccggac ggagattgct ccactgatct tcatctcact gatagagaat 360 

gcctttaaac acggcatctc ccccaccgaa ctgagtttca tccacatcct catctctgaa 42 0 

aacaaagagg agatccggtg tgagatacgc aatagttatc atcccaaaac caacacggat 480 

aaaagcggat cgggtatcgg gctcgaacag gtaaggaagc gcctcgaact ctcttatccc 540 

ggacgttatc aatgggataa agccatctcc ccggatggca aagaatatat atcgaaatta 600 

ttaatattta atcatccata g 621 

<210> 1553 
<211> 780 
<212> DNA 
<213> B.fragilis 

<400> 1553 

gcatggtgcc gaatggacca atccatgcaa gaacttaaca aacacagatc aatattgcga 60 

gctatgaaac cctacatttt tccttcgtcc atagagacgg cacgtgcact gatattacat 12 0 

ttggtgaaac tgatgttaga tgaaccggac aggacctttt gtatcgcgtt tagtggtgga 180 

agcactccgg cactgatgtt tgacttatgg gcgaatgaat atacggatat cactccttgg 240 

gaacgactga aagtgttttg ggtagatgaa cgttgtgtgc ctcccgaaaa ttcggacagt 3 00 

aattatggca tgatgcggtc gttgttgctg agtattgtac ctattccgta cgagaatgtg 3 60 

tttcgaatac agggggagaa gaatccgaag aaggaggctg cccgctattc gaagctggtg 420 

atgaaagaag tgccggtgga gaatgagttc ccgctatttg acgtagtgct gctgggagca 480 

ggtaatgacg gacatacgtc gtctatcttt cccggacagg aagaattgct ttcaactgat 540 



620 



catatatatg aggcgaattt taatccgaat aacggtcaaa agagaatagc tttgacagga 600 

cttccgattt tgaatgcccg aaggatcatc ttcctgataa caggaagggt gaaaagtccg 660 

gttgtagaag atatcttcta ttcgggagat accggaccgg ccgcctatat agcgcatcat 720 

gccgataacg tggaactatt tatggataat gcagctgctg aaaaagtcat tcgcggataa 7 80 

<210> 1554 
<211> 1281 
<212> DNA 
<213> B.fragilis 



<400> 1554 

tatcaagcgc ttgtattaag atttgtagta ttaattaata atttgtttta tatgagaata 
aaaaggttat tgtatgctat tgctacaata cttccctttc tgtttctctg ttcatgttat 120 
gaagaacagg aacctcaaca ggagaaacag gataaggaaa aatggacaat gcaggttgcc n Qn 
ggtaatcagt taaatgaatt tttaaatatt aatccggatt tacggaacct ttacgcttat 
ccggactggg atgctgcgca gattataagg gagcggagcg atacagtttc atattacgtc 
cctgtagtgg atataacagc tgatacatgc tcttatttaa taatagcacg cgcttcgaat 
gatgtttatt tgtacatggt aagacttcct gaggaatact ccggctttga ttcctttttg 
gaagaacatt taaaaatatt acggattatt gatggtgccc ggagagtccc tgttggatat 
ttgcataatt ttccggatga tgtactgact cgtacccgtt cttcaggctc tctgtttaat 
catgaccgcg aaacgaatac ggaaattctt gaaaataaca cctttgttaa agacgatctt 
tttggtgccg gtttttcgtt acccgaagtg acagtaatcg gtagacgtcc tacatcttct 
gaagacccgt ttaaatggcc ttttggggat atgccttctg agtctccgaa agcccttccc 
ggacttgatg actttttttc tcctcaagga ggcggttcat cttcgttatc ttcgccacag 
caatcaggct ctttgcctaa acctgaagaa gtcattaagg atgcaactgt aaaaaaggct 
ttggaagaag cctggagtga tatgcttaag cgttccacaa aagatcagag gcaagaggtc 
ggtttctgga tttattatga tccggtgaaa aagcaatatt atataggtaa gaaacgatat 
ggtatagcag tgaagaatga cggaaaagca agagggaata taagccttgg agacaaatcc 
ccttctgtaa atggtgtgcc tgccacagca aaggtggttg cttcttttca tacacacact 
ccaatgactg aaataaaagg catgaaaaga aaagtaggtc catctaaaga agataaaggg 1140 
aatgctgata aaaataggat tccaatcatt gtttatgatt acattggtac aaaagatcct 1200 
cgaacaaatg attattatgt tattggtgga cataaagtaa gtgaccccaa aaaaatgtat 12 6 0 
atttaccaac ctaagaaata a 

<210> 1555 
<211> 1260 
<212> DNA 
<213> B. f ragilis 



60 



180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 



<400> 1555 

aaacaattga gtatgaaaat gattttaaat ttttctgaat gtattattag ttttttctcg 
ttatgtttat tgtgtgttgt cttctcttgc gatgaaatgg atattgacca aggtcctcca 
acttccgtta cgcgtaatct gatgacatca gatggtcccg gttttaagat cgatacagtt 
acgtacgata aaattccggc tgaatatgcc cggaaaatat tgtctcttga agaacctact 
tcagttgtaa ctgataagag caagcgctct ttcagagtga atgaactttt tcaaattaga 
aaagcaaatg atcaattgct ttcaattacc agctattccg ccaagacggt ttatgatctc 
atacttgaag tctatgtaga aggtggttcc cagtatgttc ctattgctta tctggactcc 
ataccgggat tctcacaatt tgagtttaag ccatcgttga tcaatggaaa tttcatatat 
aaaaaggata acggtgtgga tactctgtcc ctttcgagcc tgaacgaaaa gagaatgaaa 540 
tttcgtttac ttagcgatga taagcatttt gaaatgcttt ctaaaataga tgcggagtgg 
aatatttctt tttcaaatta tgattggaaa ccggggtatg aaagtggttc atggcgcgag 
ttgagtgcca tctatgcacg tgagtgggtg gtcattatta caaattatgc ctatatgatg 
actactcccg agtatgcttt tatcatgaga aattttagta aaatatttgg tggagaactt 
tatgataata accgtgttaa gttcacaccg gaaaagtatt tatcagaaga aaaacgtttc 
aaacaaccgc ataactttgt ttgtggacga tctaaaccct ctgttggcgg tttgggcgga 
ggaaacgtgt ggggagtaac tcactggaat tattatggtc attatgcttc ttttagcggt 
tgggaatcaa ttacacatga atttatgcac tgtatgggat atggtcattc tagtaacatg 
acttatgctt ccggtggagt gggatggacc gagttcatgt ggcaactaca tacttatttg 1080 
agagggaatg attggctgcc atatacggat cggaatctgt taggttttca taagccggaa 1140 



60 

120 

180 

240 

300 

360 

420 

480 



600 
660 
720 
780 
840 
900 
960 
1020 



621 

aacgcgaaat atcgtgatgg tggaattgac cctgataaac tgaatgataa taagattctg 1200 
cagttttata ataaaagtaa agttacccaa tattttttag ctaatccgtt gtctaaataa 12 60 



<210> 1556 
<211> 477 
<212> DNA 
<213> B. fragilis 



<400> 1556 

actaaaatta ttatctttgt tcccatgaaa caatctctga catcagcccg ccgtcctctg 60 

gaaatcctga tacacatcat cagttggggg attgtgttcg gtttcccgtt cttcttcatc 12 0 

gatcgtacag gagacagtat caattggcat gcctatctgc gtcattctgc cgtacccctc 180 

tcttttgtca ccgtattcta tttaaactat ttcctcctcg ttcctcatct cctcttccag 240 

gaacagaaga ataaatacat catctacaac atcttattgg tctgcctcat cggactgctg 3 00 

ctgcatatct ggcaaagcct gaatgccccg gctcccactc ttaaaaaacc gcatatgcct 3 60 

cccggatggg atttttttcg taagagacat tctaagcctc atcttcacca tcggactgag 42 0 

tgccggcatc cgcatgagtg ccccgttggg gacaagctga aaccccgccc gtcgtga 477 



<210> 1557 
<211> 1548 
<212> DNA 
<213> B. fragilis 



<400> 1557 

tgtccgaata tgtatatccc tgatccgaat agaatgatgt cctctttgag taccgtccgg 60 

agtatctatt atagaggtag tctggagcat tgcaattata cgtgttcgta ttgtccgttc 120 

ggcagaaagt ctgtgtctgc cgatacgaca gaagatcagg aagcattgga tcgctttatt 18 0 

tcccgtatcg gcgggtggaa atacggttca ttacgcatcc tgattattcc ttacggggaa 240 

gcgatgatac atcgctacta tagagagggc atcatgcgcc tggccgctat gccccatgtg 3 00 

attggagtct cttgccagac caatttgtcc ttttcggtat cccgtttttt agatgaggct 3 60 

gaggcggagc aggcagatgt gtctaagttc aggttttggg cgagctatca cccggagatg 42 0 

gttggggtag gggagtttgc atccaaagta gagatgcttc gtgcggccgg catcggggta 480 

tgtgcagggg cagtcggtga tccttcggca aaggaacaaa tccggaaact gagacagctg 540 

ctggatccgt cggtttacct gtttgtgaat gccatgcagg gattgcggaa gccgctgtcg 600 

gaagaggata tccgtttctt tggtgaaata gacaatctgt tcgattatga ccggagaaat 660 

gcaaaggcgt gcttggacgg ctgtgtggga ggtagggaaa cactttttat cgaccggaaa 72 0 

ggggatatgt atgcttgccc gagaagtggg atacggatgg gaaactttta cgatgacccc 7 80 

acttcggatt ttcagccctt ctgccttcgt aaagtttgtg attggtacat tgctctcagt 840 

aatttgtgcg atacgccctt gaggagaatg atgggggatg gcgctatgtg gcgcatactc 90 0 

gaaaggaaga aggtggaagc tgtcttcttt gatgtggatg gtacgctgac ggatgctcag 960 

ggacggattc cggaccgtac ggtttcggta ttggagtata tggctaagcg tttgccttta 102 0 

tatctgagta ctgctttacc ggtgtcgcat gctaaaaaac ggcttggcaa tgtgttcggc 1080 

ctgttctcgg gcggagtttt tgcggacgga ggtctgttat gctacgggga aactatcgaa 1140 

tgtgttccga ttgcaaatcc tgtgactgcc ggttttccgg gttgcagggt gacccgttat 12 00 

acccgggagg ggaaagtctt taaatatgct gtgcttgcac cgaatacccg ggaagctgtc 12 60 

cggtggctga ccgaattgga tgaagaggcg tatcaattgt atcaggaggg acgattgctg 132 0 

acggtggtag acagtaaagc cggtaagaag aacggtctga ttactctgtg tgctcgattg 13 80 

gggatttctc ttagggaggt tttggtagta ggcaatacga tgcatgattg gccgatgatg 1440 

tccgtagccg gctattcttg tgccgtgatg gatgcggaag aaaagttgag gaaactatcg 1500 

ggatatgttc tgaaccccga tagtattcct gtattttttg atatctga 1548 



<210> 1558 
<211> 1188 
<212> DNA 
<213> B. fragilis 



<400> 1558 

aacctcgtaa tgaagaaaat acacatagga cttttgccac gtatcatcat agctattata 
cttggtatcg ctatcggaaa tttcctgccg acacctttgg tacggctgtt cgtgaccttc 



60 
120 



622 



aactccatct ttggagaatt cctcaatttc tccatacctc ttatcattct cggactggtt 180 

accattgcca tagccgatat tggtaaagga gccggacgaa tgctgcttgt cacggcactc 240 

attgcatatg gtgccactct tttctccgga tttctgtcct acttcaccgg agccgccatt 300 

ttcccttcgc tcattactcc gggagcacct ctcgacgaag tgagtgaggc gcaaggaatc 3 60 

ctaccctatt tctctgttgc cattccaccc ttgatgaatg tcatgacggc actggtcctt 420 

gcctttaccc tcggcctggg gttggcaagc ctgcatagtg acgccctgaa aaacgtagca 480 

cgagactttc aagagatcat cgtacgtatg ataagcgcag tcatcctccc gttgctgccc 540 

atttacatct ttggtatttt cctcaatatg acacactccg gacaagtatt ctccatcctt 600 

atggtgttta ttaaaattat cggcgtcatt ttcatactac atattttctt gctggttttt 660 

caatattgca ttgcggcatt gtttgtccgt aaaaacccgt tccgcttgtt gggacggatg 72 0 

ctgccggctt atttcaccgc tttaggcact cagtcatcag ccgccaccat ccctgtcaca 780 

ctcgaacaga ccaagaagaa cggtgtatca gccgatatag ccggatttgt catcccgctt 840 

tgcgccacca ttcatttatc cggaagcacc ctgaagattg tggcctgtgc tttagcctta 900 

atgatgatgc agggcatgcc tttcgatttt tccctgtttg caggtttcat tttcatgctc 960 

ggcatcacga tgattgccgc tcccggcgtt cctggaggcg ctattatggc ttctttaggc 1020 

atcctccagt ccatgctcgg tttcgatgaa tcggcccagg cattgatgat cgccctctac 1080 

attgctatgg acagtttcgg tacagcttgt aacgtaaccg gtgatggagc catcgctctg 1140 

attatagaca agatcatggg gaaaagaaaa actcccgaaa gcctctaa 1188 

<210> 1559 
<211> 450 
<212> DNA 
<213> B.fragilis 

<400> 1559 

aatagaatga tgaaaagaaa acttgaaata cataagattg acgtatccag cagtttgccg 6 0 

atcccatacg ccgatgaagg tatacgggcc ggtttcccgt caccggcaca agactatatg 12 0 

gagcaagcca tagatctgaa caaagagcta atcaaacatc cggccagtac attctttgga 180 

cgtgtagtag gcgattcgat gcgggatgaa ggcatagaag aaggagacat tctggtcatc 2 40 

gacaaatcac tggaattaca ggatgacgac cttgccgtgt gttttattga tggagatttt 3 00 

actgtaaagc gggtacgaat tgaacctaat gccgtctggt taataccggc gaatccgaaa 3 60 

tactccttga ttaaagtaac aaaggagaat gaatttatcg tctggggaat agtgacctat 42 0 

accatcaaaa agaacaggag aaaaagataa 450 

<210> 1560 
<211> 960 
<212> DNA 
<213> B.fragilis 

<400> 1560 

tttatgaaga tagtagtttt agacggttat gccgccaacc ccggagatct gaactgggac 60 

gaattgagaa ctttgggtga gtgtgaaatc tatgaccgca cggctcccga cgaggtactt 120 

gaacgctcga aagatgcaga agcgattctt accaacaagg tggtgatcac ggcagaacac 180 

atggcatcct tgcccaacct gaagtatatt ggcgtaatag ccaccggata taacatcatc 240 

gatgttgccg cagccaaaga gcgtggcatt accgtaacca atatccccgc ctacagcact 3 00 

ccctccgtcg gacaaatggt ttttgcccat atcctgaaca tcactcagcg agtacagcat 3 60 

tatgccgacg aggttcgtca aggacgctgg actcagagtc aggatttttg ttactgggat 42 0 

actccgctta tcgagctgtt gggaaagaag ataggtctta tcggcctggg acaaaccgga 480 

tacaacacag cccgtatcgc tatcggattt ggtatgaaag tgtgggctta tacatctaaa 540 

tcacgtctcc aactgcctcc tgaaatccga aaagcagaac tcgaccaaat tttccgcgaa 600 

tgtgatattg tcagcctgca ttgcccactg acggaatcaa cacgtgacct ggttaacacc 660 

cgtcgcctgg aactgatgaa gcccaatgcc attttaatca ataccagccg tggtccgctg 72 0 

gtcaacgagc atgacctggc agaagctctg aataattaca aaatctatgc cgccggactg 7 80 

gatgtgctct ccaccgagcc accccgtgcc gacaatcctt tgctgactgc cagaaactgc 840 

ttcatcaccc cacatatagc ctgggctact tcggctgccc gcgagcgcct gatggctatc 900 

ctggtcgaca acctgaaagc ctatatcggc ggcaagcctg tgaacaacgt ggccaaataa 9 60 



<210> 1561 
<211> 804 



623 



180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
804 



<212> DNA 
<213> B.fragilis 

<400> 1561 

aacatacggg tggctatgat taaaaaaagt gataagaaaa atgtgaaaaa atgtataaga 60 
acgaatttta atttttatct ttgtccaaat ttgtatgcga tgcttgtccc aaccatgacc 120 
accgaagagg tgtgtaaaga aataaagaat gactatccgg ctttttatga aaaaatgttg 
gataataagg ctagtaacta ccgaaagttt attaaagctg tcctatttcc ggttatacat 
cagttttcat ggaaatcgtc atcgggtaat atgtggaatg tgataatgtt ggctcgttat 
cgtaatgaga gaaaatgtcc cggtattgtc ccttacctta aatatgaaaa ttggggtatg 
ggaattattt atcctaaaaa tatatatagt aatctgtcta taattgactt taaacctcat 
ttttggaaaa ggtatcggga gcgtcagcta atacccaacg gcttagaagg gatttctttt 
gatgaacaaa taaaatattt ctttttaaat agcggtctct ttacttttga tttcagagaa 
ggctctaata aaggacatga gggttttgtc gggtatacta agaccggaat tttctttggt 
gtcgtaataa aagagttgga ttatctctgt gtcaaaactt atgtgtctgc taatatgctt 
tttgataatc agatagaaag cttggatagc gctgatgagt taagagagaa gatattgtcg 
catccggact attttcagaa aagagggaaa ctctttcata tcatgaatga ctcttctttt 
tggatggatg agacgatacg ttaa 

<210> 1562 
<211> 864 
<212> DNA 
<213> B.fragilis 

<400> 1562 

aatcactttt caatggaaat agatgaatta ccctccggca cgcaggatca gaagccggac 60 
atagatatga atgaaattgt aggtacgcat gatatactga tgctctgttt cgatactttg 120 
cgttatgacg tcagcgtggc cgaagaagcc tccgggggga ctcctgtact gaatagctgt 
ggcaacggtt gggagaaacg gcatgctccc ggtaatttca cttatccgtc tcacttcgct 
attttcgcag gattcttgcc gtcacccgcc gagccgcata tgttgcgtaa ccgaaagtgg 
ctcttttttc cctttcaggc cggtacggga cgtatacctc ccgaaggcag ctatgctttc 
aaagaggcta cgttcgtaca gagtctggct caggtaggtt atgaaacaat ctgcatcgga 
ggagtcaact ttttcagtaa gcggaatgat ataggaaggg tatttcccgg ctatttcaat 
aagagttatt ggctgccgac tttcggttgc acggataaga acagtgctgc caatcaggtg 
gactttgccg tcgacaaact ggaaaagtat ccggcggacc ggaaagtatt tatgtatatc 
aatttttcgg cgattcatta tccgaactgc cactacgtgg aaggaaaaaa gaaagacgat 
aaagagtcgc atgcggcagc cctacggtat gtcgacagtc agctgccccg cctgttcgag 72 0 
gctttcagga ggcgctcgga cacgttggtc attgccctgt ccgatcacgg gacctgttac 780 
ggtgaagatg gttacgagta tcattgcatc tctcacgaaa aagtatatac ggtgccttat 840 
aaacacttta ttctcagaaa atga 864 

<210> 1563 
<211> 1299 
<212> DNA 
<213> B.fragilis 



180 
240 
300 
360 
420 
480 
540 
600 
660 



60 

120 

180 

240 

300 



<400> 1563 

acactttatt ctcagaaaat gaacgaacaa cagcagattt cacgatatgt cagctatatg 
tacagttatc cgcataagac ggcttaccgt acgttgactc ctccggtctc tctttctcct 
tatcttgaac ggctggaagg aagggaggct agtttatatt tccacatacc tttctgtgcc 
cataagtgtg gctattgcaa tcttttttca cagcagtgtt gcgatgcgga gcgcatttca 
ttgtatctcc acacgatgcg ccgccaggcc gaacagctgt ctgtggcggc acaaggcctg 
aagtttactt cgtttgccgt cggagggggt actccgctta ttctggatga aggacagttg 360 
gaagagttgt tctgcctggc cgaactgttc ggtgtgcatc cttcccgggt gtttacttct 42 0 
gtcgagactt caccggaata tacgcaaaag agtgttttga ggcagttgcg ggcgagggga 480 
gtggagcggt tgagcatggg ggtgcagagt ttcaatgaga cggagttgaa gaaactgaaa 540 
agaagacccg gactcggtac agtagtcggt gcactcgaaa atattgtgga ggcaggtttt 6 00 
cctcagttta acctcgacct gatttatggt atcgagggac agacggtaga gagctttatg 660 
cgctctctga acactgcact tacttatcgg cccaacgagt tgtttattta tcctctttat 72 0 



624 



gtccggccgg 
aaatctgccc 
gtcaggcgcg 
ggggcgggag 
cagcaggcaa 
gctaacggtt 
atgtatcacc 
cggaatcttt 
cgtctcacgg 
gccgtgagga 



gtacacgcat 
gtgagttact 
aaacaacgga 
gccggagtta 
tagccgatga 
tcttgctttc 
ggggagtgga 
tccgggagtt 
aggaaggaat 
aactgatgtc 



cgatgtacgt 
ggtagggcaa 
aacggaattt 
tttgggaaac 
aatagaccat 
cacagaagag 
cttggcggag 
cacagatcgt 
ggcttattcc 
cgaatatgta 



<210> 1564 
<211> 1608 
<212> DNA 
<213> B. f ragilis 



tcgacggatg 
ggatttgtac 
tcgtgtggtg 
ttacactatg 
tatatccgga 
atgcaaatca 
tatgaaaaac 
ggatggattg 
gattatatcg 
tatccctga 



acataggtta 
aaacgtccat 
acgaagtgat 
ccactcccta 
ctaccgattt 
gattcattat 
gttttggtga 
aagagactgg 
gacaggcatt 



tgctatctac 
gcgccgtttt 
gctttcctgc 
tgctgtacgt 
tatgactgcc 
aaagaacctg 
gaagccggac 
ccggatagtg 
tatttcacct 



780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1299 



<400> 1564 

gttatgaata 

ctgtacgcgg 

ccggaaccgg 

tatatggaaa 

tgggtgccgg 

gggccttcag 

acattgggca 

gtgttccagt 

ggaaatctgc 

gcagcgatgg 

gatatgaaca 

agccacaacc 

ttgtcgcagt 

ggcaatgcgt 

aatacaccta 

tcgggttccc 

gcgaataacc 

accattacac 

caaaacggat 

aatttgaaga 

catttttatt 

aacactgcat 

gtacgatgta 

ctatttttaa 

tcaggaacta 

gctctggatt 

caaattaatg 



cgaagacaaa 
cttttttaat 
acggggagga 
gccatgttga 
tgaaagagcc 
cggtacggat 
acgacattta 
cggctgcgga 
tcaccagagc 
gaacgatccc 
gtgactttat 
tgtcggttag 
ttgggagcaa 
ctgcctggac 
cattcaatat 
gggcaatcac 
ggaatatcac 
tgaagtttga 
gtacggcgag 
gtacaggaaa 
ccttcaataa 
attatggtac 
cagatagagt 
aagcagcagg 
gtggtgttta 
ttggaacaac 
gttattctgt 



acttctgtat 

gataacatcg 

tatgattcct 

tactcccgac 

acccgccagc 

gacgctccga 

tttccggttg 

ttttacgaca 

aggaactgtc 

ttcatcctat 

ggtctatgac 

tttcacgcaa 

cacgtttacc 

gataggcccc 

agccaataat 

ggtgcatatc 

ctcaagccag 

gttggggata 

tgataagaat 

tgttaattac 

actttatgat 

aagctggcga 

ttataacgga 

aatgcgaccg 

tctaacttca 

atatatagtg 

ccgttgtgta 



<210> 1565 
<211> 1425 
<212> DNA 
<213> B.fragilis 



gtaggctttc 

tgtggcgaca 

gttacggtca 

acaaccgggg 

agggccatac 

gaagagccac 

atcgctttcc 

aacggggctt 

cgtgtgatcg 

acttataaca 

tcgggggaca 

aaactctgca 

aactgtacgg 

tctacaaata 

tcgactgcca 

ggtacgttga 

aatgtacagt 

caattggcgg 

gatttggcaa 

gtatgggcat 

ggaagcaccg 

acaccaacta 

ggtatgtggt 

gaaactggac 

acactaggga 

gtcactgata 

aaaggtacca 



gtgcgttaaa 

atgttgtgaa 

gccgggttga 

gaagaacgct 

cggctgccgt 

aggtcactac 

gtaaagtagg 

ccgctcccac 

gatactcgtt 

gtacttccgt 

tagcgaatgt 

agctgacggt 

gggtatatgt 

atgtaagtgc 

ctgtacggtt 

aacttagtaa 

tgctgccggg 

caagtgacat 

agctgagatg 

cttcgcaaac 

gagatccatg 

aaaatgaact 

ttatgaataa 

cgggcttaga 

accgtaaaaa 

ctggtgcttg 

aacaataa 



cacatgctgt 

ccccgacagc 

ggatggcagc 

tgtagacgaa 

gccttacgag 

ccgtgccgct 

cagcaattac 

actcaggcaa 

caatagtacg 

cactatcccg 

gagcacaatc 

taaactctcc 

ttctcaaggg 

caataccgga 

agtccctttt 

ttatttcaat 

aaagagttat 

taacctgaca 

ggctacagga 

agaaggcgga 

ttctaaacta 

ggaaaaactg 

ccgtttaggg 

gggaacggga 

tacttgttat 

gaatgctctc 



<400> 1565 
ccttgcatca 
gagaacaaac 
ttaaatatgg 
gatatggctt 
ctgggaggag 
ttttttattg 
tttcgcctga 
tctgaaatat 



acaatcacac 
ctttattcct 
caggtatttc 
tgggattggt 
gttttagcga 
tttcttcttt 
tatgcgggct 
ctcctgcccg 



atcgtataac 
atataccttt 
cggagccgtt 
ggtaagtatt 
ccgttatggc 
gggctgcgca 
gggtatcgga 
gctgagggga 



tactataaat 
atcaccgtct 
ccttttttac 
ctgaccgtag 
agacagaaag 
ttatccggta 
gtgatctctg 
acactggttt 



ttagaatcaa 
ttggtggatt 
aggaacaatt 
gttgcctttg 
tcatgttctc 
atttggtttc 
ccgtagcacc 
cctacaatca 



aatgaacaga 
gattgtggga 
catgttggat 
cggtgctttg 
gtcagctgtc 
actgctggtt 
tatttatata 
gttggctatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1608 



60 

120 

180 

240 

300 

360 

420 

480 



625 



gtgataggaa ttctaattgc ttacattgta gattatatat tgctggacta tgagcggaac 540 

tggcgcctga tgttgggatt cccgttcttc tttagtgtgg cctatctgtt gttattgggt 600 

atattgcctg agagcccacg ttggctttcg gctcgtggaa aagcaggtag agccaggcag 660 

gtggccagta aactgaacct ggaagccggt gagatgaccg tgtccgacac aaacacacaa 72 0 

gaaggtagag ataggataaa ggtaactgaa ttgtttaagg gtaacttggc taaagtggtt 780 
ttcataggtt ctatactggc cgccttacag caaatcacgg gtattaacgt catcatcaat 
tatgcacctt ccatcttcga gatgacaggt gttgccgggg atattgccct tgtgcaatct 
atcctggtgg gagtggtgaa tctgctgttt accctgattg ctgtgtggct ggtcgataaa 

gtagggcgta agattctgct tctttgcgga agtctgggga tgggcatctc attgctgtac 102 0 

ctggtttaca ctttcgtagt tccggcagcc aatggtatcg gtgccttaat agccgtgtta 1080 

tgctatatcg gattctttgc cgcttcactg gcacctttga tgtgggtggt gacttccgag 1140 

atttaccctt ctcgtattcg tggaacagct atgtcactct ccaccggaat cagctggtta 12 00 

tgtacttttc tcaccgttca gtttttcccc tggatactga ataacctggg cggatcggtt 12 60 

gctttcggaa tctttgccat tttcagcatt gctgcattcg cattcatttt gttttgtgtg 1320 

cccgagacca agggcaagtc actggaggca atagaaaaag agctgggagt ggataaagag 13 80 

gctgaggaga atgtgaaaga agaacatgct ttctcgaaaa tataa 1425 

<210> 1566 
<211> 555 
<212> DNA 
<213> B. fragilis 



840 
900 
960 



60 



480 
540 



<400> 1566 

ggagaatgtg aaagaagaac atgctttctc gaaaatataa taaacacaaa attggatata 

agtatgaaac aaagtaaaag aaatgtatgg tggtccgtct gtgcaggact tttgttctgc 12 0 

tcgtggggat gtagttcccg ggtgtcggat aagccggtaa cgctggaaac gttgttggac 18 0 

gaaatggtat cggtggagga acaggcgctg tatccggttc cctcttatac ctgtcgtcag 240 

gaaagtagct acgaccgtgc atcagtatca cccgattctg ccggttggtt tgccaatagt 3 00 

gacgggtttg gaatcaaacg ggtagatacg gtggcaggcc gcattgagaa agtgatgttc 3 60 

gacgaagtcg gacctggagc gatcactcgt atctggatta ctaccattga caagcggggt 42 0 

acgtggcggt tttattttga tggttctgat caaccgggct ggatcattcc ggctcatgac 
ctgttgcgga tcaatgtacc cggattggga aaggtaatgt tacatggcac acaccatcta 
tacgccttat ggtaa ^55 

<210> 1567 

<211> 186 

<212> DNA 

<213> B. fragilis 

<400> 1567 

tgcaacaaag ataagacgct tttttcatct gtgcaaatat ttatacatta ttctaataat 60 
cagttattta tctcttcttc ttatattcag tgcacagagc caaaacagtg tttcaaaaaa 12 0 
ggaacgtctt ttgaaacaca agagtcgttc tatctgtcta atttacacaa acttaccgca 
aaataa 

<210> 1568 
<211> 1512 
<212> DNA 
<213> B. fragilis 

<400> 1568 

tatatgaata gaacttttat tatgcgagag tgtctcggta aagccttatg gctgtgtttt 
tgcctttcga tagcaggatg tgccgaagat gacagaatga ctcccctgtc tgctgacagc 12 0 

180 
240 
300 



180 
186 



60 



ggggatactg ccgacgagtt aatccccatc catatcagtc tgacaggtga caacgactat 

cattcttcct cttttaacaa cgcttcgacc cgtagccact ctcccctgat cgccgaatgg 

gtgggggtaa aagctttctc acctacacgc acaggagagc aaccggacta tgacggtcca 

cggatagcct cgatggaact gacggaagat accctgcccc gtgtaagtac ccgtgcaaca 360 

gtgcctgcgg gagtctattt ccggctgatt gtttttcgga agtccggaaa taactatgtc 42 0 

ttccagtcgg ttgccgatta cgcctccaat ggtacgggca ctcctgtact caaacaaggg 



480 



626 



aaattgctga 
gctgccgact 
cctaacatga 
ctcagccaca 
agtcctaccg 
gggggaaact 
aataatacgg 
tttgcagggg 
gtgaacaaca 
tatacactaa 
aaccttacag 
gcggacggga 
agaggttatt 
gatccatgtt 
aacgaactaa 
tttatgaata 
ggagccaata 
accaaagaat 



ccicgctcggg 
tgggaactat 
gcaaggactt 
atcttccggt 
gatttccaag 
ccacatcatg 
cggcattcag 
ccagaacgat 
acaccgaaat 
aaatacagtt 
gtaacgggtg 
atctgaaatc 
actatacttg 
cgaaacttaa 
ccaagttatc 
gttccaaagg 
ctggtggtaa 
aa 



aacgatacgt 
gctttccacg 
catgaccttc 
gagtttcaat 
caatacgatc 
gaagatcggc 
tccgagcacg 
aacagtgcat 
cacgtccacc 
taaaaaggga 
tacggctcag 
aacaggaaat 
gtatagtact 
cgtttcaact 
tcggtgtaca 
gctctttctg 
tgcaataatg 



atggtgggtt 
tatgcctaca 
gattccggag 
cagaagttgt 
accaattgca 
ccttcgacca 
gccctaagta 
ttcaatacac 
caaagcgtac 
cccggcatca 
gataagaagg 
tctaactatg 
tacacaggaa 
tatggaaccg 
aataaggcaa 
ccattagctg 
aacggtaatc 



actcctttaa 


taccgctacc 


540 


acagcagcac 


aqtqtctatc 


600 


atataacgaa 


tgtaaatagc 


660 


gcaaactgac 


aattaccatc 


720 


cgggtgtata 


tgtaaagcaa 


780 


atgtggtggc 


caccaacacg 


840 


caaccatacg 


catggttccg 


900 


tgacggtagg 


cggacggatt 


960 


agttgaagga 


aqggaaaagc 


1020 


atgtgttgga 


gagtgatata 


1080 


atctggctaa 


attaatatgg 


1140 


tgtggaccac 


ttcaacagat 


1200 




CLCLCLL-CLClL-Clk_,V_. 


1260 


ggtggcgtac 


accgtcgcga 


1320 


aagtaaataa 


tggaatgtgg 


1380 


gacatacacc 


tagtgcatct 


1440 


gccatggtaa 


ttattggtgt 


1500 
1512 



<210> 1569 
<211> 213 
<212> DNA 
<213> B.fragilis 

<400> 1569 

cccccgttat ttattttatt ctccttctta cctagaattg ttcgaatatt ctttccgaat 60 

acaatgcata ttggaatcgt cttattcact aaacctatgc cttttatatt atattctgcc 120 

gttgtgctca tctatatttc taattatata tccttattct ataggttata cctaaactct 180 

cttgctccta ctccacttat aaactctgga taa 213 

<210> 1570 
<211> 330 
<212> DNA 
<213> B.fragilis 



<400> 1570 
cttttccctt 
atccgtccgc 
ggaaccatgc 
gtgttggtgg 
tgctttacat 
atggtaattg 



ccttcaactg 
ctaccgtcag 
gtatggttgt 
ccaccacatt 
atacacccgt 
tcagtttgca 



tacgctttgg 
tgtattgaaa 
acttagggcc 
ggtcgaaggg 
gcaattggtg 
caacttctga 



gtggacgtga 
tgcactgtta 
gtgctcggac 
ccgatcttcc 
atcgtattgc 



tttcggtgtt 
tcgttctggc 
tgaatgccgc 
atgatgtgga 
ttggaaatcc 



gttgttcaca 
ccctgcaaac 
cgtattattc 
gtttccccct 
ggtaggactg 



60 

120 

180 

240 

300 

330 



<210> 1571 
<211> 618 
<212> DNA 
<213> B.fragilis 



<400> 1571 
ataaaagaaa 
gaagccggac 
gattttgaga 
ttgcgcgaag 
gagattgata 
cggctgaaaa 
gacatacctc 
tctatccttg 
tgttatgact 
gggcacggta 



cgatgctctt 
gcggttgtct 
atgaattgtt 
tgattgagcg 
agataaacat 
cccgtcctca 
atactactgt 
ccaaaactta 
gggagcataa 
ctactccgta 



accctggtta 
ggcaggtgcg 
gaatgactcc 
ggatgcggtt 
cctgaatgca 
gcatctgctg 
tataaaggga 
tcgtgatgac 
caaaggttat 
tcatcgcatg 



aacgaagaac 
gtttatgctg 
aagcagttgt 
gcctgggcca 
tcgtttctcg 
atcgatggaa 
gatggcaaat 
tacatgaaca 
cctacgaaaa 
actttcaatc 



tgatagaagc 


cggttgtgat 


60 


ctgccgttat 


cttaccgaaa 


120 


cggagaagca 


gcgatatgct 


180 


tcggcattgt 


ttcgcccgaa 


240 


ccatgcaccg 


ggcggtagac 


300 


accgtttcaa 


gaagtaccct 


360 


atctttcgat 


agctgctgct 


420 


ggctgcatca 


ggaattccca 


480 


agcaccgtgc 


ggctattgcc 


540 


tgttgggcga 


cggacagttg 


600 



627 



gaattgttct caaaatag 618 

<210> 1572 
<211> 936 
<212> DNA 
<213> B.fragilis 

<400> 1572 

aatattatga gtaagaaagc ccttttaatg attcttgatg gttggggatt aggcgaccac 60 
gggaaagatg acgtaatctt caacactgct actccttact gggattatct gatggagacc 12 0 
tatcctcact ctcagttaca ggccagcggt gaaaacgtag gtttgcccga cggacagatg 
ggtaactcgg aagtgggtca cctcaatatc ggcgcaggac gcgtagttta tcaggatttg 
gtgaaaatca acctctcttg tcgtgacaac agcatcctga agaacccgga gatcgtttca 
gctttctctt acgcaaaaga aaacggaaag aatgttcatt ttatgggact gacttcggat 3 60 
ggtggcgtac atagctctct ggaccatctg ttcaaacttt gcgatattgc taaagaatat 
aatattgaga acactttcgt tcattgcttt atggatggac gtgacacaga cccgaagagc 
ggtaaaggct ttatcgaaca actggaagcg cattgcgcca agtctgccgg taaagtggct 540 
tccatcattg gccgttatta tgctatggac cgtgacaaac gctgggaacg tgtgaaagaa 
gcgtatgacc tgctggtaaa cggcattggt aagaaagcta ccgacatggt gcaggctatg 
caggaatctt atgatgaagg ggtaacagac cagtttatca aaccgattgt gaatgccggt 72 0 
gtagacggta ctatcaaaga aggtgacgtg gtgatctttt tcaactaccg taacgaccgt 780 
gccaaagagc tgactgtggt tttgaccaac aagatttgcc tggagccagc atgcccacaa 
taccggggat tgcagtacta ctgttttgat tccggaccag agctttggtt caagggttgg 



180 
240 
300 



420 
480 



600 
660 



840 
900 



gcatattttg gttccataat ggaaaacgtt ggctaa 936 

<210> 1573 
<211> 948 
<212> DNA 
<213> B.fragilis 

<400> 1573 

caaacgaaag gaggcaataa gatgcgaaca tatctttatt gtgaggccgg ctttgtggaa 

aaagcacaat ggcttcataa cagctgggtc aatgtagtat gcccggacag cagtgatttc 12 0 

aaattcctga ccgaaaccct gaaagttcct gaatcttttt taaatgacat tgccgatacc 180 

gacgaacgtc cgcgtacgga aacggaaggc aactggttgc tgaccatact gcgtataccg 240 

gtccagaacg ctcaaagcag tatcccttat accaccgtac ccatcggcat catcaccaac 

aatgaaatca ttgtttctgt gtgctaccat cagacggata tgattcccga tttcatcgaa 

catacccgcc ggaaaggcat cgaagtacgc aataagctcg acttgatttt ccgactgatc 

tactcttcgg ctgtctggtt cctgaagtat ctgaaacaga taaatataga catcaccgct 



60 



300 
360 
420 
480 



600 
660 



780 
840 
900 



gccgagaagg aactggagcg aagcatccga aacgaggacc tgctgcgatt gatgaagtta 540 
cagaagacac tggtctattt taatacttcc attcgtggca atgaagtgat gatcggcaag 
ttgaaaacca tcttccagga taccgattat ctggatgaag agttagtgga ggacgtgatc 
attgaactga agcaggcatt caatacggtc aacatctaca gtgacattct caccggaacc 72 0 
atggacgcct ttgcatccat catctccaat aatgtgaatg cgatcatgaa acgtatgaca 
agtctttcca ttacattgat gatccccacg ttaatagcca gtttctatgg catgaacgta 
gacatacatc ttgaggagat gcctcatgcc ttcctgctga tcatcctggt atccgtattc 
ctgtctgccc tctcctttgt gatcttcagg aagataaagt ggttttaa 948 

<210> 1574 
<211> 185 
<212> DNA 
<213> B. fragilis 

<400> 1574 

ttacgtttgg gccgaaggta tttggctttc ttaaacggga aaaaaaaaca gcttttattt 60 

tttgtcgcaa accacggaga aatagtgcgt acacgtaaca tgttattttt ttttaacgtg 120 

gggggtgtgt tgaaattcgc gctatgtgga ataacgaaat atgccggctt actctttgtt 180 
ttccg 185 



628 



<210> 1575 
<211> 579 
<212> DNA 
<213> B.fragilis 



<400> 1575 

cccgaaaaga cggaggcacc ggcaagggca aacccctttt tcggatggca cccggaacat 6 0 

agcttttcac ccatcaccac aaccgacttt gccaaagctg ccagagtgga acttgagcat 120 

cgcggcgacg gagcaaccgg atggagtatg ggatggaaac ttaaccaatg ggcacgtctg 180 

caagacggta accacgccta caaacttttc ggtaatctgc tgaaaaacgg tacactggac 2 40 

aatctgtggg atactcaccc gcctttccag atcgacggaa actttggagg taccgccggt 300 

atcacagaga tgctgctgca aagtcacatg ggcttcatcc aactattgcc cgcacttccg 3 60 

gatgcctgga aagacggaag catcagtgga atctgcgcca aagggaactt tgaggtagac 42 0 

ttgtcatgga aaaacggaca gcttgcagaa gcaaccatct tctcaaaagc aggcgaacct 480 

tgtacggtga gatacggaga taaaactctc tctttcaaaa caagtaaagg aaaagtttat 540 

aaattggctt tagatgcaga ccgactggtc atcaaataa 579 



<210> 1576 
<211> 270 
<212> DNA 
<213> B.fragilis 



<400> 1576 

agaaaaagga cgaatcagaa ggttcgaagt tatataaaag ccgatttatc acttttttcc 
aagttccgtg agaacttaat cacctttgga caaacaaatt taataaacaa atcagcatgg 
ttcagaaagg agcaagaaaa gaatgacata catacagtaa cctctaataa ttatgcttat 
gaagtagaaa aaacgaatcc tttacaaacc ttattccgga aacaaaataa aatccggaat 
cctaaaacaa actatcaaat taatgtttaa 



60 

120 

180 

240 

270 



<210> 1577 
<211> 189 
<212> DNA 
<213> B.fragilis 



<400> 1577 

ccttttgcaa atagagagag gcttattttt ctaccacctt cctctgaggc ggcacgtttt 
attcattatc tgctgctttt taataaaata gtggtgcaaa gagcagtgat aaagaaaatt 
attattataa attcttgcat attattaaaa gttatcgcat ctttgccgag agatattgtg 
attttataa 



<210> 1578 
<211> 288 
<212> DNA 
<213> B.fragilis 



<400> 1578 

ttctatcaaa taaaaatatc cccggaagaa aagtattttt tccggggata tttttgcttt 60 

attacacaga gttcactcca tccggagaag tatattcatc atacagagat atccaaacct 12 0 

aatgttactg ccatccggat cttactacta acgttctact atctttattg taaaatccaa 180 

aaagtatcct atttaacatt ttggtatttc tcaactccaa cattcctaaa tataaacact 240 

attaaacaga accggataga aaaccggaac gaaaagacaa aaccttaa 2 88 



<210> 1579 
<211> 1164 
<212> DNA 
<213> B.fragilis 



<400> 1579 

gatcacacat caagatctct tgaaaaatcg cttttatatg attttactaa ctctcaaaaa 



60 



629 



acggtttaca tgaaaatagg aatacttact tttcatgatg ctcataacta tggtgccatg 12 0 

ctacaatgct acgctctcca acagttcctc tttaaaaaag gatacaacgt agaagtgatt 180 

gactacagac cggcattcta tcaaaagcaa taccatcgtc acagtctatg cccgtggata 2 40 

gggaaaaatc cagtacgcac aatgaagagt atctattata attattatct tttcaacaaa 3 00 

cgttgtgccg cattctctga ttttcacaat cgccacctct acatgtctat accggctact 3 60 

cgtactaata taccacaaag ctatgatgct tatattgtag gcagtgatca gatttggaat 42 0 

ccccaactca ctaatggctt tcatgatgtc tatttctgcg atttttgttt tcctaaagga 480 

aagagtcggt tcattgcgta tgcccccagt atggaaatca gcagattatc tacacaagaa 540 

gcagaatacc taactcgtgt tcttaactgt ttcgatgctc tctctgtacg tgaatcatca 600 

cttattccca tactacagcc tctcgtcagc cagcccatcc aacaagtact tgacccgaca 660 

cttttattag atgcgacagc ttggaatcca ttaataggca aatgccctga aaatcgtcct 72 0 

tacgttgtat tatatcaagt tcgtgaaaac ccagcagtcc ggatgaaagc tttcgaaata 780 

gcccaatcta ttggaggcat agtagtagaa ctgacagcac gtatcgactg tcattattcc 

actaagtatc aaacagcctc acctgcagat tttgtcactt acatccgata tgcaacctat 

gtagtcacca cctcattcca tggaaccgct ttctcactta ttttcaatcg tccattttac 

acattttctc ttggagataa ttttgactcc cgttcagctt ccctccttga gtcagtaaat 

ctcaccgagc gtttagtatg ccccgacgaa cagtttgaaa taagtctaat tgatttcaaa 

caagcgaata aaagactgaa gcatttgaga aaacagtcat gtgatttttt aagacagtca 1140 

ttacacgaaa gacaagaaca ataa 1164 

<210> 1580 

<211> 2400 

<212> DNA 

<213> B.fragilis 

<400> 1580 

atatacttgt ttccggttta tcattcctat cgcatgaaac gctgtctgct atatatcctt 60 

tgctgctttg gggttgtgca actgtcagct cagccggcgg ataacatccg taattcgttt 12 0 

gtaaaggcgg aaacatttta taagagcgga gatatcgatg aagcttgtcg tatcttggag 180 

gagaatctgt cgtcttttca aggtacgatg cataccgagg cgtgccgttt ggctgcatta 240 

tgctgtttag ctttagaccg cttcccggaa gcggaaaagt atgtttcgct gttattgaaa 3 00 



840 

900 

960 

1020 

1080 



420 
480 
540 
600 
660 



840 
900 
960 



gacgaacctt attattatat ctcgcttcaa gatccggaac gctttgcgga tatggtaagg 3 60 
aaacacaggg agactaaagt aactctggtc acagcctcac aacaggtaga gaccccggaa 
gaagctccgg tgcctgttac tttaattacg gaggagatga tacgggctat ccatgcccgg 
tgcctgcggg atgtactgat agcttatgtg cccggtatat ccggtttgtc ttccaatgaa 
gagatgaatc tggctatgcg cggggtatat tctcccgaac aggagaacat attgattatg 
caggatggac aacgtctgaa cagttatatt actaatgctg tttcgccgga ttatggtatt 

agtctggcaa aagtcaaaca aatagaggtg ctgcgcggac cggcttcttc gctttatggc 72 0 

agtgtggctt tgacagccgt gattaatata gttactaaag acggggtgga tgttcgtaat 780 
ggttccatat ctgtcagtgc cggtaatcgg ggccagttgg ctgctgacct gctattggga 
aagcacgaca tgaacatgga ctttatggca tggttctcct tgtaccgtgc aacgggagaa 
tcggtttttg tcccggccga aaaacaatat gctctttacc ctagggacgg attcatccgt 

ctggacaatt attcgggatt tcctgccatg gatggaggaa ttaaattgca acgtggaaat 102 0 

ttgcttttta gtttcagtat gaattacgcg aagaaaaggc aaccttatag catgtggctc 1080 

ttttcttctc catattctta tgagcggttt cgtacttttg acggttccgg cccgggatat 1140 

tccagatggt cggcaagaga acaggctgtt tatagccgca cgtggcagcg gatcactttc 12 00 

agtaccgcat tttataccga ttggaataaa aacgtacatt atgaaacctc gggagatact 12 60 

ttacaagact atcctatttt tcctaactat gattatcaac ctattattta tccgactcgc 132 0 

ggagcatttc aatacatccg ctggatggat tttaacgtag gctttaatgg gcgtgtaaac 13 80 

tatgcctatg attgggggaa gctggggaaa ggcaatatgc ttggcggtgt ggaatggaat 1440 

cggtatacgc tttatgattc cgagtatttg gaaggtatga attttaaaga gatcgtcagg 1500 

acgtggatcg aaaaacgact ttacacggga cacgagatga acacggatgc ctttcttcag 1560 

attaaacaca acctgcataa aaattggatt gtcaatgccg gcatcaggta tgactataaa 162 0 

cggcgaagta ataagcggac attacaggct ttttctcccc gtctgtcact gatttatctc 1680 

aggaacggat taaatatcaa agccagttat tccagagcat ttgtcgatgc gccctattat 1740 

tacagaaaca atgaaatgga tacctattcg ggtggcgaaa atctgcaagc tgaatatcta 1800 

agttcatatc aggtgacttg tgcttatcat cattcacctt cacatataga tgtagagtgt 1860 

aatttatttt ataatcgggc gtcccatttc ctgttcacac agccggaaac acgtgtttat 192 0 

gaaaatgccg gttctctgga tatgggaggt gtcgaggttg ttgcccgtta taaggcagac 1980 



630 



<400> 1582 

atcctattac ctaaaaacgt atcagccatg aaaaagatag tctttttgat aatgtgtctt 



tgctcggtat atgcaaattc acaagaaggt atcccgtttt ttgtaaacta tccggcttcg 

gtttatcagg cgcacaatcg taatttcgat gtcgtttgcg acagttgtgg aaatgtctat 

tttgctaatt tcgaaggtat ccttcattat gactataacc gttgggaaac tatttataca 

ccgggttttt cccgtgtcac ccgtttgttt cgtgattcgg aaggtaggat atgggtggga 



cgactgagcc tggatggaaa cctgtgtttt cagaaagtat tgaattatac caattttttt 2 040 

gttaccgatg gatctgtgaa taacgtaccc ggcttttcta tgaatcttgt agcaaactac 2100 

tttttactta aaaagaaagt acagagctgg tctgcccatt taaaattgaa ttgcagtagc 2160 

cactgctaca ctcagataag tgtattggaa gatggactca atggagatgc gacgaattac 222 0 

acagtccgtt tgcccggtta tgctgtcttt tctttcacta ccagatataa gtataaaagg 2280 

atagaaggca gtctgggcat tgagaatctt ttcaataacc gatatgaatg cggaggagcg 2340 

actgttccta tccgccagaa aggaaggtgg atttccgcca gtatattgta taatttctaa 2400 

<210> 1581 
<211> 204 
<212> DNA 
<213> B.fragilis 

<400> 1581 

ctgcaattcc cctcccccga ggggaaagag gattctgttc ctctaccaat accaggatac 60 
gcctttaata caatggccca caattatccg ggtctggcag atatcctgaa aagattaggc 12 0 
attaacgagg tgaacgaagt aaatgccatt ctcagactgc cagattatga aagaaaggga 
acagtacagc ccctcttttc atag 

<210> 1582 
<211> 2865 
<212> DNA 
<213> B. fragilis 



180 
204 



60 
120 
180 
240 
300 

ggatataacg tgttcggacg gatagagcgt gatggacgcg gatgtattac gttgagaacc 3 60 
cttctttccg atctggacac agactctttg ggcgaattgg aagatatggc agaaatagat 
aaacgcattt atctgaaagc tacatcagga agatattata cggttcagtc agattccatt 



420 
480 



780 
840 
900 
960 
1020 



ttgacaccgg ttcaggtact tccggcccaa ttgcaggaaa agtggagaaa tcaatcctca 540 

gtctcaatga atcgggcgtt ttctctaccc ggaggagaaa caatctcgat aaactctgcc 600 

catggattaa taatggatga cagtgggaaa aaagaacggt tttccgtaac tgagagaaat 660 

ggcttatgca gtaacgctgt ttcgggtatt gcggccgatg ggcgtggcaa tctctgggga 72 0 
gctacggata acggtgtgtt tcatgttttt attccctctt tgttcagccg ttatacctct 
ggtgaaggtc tgaagggaga ggtaatatcg gctgtctctt acaaaggaat gatatatact 
gggaccttac agggattata cgtattgaag caaaacactt ttgttcctgt tcagggaatt 
tcgcaagcat gttggcgact gtgtctttct ccacaaggag aattatatgc agcgtccggc 
gatggtgtgt atgtgatccg ggattataat cattcggaaa aactcacaga tatggcagct 

tattcattgg cattcattgg gcatacgaac cttctgatgg gcactatgga tgggatttat 1080 

caatattcgg cagatgagga acggataaag aaaatttccg atgtcgaaaa agtcgttcgt 1140 

ctggaagtga agaaagatcg ttctgtatgg gctaaaacgt tgtacggaga aatctatttg 1200 

cgggaagagg gcgaaagctc ttttgtactt caagacagag agagtgaaga agtaatgacc 1260 

gagtatacgg acaacgatgg ctgtcattgg cagacaaatc tgaaaggaaa agaggtacag 13 2 0 

gtacatcatt cgcaaattga tacggagaaa ttcaaccaat gcctgtatgc cattcggaat 13 80 

tatgtcgtga gggtgattta tattgaagaa gacagggcgg catggttcgg cggtgatttt 144 0 

ggtttaattc ggatggacct tgaaaaggcc cggacattta ctccggttgc tccccgcatc 1500 

tatcttcgtg aaatttgtct gaaccgtgat tctgtctatt ggggaggaga tttgccggaa 1560 

gagtctgggg gagcggactg gcagataaat agtacagcac ctcgtttggg gaatgatgta 162 0 

cgttccattc gtttttcatt tgctaccgat gcgccctgtt tcaccggctc caatgagtat 1680 

agatatcgcc tggtcggtta tgatccggaa tggagttcgt gggatcccgg gactgtgaaa 1740 

gagtatgcca acctttcgtc gggtacgtat acgttttgtg tccgggcccg tgatatttac 1800 

ggtacagaga gtgaaatgaa ccaattccgc ttttcgttat tgcctccttt ttatttgcaa 1860 

tggtattgtc tgattctata tgccgttgct tttggaatgc tgttgttctt attatttaaa 1920 

tggagaatgc gcagtttgct gaaagagaag gagaggttag aggcccttgt cggtcaacgt 1980 
accaaacagt tggtccacca aaagaatgaa attgaggaaa agtccttaaa attggaaaaa 



2040 



631 



gcattgaagg agctgggcca ggcacaagat gaattggttc gccaagaaaa aatggctact 2100 

gtaggtaaac tgactcaagg gttgattgac cgtattctga atccgttgaa ttacattaat 2160 

aatttttccc acttgacctc aggattactg aaggacctat atcagaatct ggagagtgta 2220 

aaagaacttt tggatgaaga tacctacttg gattcggtag atgtcatcaa catgatgaga 22 80 

gataatctgg aaaagattga ggaacatgga agtaacacca ctcgtgtact gaaggccatg 2340 

gaagaaatcc tgagagaccg taacaggcaa ctggaaaaaa cagaattaat cggtttatgc 2 400 

cggaaagaca tggagctatt gagcagttat tatcagaagg agattacggc tatgcacatt 2460 

gcggttcgta cttccttgcc cgataatccg ttgtttattg acgggaatgc cgaacaactg 2520 

ggaaaaacca ttatgagtct tcttaacaat ggtatgtatg ccatagccaa aaaatacggt 2 5 80 

aagaaagcct atccggcaga gattggcctg gcattggaaa gtaaggacgg acaggcagtc 2 640 

attcgcctat atgacaatgg agtggggatc gagcagagta ttctggataa aatcttcgat 2700 

ccgttcttta ctaccaaaac aaccggagag gcagcaggta tcggtttgta tctgagtaag 2760 

gaaatcatat taaat caeca tgggcagata geagtaegtt ctgaaaaaga tgaattgaca 2 82 0 

gaattcacta tcactctgcc attgtgggag gaaaagtcag tataa 2 865 



<210> 1583 
<211> 1032 
<212> DNA 
<213> B.fragilis 



<400> 1583 

tctatgatac ctaaagtttc agttatagta ccaatatata atgtagaaaa atatctagac 60 
caatgtgtac aggcacttct tgcacaaaca ctatcagata tagaaattat tctaattgat 12 0 
gatgagtctc ctgacaattg tccaaagata tgtgatgatt atgccgctca ataccctaac 180 
ataaaggtta ttcataaaaa aaatgcaggg ctgggtatgg cttgtaacag tggcttagat 240 
gtagctaegg gagagtatgt ggcattttgc gactctgatg attatgtaga ttctgacatg 
tatatgacca tgtataatgt ageccaaaaa tatacctgtg atgctgtttt tacaggctta 
aaacgaataa caatggctgg catccctaca ggaacagtga ctcatcaaaa agaatttaaa 
ctatacaaaa ataaaaatga aattcatacg cttctaaaag atttaatagc ttcagatcct 
tatgeacgeg aggagegege tattcaagta teegctaagg tagtcctcta ccgtcgtaat 
ttgatagaaa aaaaacatct aegattegta teagaaegta tattaccttc cgaagacttg 
atattcaatg tagatgtatt ggctaatagt aatattgtat gtgtactacc acaaaccttc 
tataactatc ggacaaatcc gatctcaatt tcgcacacaa taaaaaaaga taaattcagc 
ctttttaaac aattatatat agagataacc gaccgttgcc atcgattagg agtggaagac 
aatgtacaac tacggataca aagaatgttc cteggttaca cacgcaacta tatatgeaac 
atactcaatt ctagcataac aaacattgag aagaaacaaa ttacttcttc aatatgtaaa 
gaeggtattt ggaaacccat ttggaaaaca tatcctctgt cagtaatgee tttaccacat 
agaatattta cattegctat gcgtcataat ttctattcat tactgttagt attagcaaaa 
atcaagaaat aa 



300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1032 



<210> 1584 
<211> 231 
<212> DNA 
<213> B.fragilis 



<400> 1584 

aaaaaggaac eggaagcttt tatacaaatt aattatgcag attcctattt aagattcagc 60 

ctgatctgtt atggtccagg aaataaaggc caacagaata tagecaaaca aaatagcaac 12 0 

cggtcagccc ggattaagac agaaaagaga tcagcacccc tgccgctacg gcagacccga 180 

tcactcccga gatattgetg gecatgeaat attgeaatae atgattttta g 231 



<210> 1585 
<211> 432 
<212> DNA 
<213> B.fragilis 



<400> 1585 

geaatgaegt tacttcagtt tcgccctgtc gatgggaggt tagatgaaat tcttagtatg 
ctgatgeata gtcgtgaagt tgcttcacat ccttccctgt ettatgecat ccgtttggtc 



60 
120 



632 



agtgaggaga tcattgtcaa tattctcaat tacgcttatc cgcagcaggc agaaggttat 

ctgactctgt gcttatggga tgaagacgga gagattactt tggagtttat agatggtggt 

attccattca atcctttgga taaagccgat cctgatattt cattaccgtt agaacaacgt 

gaaataggag gattgggcat tttcctggta cgtgaaatga tggatgacgt agcgtatacg 

tacgtgaata aggagaatcg gttgaccatt aaaaagaaat atctgcaacc cactgatgaa 
cctgtatcat ga 



<210> 1586 
<211> 1551 
<212> DNA 
<213> B. fragilis 



<400> 1586 

tatcatatgc caagcgattc tcaaaataac aaacgcatag cccaaaatac actgttattg 
tattttcgaa tgctcttttt aatgctggtt agcttatata ccagccgtgt caacctaaat 
gcattaggta tagaggattt tggtatatac aacgtagtag gcggccttgt cgctatgttc 180 
tccattatat cgggctctct ggtgtcatcc atcagcagat ttatcacctt tgaacttgga 240 
acagaaaata aagaaaaact aaaaaaagta ttttcaacag ctgtttctat acagtttttt 
ctggttatca tcgtggttat tcttgccgaa accattggac tttggtttct aaataacaaa 
atggtaatac ccgaagaacg tatacttgcc gccaatatca tttatcagtt ctctattata 
tcttttgcac tttcgttaat gagtattccc tataccggaa caatagtagc acacgaaaaa 
atgtcagctt ttgcctatat cagtattttt gatgtcatag ggaaactggc tgtcgctcta 
accatttcta tagctccgat agataaacta atttggttcg caggcttcat tgtattcaat 
tctaccatca tacaaagtat atatattttt tattgtaaac gccattttga ggaatgtact 
taccatttca tttttgataa gtctttactc aaaaatatgt tcggtttcgc cggttggaat 72 0 
tttatcggct ctatagcagc tattctccgt gaccaaggcg gaaacattgt catcaatatg 700 
ttttgtggac cagccgtaaa tgcagctcgt ggagttgcca tgcaagtcaa caatgcagtc 
agtggttttg tctccaactt tcagacagca ctcaatccac aaattacaaa aagttatgct 
tctggcaact atgattacat gatgcaactc atctttcaag gtgcacgact ttcttattac 
attttactta tccttgcctt acctatcatc agtaataccc attttattct tcaattatgg 
ttggggcaag taccaaaaca tactgtacta ttcgtacagc ttgtcctatt cttcactatg 
agtgaatctt tagccaatcc tctaataaac gctatgctgg ctactgggaa gatcaaaaaa 1140 
tttcaaatta tagtcggtgg acttaatctg gtcaatttac ctttatctta tatctgcctt 1200 
cgtttaggat gtattccaga atcagttgtg ataatagcta ttatcatatc tatgatatgc 12 60 
gaaatggccc gtgttattat gttacgtaat atgatacact ttcctgctcg ctccttcctt 132 0 
aaaaaggtat atttcaatgt aatatttgtc accatcacag cctctatact ccctctgtac 13 80 
ctacacttta tacttgaaga aaatatttat acttttactc tcataagtgt agtttcattt 1440 
tcatgtacac ttctctctat tttatatatt ggttgcagta gtgaagagcg tgtcatggta 1500 
tttagtaaag ttaaggtaat agtgaacaaa gtatcaaaac gctataaata a 1551 



60 
120 



300 
360 
420 
480 
540 
600 
660 



780 

840 

900 

960 

1020 

1080 



<210> 1587 
<211> 453 
<212> DNA 
<213> B. fragilis 



<400> 1587 

tcacaattca cacacactaa tcacatgacg attgctgttg attttgatgg taccatcgta 60 

gaacaccgct atccgaagat cggagaagaa attccattcg caacagaaac cctgaaaata 12 0 

ttggctcagg agcgacataa gcttatctta tggaccgtac gcgaaggaga attgcttgaa 180 

gaagcgattg aatggtgccg ccaacgggga gtctttttct attctgtcaa caaggactat 240 

ccggaagaag aaaagagtca taacggattc tcccgtaaac tgaaagcaga cctgtttatt 3 00 

gatgaccgga acctgggagg tttgcctgac tggggaacca tctaccagat gatccatgaa 3 60 

caaaagccat acgaacctgt tctatgtgac aggcagaaac cgaccggcga tttaagctgg 420 

atagagaaac tgctcggcaa acgtaacaaa taa 453 



<210> 1588 
<211> 1065 
<212> DNA 
<213> B. fragilis 



633 



<400> 1588 

agaaagaggt tgacaatgaa caatcatgta gtaattatgg ccggtggcat aggaagtcga 60 

ttttggccca tgagtacacc ggaatgtccc aaacaattca tagatatatt gggatgtgga 12 0 

aaaacactga ttcagctaac tgtagagaga ttcggtaatg tttgtccaca ggagaacatg 180 

tgggtggtca cttcggaaaa gtatatagat actattcggg agcaactgcc gggtatcccg 240 

360 
420 
480 
540 
600 
660 



780 
840 
900 
960 
1020 



gaaagtaata tactggcaga accctgtccc agaaatacag ctccctgcat tgcgtatgcc 

tgctggaaaa taaaaaagaa atatccggaa gccaacattg tcgtgactcc ttccgatcaa 

gtggtaatcg ataccactga atttcgcagg gtgattgaga aagcgctttt gttcactgat 

aaaagcagtg ctatcatcac attgggaata aaacccgccc gtccggaaac cggatatgga 

tatattgccg caggtgaacc gataacgaga gacaaagaaa tattccacgt agaagcattc 

aaggaaaagc ctgataaaga aactgctgaa aaatatctgg cagcaggcaa ctacttctgg 

aatgcaggaa tattcgtttg gaatgtgaga acgatcacag ccgtaatgcg agtatatgca 

ccggggatag ctcagatttt cgaccggata tatcccgact tttatacaga acgcgaggaa 72 0 

gaaagcgtga agaagctatt ccccactgcc gaaagtatct cgatagatta tgcagtgatg 

gaaaaagcgg aagagattta tgtattacct gcccaaatgg ggtggtcgga cttaggtacc 

tggggagcat tacacacctt gttgccaaaa gataaagaag gaaatgcaac agtaggaccg 

gatatccgga tgtatgaaag tcaaaactgc atggtgcacg cctcacagga aaaacgagta 
gtcatacaag ggctgaacga ttacatcata gccgaaaaag acaatatatt attaatatgc 
cagttatcag aagagcaacg aattaaagat ttctcaaaag aataa 10 65 

<210> 1589 
<211> 1110 
<212> DNA 
<213> B.fragilis 

<400> 1589 

tataacacaa ccatgacaaa tatattggga ttgaaacaaa acagatggat cgttggggca 60 
gtactgctgc tgacaacctg taacttacag gcacaggaac agccaaacgc cagggaaaag 12 0 
tttgaaagag gaaattggtt tgtatccggt gccctgaatg gagaatggct gaccaagaca 180 
atcggtaacg tatatgcagg cggcaaaatc tccggaggtg tatatctgac gcctctttcg 240 
ggattcagag ccactgcaga gatcggtaag aactggatag ggaatgacac agaagccacg 3 00 
caactcagcg caaacctgga ttatatgttg actctgatag gaaacaatgg attcaaaaga 3 60 
tttaatctgg cggctattct gggtgccggt ttcaactatt atgactttgg ggacaatgat 42 0 

480 
540 
600 
660 



840 
900 
960 



ccgaaatata caagggtcaa cactatttcg ggtaacttct cgattcaggc ttcgtacaat 
gtaaacagaa aattcagcat atttatagaa ccgggattaa aggttttacc caaatactac 
agcaaagaac tgaacaacaa aatttatatg caaagcaacc tcacaatagg acttgcatat 
actttcagag ataaatatcg gaagagcgtg gacaacagca tccatcccct ctatctaccg 
gaagcggatc tgctggagat aaaggagaaa atagggatgc tctgtgaaga ggtaatgcaa 72 0 
atgaagcagg aattaaagga acgccggaaa ataacagacg ggcaaaacct gatgattgtt 7 80 
ccgcaaaagg atgcactctc catcgatatt atgttcgacg aattcagctc gttcgttagc 
gaagagcagg gacagaaaat agacggaatt ggcgaatgga tgaagaataa taacgcaagt 
atccggatca ttgctttcag tgataacctg accgacaaaa aagcggatca ggagctgcgt 
aaacgtcgtt cggaagccat ccggaagata ttgatagaga aatatcacat ctctcccgaa 102 0 
cgcatttcgg aatcgacacc ggaagcgatg ggatatgaaa acaaaacggg atgtaatgca 1080 
atgattgtat acattcctga aaacaaataa 1110 

<210> 1590 
<211> 1752 
<212> DNA 
<213> B. fragilis 

<400> 1590 

aggcgtgaaa caacgtacaa attaaaagaa acaacattca aatacatgaa aaagagcata 60 

ctattcacat ttgtgctttg cctcctgtca caatggagtg tagcacaaga ccctaagtgg 12 0 

gttgagaaag cgaaacgatc ggttttttcc atcgttacgt atgataaaga cgacaaaatt 180 

cagaataccg gtaacggatt ctttgtaacc gaagacggag tggctctgtc tgactattca 240 

ctgtttaaag gtgcccagcg tgctgttatc attaactcag aaggtgaaaa gatgcctgtt 3 00 

gagtgtattc tgggagccaa tgatatgtat gatatcatta agtttcgtgt gggcattaca 3 60 



634 



gtgaagaaag taccggcttt acaagtggct gctcttgctc cggctgtggg tgctgaagtt 42 0 



<400> 1592 

tgtatctttg caccgcctaa aacaaaaact gtaaaacaaa tgaaacagaa tttttttcat 



cggtatcttt ctgcgaaggt actgcccatt tggactattc tattgattga tatatttatt 

atcgtcgcat cctgcttgct cgcttactcg cttcgttacg attttcgcag cattttcttg 

gattcgtcga caatagataa gaccattctt tggacggtag tggctaactt aatcttcttc 

cgggtattcc gtacctattc aaatgtgctt cgcttttctt cgtttgtaga cattatgcgc 



840 

900 

960 

1020 

1080 



tatttattgc cttattccac tcagaaaggc ggtaacgtga cccgtggtaa agtgaaaaag 480 

gtagataata tcggtggtga taaatatcat tactatacgc tggacatggt attgaaagat 540 

aaaatggtca gttgtccggt cactacggcg gacggtaaag tgtttggagt ggcacagaag 600 

tcctcgggac aagatactgc ctctatcagc tatgcggccg gtgcggcttt cgccatgtct 660 

caaaatatca gtgcacttgc gctgagcgac cctgccttga atgctatcgg cataaagaag 72 0 

gggttgcccg aagatgaaga tcaggctctg gtctatctct ttattgcttc aacacaatcc 780 
acacctgagg cttatgccat cgcccttgac gattttataa agactttccc gaacagtgcc 
gacggttatc ttcgccgtgc cggaaactat gtttttgcag acaaggatga aaaccacatg 
gataaagcgg ctgccgacct ggaacatgcg ctgaaggttg cacagaagaa agacgatacc 
tattataaca tagccaagct gatatacaat tatcaattga gtaaacccga aactgtttat 
aaagactgga cgtatgataa ggctctggag aatgtacgga gtgcgattgc cattcagagt 

ttacccgtct accagcagtt ggaaggagat attctttttg ccaagcagga ttatgcaggt 1140 

gcatttgcca gctatgacaa agtgaatcag accgaactgg catctcctgc ctctttcttc 1200 

agcgctgcca aagcgaagga gttgagtaaa gctgctcctg aagaagtaat tgctttgttg 12 60 

gacagttgta tagcccgttg ccagacacct ataacttccg atttggctcc ttacctgttg 1320 

gaacgtgccc agatgtatat gaatgtagag aagtatcgtt tggcactggc agattatgat 13 8 0 

gcctatttca atgcagtgaa aggtagtgtc aatgacctgt tctattatta ccgtgagcag 1440 

gctgccttca aggctaagca gttccagcgt gcgttggatg atatagcaaa ggcgatcgaa 1500 

cttaatccgg aagatctcac ttatcgtgca gaacaggctg tggtaaatct ccgtgtaggc 1560 

cgttacgaag aggctgagaa agtattgaaa gacgcattag ctatcgatcc gaaatatgct 162 0 

gaagggtatc gtttgctggg aatctgccag attcagttaa agcaagagaa agcggcttgc 1680 

acaagctttg ccaaggcaaa agagcttgga gaccccaatg tagacgaact gattaaaaaa 1740 

cattgtaaat aa X/D * 

<210> 1591 

<211> 318 

<212> DNA 

<213> B.fragilis 

<400> 1591 

aataaaagaa cgatggaaaa gtatgaaatt cattttgtag gttccgtctt ggattcaaat 60 

acaagcggtg acgaacaagc aaaaattgta gctcttatcg agcaaggaca ttctgttgca 12 0 

ttagatctca gcggctgttc ttatgtatcc agtgccggat tgagagtcat gctttatgcc 180 

tttaagctgg cgaaagctaa aagtagagat gtttgccttg tcggtgtgtc acaagaggtt 2 40 

aaagacgtga tgcacatgac cggattcgat aaattctttc gtttttatca gactctcgat 
gaattatcac aaccctaa 

<210> 1592 
<211> 1944 
<212> DNA 
<213> B.fragilis 



300 
318 



60 
120 
180 
240 
300 

atatttgtgt cgcttacggt ctcctatggg gtattgatga ttctgagtct tctgctggat 360 
gcttacctgg gcattcggat tggtgccatc agtgtactgt ttatggcata tgtgatcaat 
tttgctatga tggcctgttc gcgtattgtg gtcaaaatgt tcttcgaggt actcaatttt 



420 
480 



gacggtagcc acacgaccaa cgtctttatt tatggtgcta aagaagccgg agtaaacatc 540 



600 



gccaaatccc tgcgtgtcaa tttgcgtaat cattatcgtc ttcgtggttt cattgccgat 
gaacccgaac tgattggtaa ggtgatgatg ggggcgaaag ttttcccgaa tgatgaagca 6 60 
ttgattgaaa acatgaatga ccgtgatgtg cataccatca ttgtttctcc ggctaagatg 72 0 
gaaaagctga agaaatcaga tatgattgat actctgcttt ccaataacgt gaagttgctt 
actgctcccc ctttgagtga atggggtggg caggcactga ataaaactca gttgaaggaa 
atacagattg aggacctttt gcaacgcgaa ccgattgagg tggacatcca taagatagct 



780 
840 
900 



635 



tctcacctgg 
attatgcgac 
actccgttgc 
acgattgtag 
cctcagtata 
tcggagtcta 
ttcggttcag 
atgggttgtt 
aaggaaggca 
aacggttcgg 
acccatcccg 
gaagcaggaa 
aagattgtcg 
gagtttaccg 
ctgaccaaac 
gacgaagtga 
aagatcgtag 
gaggcgttgg 



aaggtaaacg 
aggtcgcttc 
atgacattcg 
ctgatatttc 
tattccatgc 
ttcagattaa 
agaaattcgt 
caaagcgtat 
cacgttcggt 
ttattccccg 
aaataatccg 
gtatgggtaa 
atctggcaaa 
gtttgcgtca 
ctacttatca 
aagagagaat 
ctgccatgaa 
ataagaaaga 



tgtaatgatt 
tttcaatcca 
tctggaattg 
gaacgcaact 
agctgcctat 
tgtatcgggt 
aatgatttct 
ttgtgaaatc 
acagttcatc 
tttccgcgat 
ttatttcatg 
tggtggggaa 
acggatgatc 
tggtgagaaa 
cgaaaagatt 
ccagaaatta 
agatattgtc 
ctaa 



acaggagccg 
tataagttga 
caggatcgtt 
cgtatggagg 
aagcatgtgc 
acacgtacgc 
acggataaag 
tatgtacagt 
accactcgtt 
cagattcagc 
accattccgg 
atctacatct 
agtctttcgg 
ctgtacgaag 
atgattgcca 
attgatgtaa 
cctgagtttg 



ccggttctat 
tcctgattga 
ggcgtgacat 
ccattttccg 
cgatgatgga 
ttgccgatct 
ccgttaatcc 
ccttagccaa 
tcggtaatgt 
gtggaggacc 
aggcttgtcg 
tcgatatggg 
gacgtacgga 
agttgctcaa 
ctgtgcgtga 
gctataccta 
tcagcaagaa 



tggtagcgaa 
tcaggctgaa 
tgatgctgaa 
tgaatacaaa 
agacaatgtt 
ggctgttaaa 
gacaaatgtt 
gaaattacag 
attaggatct 
tgtcacagtg 
tctggtactg 
taaaccggtt 
tgtgaaaatt 
tgtgaaggag 
gtatgattat 
tgatcagatg 
ttcttgtttt 



960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1944 



<210> 1593 
<211> 1152 
<212> DNA 
<213> B.fragilis 



<400> 1593 

actcttcatg 

ttcctacaag 

tgtgctctta 

tctattatca 

attacccaga 

agcggtgagt 

aaaccaaacg 

tactctattc 

caacaagctg 

aaccgtcaga 

caactccttc 

taccataaat 

tgctgttcca 

tacgactttt 

gcacagatag 

ctcaatgaac 

caagcaattc 

gattatgaaa 

actgcaagaa 

agaatgctat 



caacgtctac 
atgcgtttga 
atattccttt 
taccagttta 
catttactga 
tatgtgaaaa 
gaggcgttgc 
atgtagatcc 
ttaaagaaca 
tacacaattg 
agcaggaacg 
accaactaca 
ttctgctaca 
atacaaacga 
acttctgcag 
taaaaggtat 
gttctctttt 
aatccaacta 
gaagaatgtt 
aa 



atcatttttt 
tacattactt 
caatcttaaa 
tatggcagaa 
ttgggaactc 
atatgcttcc 
ttccgcacgt 
agatgattgg 
agctgacatg 
tcaaaagcca 
ccatggcagt 
tttcccggaa 
tggatgtaaa 
taatagtatg 
acttatgcag 
aacccttatc 
tccagaaatc 
ctatggccta 
tgtagccaaa 



tactatacca 
ggtaaaaagc 
gcattaatct 
tcctatctac 
cttttggtgg 
gcctacaagt 
gaatttggta 
attgacagta 
gtcatatgcg 
caattattag 
ttatgtaaca 
aaaatgatat 
ttagcttatg 
gtacgccaca 
gccaaaatat 
acggcctttc 
aatgattggt 
acattagttc 
ttcctagtac 



accaatcagt 
aaacattaca 
ttatgagttc 
accgatgtgt 
atgacggaag 
cccgaataaa 
tgcaacaggc 
atacactaga 
actttatgat 
attccagcag 
agcttatacg 
gttgggaaga 
taccccacgc 
ctgatatgcg 
caccagagta 
gtaaccaatt 
atgtaactag 
ttcgtggcta 
aaataaaaaa 



tactctcgta 
cgatggccca 
gcctaaaata 
agatagtatc 
tcccgatcat 
agctttccac 
gcgaggcgaa 
agaactctac 
ggaataccct 
tttcatgcac 
taccgaactt 
tctttatatc 
attatatcac 
tggtcttcag 
tctgcctgaa 
gcttaatgaa 
atatggacat 
caatttcaaa 
taaaattaca 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1152 



<210> 1594 
<211> 1650 
<212> DNA 
<213> B.fragilis 



<400> 1594 
acaacgaatg 
ctgctgtgta 
aaggatttta 
gatcttgccg 
ctttatgttt 
acgccgcacg 
aatgatcgca 



ccatgaaact 
caggctttgc 
atctctacac 
gaatgataca 
ccaaattatt 
ggttgagcaa 
atcaggtgtt 



ccgtacgata 
catgttttct 
gcttgttccg 
ggggatcaat 
ttcttatctg 
gcagatgaat 
gtattgcagc 



gtgaaaatag 
ttcttcaggt 
ggttcggcca 
gagttgagtt 
aagctccatc 
aaagtgttgt 
ttgggaaacg 



ccgtaatctc 
tgtcggcggc 
ctgtagtcct 
gcagtaaaga 
tgtatacgtt 
tgagtttcca 
gcgattacga 



gtctgttgta 
ggaaggacgc 
cgagacggat 
ccgacatttt 
gttggaagat 
tgaacccgac 
gttggtggaa 



60 

120 

180 

240 

300 

360 

420 



636 



480 
540 
600 
660 



aagtttatac gtaaatactg ttcgagtagt ttcccgtcaa agctgtttga ctataaaggg 
gaagagattc gtatttatcc gatgccggat gacagtttcc tggcttgtta tttcacttcc 
gactttctgg ttgtgagtta tcagaaaaaa ctgattgagc aagtgatcga tgcccgtttg 
tccaagaaat ctttgctgac cgatgcctcg tttgccaaag tacatgagga taaacgtgcc 

cgtgtggcag ctactattta tgcccgtatg caaccgttga gtatgggaaa ggctaccgac 72 0 

gggattcgtt cgtgtacaca gttgggcgga tggaccgaat ttgatatgaa aatgaatgga 7 80 
gatgccattt atttctccgg tgtcagccat gatacggata cttgtctgac atttatgaat 
gtgttgcgcc aacaacaacc ggtggaggat ttcccgggag atatcctgcc ggcttctact 
tttttcttca ataaacggtc ggtaaccgat atgcaggcta tgcttgattt cactgccagg 

caggagtata cgacatcgac ttattccgac tatatcagag atcgggatgg ggagttgctc 1020 

gcatatctta aagagaatgc agggggggag attgtgactt gtctgtttca ttcgacagat 1080 

actctttcaa atccttgtgc ggtgatgagt attccattga gggatgggca gcaggctgag 1140 

cgtgtgttgc agggaatgct tcgcactgct ccgaaggagg tagacggtcc tccgaaacca 1200 

cgtactactt tctgtaaaac tcctttaagg gcttatacgc tctatgtact gcctcgtaat 1260 

acgttgttta cgcaattgac cgggataaca gagtcggctt tatacattta cgcttgcttc 1320 

tacgagggaa gactggttct ggccccggat gtggaaagcc ttaccgcata tctccgtcat 13 80 

ctggataaaa aagaaattct ggatgatact cccggatatg aagaggcggt ggtcaatctt 1440 

tctccgtcgt acaactttat gatggttgct gatttgggag agactttctc acaaccggag 1500 

aattatgtga ggctgatacc tgcatttttc tttcgtaatc aggagttttt ccgccacttt 1560 

attctttctg cacaatttac ttgtacggac ggaattgtat atcccaatgt ggtattgatt 1620 
tacaaaggtg agtctgatga catttcctga 



840 
900 
960 



1650 



<210> 1595 
<211> 204 
<212> DNA 
<213> B.fragilis 



<400> 1595 

atctttcatt ttaaggttga gattggtgac ttggacgaga tagtctttaa agaaattttc 
ggattggatc ttgaatcaca agttgtagaa tacgggcata aaaaatgttc ctccgaccac 
gcctggccgg aggaacacac acaaacaaaa caaataaaag cagtcacgat tcgcatcggg 
acgcccctac acccggtatt ataa 



<210> 1596 
<211> 1473 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (145) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1596 

aaaaaacatt atcgtctggt tatcagtatg tttctgtcag gagggcggac gctgtcatgg 60 

cgagtcaggg gccaaaagtc caagggccac tctatgctct cagaggaggg ggagagtgaa 12 0 

gaggatatgg aggatgtgtt gttcngaaac cctgaacaat ttcgtcgggt ggctccttac 180 

tttcccgaag aggtgtttga gcatttgcct gatctgcttg cgcagggggt gaaggcggcc 240 

ggcaattatc gggaacgcga tatgttgctg atggcgatga tcaccaatat cagcgcctgt 3 00 

ctgcccgagg tgcgtgtatt gtacgatcag gtgtattact cgccgcatct gtattacatg 360 

gtgatagccc atgcgggagg cggaaagggg gtggtgtctc tggccggctt gttgcccgga 42 0 

gagattcacc gctattatga gaagcagaac gaggagatgc gcctggtgta tgataaggcc 480 

ttttttgagt gggagctgga gttgaaaaag gcgcaggcgg aaaagcgttc gccggatttt 540 

tcgctacgtc ccaaagagcc tgtccgtaag ctgttgacgc tttctcccaa tgtgtcgaag 600 

agtatgctga tcagtgcatt ggaggagagc ggcaagctgg ggtgctgcat caatgctacg 660 

gagctggaca tggtgtcggg agctatccgg aatgattacg gaaagcacga tgatgtgttt 72 0 

cgggcggctt ttcaacacga agtggtttcg gccgatttca aggtgaacgg ccgtcaggtg 780 

gtggcccata atccgcattt ggcgttgtgc cttgccggca ctcccaacca gctggtgcgt 840 

ttcattcctt cgttggagaa cggtttgtac agccgtttcc tggtgtatac gggccagagc 900 



637 



gattggtgtt ggcgttcggc tgccccgcgg gagggaggtg aagatcatcg ggcgatgttt 960 

gcccgcctga gcggccggtt gttggagttg caccagttct tgcttcagtc tcccacggag 102 0 

gttacgttta cggctgcaca atgggaggaa catacgtctc gtttctcctc acacttgtcc 1080 

gaggtggtga acgagcggga tgattcgccg ggagcgatcg tgcttcgtca cgggctgatg 1140 

gcgagccgga tagcaggggt gctcactgcc cttcgcaagg gagagtgcgc ctgggcgatg 1200 

ccgcagtatg tatgttcgga cgaggatttc cataccgcta tgttgatgac ggatgtgttg 1260 

ttggagcata gcttgctgct gtccaccagt gtgcggaaga gcgagagtaa gtccgggccc 1320 

ctgaagcctt atttcaggtt gcgtccggtg ttacagactt tttcaggaac ttttacttta 1380 

tatgatgcga tggatcgtgc tgtagagatg gggatctcgg tacctacttt taactgctta 1440 

ttcatgagga ctatagagct taaaaataat tga 1473 



<210> 1597 
<211> 1380 
<212> DNA 
<213> B.fragilis 



<400> 1597 

aacatgaatt atcaggagac tttagactac ctatacaaca gcgttcccat gtttcagcag 60 

gtaggaagca gcgcgtacaa agagggactg gaaaacacct atgcattaga tgaatatttg 12 0 

ggacatcccc acacagcatt tcaaagcatt cacattgcag ggaccaacgg aaaaggttcc 180 

tgctcacata cattggctgc cattctgcaa tcggccggat atagagtcgg gctctacact 240 

tcgccccatt tagtagattt tcgagagcgt atccgcatca atggcgagcc catcccgcaa 3 00 

gaatacgtca tccgctttgt ggaagaccat cgtctgttct ttgaacctct gcatccttct 360 

ttctttgaat tgaccaccgc aatggcattc cgatactttg ccgatcaaca tatagacgtg 42 0 

gctgtcattg aagtcggatt gggagggcga ctggactgta ccaacatcat ccgtccggac 480 

ttaagcatca tcaccaacat cagtttcgat cacatgcagt ttttaggtaa tacgctggcc 540 

caaatagcaa cggagaaagc gggtatcatc aaaaggggca taccggtcgt tgtcggtgag 600 

acgacagaag agaccaagcc tgtattttat cggaaagcac aagagatgga ggcacctgtc 660 

acctttgcag aagaagaaca acgcctaaaa ggagcaacta aattcaaaac tcgcgcggat 72 0 

cgaaaaccgg gacagtatac agatcatcag ccgaataccg cagcagaaaa agatacagag 780 
tatattaccg gctggatcta tgaaaatgat gtatatcccg ggttggaagg agtattggga 
ggctcgtacc aactaaaaaa tacaaatacg ctactgtcgg ctctccctgt tctgaaaagc 
ctgggctata agatagaaga tcatgacgtg agaaacggat tcctgcaagt agataaaatg 
accggtttgc aaggacgctg gcagaaattg agtgactctc ccactgtcat ttgcgatacg 
ggacacaatg tagcagggat ttcttatatc gtggagcaac tgaaacaaat gaaatacaat 

tgcctgcaca tggtcatcgg catggtgaac gacaaagacg taagtggaat attgtccata 1140 

cttcctgaaa acgcagtgta ctatttcaca aaagcgagcg tcaaaagagc attaccggag 12 00 

gctcagttac aacagatcgg agcttctgcc ggactgcaag gaaaggctta tcccgatgtt 1260 

cagtctgccg taaaggccgc acaagaaaaa agcctcccgg aagacctgat cttcgtagga 132 0 

ggcagtagct ttattgtagc cgacttatta tcgtgccgcg atgcactcga tctcgactaa 1380 



840 

900 

960 

1020 

1080 



<210> 1598 
<211> 267 
<212> DNA 
<213> B.fragilis 



<400> 1598 

aatacttctc tgagaaataa gatgataata gcgagtggtg aacaagcttc acagtttgtt 60 

caccactctt ttttaatcca cagttctctt ttaattcaaa ttttattgag atggttcaaa 12 0 

ggttataatt gtcctacgaa aatttctggt aagccgctgc taatttttta tcatattgat 180 

tctgctcata agcaggacca ttataacgtt tggcaaactc tgcccagtct ttggcttgca 240 

aagcggaaag cattccggat tgtttga 2 67 



<210> 1599 
<211> 1065 
<212> DNA 
<213> B.fragilis 



<400> 1599 



638 



acaaagtatc 
gttcgttctt 
catgccgatt 
ttttctgtat 
ttattatcag 
cagtgtgcat 
ttatcgataa 
tgttatttag 
aaaagaacat 
ttttccaatg 
tatcttttag 
aagaatctcc 
tgctactttg 
atagcaggtg 
atcaattgga 
tactttatga 
ttggtaatca 
cgtctctata 



aaaacgctat 
ccaatctgga 
ttcaggctct 
ttcgaattat 
gctggtttgg 
tctttttaat 
gtggtattaa 
gtatgtttat 
ttagtacagt 
gggcccccta 
cgcgctacgt 
ttatgtacat 
acaaagttgg 
cactttatct 
ttgcggcgtc 
aaccacacat 
cgggtctatt 
tatggaacca 



aaataatatg 
gtcgttacgt 
tggtatgcct 
tatagaagca 
cataaacttc 
tggtatctat 
aagatgtttg 
tatggctccc 
gctgctctct 
ttttgaaaaa 
tcgtatctat 
aagtttgtct 
atatttttgt 
gctccttttc 
ctgttttgct 
ccaacaacta 
attaacattc 
ttttgtatct 



tccttagtca 
ttgctttcaa 
gacaggacag 
tttgctatcg 
cacattaaaa 
acttttacaa 
atgctcacta 
atcctcaatg 
ttttttatct 
ggatattctg 
caaccatctt 
acttttacag 
atgttttgga 
tttaataggt 
gtttatctat 
gcagttacct 
tttacagcag 
aaatatatca 



ataacttcac 
tgtttttcgt 
aaatgacaat 
tctgtgtcaa 
gtctatgcaa 
tattgattgg 
ataatgtttg 
ccttcgttga 
tccagacaat 
ccttctcttt 
atacacagtg 
ccctaatgct 
cttatacgag 
tcgcttttca 
tccatttctt 
acaatggtat 
ctattttaat 
gttaa 



agttaaaaaa 
tttggtagta 
tcttccttta 
ctctttcgtc 
tctgcttttt 
cattgaacca 
gtttgtgaag 
aaaaactgac 
atatggctgg 
tatggggctg 
gtcaaaaagc 
aatcattaca 
cccattagtg 
aaacaaagga 
catatgggaa 
atcatgcctg 
agataagata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1065 



<210> 1600 
<211> 1266 
<212> DNA 
<213> B.fragilis 



<400> 1600 

atgtgtctta 

ctctcctcct 

gaaaatatcc 

tacagaggaa 

aacggcaacc 

ctgccggata 

gaacggatgg 

ggggtgatca 

gaaaactcca 

atggacttgt 

atgaagggcg 

ttccggaata 

gcacaggctg 

ggcacaaccg 

atcaatgcag 

caggccggaa 

ggagagtatg 

cattttgcca 

ccggctgagt 

aaaatgggat 

ccggaatata 

ttttaa 



atccattaag 
ctatcactgt 
ggaccattga 
cctaccgcgg 
tggaactggt 
cgttgatagc 
agatgagtta 
accgctcggc 
ctttcgataa 
ggaaaggagc 
aatacaaaaa 
attttctggc 
aagtgaaata 
gatactcggc 
ccgtgaaagg 
gatatttgta 
ccgtaggagt 
tccccctgcc 
tttttgccgc 
atacctatca 
tacggcattt 



ctgtatgaag 
ccatgcacag 
aacgggagga 
agtgggaaaa 
ggcactggac 
gggctataaa 
tgataccgac 
atggaaagcg 
actgtattca 
gaaagcaacg 
gatacgtcca 
ccgcatcgtg 
tcgtaccggc 
catcacggac 
ttcgctctat 
tggagactat 
gtatgccatg 
cggaaagaaa 
cgaatatagc 
gacacgaccg 
cctgataaaa 



aaacaattga 
gtcgcccaaa 
acgactgttg 
gcaatcatcg 
ggaaacggta 
tccggtggga 
cgccctatgg 
gacatcgtcc 
tacagagtca 
gcacaagtgg 
ggggtaatga 
gcaggcaact 
aacggacggg 
gatggatggt 
gtaccgcaat 
gggctgagag 
tatgtggaag 
tggaaccgga 
atggtatcgt 
gcagaaaacc 
agtatcgaaa 



caataataat 
agctcaggga 
ccgcctttga 
caggaatgga 
ttccacaact 
tttcattgaa 
gactattgaa 
tgtatccgga 
atctgtcgcc 
tcttccccat 
cgatttcaca 
tcaccgatca 
tggagctggg 
atatcggtac 
tcaatactca 
gagactgcac 
gagaagttaa 
atcatgctgt 

ggggcgagta 

ggagcaacgg 
aagagagaaa 



cggcctgctc 
gctgggtatg 
ggacaatgta 
agggatgggc 
cagcatcagc 
agaggtgtac 
aggaagcgca 
agtgtcactg 
ggctgtagaa 
cgcaaccaac 
ggaaataagg 
tcggatcgga 
agcgcagata 
ccgtcaacga 
actcgaccta 
acgccacttc 
cggaggattt 
caggatgaag 
tgccgatagg 
attctttcag 
caaaaaacag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1266 



<210> 1601 
<211> 753 
<212> DNA 
<213> B.fragilis 



<400> 1601 
cgtacttata 
ctgaagcaca 
aacgcccatt 
ggcggagcac 
gagagcatag 



actacccatt 
ggagcgactt 
catataatac 
tgattcctga 
aacgtactgc 



ccatctcaga 
atctatattg 
agcgctgaaa 
cggggcaagc 
cggatgggat 



aacatgttga 
cccgaaggta 
gatgccgggt 
atggtattgg 
ctctttgaat 



agctcaaaac 
aattattgat 
ttgcagaagc 
cgtttagatg 
atgaaatgga 



agtttcactc 
aaacacgatt 
cttgctcaaa 
gctccgaaaa 
acggctgaac 



60 

120 

180 

240 

300 



639 



360 
420 
480 



cggaaaggag gtatttgcta ctttctggga agcagcaaaa atacacttaa attgatcaaa 
gaaaaagcta aaaccgtata tccgaatatt cgaatagaga cttactcccc accctacaaa 
ccggaattta cagaagaaga aaatcaaatg atgatagatg ccataaatgc ggtaaagccg 
gacttattat ggataggtat gacggctccc aaacaagaga aatgggcgta tacgcatctg 540 
gatgcactgg aggtgaccgg acatatagga actatcggag cggtattcga cttctttgcg 600 
ggtacggttg aacgtgctcc ggtccggtgg caagagcacg gactggaatg gctatatcgc 660 
ttgatcaaag agcctaggcg catgtggcgc cggtatatca tcgggaatgc cctgttcctg 72 0 
tggaacatca ccaaagaaaa attctctata taa 753 

<210> 1602 
<211> 813 
<212> DNA 
<213> B.fragilis 



60 



<400> 1602 

ataaaaaata attggaatat gaagactatt aaattaggtt atgaaggtga agaagctctc 

ttgctgtgtc gggagttgaa acgcaatggt tattcagtaa aggaaagccg gacttttaca 12 0 

j j j 180 

240 
300 



caagaaatga aagaggcagt tattgatttt caacagaaaa acaagttgga tgctgatgga 
atcgtgggat atcgcacttg ggaagttctg ttctttacag ggcatcccat taccgaacgt 
ttgactgaag aagattttat tcttgtggcc cggttgctcg atgtggaagt ggctgcttta 
aaagcggtac agcaagtaga aacaggaggg agaggaggat tttttgctcc cggtaagccc 360 
gctatccttt tcgaaggtca tattttctgg aatcaattga aaaagcggaa tatcaatcct 420 
gaatcgcatg tgaaggggaa tgaaaacatt ctctatccca aatgggagaa gggacattat 480 
aaaggcggta tgggtgaata cgatcgtttg gaacaagccc gtaagatcaa tcatgaagca 
gcggatgctt ctgccagctg ggggatgttc cagattatgg gtttcaacta tgcagcctgt 
ggagagaaga gtgtcgacag ctttgtaaaa gctatgtgta tgagtgaatg tcgacaattg 
gtgctgtccg cccgctttat caaacaatcc ggaatgcttt ccgctttgca agccaaagac 
tgggcagagt ttgccaaacg ttataatggt cctgcttatg agcagaatca atatgataaa 
aaattagcag cggcttacca gaaattttcg tag 

<210> 1603 

<211> 195 

<212> DNA 

<213> B.fragilis 



540 
600 
660 
720 
780 
813 



60 
120 
180 
195 



<400> 1603 

aggcgtatcc tggtattggt agaggaacag aatcctcttt cccctcgggg gaggggaatt 
gcagttacgg atgctccgtt ctatgggaat ggtacgagtg aacgtttatt taatatcggt 
ctgtgtctgc cttcgggacc tacattgaca gatgaggata tcaggagagt ggtggatacg 
atcaggaaga tgtag 

<210> 1604 
<211> 756 
<212> DNA 
<213> B.fragilis 

<400> 1604 

aatgcagaca attacatgat tgataaaagc gaaatgattt tcggcgttcg tgccgtgatt 
gaagccattc aggctggtaa agagatagac aagattttgg tgaaaaaaga cattcagagt 120 
gacttgtcaa aagagctgtt tactgctctg aaaggtacgc tgattcctgt tcagcgtgtc 
ccggtggaac gtatcaaccg tatcacccgt aagaatcatc agggggtggt tgcgttcatc 
tcttcggtaa cgtatcagaa gacggaagat ttggtgcctt tccttttcga agaaggtaag 
aatcctttct ttgtcatgct tgatggaatt acggatgtgc gtaattttgg tgctatagcc 
cgtacttgcg aatgtgccgg agtagatgcc gtcattattc ctgcaaaagg aagcgttacg 
gtcaatgcgg atgcgatgaa gacttcggcc ggtgcattgc acactttgcc ggtttgccgt 
gaacagaatc tgaaaacaac cttgcaatat ctcaaagata gcggcttccg tattgtggct 
gctaccgaaa aaggagatta tgattatacg aaggcagatt ataccggccc gatgtgtatc 
attatggggg ctgaggatac cggtgtttcc tatgataatc ttgcactatg cgacgaatgg 
gtcaagattc cgatgctggg tagcattgaa tcactcaatg tatctgtagc cgcaggtatc 720 



60 



180 
240 
300 
360 
420 
480 
540 
600 
660 



640 

ctgatttatg aaggcgtgaa acaacgtaca aattaa 7 56 



<210> 1605 
<211> 831 
<212> DNA 
<213> B.fragilis 



<400> 1605 

aacttaaaaa atcatattat ggctcagtta agttcagtaa tcggctctat attgcgtgat 60 

atcgtttcgg cacaacacga agcaaatctt tattcgttgt cgcttggcga ctcttacgga 12 0 

aaagacggaa aggcgaaaga ttttcaattg cctaatgtta tggtaagcga tatggaactg 180 

gatttgaaat atggtgtgaa aagtgcatcg gaaagtcagc aacagtttaa tatcaagtat 240 

gataagtt cc gtcagttcct taaagaactg tgcgaacaag ttgccagggt agccattagc 3 00 

agtgctgtca ccacagtgat gacttcggat atagagagaa atgaaggaga gaaacacttc 3 60 

tttgaacggc ttaaaaaaga aaacaaactt catcaggaat tctgcacttt tctgagccgt 42 0 

aatatgagaa actctttccg aaataatctc tatgatgccg tagacagtag taatggttct 480 

gtgaataacg atgttgtgat tagcagactg acagatgtcg tacgtaaaaa atttctttac 540 

gatacagatc ttgatgatct ttttgccgga gaagatggag aaaaacttcg tgataccgct 600 

gaaaagaata ttataaaagc gatggaagct attgtaaaaa agctgtcggt agatgccaac 660 

tttaaaagtc ttcattcatt tccacagctg gatgtggcca tcacggctga tgaactgatg 72 0 

aatatgcctg aagaagcgat acacagtttt aagatcaagt tcagccctcg caattattca 780 

gtcagtcaaa cggatgatga ttcgttactg gaagattttg tgatgcgata a 831 



<210> 1606 
<211> 537 
<212> DNA 
<213> B.fragilis 



<400> 1606 

aatcaaaaaa acggattgac tacaatgaaa ttaagtaaat tcttatcgag cagaacagga 60 

aaacgtttct ataacctctg ttattgctgg ggagcctgtc tggttatttt gggagccgta 12 0 

ttcaaaatcg ctcacatgcc ttatgataat ctgtttttaa tgatcggatt atttacggag 180 

gtattcatct tcttcatctc cggatttgac gaaccggcaa gagagtacaa atgggaaagg 2 40 

gtgtttccgc tattgaatga taaaaacgca aacataaatc cccatacagg agtatcggat 3 00 

acactgatga cagaaaagta catacaacag ctgaaaagac tggaaaacaa cgtgtgtaaa 3 60 

ctcaatgaaa cgtacgaagc gcaaataaag ggaatgacgg aacacgctaa gtcgttgaac 42 0 

gagatgaatt cggaggaact gaaaaaggag acagaaaaaa tggcagcata catagaatta 48 0 

ctgaacaagc aatatagtca gatgctgaat gccatgaatg taaaaaccgg gaaataa 53 7 



<210> 1607 
<211> 192 
<212> DNA 
<213> B.fragilis 



<400> 1607 

ttatggaaaa aaaacgacaa gtttaagagt ccgaacgaat tgcttaaaga gttgtccgga 
caggtgtttg ccctggtgcg tgagcttccc aaaccgcttt cgagagaaga gatgcgggag 
ttgaaacggt tgtgccgctt cctgaacaat acggtgaagg atcaggagcg gaaacaggag 
gtgagaaaat aa 



<210> 1608 
<211> 243 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (145) , (184) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



641 



240 
243 



<400> 1608 

aaaacacgaa aaagacttag acgtttccct cgaaacatcc aagtcttttc ctctaaactc 60 

ttaagtcttt tcggaaaagg cttaagagtt tgtcatacaa tcattggttg gtccggtgtt 12 0 

ttcgaagggc aaaagtccaa agggnaactc tatgagtcag tttccggcaa gccccgcgga 180 

gttntgatat acatatctcc atctttatca attattttta agctctatag tcctcatgaa 
taa 

<210> 1609 
<211> 606 
<212> DNA 
<213> B.fragilis 

<400> 1609 

agagtaggaa aaaataataa gaaatatatt atggatgaga aattattacc ttactttgag 60 
aatgttaatg atggaggaga acagggcaaa tacttaaaag aatttggaaa tgaagaaacg 12 0 
caaggaggta tttgtctaca cttatcaatt acttggttat atctatggca taacagcaca 
aataaagctc cgaatacgat atggcaggaa atgaaaactc ccactttaat tcaacaaata 
gcaagcaacc aaagaagtta ccaacaatat tatccgaata ttgcagataa tgtatcttta 
gctactcgta actcccttca tgtaacaggg actaacgcag gagaaattta tcagataacg 
accaatgcac tagtcaagag taacatgctt ttgtatgtca tcaatttaga aaaagaccat 
aagccagtcg gaagacatgc cattgcagca attgcaacaa gaggacgttt ctatttgtac 
gatcctaatg ttggtgtaat gtcagtgcct atgcccaata tgaaagaact aatagaaaaa 
atcccttata tatatggtaa gcattctctc aatattagtc agacttctgt ttataacata 



180 
240 
300 
360 
420 
480 
540 
600 
606 



60 



tcctaa 

<210> 1610 
<211> 345 
<212> DNA 
<213> B.fragilis 

<400> 1610 

aagagtatga atcctatatt gaataaaatg ggcgcaaatg ccaatgaaca gaaaaaactc 

ttgatggagt gtgtgtcaat gcttgaaaag tatgtgaaca gatttccggc agaaaaggga 12 0 

tgtgcttcat tctccggaga agatatgaag ctgtggaagg aagtttattt tccgaaactt 180 

gttcagacgg atattttgtt ggacggtaaa tttttctgtg gcacgtcgtc cggtaatagt 240 

ggtattggta cagacggtta ttttaccggt tatgaatttt tccagtttat ttatcgtgcc 300 

tacaaggcac tttatgaact ggaaaaggct tcacaaatga gatga 345 

<210> 1611 

<211> 972 

<212> DNA 

<213> B.fragilis 

<400> 1611 

tactcaatga aaaagagaga gtttaaaatt tcctttttcc tccatgtgtg ggaaaggaaa 60 

gcggaagaga tttcgctgga agagtttcat aatgacctca ggggagcacg ctggaaggtg 12 0 

cttgccgagt cgtaccggcg gtggatgcgg acgggcatga cagaggaggg caaaaggctg 180 

aaaggggctc tgaatgcggt ggtcgtggcc ggcaagtgcc ggggcggaca tgcggcgaac 240 

caggtgaccg agctgaacgg actggcgctg ttcgactttg atcattgcct cgagatgctg 



300 



gccgggatga aggagaaggc cggggcgctg ccttatgtgg tgggggcttt tgtcagtatc 3 60 

tcgggtgaag ggctgaagct gattgtgcgt atcgatgccg agaatgccgg gcagtatgcg 42 0 

gtggcttatc ctgtcgttgc ccgtgagttg gagcgggtgc tggggcatcc ttgcgatatg 480 

tcgtgccgcg atctgggacg ggcgtgctac gcttcgtatg atccggaggc gtactataat 540 

cccggtgccg gggtgtttcc gtggcgggag caggtggacg ggctgttgca ggcggaaggg 600 

gagtgttccg cgcagtcggt gggcaaggct tgtccggcgg gcgttgcttc cgaagcgggg 660 

gatggcttta tgcaggtttt cctgaatgat tttgatgccc ggaatccgtt tgtggcggga 720 

gggcgccatg cgtttgtgct gaagctggga cgtgttgccc ggtataaagg tttttcgccg 780 

gaagaaatgc ggctgttgca aaaagcagtg gttgagaaat acgcgcaggc tgatttcggg 840 



642 



agcggagaaa tagaaaaaac attatcgtct ggttatcagt atgtttctgt caggagggcg 
gacgctgtca tggcgagtca ggggccaaaa gtccaagggc cactctatgc tctcagagga 
gggggagagt ga 



<210> 1612 
<211> 246 
<212> DNA 
<213> B.fragilis 



<400> 1612 

attaagatga agcggataac ggacaggaca aatttttccg tcataaatac ggtggtgaac 

tatagcgaag ctccctatgc tgtctgtccc caatgtaata tccggacggc ctgcagaaga 

tttccatctt ccatacattc cgtccggcat ctgtacccgg acaggtcgac aacccaaaag 

cacaaatatg ataaagacaa aatacaaaaa aaagctgtaa tgattcctat ttttacccat 
atataa 



<210> 1613 
<211> 1350 
<212> DNA 
<213> B.fragilis 



<400> 1613 

aaagtatcga aaaagagaga aacaaaaaac agttttaata ttttcttaat gacaaaaatg 60 
attatgaaaa agagtgactt atttaaaata ggtgtgttgc tgatggcaac gaccttggga 12 0 
acaaccggat gctctttcgg agaagacgag aagaaaccgg aaattgtagt ggatcctgcc 180 
gaaaaaacaa tagaatacta cattgcaggt aaagtgacgg aaggaacgac cgcgctgtcc 240 
ggtgtagaag tgaaagccgg tgaagtaacg gctacgacgg atgcggaagg ggcttataaa 3 00 
ctgacagtgg acagcaagaa ggtgtacacc gtgacattca gcaaagaagg gtatatgagc 
atagacaatg caacggcaac catcgcagac aatgcggcaa accgcagtat ggtgagtctg 
agtgtgaaat taagcaagaa agctccggaa aaagaagtga aggccgatgc ggaagaagaa 
gtggtggtaa ccgataaagg agacagcaat atttctcagg cagaagcagc tgtaattatt 
cctcccaaag ccatagaaac aactacaacc gtaagcgtga ctccatatga agaaccggct 
gccgtgacaa caaccgtaac accgggaaat aatgtggaga ctccggtagc gatcgcaaac 
atcgaagtgg aaacagccca agaggtcact ctggccaaac cggtaacact ggcaatcata 720 
aacaaagctt cggaacatac aacgttcgaa aatgtggaag tgtacaatca gaaaacaacc 
acaagggccg gagaaaactg gaacaaagtg gcagatgcca tttatgactc ggaaacgaac 
agctataaat tcacattgcc cgcaggcgca tcactgtccg gaaaatattc gatgcgtgtc 
aagagtagca agaccacagg aaaagaacgg ataggcgaga caaacaagga agagaaaaaa 
agcaatgaag gcaatatgac tgccattccg gaatacaaaa tcaactttga ggctacggcc 102 0 
ggatgggaat atactgtcag tccggaaaag gcgctgatga atgcaggcgt agacgctgcg 1080 
gatgcccaag gcatgggcac gacgatcaac agtgccattg aagcgcagga aggaacgacg 1140 

• • ■ ■ 1200 



360 
420 
480 
540 
600 
660 



780 
840 
900 
960 



ggaacttata aagtggctca cgaactgata gcgggtatca gcggtaacca tatcctttat 

tacctgaatc aggctaaata ttgcgaaaag acatatacat tcaaaatcag tggcggaaga 12 60 

acagtgacca tcaccctgaa attctataca ggaatgcaga ttacttacac caacgtggaa 1320 

gcaagccagc actcgggagg taagatttaa 



1350 



<210> 1614 
<211> 1212 
<212> DNA 
<213> B.fragilis 



<220> 

<221> unsure 
<222> (250) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 1614 

gttaaccgga aaataattat gctgaacgga aagaaaatta tactcggtat taccggtagc 
atagctgcct ataaagcttg ttacatcata cgtggcctga tcaaacaggg agctgaggta 



60 
120 



643 



caagtcgtaa ttactcccgc cggaaaagaa tttatcactc cgataactct ctctgcgttg 180 

accggcaaac ctgtcatcag tgaattcttt gctcaacgtg acggtacgtg gaatagccat 2 40 

gtagacctgn gattgtgggc ggatgctatg ttgatagccc ctgccacggc ttctaccatc 3 00 

ggaaaaatgg cgaacggcat agccgataat atgttgatta cgacttatct ttctgctaaa 3 60 

gcgccggttt ttgttgctcc ggctatggac ctggatatgt ttgcccaccc cagtactcaa 42 0 

aagaacctgg atacgcttcg ttcgtatggc aatcatatca ttgagccggc ttcgggtgaa 480 

ttggccagtc atctggtagg aaaaggccgt atggaagaac cggagaatat aatccgggta 540 

cttgatgaat tcttttcatc aacgggcgaa ctggcgggga aaaaagtgct gatcacggcc 600 

ggaccgactt atgaaaagat tgatccggtg cgcttcatcg gcaattattc ttccggtaaa 660 

atggggtttg ccttggctga ggagtgtgcc cgtcgcggag ccgatgtggt actgattgca 72 0 

gggccggtac aacagaaaac atatcattca catattaccc gcattgatgt ggagtccgct 780 
caggacatgt atgaagcagc catggcgcaa taccccttgg tcgatgccgg aatactgtgt 
gcagcggtag cggattttac tccggacgct gttgctgaca agaagataaa acgggaagga 
gacgagttgt tgctgcatct taaacccact cacgatattg ctgctgcatt gggcaagata 

aaaactccgg gacagaagtt aatcggtttt gctcttgaaa cgaatgacga gcagcgcaat 1020 

gccgaaggaa agctgatccg gaagaacttt gatttcattg tgctgaattc gttgaatgat 1080 

gctggtgcgg gattccgtta cgataccaat aagataagca ttcttagttg caggggcaga 1140 

accgattatc cgttaaaatc gaagacggaa gtagccagag atattattga tagaatgata 12 0 0 



840 
900 
960 



1212 



60 



aaagaaatgt ga 

<210> 1615 
<211> 1368 
<212> DNA 
<213> B.fragilis 

<400> 1615 

tcagatgctg aatgccatga atgtaaaaac cgggaaataa gacgaagaga catggctaaa 
tatacattgc cgccaaggca aaagatgatt aacctgctgt acgtggtatt gattgctatg 12 0 
ctggccatca atatatcgtc ggatgtctta gaggggtatg gacaaatgaa caacgactac 180 
cttccacaaa taaaaaagct ggaagaatat aaccggactt tactggaaag aattaacagc 2 40 
cgaaatgata aagcggcttt atctgcacag aacatagatg cggcggcagg aaaactaatg 300 
gatacactgg aggaactgaa agaagatatc gcccggaaag cggacaaaga gaaatatgaa 3 60 
gccggcaagc taaaggcaaa agatgacttg aacgctgtgc cggaggtatt tctgtcggtc 42 0 
accgggggga aagggaaagc actcaggctc tcactggata cattcaaaga agacgcttta 480 
tcgctgatca agaatgatgc acacagacaa ctggtaggca cttacctcaa tacggaaagt 
ccgggtaccg gaatatcctg ggaaaaggaa accttctctt atcttcctgc catcggtgga 
gtgacattta tcaataaaat gcaggaagag gtgttgctgt gcgtgaatga agtatatcgg 660 
tcactgctgt acgaagaggc agaagatgga aaaggcggag cttttgtatt catcaatgaa 72 0 
gaccagatga tagtaaataa agatggaacg gtggacctgc ctgtagtaca gatcacaccc 
gccttaacaa gtatcttgta taccgactat gaaaacccgc taaatatact gactgcggga 
ataccgttca acgaggtgac attccggatg acgaacggaa agatactcaa aagaggaaac 
cattgcatag ccgttcccga cgaaaaagca cagacagcga cagttaccgc cacacagata 
aaaaacgggg tggcaaggca actggccgaa taccggtata ccgtaaaggc actgcccgat 
ccgacacctt atatactctg cacggatgaa aacgggagaa cggtacaata ccggggaaat 
gtgcccatta acaaacggct ggtatccaac atgacacagc tgggagcttc aatcagcgat 1140 
ggtccgaaag ccaactacga gatcagcagc tttgaaatgg tattgatcaa aggaagcagt 1200 
aaagcggtaa cttcaatacc caacaccggg aacaaattct cggccaggca aatggaactg 12 60 
atcagacaat tggagaaagg agataaattc tatatcactt cgattgttgt gaccggtccg 13 2 0 
ggaaacaaaa agaaacagat tgcatcaatc aatgtcgtat taatataa 13 68 

<210> 1616 
<211> 1257 
<212> DNA 
<213> B.fragilis 



540 
600 



780 

840 

900 

960 

1020 

1080 



<400> 1616 

tgtattaatg aagataggat gatgaacgga atatttgaaa aattatatga tatgactgcc 

ttcagcaata ttgttgccga accgcagttt ctggtgatgt atgtcattgc cttcgttctc 

ttgtatctgg gtataaagaa acaatacgag cctcttttat tggtgccgat tgcctttgga 180 



60 
120 



644 



gtgctgttgg 
atgatcctgg 
catgaattgg 
ccgatcattt 
cgtctgtcta 
atcctgatgg 
ggacctacgg 
attgccgcct 
ttgtgtacca 
aaaacggaaa 
gtcgtggctc 
ctggtgaaag 
atgaatgcgg 
ttcctgaact 
atagcaggag 
ccgctgattg 
attgctttga 
tcgggagtga 



ctaacttccc 
tgaacggagt 
gactgatgaa 
ttatgggagt 
tattcggggc 
gatttacacc 
ccatctttac 
attcttatat 
agaaggaact 
ttaaaaacct 
tgtttgtacc 
agatcggtgc 
caaccatttt 
ggacgactat 
gtatcttctt 
gtgctacggg 
aatatgatcc 
tcgggtctgc 



cggcggagga 
aatgaagaat 
ctttgtgtat 
gggggccttg 
tgccgctcaa 
cagtgaagca 
caccatcaag 
ggcattggtt 
gagtatcaat 
ccgtgtattg 
gagtgcagtg 
caatactttc 
cctgggtttg 
cggtattgtg 
tgtgaaactg 
acttagtgcg 
taaaaatcat 
cgtagcggca 



atgggagtga 
atctgggaga 
tatatgctta 
acggacttcg 
ttgggtatct 
gcttctttgg 
ttggctccgc 
ccggttatca 
atgaaagagc 
aaaattattt 
cctttgatcg 
cgtctgtttg 
tcggtaggag 
gtaggaggat 
gtgaatctgt 
gttcctatgg 
gtattgcaat 
ggggtgctga 



tacaggctga 
tgcctctcca 
taaagacagg 
gaccgatgct 
ttactgtgtt 
gaattatcgg 
atctgttggg 
ttccactggt 
aggagaagaa 
tcccgattgt 
gtatgctgat 
atgcggcttc 
ccacgatgac 
tcctggcttt 
ttacgaagaa 
ccagccgtgt 
attgcatggc 
tctcttttct 



cgagaatggc 
tgatattgct 
gttccttccc 
tcgcaatctg 
gttggtagct 
tggtgcggac 
cccgatcgcg 
cgttcgtctt 
atatccatcg 
ggtgactacg 
gttcggtaac 
gaatagtatc 
aagtgaagct 
tgctttgtca 
aaagattaat 
agccaatgac 
cagcaatatc 
gtcttaa 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1257 



<210> 1617 
<211> 1197 
<212> DNA 
<213> B.fragilis 



<400> 1617 

gaagtttgtg 

aatttactgt 

tactggcata 

gccagaggaa 

gaaaatccat 

agttgtttta 

tatcgcacgt 

gaaagcatga 

tttaagttga 

tgcataatat 

gtaatcacca 

atatatgttg 

attttaagct 

tattggtttt 

attcaagtga 

ttacccaaac 

gagtctatat 

ggttactttg 

aaagtagcct 

agtgggcgta 



agacaaataa 
tattatatac 
atgccggatt 
atgattactt 
ttttctcagt 
tggtgtatgc 
atgccagata 
tacgccaggc 
aatttaacaa 
ttgccatact 
ccctctatat 
cgtgtgtcta 
ttgcagcaga 
cagaaaaggg 
taggctcttc 
attatgctct 
tcgtaaaact 
ctttagcaat 
atgtctgtct 
ctatgttcat 



gcgcattata 
attctggaaa 
gtgcgtaatc 
tgcgtactcc 
cattaatgaa 
gttcacattt 
tatgttccca 
attcagctac 
gccaaaggat 
aacattagcc 
attttggcgt 
catattacca 
cacaaacgaa 
tgagaacgat 
tgcattgatg 
gataacaatg 
agaaattttg 
agtagt at cc 
cctttggttt 
ctgggatacc 



atgaatttgg 
gccggtagaa 
ctattctcta 
agaattttcc 
ttactcagaa 
gcattgtgcg 
ctattcttga 
tcttttttct 
atattgcata 
atacacactg 
aaacctttcc 
catatattta 
cgtgcagcag 
cagtatgata 
tatttcggat 
cttaatacct 
catcgtatag 
tataaaacaa 
gtttattact 
cattatcctt 



accttgactt 
atatctcgca 
ttgtacaagg 
gtgaaggtag 
tagttggtat 
ccatgatttt 
taggcttcat 
tcctatattt 
accataaaaa 
gcaatattat 
agccacagtt 
atttcaattg 
aatatgtgaa 
aaaattttat 
atagactaat 
ttattattgg 
gacaaactct 
ttaaactaaa 
acgtaaaata 
ttttcaaatt 



catattacta 
aagcatggat 
ctgccgattt 
cctgcatgtc 
taatgagtat 
tatgaaagac 
gaacttcgaa 
gaaatatctc 
attaatatac 
aagcttattt 
tgccataccg 
gctggaaccc 
gaatgctgac 
cgtggaaatt 
catagaaaag 
tttgtgtata 
agatattgta 
acctatccaa 
cttattcttc 
tatataa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1197 



<210> 1618 
<211> 1182 
<212> DNA 
<213> B.fragilis 



<400> 1618 
ataacaatag 
ttacttaccc 
cataacgggt 
agtgatatta 
aaaaatccgg 
tctgctatga 
tcgcggacta 



atatggcagt 
agtacttcag 
tggaagcttt 
atatgccgga 
caatgaagtg 
acaagggagc 
ttgaaaaggc 



aaaaatattg 
gcgacaaata 
acagaaactt 
gatggatggg 
tataatggtc 
gtttgatttt 
gatcgaacag 



agtgtagatg 
cgaaaaggtg 
ctggaaactc 
ttgaccctat 
tctgcttatg 
gcaaccaaac 
gttcgctata 



atgaactcga 
aatacgagtt 
ccgatttcga 
tggctaaggt 
gagacatgga 
cgatcgattt 
tccgtgagtc 



tttagaagta 
cgcttttgcc 
tatcatcctg 
caatgaactg 
taacatacgt 
ggatgatctg 
acagcaggag 



60 

120 

180 

240 

300 

360 

420 



645 



cacaaccaac tggaatctat caaaaatgac ctggccattg cgggagaaat ccagcaaacg 480 

attcttcccc gttcctttcc tccttttccg gaactgacgg aagtggttga tatttatgct 540 

tccatgactc cggcaaaaga tgtaggtggc gatttttatg atttcttcca gattgacgat 600 

gaacgtatcg ggctggtgat tgctgacgta tctgggaaag gggtgccggc atccttgttc 660 

atggcggtta gtcggaccct gctccgtgca actgctcttc ggggtgtttc gtcggcagaa 720 

840 
900 
960 



tgccttactt atgccaataa gttactgtgt aaagagagcc tggactctat gtttgttacg 

gtcttttatg ggatttatca ttataaaacc ggcatgatgg actataccaa tgccgggcat 

aatccccctt atctccttcg cggcggacgg actgttgaat gcttgcctgt cgcttctaat 

tttgtggtag gcgtgttcga tgatattgaa tttgagagta atacattgac gttcggcatc 



ggtgacactt tacttctgta tacggatggt gtcacagagg cttttaacga caagcgggaa 102 0 

caattctcgg aaagtaactt acaggatata ttggcgtcta tgcacgaaag tagttccgca 1080 

aaagaggttg ttacgagtgt attgcagtct gttaagactt tctccggaga ctatcctcag 1140 

tccgatgaca taaccctgct ttctcttcaa cgaatcaaat aa 1182 

<210> 1619 
<211> 480 
<212> DNA 
<213> B.fragilis 

<400> 1619 

aaatcaaaag agaatatgtc aatgcatact tggtttgagt gtaagatccg ttacgagaaa 



60 



ttgatggaaa acggaatgaa caagaaagtt actgaacctt atctcgtgga tgcactcetgc 120 

240 
300 



tttacagaag cagaagcacg cattattgag gagatgaccc cgttcattac aggagaattt 

accgtatcgg atatcaaacg agccaactat agtgaacttt tccccagcga agaagaagct 

gctgaccgct ggtttaagtg taaactgatt tttatcacac tagacgaaaa aagtggtgct 

gagaaaaaaa cgtcgacaca agttctggta caggcagctg acctgcgcga tgctgtaaaa 3 60 

aagctggacg aaggcatgaa aggtacaatg gctgattatc agataggttc tgttgctgaa 420 

acagcaataa tggatgtata tccatatagt tctgagccta atgataaacc ggaagtataa 



480 



<210> 1620 
<211> 405 
<212> DNA 
<213> B.fragilis 

<400> 1620 

tttatgaaag agaattcaat caaaccgtat tgttattgtg gtgagtcaga aagctcattg 60 

gtcgataatg ctatttttgt ttattttggt gatgaataca gaagagtact tctggatgaa 12 0 

atcttatggc tggaagcatc cggcagttat tgtgtactct gtatggagaa cggtgcagag 180 

ataacagtca gctatccttt ggatcggatc ttcaataatg accttcctcg cggcaagttt 240 

cagagaattc atcgttctta cgctatcaat gtgttcaagg tgaccggatt tgcaggtaac 300 

tatgtacata taggaaagaa gatgttgccg gtcagtgaat ctcacaaaaa gaatttttta 3 60 

gcttgtttcc ataaaattta ctcaaagcgt gcattgggaa aataa 405 

<210> 1621 
<211> 621 
<212> DNA 
<213> B. fragilis 

<400> 1621 

tataatatgg ataataagaa agttaggagc actagtagcc aggtaatgga acttcagcaa 60 

ttgattgccg gtcctttgat tgcaactatt gaagcggatt cattatcttc acaaagatat 120 

ctggattatc tgatgaaaat cgcatttgaa tcctatgatc ctgtgacagg acggaccggt 180 

aagatacgta tgcttacgtt caactatcag agtcaggatg ccggtggggg aagaacgcaa 240 

agtgtaagta taccgatact gacattggta cctctgccac tgttgcaagt acaggaagca 3 00 

gatttcgatt tcgatattaa aattctggat gcactgtcgg aaacagctga agaaaaattt 3 60 
tcactggaag aaggtaaaag cgtgaatgag ccgcaaagtg gaggaggatt taaactccgg 
gcttcactgg ctcccaaaca gggagaaggc agcagcactt cgaatgtgca gcagagcttg 



420 
480 



tcggcaaata tgaaagtgaa agtgaagatg cgtcaggcag atatgcctgc ggggttgtct 540 
aatctgttac atctgacggc gagcaatatg caagtagaag aaactgaagc tgaagaaata 600 



646 



300 
360 
42 0 
480 



fi 0 1 

acggaaggag gaaataaatg a 

<210> 1622 
<211> 582 
<212> DNA 
<213> B.fragilis 

<400> 1622 

cgcattgcgg aaagttcaga atgcgcagga gaatttggag agaaaaagca agttcaacgg 60 

ttaaacggac gaggcatgaa attgactttt ttcaagcgga tgggggagaa gatccgccat 12 0 

ccgttccgaa aggaaattcc gaaaacaatt cccgttgtag aaactgcccc tcagccggta 180 

gcggataata caaccgaagc aacggcagaa gactcttccg tcataagatc ggcagatcaa 240 

tgtggggaac aggcacgtta ttttttacta agaaataaca agccggttgg taaacctttc °™ 

agttattatc atcccgagat acggatcgtt catgtcggta gttttgtaaa tgccttttta 

tttttcttgc gtatgtgcga tcagcgtctg ttgacctatc gccagaccgg agaatatctg 

cattgtacag ccgtttttcc ggatgaaagc ggtaatttgt atttcacgaa taaagtgact 

tgccgtaaca aggaaaatac tgttgcggtc ctgaaaattg attatgttgg ccttaagcca 54 0 

aaaatcactg aaattagatt tgaattaaat attaaaaaat ga 582 

<210> 1623 
<211> 573 
<212> DNA 
<213> B.fragilis 

<400> 1623 

cgtatccggg agcgaccctc cctccccttt ctcaaaccat actcccaatc agacatgaaa 60 

tcctggcttg ccgcctatgt ccgtctctat cacgaaaaga aaacccgtga ccgcctgacg 120 

gcaatgggca tcgaaagttt cctccccgtg caggaagaaa tccatcaatg gagcgaccgc 

cgcaaaaaga tcgagcgcgt agtcatcccg atgatgatct tcgtacacgt cgacccggca 

gaacgtgccg aagtgttgac cctttcgtcc gtcagccgct acatggtgtt gcgcggacaa 

agcacccccg ccgtcatccc cgacgagcag atggagcgct tccgcttcat gctcgactac 

tccgaagaag ccatcgaagt gtgctcctcc cccctcgccc ccggcgaaca ggtgcgagtc 

atcaaaggcc ccctcgccgg actggaaggc gagctggtga ccatcgacgg caaaagcaag 

gtggcggtaa ggctggatat gctgggctgc gcccatgtgg atatgccggt ggggttcgtg 540 

gagagagtgg gaaaaatgga ggcggtgaga tga 573 

<210> 1624 

<211> 1650 

<212> DNA 

<213> B.fragilis 



180 
240 
300 
360 
420 
480 



<400> 1624 

aggaatatac ttatggaaaa attgcatatc cggaagattg cttcattggg gttgatgctt 
tgctttttta cgggggtagg agcacagaca cctgtcaaag tggaaaaaag gaaagagcat 
aaatcgaata ctgtaatacc tgttgtcaag ggaaatgtga cagatacctt gtcacttgta 
tcatttaacg attttcatgg agcctttgcc tgcgataagg gtgttcccgg agccggccaa 
ctggtacaaa cagtgttgac acagaaagag aaaaataaaa ataccatcgt gctttctgtc 
ggagataatt tcagcggaag ttatttctca agaataacca gaggcaatcc gttaccggaa 
atgtttcagg aaatggatgt aaaaatgtcc gctgtaggca atcacgagtt tgattgggga 
ttaccctatc tgacggatac ggcgaaggta tatatgaatt ttgtggcagc caatattata 
acggatcggg gagatacgtt ggagtgggct aaaccttacc ggattgtgac tctgaatttg 540 
aagaatggag gaacggtgcg ggtggctttc gtagggttga caacgactga tacggcacat 
aaaacgagtc cggaaaatat aaagggactg gcttttgtgc atcctgtata tgcagcccgt 
gtcgagactg cctgtcggtt gaagaaagaa ggcaaagtgg atatggtagt actcttgatg 72 0 
catatcggca ctaacatgaa gaatagagat attatagaag aggagaatgc taaattgctg 
cctttcctga aaggagtgga cgcaatcatt tccgggcatt ctcacgaagt tgttcttagt 
aaggtgaacg atgtacccat tatccaggca ggggtaaacg gtactcatat cggtaagttg 
gattttagag tagtgaaaga agagggcggc aatcgcatct cttatatagg aggtgataca 
attcggacag aggggccgtc taatgcacat atcgattcgt tggtcgataa agtattggcg 102 0 



60 

12 0 

180 

240 

300 

360 

420 

480 



00 
660 



780 
840 
900 
960 



